0% found this document useful (0 votes)

19 views45 pages

AI Session 3 Lecture Note

The document outlines a hands-on practice session focused on financial market prediction using machine learning, specifically targeting the KOSPI index and incorporating sentiment analysis from Twitter data. It discusses the use of machine learning models like XGBoost, RNN, and LSTM for predicting stock prices, as well as the importance of preprocessing text data for sentiment analysis. The session aims to enhance stock price prediction by integrating sentiment scores into the predictive models.

Uploaded by

snowkimjeonil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views45 pages

AI Session 3 Lecture Note

Uploaded by

snowkimjeonil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 45

Hands-on Practice on Financial AI Session

Session 3.
Financial Market Prediction Using Machine Learning

AI for Finance (IE471 )

KAIST Financial Engineering Lab.

[3-1] Introduction
1 Binary Classification Problem in Financial Markets

If the financial market index is interpreted from a very simple

aspect, it can be interpreted as Up (≒Bullish) and Down (≒
Bearish) of the financial market index.
2
[3-1] Idea
1 Predicting the Fluctuation of KOSPI of the Next Day Using Other Global Market Indices

Caution
▪ This row is the fluctuation of the next day (2016-
01-06)

3
[3-1] Machine Learning Models
1 XGBoost

▪ Chen, T., & Guestrin, C. (2016, August). Xgboost: A scalable

tree boosting system. In Proceedings of the 22nd acm sigkdd
international conference on knowledge discovery and data
mining (pp. 785-794).

▪ In machine learning, boosting algorithm is an ensemble

meta-algorithm for primarily reducing bias, and also variance
in supervised learning, and a family of machine learning
algorithms that convert weak learners to strong ones.

▪ XGBoost is an algorithm that has recently been dominating

applied machine learning and Kaggle competitions for
structured or tabular data.

▪ XGBoost is an implementation of gradient boosted decision

trees designed for speed and performance.

4
[3-1] Machine Learning Models
1 XGBoost

Source: StatQuest with Josh Starmer. XGBoost Part 2 (of 4): Classification (Retrieved: 2022.03.10.)
▪ https://www.youtube.com/watch?v=8b1JEDvenQU 5
[3-1] Machine Learning Models
1 XGBoost

6
[3-1] Machine Learning Models
1 XGBoost

7
[3-1] Machine Learning Models
1 XGBoost

8
[3-1] Code Description
1 Preparation

▪ For data preprocessing and visualization

▪ For constructing a machine learning model

▪ Setting seeds for scoring

9
[3-1] Code Description
1 Preprocessing Data

10
[3-1] Code Description
1 Training a Classification Model and Evaluating Their Performance

▪ Training data

▪ Saving classification results

▪ Evaluating performance and

Illustrating confusion matrix

11
[3-2] Introduction
1 Sentiment Analysis?

Sentiment analysis is a natural language processing technique

used to determine whether data is positive, negative or neutral.

12
[3-2] Idea
1 Sentiment Analysis: Example

Ex. NAVER Sentiment Movie Corpus:

fin1234
- This movie is very funny! I recommend this!

eng5678
- The dubbed voice is so annoying.

Conducting sentiment analysis, we can classify text data into three

emotions: positive, negative, and neutral.

13
[3-2] Idea
2 Sentiment Analysis: Idea

Question
Can we include the positive / negative
perspective of the particular stock into the prediction?

Goal of this session

We will include the positive score / negative score
into the stock price prediction model of the previous session.

14
[3-2] Machine Learning Models
1 RNN (Recurrent Neural Network)

▪ A recurrent neural network (RNN) is a class of artificial neural networks where connections between nodes
form a directed graph along a temporal sequence.
▪ It is known as a suitable model for processing data that appears sequentially or time-series data.

An unrolled recurrent neural network

Source: Stanford University. CS231n:CS231n: Convolutional Neural Networks for Visual Recognition. (Retrieved: 2022.03.10.)
▪ http://cs231n.stanford.edu/ 15
[3-2] Machine Learning Models
2 LSTM (Long Short-Term Memory)

▪ Because of the disadvantage of RNN models (vanishing

gradient problem), the vanilla RNN models are not used
very often.

▪ Long Short-Term Memory (LSTM) (Hochreiter and

Schmidhuber, 1997) is one of the improved model of RNN.
This model made up for the vanishing gradient problem.

▪ LSTM models are very powerful in sequence prediction

problems because they’re able to store past information.
This is important in our case because the previous price of
a stock is crucial in predicting its future price.

16
[3-2] Machine Learning Models
2 LSTM (Long Short-Term Memory)

RNN

LSTM

17
[3-2] Machine Learning Models
3 Package for Machine Learning: PyTorch

PyTorch is an open source machine learning library based on the Torch library, used for applications such as
computer vision and natural language processing, primarily developed by Meta's AI Research lab (FAIR).

Advantages

▪ It is easy to install.

▪ It consists of intuitive and concise code that is easy to understand and debug.

▪ It is highly compatible with Python libraries (Numpy, Scipy, Cython and so on).

18
[3-2] Prior Research
1 Sentiment Analysis for Predicting Stock Prices: Mittal, A., & Goel, A. (2012).

What they did…

▪ They use twitter data to predict public mood and use the predicted mood and previous days’ DJIA (Dow Jones
Industrial Average) values to predict the stock market movements.
▪ They got 75.56% accuracy on the Twitter feeds and DJIA values from the period June 2009 to December 2009.

19
[3-2] Prior Research
2 Sentiment Analysis for Predicting Stock Prices: Nguyen, T. H., Shirai, K., & Velcin, J. (2015)

What they did…

▪ This paper shows an evaluation of the effectiveness of the sentiment analysis in the stock prediction task via a
large scale experiment.
▪ Their method achieved 9.83% better accuracy than historical price method, and 3.03% better than human
sentiment method.

20
[3-2] Data
1 Data

We will use…

▪ Tesla (TSLA) stock price data from 2nd January 2020 to 31st January 2020.

▪ 200 tweets per day including ‘TSLA (ticker symbol of Tesla)’, and ‘Tesla’ keywords from Twitter.

21
[3-2] Code Description
1 Original Stock Prediction Model (Python File: 3-2-1)

▪ For data preprocessing and visualization

▪ For constructing a machine learning model

22
[3-2] Code Description
3 Loading Dataset

▪ There are six columns are in the loaded

dataset.
▪ We will use five features: High price, low
price, open price, close price, and volume.

23
[3-2] Code Description
4 Scaling and Converting Data

▪ We collected Tesla’s stock price

data in 2020.

▪ We split these data into the

training set and test set. We set the
data from January 2020 to
September 2020 as training data
and the data from October 2020 to
December 2020 as test data.

24
[3-2] Code Description
5 Constructing LSTM Model

Reference: https://9bow.github.io/PyTorch-tutorials-kr-0.3.1/beginner/blitz/autograd_tutorial.html

25
[3-2] Stock Price Prediction Using LSTM
6 Setting Hyperparameters and Training Data

Input_size (Input Size)

▪ 5 features (high price, low price, open price, close price, volume)

26
[3-2] Code Description
7 Test

Reverse
Transformation

27
[3-2] Code Description
1 NLTK (Python File: 3-2-2)

▪ NLTK (Natural Language ToolKit) is a platform for constructing Python programs to work with human language
data.

▪ It provides easy-to-use interfaces to over 50 corpora and lexical resources such as WordNet, along with a suite of
text processing libraries for classification, tokenization, stemming, tagging, parsing, and semantic reasoning,
wrappers for industrial-strength NLP libraries.

28
[3-2] Code Description
1 Cleaning the Raw Tweets (Python File: 3-2-2)

▪ For preprocessing text data

▪ For visualizing of text data

29
[3-2] Code Description
1 Cleaning the Raw Tweets (Python File: 3-2-2)

Remove Special Characters

▪ Delete special characters like #, !, ., … .

Lowercase all tweets

▪ For detecting stopwords
▪ In computing, stopwords are words which are
filtered out before or after processing of
natural language data (texts).
▪ For example, words such as I, my, me, over,
postposition, and suffixes often appear in
sentences, but rarely contribute to actual
semantic analysis.

Tokenize tweets
▪ Tokenization is the splitting of a string into
several pieces (tokens).
▪ Ex. "Hello, World.” → 'Hello’, ‘,’, ‘World’, ‘.’

30
[3-2] Code Description
2 Lemmatizing Tweets (Python File: 3-2-2)

Lemmatize tweets
▪ Lemmatization in linguistics is the process of
grouping together the inflected forms of a
word so they can be analyzed as a single item,
identified by the word's lemma, or dictionary
form.
▪ Ex. watched → watch

31
[3-2] Code Description
3 Frequency Analysis (Python File: 3-2-2)

32
[3-2] Code Description
1 TextBlob (Python File: 3-2-3)

▪ Textblob is based on NLTK and has many features to facilitate text processing.
▪ TextBlob’s output for a polarity task is a float within the range [-1.0, 1.0] where -1.0 is a negative polarity and 1.0 is positive. This score can
also be equal to 0, which stands for a neutral evaluation of a statement as it doesn’t contain any words from the training set.

33
[3-2] Code Description
2 Calculate Sentiment Score with TextBlob (Python File: 3-2-3)

Dictionary-based
Scores

Ex.
▪ Recommend = Positive Word (+1)
▪ Disappointed = Negative Word (-1)

34
[3-2] Code Description
2 Calculate Sentiment Score with TextBlob (Python File: 3-2-3)

σ ni=1 Sentiment Score of Tweets #iin the particular day

Average Sentiment Score Values

35
[3-2] Code Description
2 Calculate Sentiment Score with TextBlob (Python File: 3-2-3)

Remove Neutral Tweets

▪ The neutral tweets may lead to underestimate
the average daily sentiment scores.
▪ Sentiment scores including too many neutral
tweets may not affect to improve stock price
prediction results.
▪ Therefore, we delete neutral tweets whose
sentiment scores are exactly 0 in this session.

36
[3-2] Code Description
2 Calculate Sentiment Score with TextBlob (Python File: 3-2-3)

Average Sentiment Scores Average Sentiment Scores

before Removing Neutral after Removing Neutral
Tweets Tweets

▪ We will use the average sentiment scores after removing neutral tweets to predict Tesla’s stock
prices.

37
[3-2] Code Description
1 Difference from the Original Stock Price Prediction Model (Python File: 3-2-4)

38
[3-2] Code Description
1 Difference from the Original Stock Price Prediction Model (Python File: 3-2-4)

Changed dataset shape

Changed torch tensor size

39
[3-2] Code Description
1 Difference from the Original Stock Price Prediction Model (Python File: 3-2-4)

Changed input size

40
[3-2] Code Description
2 Comparison of two results

Without sentiment analysis results With sentiment analysis results

▪ Using the sentiment analysis results, we may lead to improve prediction results.

▪ However, many features can also lead to worse results than original model, because of the reason that added
sentiment index is not good for training, overfitting problem, and so on.

41
[3-2] Results & Conclusion
2 Conclusion

▪ The recurrent neural network (RNN) model can be used as a time-series forecasting model.

▪ LSTM (Long Short-Term Memory) models are improved versions of RNN from the aspect of preventing vanishing
gradients problem.

▪ There can be various features to consider when we predict stock prices.

▪ Sentiment analysis is a natural language processing technique used to determine whether data is positive, negative or
neutral (narrow definition).

▪ A more efficient sentiment analysis is possible through preprocessing processes on collected raw text data such as
tweets.

▪ We can conduct sentiment analysis in English using NLTK and TextBlob Python libraries.

42
Reference
▪ Chen, T., & Guestrin, C. (2016, August). Xgboost: A scalable tree boosting system. In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining (pp.
785-794)
▪ Kim, Y. B., Lee, S. H., Kang, S. J., Choi, M. J., Lee, J., & Kim, C. H. (2015). Virtual world currency value fluctuation prediction system based on user sentiment analysis. PloS one, 10(8), e0132944.
▪ Kuzminykh, N. (2020). Sentiment Analysis in Python With TextBlob. https://stackabuse.com/sentiment-analysis-in-python-with-textblob/
▪ Lee, K. (2017). Understanding RNN and LSTM. https://ratsgo.github.io/natural%20language%20processing/2017/03/09/rnnlstm/.
▪ Li, F., Krishna, R., and Xu, D. (2021). CS231n: Convolutional Neural Networks for Visual Recognition Stanford - Spring 2021 http://cs231n.stanford.edu/
▪ Mittal, A., & Goel, A. (2012). Stock prediction using twitter sentiment analysis. Standford University, CS229 (2011 http://cs229. stanford. edu/ proj2011/GoelMittal-StockMarketPredictionUsing
TwitterSentimentAnalysis. pdf), 15.
▪ Natural Language Toolkit (https://www.nltk.org/) (Retrieved on: 2022.03.10.)
▪ Nguyen, T. H., Shirai, K., & Velcin, J. (2015). Sentiment analysis on social media for stock movement prediction. Expert Systems with Applications, 42(24), 9603-9611.
▪ Olah, C. (2015). Understanding LSTM networks.
▪ Park, L. (2015). NAVER Sentiment Movie Corpus. https://github.com/e9t/nsmc.
▪ Singh, G. (2019). Updated Text Preprocessing techniques for Sentiment Analysis. https://towardsdatascience.com/updated-text-preprocessing-techniques-for-sentiment-analysis-549af7fe412a
▪ TextBlob (https://textblob.readthedocs.io/en/dev/#) (Retrieved on: 2022.03.10.)

43

Questions
&
Answers

▪ E-mail: [email protected]

Novel Approaches To Sentiment Analysis For Stock Prediction: Chris Wang, Yilun Xu, Qingyang Wang
No ratings yet
Novel Approaches To Sentiment Analysis For Stock Prediction: Chris Wang, Yilun Xu, Qingyang Wang
1 page
Ijarcce 2021 106132
No ratings yet
Ijarcce 2021 106132
5 pages
Final-Report - Twitter Sentiment Analysis
No ratings yet
Final-Report - Twitter Sentiment Analysis
8 pages
SSRN 4314299
No ratings yet
SSRN 4314299
4 pages
Text Classification - Movie Review - News Wires
No ratings yet
Text Classification - Movie Review - News Wires
5 pages
Twitter Sentiment for Stock Forecast
No ratings yet
Twitter Sentiment for Stock Forecast
5 pages
2290-Article Text-12602-2-10-20250627
No ratings yet
2290-Article Text-12602-2-10-20250627
7 pages
Stock Sentiment Analysis Using Ai
No ratings yet
Stock Sentiment Analysis Using Ai
17 pages
Maneesha Nidigonda Verzeo Major Project
No ratings yet
Maneesha Nidigonda Verzeo Major Project
11 pages
Thesis - Aru Omarali
No ratings yet
Thesis - Aru Omarali
34 pages
Recurrent Neural Networks: Anahita Zarei, PH.D
No ratings yet
Recurrent Neural Networks: Anahita Zarei, PH.D
37 pages
AyushiGupta 1912940
No ratings yet
AyushiGupta 1912940
20 pages
PBL Project
No ratings yet
PBL Project
18 pages
Welco ME
No ratings yet
Welco ME
15 pages
IRJET Price Prediction and Analysis of F
No ratings yet
IRJET Price Prediction and Analysis of F
7 pages
The Impactof Twitter Sentimentson Stock Market Trends
No ratings yet
The Impactof Twitter Sentimentson Stock Market Trends
67 pages
08 Natural Language Processing in Tensorflow
No ratings yet
08 Natural Language Processing in Tensorflow
29 pages
Document Dsbda Codes For Mini Project
No ratings yet
Document Dsbda Codes For Mini Project
9 pages
Report
No ratings yet
Report
89 pages
Get Rich or Die Trying
No ratings yet
Get Rich or Die Trying
8 pages
Lecture Notes 6
No ratings yet
Lecture Notes 6
5 pages
Part 4
No ratings yet
Part 4
5 pages
BTP Presentation
No ratings yet
BTP Presentation
29 pages
Hacker's Guide To Machine Learning With Python Venelin Valkov Z
No ratings yet
Hacker's Guide To Machine Learning With Python Venelin Valkov Z
240 pages
Sentiment Analysis: A NLP And: 2. Detailed Approach
No ratings yet
Sentiment Analysis: A NLP And: 2. Detailed Approach
6 pages
Maneesha Nidigonda Major Project
No ratings yet
Maneesha Nidigonda Major Project
11 pages
Tensorflow
No ratings yet
Tensorflow
9 pages
Natural Language Processing-Section
No ratings yet
Natural Language Processing-Section
38 pages
Automating Web Scraping of User Comments For Sentiment Analysis in Social Networks
No ratings yet
Automating Web Scraping of User Comments For Sentiment Analysis in Social Networks
5 pages
Analysis of News Sentiments Using Natural Language
No ratings yet
Analysis of News Sentiments Using Natural Language
8 pages
Review
No ratings yet
Review
34 pages
Stock Prediction
No ratings yet
Stock Prediction
10 pages
Predicting Stock Market Movement With Deep RNNS: Jason Poulos
No ratings yet
Predicting Stock Market Movement With Deep RNNS: Jason Poulos
7 pages
Report Sentiment Analysis Marcos Matheus
No ratings yet
Report Sentiment Analysis Marcos Matheus
12 pages
Deep Learning For Stock Selection Based On High Frequency Price-Volume Data
No ratings yet
Deep Learning For Stock Selection Based On High Frequency Price-Volume Data
25 pages
SocrAI Day 3
No ratings yet
SocrAI Day 3
43 pages
AIML 7 To 11
No ratings yet
AIML 7 To 11
7 pages
Internship PPT
No ratings yet
Internship PPT
22 pages
Machine Learning Model ENG
No ratings yet
Machine Learning Model ENG
16 pages
Lesson 7 - RNN
No ratings yet
Lesson 7 - RNN
89 pages
ML Resources CW 2025
No ratings yet
ML Resources CW 2025
5 pages
Project Question
No ratings yet
Project Question
12 pages
Asi 04 00013
No ratings yet
Asi 04 00013
22 pages
3 Stock
No ratings yet
3 Stock
12 pages
Twitter Sentiment Analysis Project
No ratings yet
Twitter Sentiment Analysis Project
18 pages
Advanced Topic Data Mining
No ratings yet
Advanced Topic Data Mining
40 pages
Analysis of News Sentiments Using NLP and DL
No ratings yet
Analysis of News Sentiments Using NLP and DL
7 pages
Project
No ratings yet
Project
17 pages
Deep Learning
No ratings yet
Deep Learning
21 pages
Naveen PILLA Implementation of Sentiment Analysis in The Stock Market Using Machine Learning 2414012 1532031119
No ratings yet
Naveen PILLA Implementation of Sentiment Analysis in The Stock Market Using Machine Learning 2414012 1532031119
8 pages
Machine Learning Notes22
No ratings yet
Machine Learning Notes22
45 pages
Youtube Analysis3
No ratings yet
Youtube Analysis3
58 pages
Building An AI Model Capable of Judging User Sentiments
No ratings yet
Building An AI Model Capable of Judging User Sentiments
2 pages
Stock Market Trend Prediction Report
No ratings yet
Stock Market Trend Prediction Report
4 pages
Stock Prediction
No ratings yet
Stock Prediction
27 pages
Practical Machine Learning Pipelines With Mllib: Joseph K. Bradley
No ratings yet
Practical Machine Learning Pipelines With Mllib: Joseph K. Bradley
35 pages
ThesisFinal - Predicting Forex Rates Using Sentiment
No ratings yet
ThesisFinal - Predicting Forex Rates Using Sentiment
49 pages
10 Ecf Procs
No ratings yet
10 Ecf Procs
53 pages
AI Practice Session 2 Note
No ratings yet
AI Practice Session 2 Note
27 pages
AI Practice Session 1 Note
No ratings yet
AI Practice Session 1 Note
23 pages
Forwards, Futures, and Swaps
No ratings yet
Forwards, Futures, and Swaps
68 pages
The Capital Asset Pricing Model
No ratings yet
The Capital Asset Pricing Model
64 pages
Models of Asset Dynamics
No ratings yet
Models of Asset Dynamics
34 pages
Term Structure of Interest Rates
No ratings yet
Term Structure of Interest Rates
96 pages
Fixed-Income Securities
No ratings yet
Fixed-Income Securities
95 pages
Models and Data
No ratings yet
Models and Data
30 pages
1 Ot
No ratings yet
1 Ot
10 pages
MAS 109 Introduction To Linear Algebra: Midterm Exam April 20 (Wednesday), 2022 16:00 PM 18:45 PM
No ratings yet
MAS 109 Introduction To Linear Algebra: Midterm Exam April 20 (Wednesday), 2022 16:00 PM 18:45 PM
18 pages
MAS 109 Introduction To Linear Algebra: Final Exam June 15 (Wednesday), 2022 16:00 PM 18:45 PM
No ratings yet
MAS 109 Introduction To Linear Algebra: Final Exam June 15 (Wednesday), 2022 16:00 PM 18:45 PM
28 pages
Ise Interpetation
No ratings yet
Ise Interpetation
90 pages
MAS 101 Spring 2023 Calculus I 2.4: Gyo Taek Jin
No ratings yet
MAS 101 Spring 2023 Calculus I 2.4: Gyo Taek Jin
9 pages
Final Project Report
No ratings yet
Final Project Report
24 pages
Real-Time Forecasting of Time Series in Financial Markets Using Sequentially Trained Dual-LSTM
No ratings yet
Real-Time Forecasting of Time Series in Financial Markets Using Sequentially Trained Dual-LSTM
53 pages
Massive MIMO CSI Feedback Using Channel Prediction: How To Avoid Machine Learning at UE?
No ratings yet
Massive MIMO CSI Feedback Using Channel Prediction: How To Avoid Machine Learning at UE?
14 pages
04 - Deep-Learning-Based Surrogate Model For Reservoir Simulation With Time-Varying Well Controls
No ratings yet
04 - Deep-Learning-Based Surrogate Model For Reservoir Simulation With Time-Varying Well Controls
20 pages
A Method For Well Log Data Generation Based On A S
No ratings yet
A Method For Well Log Data Generation Based On A S
12 pages
Information Processing and Management
No ratings yet
Information Processing and Management
15 pages
Sms Spam Term Paper
No ratings yet
Sms Spam Term Paper
10 pages
Fninf 2 1494970
No ratings yet
Fninf 2 1494970
21 pages
Smart City Traffic Optimization
No ratings yet
Smart City Traffic Optimization
17 pages
Faculty Project Titles 2024
No ratings yet
Faculty Project Titles 2024
26 pages
Yichen Zhou
No ratings yet
Yichen Zhou
17 pages
Deep Learning For Cyber Security Intrusion Detection Approaches, Datasets, and Comparative Study PDF
No ratings yet
Deep Learning For Cyber Security Intrusion Detection Approaches, Datasets, and Comparative Study PDF
20 pages
LSTM Models for Lima's Risk Forecast
No ratings yet
LSTM Models for Lima's Risk Forecast
14 pages
Advanced NLP with LSTMs & Attention
No ratings yet
Advanced NLP with LSTMs & Attention
26 pages
Predicting The Price of Bitcoin Using Machine Learning
No ratings yet
Predicting The Price of Bitcoin Using Machine Learning
5 pages
RNN & LSTM: Nguyen Van Vinh Computer Science Department, UET, Vnu Ha Noi
No ratings yet
RNN & LSTM: Nguyen Van Vinh Computer Science Department, UET, Vnu Ha Noi
35 pages
Deep Learning for ECG Noise Detection
No ratings yet
Deep Learning for ECG Noise Detection
22 pages
Transformers in Time-Series Analysis: A Tutorial
No ratings yet
Transformers in Time-Series Analysis: A Tutorial
34 pages
Modern Developments in Flood Modelling
No ratings yet
Modern Developments in Flood Modelling
6 pages
Machine Learning Techniques - SDN
No ratings yet
Machine Learning Techniques - SDN
38 pages
False Information Detection in Online Content and Its Role in Decision Making A Systematic Literature Reviewsocial Network Analysis and Mining
No ratings yet
False Information Detection in Online Content and Its Role in Decision Making A Systematic Literature Reviewsocial Network Analysis and Mining
20 pages
Project Report Format (2024-25)
No ratings yet
Project Report Format (2024-25)
35 pages
Intrusion Detection of Imbalanced Network Traffic Based On Machine Learning and Deep Learning
No ratings yet
Intrusion Detection of Imbalanced Network Traffic Based On Machine Learning and Deep Learning
14 pages
Unlocking Online Insights: LSTM Exploration and Transfer Learning Prospects
No ratings yet
Unlocking Online Insights: LSTM Exploration and Transfer Learning Prospects
14 pages
Multimodal Recognition With Deep Learning: Audio, Image, and Text
No ratings yet
Multimodal Recognition With Deep Learning: Audio, Image, and Text
11 pages
Memoire: Universite Ibn Khaldoun - Tiaret
No ratings yet
Memoire: Universite Ibn Khaldoun - Tiaret
88 pages
8478457
No ratings yet
8478457
13 pages
Sentiment Analysis Based On Weighted Word2vec and Att-LSTM
No ratings yet
Sentiment Analysis Based On Weighted Word2vec and Att-LSTM
5 pages
Syllabus DSA4213
No ratings yet
Syllabus DSA4213
6 pages
ARTIFICIAL NEUERAL NETWORK Notes
No ratings yet
ARTIFICIAL NEUERAL NETWORK Notes
28 pages

AI Session 3 Lecture Note

Uploaded by

AI Session 3 Lecture Note

Uploaded by

Hands-on Practice on Financial AI Session

AI for Finance (IE471 )

KAIST Financial Engineering Lab.

If the financial market index is interpreted from a very simple

▪ Chen, T., & Guestrin, C. (2016, August). Xgboost: A scalable

▪ In machine learning, boosting algorithm is an ensemble

▪ XGBoost is an algorithm that has recently been dominating

▪ XGBoost is an implementation of gradient boosted decision

▪ For data preprocessing and visualization

▪ For constructing a machine learning model

▪ Setting seeds for scoring

▪ Saving classification results

▪ Evaluating performance and

Sentiment analysis is a natural language processing technique

Ex. NAVER Sentiment Movie Corpus:

Conducting sentiment analysis, we can classify text data into three

Goal of this session

An unrolled recurrent neural network

▪ Because of the disadvantage of RNN models (vanishing

▪ Long Short-Term Memory (LSTM) (Hochreiter and

▪ LSTM models are very powerful in sequence prediction

What they did…

What they did…

▪ For data preprocessing and visualization

▪ For constructing a machine learning model

▪ There are six columns are in the loaded

▪ We collected Tesla’s stock price

▪ We split these data into the

Input_size (Input Size)

▪ For preprocessing text data

▪ For visualizing of text data

Remove Special Characters

Lowercase all tweets

σ ni=1 Sentiment Score of Tweets #iin the particular day

Average Sentiment Score Values

Remove Neutral Tweets

Average Sentiment Scores Average Sentiment Scores

Changed dataset shape

Changed torch tensor size

Changed input size

Without sentiment analysis results With sentiment analysis results

▪ There can be various features to consider when we predict stock prices.

You might also like