0% found this document useful (0 votes)

29 views16 pages

Report

The document discusses stock market price prediction using various machine learning and statistical methods, specifically focusing on Artificial Neural Networks (ANN), Support Vector Classification (SVC), and ARIMA models. The objective is to predict the direction of stock market movements based on time series data, with a detailed analysis of data preprocessing, model training, and performance evaluation. The study highlights the challenges of non-stationary data and the importance of hyperparameter tuning for improving prediction accuracy.

Uploaded by

hasmaabdul992

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views16 pages

Report

Uploaded by

hasmaabdul992

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

STOCK MARKET PRICE PREDICTION

CS725: Foundations of Machine Learning

SUBMITTED BY:
ANAND NAMDEV(163050068)
PARTH LATHIYA(163050095)
AWISHA MAKWANA(163050079)
1) Introduction :
Time series data prediction is one of the broad topics in the field of machine learning. The data
in the time series is dependent only on time. Stock market prediction is regarded as a challenging
task of the financial time series prediction process since the stock market is highly, nonlinear and
nonparametric.

In addition, stock market is affected by many macro economical factors such as political events,
general economic conditions, investor’s expectations, movement of other stock markets and
psychology of investors etc.

A various set of different machine learning techniques and statistical techniques have been
studied, out of which three different models have been implemented to predict stock market
direction. Artificial Neural Networks(ANN), Support Vector Classification(SVC) and Auto
Regressive Integrated Moving Average(ARIMA) are the methods studied to achieve the task.
Also, a version of ARIMA model is also implemented which has minor variations with respect to
ARIMA model. ARIMA and ARMA models are statistical approaches while ANN and SVC are
machine learning based methods. The values of parameters according to corresponding model is
varied and performance is measured to tune the hyper-parameters.

2) Objective :
The main objective of the project is to predict the direction of stock market. The direction of
stock market is defined to be upwards if the closing value of index increase from the previous
day and downwards if the closing value of current day is smaller than the previous day.

Various machine learning and statistical methods have been studied and ANN, SVC & ARIMA
models have been implemented to achieve this task.

3) Motivation :
The main motivation of doing this project i.e. stock prediction is the association of this problem
with one of the most challenging domain that is Time-Series Analysis. In a time-series domain
there are no features associated with a output values (here closing value). All values are either
varied with respect to the time or in a way dependent on factors which can’t be converted into a
continuous vector space. For ex: for stock prediction one cannot measure the value of various
factors behind the ups and down of the closing price like political shaking, investor’s interest in
buying the stock etc but one can definitely examine these patterns over a period of time and
predict for the future values if a stock values has increased or decreased.

4) Approaches :
Stock prediction task will be achieved here by two methods.
a) Statistical methods
b) Machine learning method
a) Statistical Methods :
These are the mathematical method which define the pattern in the time series with the help of
mathematical functions like moving average, exponential moving average etc. One of the most
popular ARIMA and its variation ARMA models have been implemented.

b) Machine learning method :

These are the sophisticated supervised machine learning methods. Two most popular methods
have been implemented.
a) Artificial Neural Networks
b) Support Vector Classification

5) ARIMA Model :
ARIMA stands for Autoregressive Integrated Moving Average model. It is based on a regression
model where the y-closing values of the stock price is matched by drawing a line consisting of
previous value and errors.

In ARIMA p defines the previous p-days closing values and q defines the previous q-day's errors
values.

5.1) Data Description:

For the prediction of the stock price, we have used the Nikkei-225 which is the stock of Japan. It
consists of 8000 rows of closing values in the range of 9k-17k varying with respect of time.

The below figure is the original data.

Figure 1 :Original Data

The above graph of the data shows that the data is behaving completely abnormally just like the
stock price. That is there is no dependency on any feature except time.

In order to model such a time series, we first have to convert the data into a stationary data.
Above series is a highly non-stationary data because it contains non-zero mean and non-zero
variance. For a stationary time series the value of mean and variance should be as close to zero
as possible.

The data and its various other components have been introduced to study the effect of accuracy
on the data stationarity.
a) Original Data
b) Log data
c) Moving average lop difference
d) Exponential average log difference
e) Decomposed data
To convert the above time series into a stationary data, various changes have been done to the
original data like logarithm of the data, moving average of the data, exponential average of the
data, data decomposing. And for each of the data dickey-fuller test has been applied which
returns the p-value. The p-value should be as close to zero as possible for a stationary data.

Below is the figure which shows the mean and variance of the all the types of data setting. From
the table it is clearly visible that for decomposed data the mean and variance is closest to zero. It
implies that the data for this setting is most stationary. On the other hand the original data and
the log data has very high mean and variance which is a property of the non-stationarity.
Table 1: Mean and Variance for each Dataset

The following graphs shows the plots of above data for ex: log data, moving average data,
exponential data, decomposed data.

This is the moving average data. This data is more centered to the zero and has fairly low mean
and variance value.

Figure 2: Moving Average Dataset

The below is the decomposed data.

Figure 3 :Decomposed DataSet

5.2) Training of ARIMA :

For the training part of the ARIMA model, various p,i and q values have been optimized by
running the experiment extensively. And for each value of p, i and q the error value AIC and BIC
is calculated. The value of p,q and i for which this error is minimum will be selected as the final
p,q and i value. The similar work has been done for ARMA except that the value of i is kept
constant to be = 0.

This is the prediction error on the training data. The training data used here is the original data.
The red is the original data and the blue is the predicting data.
Figure 4 : Training Accuracy for original data

The below is the training prediction on training data based on the model from
decomposed data.

Figure 5 : Training Accuracy for decomposed data

5.3) Test Prediction :
For predicting the values of the stock, next 10 values are predicted.
The below data contains the red value which is the predicting value of stock for next 10 days
according to the original data model. Here the data consists of upper and lower bound as well
which account for 80% and 95% confidence value.
Figure 6 : 10-Days prediction according to Original Data

This is according to the moving average data.

Figure 7 : 10-Days prediction according to Moving Average Data

This is according to the final decomposed data.

Figure 8: 10-Days prediction according to final decomposed data

While predicting for each type of data model, an error measure called MASE which stands for
Mean Absolute Scaled error which is an important error function when different data series are
scaled differently. The error here scales down to the common scaling point.

Once the error value for each type of data model is calculated, the similar type of work is done
for ARMA model where I=0. Here as you can observe the error is least for decomposed data for
both ARIMA and ARMA and highest for original data for both ARIMA and ARMA model. This
justifies the saying that if a data is non-stationary it can’t have very good error.

Table 2 : Error Comparison on ARIMA & ARMA

The graph between error and the type of the data used for both ARIMA and ARMA.
Figure 9 : Graph Error vs DataType (for ARIMA & ARMA)

6) ANN learning model :

ANN has demonstrated their capability in financial modeling and prediction. In this project , a
two layered feedforward ANN model has been structured to predict stock price index movement.
This ANN model consists of an input layer, 2 hidden layers and an output layer, each of which is
connected to the other. In our model there are 10 inputs each of which have been calculated from
the attribute values in the dataset used. The calculation involves using 10 functions specified in
table 3 .These 10 inputs are the 10 neurons in the input layer. The output layer has 2 neurons for
the two class outputs as two patterns(0 or 1) of stock price direction. The architecture of the two-
layered feedforward ANN is illustrated in Fig 10.
The number of neurons in the hidden layer has been determined empirically. In an ANN model
the neurons of a layer are linked to the neurons of the neighboring layers with connectivity
coefficients (weights). These weights are updated to classify the given input patterns correctly
for a given set of input-output pairs using a learning procedure. Initially the weights are assigned
random values. The back-propagation learning algorithm is used to train the two layered
feedforward ANN structure in this project.

To evaluate the performance of the ANN model absolute error is used. The gradient-descent
method is used as the weight update algorithm to minimize the absolute error. A sigmoid
function is selected on the hidden layer as the activation function. On the other hand, a softmax
function is used on the output layer. That is, the outputs of the model will vary between 0 and 1.
If the output values are probabilities of that input belonging to that class(decreasing or increasing
direction).

Figure 10 :ANN Model Architecture

Data Preprocessing:
Data used has 4 attributes which have been processed using 10 formulations as follows:

S No. NAME FORMULA

1 Simple 10-day moving average Ct  Ct 1   Ct 10

2 Weighted 10-day moving ((n)  Ct  (n  1)Ct 1   C10 )

average (n  (n  1)  1)

3 Momentum Ct  Ct n

4 Stochastic K% (C t  LLt n )
100
( HH t n  LLt n )
5 Stochastic D% n 1

K
i 0
t i %

6 RSI (Relative Strength Index) 100

100 
1  (i 0Upt i / n) /( Dwt i / n)
n 1

7 MACD (moving average MACD(n)t 1  2 / n  1 ( DIFFt  MACD(n)t 1 )

convergence divergence)

8 Larry William’s R% H n  Ct
100
H n  Ln

9 A/D H t  Ct 1
(Accumulation/Distribution) H t  Lt
Oscillator

10 CCI (Commodity Channel M t  SM t

Index) 0.015Dt
Table 3: Technical Indicators and there formula
Ct is the closing price, Lt the low price, Ht the high price at time t, DIFF: EMA(12)t EMA(26)t, EMA exponential
moving average, EMA(k)t: EMA(k)t1 + a(Ct EMA(k)t1), a smoothing factor: 2/1 + k, k is time period of k day
exponential moving average, LLt and HHt mean lowest low and highest high in the last t days, respectively, Mt : Ht
þ Lt þ Ct=3; Upt means the upward price change, Dwt means the downward price change at
time t.

HyperParameter Tuning:
The number of neurons(n) in the hidden layer, value of learning rate(lr) and number of
iterations(epochs) are ANN model hyper parameters that must be efficiently determined.
Eight levels of n, 10 levels of learning rate and ten levels of epochs were tested in the hyper
parameter tuning. The ANN parameters and their levels are summarized in table 4.
Each parameter combination was applied to the training and holdout data sets and prediction
accuracy of the models were evaluated seeing the absolute errors.The parameter combination
that resulted in the best performance is selected as the best one for the corresponding model.

Parameter Levels

Number of neurons in hidden layer 10,15,20…..,40

Value of learning rate 0.1,0.2,……,0.9

Number of iterations(epochs) 10,20,…..,100

Table 4:ANN parameter levels tested in hyperparameter tuning

Some observations while hyperparameter tuning:

Learning Rate Epochs No. of neurons Training Testing

0.1 10 30 0.9113 0.8818

0.1 10 12 0.9134 0.8868

0.1 50 30 0.9141 0.8849

0.9 20 28 0.9146 0.8905

Table 5:Observed accuracies with different hyperparameter values

Figure 11 : Learning rate vs prediction performance

Further, Advantages and Disadvantages:

Neural networks are advanced enough to detect any complex relationships between inputs and
outputs as well, which is another advantage when using this model. Neural networks are not
without their disadvantages. Due to the complicated and advanced nature of the model, they are
very difficult to design.

While the adaptability and sensitivity of a neural network is most certainly an advantage, it does
also come with problems. Given that a neural network will react to even the smallest change in
data, it can often be very hard to model analytically as a result.Running a neural network also
requires a huge amount of computing resources, making it expensive, and possibly impractical,
for some companies and applications.

7) SVC

Support vector machines (SVM) is a family of algorithms that have been implemented in
classification, recognition, regression and time series. SVM emerged from research in statistical
learning theory on how to regulate generalization, and find an optimal tradeoff between
structural complexity and empirical risk.

SVM classify points by assigning them to one of two disjoint half spaces, either in the pattern
space or in a higher-dimensional feature space.

The main idea of support vector machine is to construct a hyperplane as the decision surface
such that the margin of separation between positive and negative examples is maximized.

For a training set of samples, with input vectors xi ∈ Rd and corresponding labels yi ∈ {+1,-1},
SVM learns how to classify objects into two classes.

The choice of kernel function is a critical decision for prediction efficiency. Both polynomial and
radial basis functions were adopted in experiments. Several levels of the degree of polynomial
function (d), gamma constant of radial basis function (c) and regularization parameter (c) were
tested in the parameter setting experiments. The SVM parameters and their levels are
summarized in below table.

Parameters Levels (polynomial) Levels (radial basis)

Gamma in kernel function 0, 0.1, 0.2, … ,5.0 0, 0.1, 0.2, … ,5.0

(c)

Regularization parameter (c) 1,10,100 1,10,100

Table 6 :SVM parameter levels tested in parameter setting experiments
No Kernel function d γ C Training Testing Average

1 RBF - 2.5 100 0.9125 0.9038 0.9081

2 RBF - 5.0 100 0.9125 0.9001 0.9063

3 RBF - 3.1 100 0.9125 0.9041 0.9083

4 Linear - - 100 0.9036 0.8992 0.9014

5 Polynomial 1 3.5 100 0.9003 0.8982 0.8992

6 Polynomial 1 0.3 100 0.9033 0.9032 0.9032

7 Polynomial 1 0.5 100 0.9064 0.9002 0.9033

Table 7 :Best three parameter combinations of SVM model

The data sets were applied to the SVM models with three different parameter combinations and
the results are given in above table.

8) Advantage and disadvantage

Since the kernel implicitly contains a non-linear transformation, no assumptions about the
functional form of the transformation, which makes data linearly separable, is necessary. The
transformation occurs implicitly on a robust theoretical basis and human expertise judgment
beforehand is not needed.

SVMs provide a good out-of-sample generalization, if the parameters C and r (in the case of a
Gaussian kernel) are appropriately chosen. This means that, by choosing an appropriate
generalization grade, SVMs can be robust, even when the training sample has some bias.

SVMs deliver a unique solution, since the optimality problem is convex. This is an advantage
compared to Neural Networks, which have multiple solutions associated with local minima and
for this reason may not be robust over different samples.

The disadvantages of SVM are that the theory only really covers the determination of the
parameters for a given value of the regularisation and kernel parameters and choice of kernel. In
a way the SVM moves the problem of over-fitting from optimising the parameters to model
selection. Sadly kernel models can be quite sensitive to over-fitting the model selection criterion

ARIMA combines auto regression--which fits the current data point to a linear function (usually)
of some prior data points--and moving averages--adding together several consecutive data points
and getting their mean, and then using that to compute estimations of the next value and
advantage is that, with enough elements regressed and averaged, you can fit an approximation to
almost any time series you like, to whatever precision you like.

The trouble, of course, is Slutzsky's theorem: Slutzsky showed that, by using ARIMA type
computation, and perhaps adding a trend line or two, you can take random noise into any time
series you like... The point? Well, it basically means that you may fit the data magnificently

George and Mallery (2003), PDF
33% (3)
George and Mallery (2003), PDF
63 pages
Probability and Stochastic Processes 3rd Edition Quiz Solutions
100% (2)
Probability and Stochastic Processes 3rd Edition Quiz Solutions
90 pages
Using Machine Learning Algorithms On Prediction of Stock Price-SVR
No ratings yet
Using Machine Learning Algorithms On Prediction of Stock Price-SVR
16 pages
Method Validation: With Confidence
100% (2)
Method Validation: With Confidence
52 pages
Forecasting Directionof Stock Index Using Two Stage Hybridizationof Machine Learning Models
No ratings yet
Forecasting Directionof Stock Index Using Two Stage Hybridizationof Machine Learning Models
6 pages
Stock Market Prediction Using Time Series Analysis: N Viswam and G Satyanarayana Reddy
100% (1)
Stock Market Prediction Using Time Series Analysis: N Viswam and G Satyanarayana Reddy
5 pages
Statistical Process Control Exercise For Exam Two
No ratings yet
Statistical Process Control Exercise For Exam Two
4 pages
Predicting S&P500 Prices Using ARIMA (LinkedIn - Ivan Hung)
No ratings yet
Predicting S&P500 Prices Using ARIMA (LinkedIn - Ivan Hung)
15 pages
Effective Stock Price Prediction Using Time Series Forecasting
No ratings yet
Effective Stock Price Prediction Using Time Series Forecasting
5 pages
Stock Price Prediction Using LSTM, RNN and Cnn-Sliding Window Model
No ratings yet
Stock Price Prediction Using LSTM, RNN and Cnn-Sliding Window Model
5 pages
Time Series Linear Models
No ratings yet
Time Series Linear Models
121 pages
Time Series Components:: The Long-Term Direction.: The Periodic Behavior.: The Irregular Fluctuations
No ratings yet
Time Series Components:: The Long-Term Direction.: The Periodic Behavior.: The Irregular Fluctuations
19 pages
Group 1B - MOBD Project
No ratings yet
Group 1B - MOBD Project
18 pages
Wipro
No ratings yet
Wipro
21 pages
Soft Computing Model Coupled With Statis
No ratings yet
Soft Computing Model Coupled With Statis
19 pages
Uksim2014 Ieee
No ratings yet
Uksim2014 Ieee
7 pages
Term Paper 2
No ratings yet
Term Paper 2
13 pages
Different Types of Time
No ratings yet
Different Types of Time
6 pages
MIS410 Chapter7
No ratings yet
MIS410 Chapter7
49 pages
Results Paper
No ratings yet
Results Paper
6 pages
A Novel Residual Correction Approach Based On A Hybrid GATR
No ratings yet
A Novel Residual Correction Approach Based On A Hybrid GATR
8 pages
p2 - Discrete Wavelet Transform-Based Prediction of Stock Index
No ratings yet
p2 - Discrete Wavelet Transform-Based Prediction of Stock Index
33 pages
Course Project Presentation: Foundations of Machine Learning - CS725
No ratings yet
Course Project Presentation: Foundations of Machine Learning - CS725
33 pages
Group Id 2. Project Title 3. Project Option 4. Internal Guide 5. Sponsorship and External Guide 6. Technical Keywords (As Per ACM Keywords)
No ratings yet
Group Id 2. Project Title 3. Project Option 4. Internal Guide 5. Sponsorship and External Guide 6. Technical Keywords (As Per ACM Keywords)
3 pages
Stock Price Prediction Using The ARIMA Model
No ratings yet
Stock Price Prediction Using The ARIMA Model
7 pages
Comparison of Trend Forecast Using ARIMA and ETS Models For S&P500 Close Price
No ratings yet
Comparison of Trend Forecast Using ARIMA and ETS Models For S&P500 Close Price
4 pages
Proceedings - IS403 N23 HTCL-28-37
No ratings yet
Proceedings - IS403 N23 HTCL-28-37
10 pages
Stock Price Prediction Based On Arima-Garch and LSTM: Abstract
No ratings yet
Stock Price Prediction Based On Arima-Garch and LSTM: Abstract
11 pages
Applsci 13 10782
No ratings yet
Applsci 13 10782
12 pages
Satyam Projrct TIME SERIES
No ratings yet
Satyam Projrct TIME SERIES
28 pages
Time Series Forecasting for Analysts
No ratings yet
Time Series Forecasting for Analysts
19 pages
ARIMA for Stock Price Prediction
No ratings yet
ARIMA for Stock Price Prediction
1 page
Act64 - 11220249 - Le Ngoc Phuc Anh Hihi
No ratings yet
Act64 - 11220249 - Le Ngoc Phuc Anh Hihi
21 pages
Time Series Forecasting Methods
No ratings yet
Time Series Forecasting Methods
23 pages
Statistics - Statistical Inference
No ratings yet
Statistics - Statistical Inference
3 pages
Stock Price Prediction via Time Series
No ratings yet
Stock Price Prediction via Time Series
5 pages
Time Arima 002
No ratings yet
Time Arima 002
11 pages
Arima Time Series Stock Prediction
No ratings yet
Arima Time Series Stock Prediction
23 pages
Sreekanth Presentation
No ratings yet
Sreekanth Presentation
19 pages
Stock Market Prediction Using Time Series Forecasting
No ratings yet
Stock Market Prediction Using Time Series Forecasting
6 pages
Stock Price Prediction: By: Aarushi Sunderrajan (S0 Paridhi Deval (S0 Pranjal Gupta (S059)
No ratings yet
Stock Price Prediction: By: Aarushi Sunderrajan (S0 Paridhi Deval (S0 Pranjal Gupta (S059)
34 pages
Stock Price Prediction Using The ARIMA Model
No ratings yet
Stock Price Prediction Using The ARIMA Model
7 pages
Paper 4
No ratings yet
Paper 4
11 pages
Solutions To Chapter 12 Problems: 12-1 EUAC
No ratings yet
Solutions To Chapter 12 Problems: 12-1 EUAC
29 pages
FULLTEXT01
No ratings yet
FULLTEXT01
41 pages
Stock Trend Prediction Based On ARIMA-LightGBM Hybrid Model
No ratings yet
Stock Trend Prediction Based On ARIMA-LightGBM Hybrid Model
5 pages
Microsoft Stock Prediction Using LSTM
No ratings yet
Microsoft Stock Prediction Using LSTM
5 pages
Sargent T., Et. Al (2010) - Practicing Dynare
No ratings yet
Sargent T., Et. Al (2010) - Practicing Dynare
66 pages
Stock Market Prediction: Hrithik D B181070PE
No ratings yet
Stock Market Prediction: Hrithik D B181070PE
5 pages
Arima 1b
No ratings yet
Arima 1b
6 pages
Time Series Forecasting Models
No ratings yet
Time Series Forecasting Models
14 pages
05.stock Market Prediction Using
No ratings yet
05.stock Market Prediction Using
4 pages
Prosiding Seminar Edusainstech ISBN: 978-602-5614-35-4 Fmipa Unimus 2020
No ratings yet
Prosiding Seminar Edusainstech ISBN: 978-602-5614-35-4 Fmipa Unimus 2020
9 pages
Stock Price Prediction Using Machine Learning Algorithms: ARIMA, LSTM & Linear Regression
No ratings yet
Stock Price Prediction Using Machine Learning Algorithms: ARIMA, LSTM & Linear Regression
7 pages
Time - Series - in - Brief
No ratings yet
Time - Series - in - Brief
11 pages
08.forecasting Method of Stock Market Volatility in Time
No ratings yet
08.forecasting Method of Stock Market Volatility in Time
17 pages
ARIMA Modelling and Forecasting: by Shipra Mishra Intern
No ratings yet
ARIMA Modelling and Forecasting: by Shipra Mishra Intern
17 pages
Artificial Neural Networks in Time Series Forecasting: A Comparative Analysis
No ratings yet
Artificial Neural Networks in Time Series Forecasting: A Comparative Analysis
21 pages
Properties of A Random Variable
No ratings yet
Properties of A Random Variable
3 pages
Notes
No ratings yet
Notes
37 pages
What-If Analysis Template
No ratings yet
What-If Analysis Template
5 pages
Chapter I Random Variables and Probability Distribution
No ratings yet
Chapter I Random Variables and Probability Distribution
43 pages
Optimal Receiver Design Guide
No ratings yet
Optimal Receiver Design Guide
41 pages
Be A 65 Ads Exp 8
No ratings yet
Be A 65 Ads Exp 8
10 pages
An Effective Time Series Analysis For Stock Trend Prediction Using ARIMA Model For Nifty Midcap-50
No ratings yet
An Effective Time Series Analysis For Stock Trend Prediction Using ARIMA Model For Nifty Midcap-50
14 pages
Unit 1 - Basic Statistics
No ratings yet
Unit 1 - Basic Statistics
24 pages
Simulation Modeling in Manufacturing
No ratings yet
Simulation Modeling in Manufacturing
3 pages
Unit 7 Mode
No ratings yet
Unit 7 Mode
7 pages
Ejemplos R
No ratings yet
Ejemplos R
88 pages
Stock Forecasting: ARIMA vs LSTM
No ratings yet
Stock Forecasting: ARIMA vs LSTM
8 pages
Staff Manual 06
No ratings yet
Staff Manual 06
3 pages
Chapter-7 Slides 7
No ratings yet
Chapter-7 Slides 7
14 pages
Auditing: Estimation of Errors
No ratings yet
Auditing: Estimation of Errors
5 pages
Hypothesis Testing Quiz
No ratings yet
Hypothesis Testing Quiz
5 pages
Position of Fovea Palatinae Relative To The Vibrating Line in Various Soft Palate Classifications Among Jordanian Edentulous Population
No ratings yet
Position of Fovea Palatinae Relative To The Vibrating Line in Various Soft Palate Classifications Among Jordanian Edentulous Population
9 pages
The Randomized Block Design
No ratings yet
The Randomized Block Design
32 pages
Poisson Regression for Counts
No ratings yet
Poisson Regression for Counts
51 pages
Market Risk Analysis VaR Expected Shortfall Presentation
No ratings yet
Market Risk Analysis VaR Expected Shortfall Presentation
15 pages
Non Parametric Test Examples
No ratings yet
Non Parametric Test Examples
13 pages
Hvac Chapter 6 Solution Manual
No ratings yet
Hvac Chapter 6 Solution Manual
20 pages
Problem Sets 202324
No ratings yet
Problem Sets 202324
21 pages
Advanced Regression Analysis
No ratings yet
Advanced Regression Analysis
14 pages
Chapter 0
No ratings yet
Chapter 0
10 pages
Btech Sem 3 Btam 303 18
No ratings yet
Btech Sem 3 Btam 303 18
2 pages
Tandem Walking for Elderly Balance
No ratings yet
Tandem Walking for Elderly Balance
4 pages

Report

Uploaded by

Report

Uploaded by

STOCK MARKET PRICE PREDICTION

CS725: Foundations of Machine Learning

b) Machine learning method :

5.1) Data Description:

The below figure is the original data.

Figure 2: Moving Average Dataset

The below is the decomposed data.

5.2) Training of ARIMA :

Figure 5 : Training Accuracy for decomposed data

This is according to the moving average data.

Figure 7 : 10-Days prediction according to Moving Average Data

This is according to the final decomposed data.

Table 2 : Error Comparison on ARIMA & ARMA

6) ANN learning model :

Figure 10 :ANN Model Architecture

S No. NAME FORMULA

1 Simple 10-day moving average Ct  Ct 1   Ct 10

2 Weighted 10-day moving ((n)  Ct  (n  1)Ct 1   C10 )

6 RSI (Relative Strength Index) 100

7 MACD (moving average MACD(n)t 1  2 / n  1 ( DIFFt  MACD(n)t 1 )

10 CCI (Commodity Channel M t  SM t

Number of neurons in hidden layer 10,15,20…..,40

Value of learning rate 0.1,0.2,……,0.9

Number of iterations(epochs) 10,20,…..,100

Some observations while hyperparameter tuning:

Learning Rate Epochs No. of neurons Training Testing

0.1 10 30 0.9113 0.8818

0.1 10 12 0.9134 0.8868

0.1 50 30 0.9141 0.8849

0.9 20 28 0.9146 0.8905

Figure 11 : Learning rate vs prediction performance

Further, Advantages and Disadvantages:

Parameters Levels (polynomial) Levels (radial basis)

Gamma in kernel function 0, 0.1, 0.2, … ,5.0 0, 0.1, 0.2, … ,5.0

Regularization parameter (c) 1,10,100 1,10,100

1 RBF - 2.5 100 0.9125 0.9038 0.9081

2 RBF - 5.0 100 0.9125 0.9001 0.9063

3 RBF - 3.1 100 0.9125 0.9041 0.9083

4 Linear - - 100 0.9036 0.8992 0.9014

5 Polynomial 1 3.5 100 0.9003 0.8982 0.8992

6 Polynomial 1 0.3 100 0.9033 0.9032 0.9032

7 Polynomial 1 0.5 100 0.9064 0.9002 0.9033

8) Advantage and disadvantage

You might also like