0% found this document useful (0 votes)

24 views7 pages

Time Series Analysis Homework

The document discusses time series concepts including autocorrelation, stationarity, and differencing. It provides examples of autocorrelation plots with different sample sizes and analyzes their properties. It also examines residuals from a forecasting model and discusses how to make a non-stationary series stationary through differencing.

Uploaded by

Siddharth Reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views7 pages

Time Series Analysis Homework

Uploaded by

Siddharth Reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

Siddharth Singireddy

Time Series

Homework 3

Question 1:

a) The series at Lag-7, Lag-9, and Lag-11 is given by [1,2,3,4,5,6,7,8,9,10,11] at Lag-7, the
value is 3, which is the sum of the numbers from 1 to 7. [1+2+3+4+5+6+7+8+9+10+11=3] at
Lag-9, the value is 5, which is the sum of the numbers from 1 to 9.
[1+2+3+4+5+6+7+8+9+10+11=5] at Lag-11, the value is 10, which is the sum of the
numbers from 1 to 11. [1+2+3+4+5+6+7+8+9+10+11=10]

The series at Lag-7, Lag-9, and Lag-11 is given by [1,2,3,4,5,6,7,8,9,10,11] at Lag-7, the
value is 3, which is the sum of the numbers from 1 to 7. [1+2+3+4+5+6+7+8+9+10+11=3] at
Lag-9, the value is 5, which is the sum of the numbers from 1 to 9.
[1+2+3+4+5+6+7+8+9+10+11=5] at Lag-11, the value is 10, which is the sum of the
numbers from 1 to 11. [1+2+3+4+5+6+7+8+9+10+11=10]

b) The 8th order autocorrelation is the correlation between the 8th and 1st order terms. In this
case, the 8th order term is 3, and the 1st order term is 1. The correlation between these two
terms is [31=3] Therefore, the 8th order autocorrelation is 3. The 10th order autocorrelation
is the correlation between the 10th and 3rd order terms. In this case, the 10th order term is
5, and the 3rd order term is 3. The correlation between these two terms is [53=15] Therefore,
the 10th order autocorrelation is 15.

The 8th order autocorrelation is the correlation between the 8th and 1st order terms. In this
case, the 8th order term is 3, and the 1st order term is 1. The correlation between these two
terms is [31=3] Therefore, the 8th order autocorrelation is 3. The 10th order autocorrelation
is the correlation between the 10th and 3rd order terms. In this case, the 10th order term is
5, and the 3rd order term is 3. The correlation between these two terms is [53=15] Therefore,
the 10th order autocorrelation is 15.

Question 2:

a) The graph shows the residuals (errors) from a forecasting model. These residuals exhibit a
noticeable pattern, suggesting that they are not white noise. White noise residuals would
appear random without any discernible structure.

The histogram and Q-Q plot indicates that the residuals are not normally distributed. In a
normal distribution, the points on the Q-Q plot would align closely with the diagonal line.
But in this case, we see deviations from that line.
The model might not be a good fit for the data due to the non-random residuals and departure
from normality.

b) Normally distributed residuals are desirable because they indicate that the model’s errors
are random and unbiased. When residuals follow normal distribution, the model captures the
underlying patterns effectively.

Non-normal residuals can lead to biased forecasts. For instance, if the residuals are skewed,
the model may consistently overestimate or underestimate future values. Normality ensures
that the model’s assumptions align with reality, improving its reliability.

c) Small residuals suggest that the model fits the data well during training. However, this
doesn’t guarantee or assure good forecasts.

Overfitting is a concern: A model can have small residuals by memorizing the training data
but fail to generalize to new data (poor forecasts). While small residuals are desirable, we
must also consider the model’s ability to generalize beyond the training set.

d) Increasing model complexity isn’t always the solution. Adding complexity can lead to
overfitting, where the model fits noise rather than true patterns. A more complex model may
perform well on training data but poorly on unseen data (overfitting). Instead of blindly
increasing complexity, we should focus on improving model selection, feature engineering,
and addressing biases in the data.

Question 3:

No, the residuals in the graphs do not appear to be uncorrelated and normally distributed.

 Residuals are considered uncorrelated if they show no pattern over time. In the graph,
the residuals plot shows a cyclical pattern, with positive residuals followed by
negative residuals. This suggests that the errors are correlated.
 Normally distributed residuals would follow a bell-shaped curve. The histogram of
the residuals doesn't appear to follow a bell-shaped curve, particularly at the tails.
This suggests the residuals are not normally distributed.

Few reasons why the residuals in the graphs are uncorrelated and violate the assumptions of
normality
a) No Autocorrelation:

 Imagine the residuals plotted like points on a graph, with time on the -axis and the
residual value on the -axis. Ideally, these points should be scattered randomly, with no
trend.
 In the graph, the residuals form a wave-like pattern, other non-random pattern, it
suggests that the errors are correlated.
 This means the error made on one measurement might influence the error on the next
measurement, which is not ideal.

b) Normal Distribution:

 Imagine a bell-shaped curve. This is the shape a histogram of normally distributed

data would take.
 The residuals, when plotted as a histogram, should roughly follow this bell-shaped
curve.
 In the graph, the histogram has a different shape, with fat tails compared to a bell
curve, it indicates the residuals are not normally distributed.

Residuals that are uncorrelated and normally distributed are important assumptions for many
statistical tests. If these assumptions are not met, the results of the tests may be unreliable.
These assumptions are crucial because many statistical tests rely on them. If the residuals
aren't random and normally distributed, the test results might be misleading.

Question 4:

a) Figure 1 36 Random Numbers:

 The autocorrelation function ACF shows a mix of positive and negative correlations.
 Some lags have significant correlations e.g., lag 5, while others are close to zero.
 This graph does not indicate white noise because white noise has no correlation
between consecutive observations.

Figure 2 360 Random Numbers

 The ACF is smoother and more evenly distributed around zero.

 Most lags have correlation close to zero.
 This graph suggests closer resemblance to white noise, but some patterns may still
exist.

Figure 3 1000 Random Numbers

 The ACF is even smoother and more centred around zero.

 Almost all lags have correlations close to zero.
 This graph closely resembles white noise.
As the sample size increases, the ACF becomes smoother and the correlations approach zero.
However, only the last figure random numbers can be considered close to white noise.

The graphs differ due to sample size. Figure 1 (36 random numbers) shows mixed
correlations, Figure 2 (360 random numbers) has smoother ACF with correlations closer to
zero, and Figure 3 (1000 random numbers) closely resembles white noise with predominantly
zero correlations. As sample size increases, ACF becomes smoother and correlations
approach zero, indicating white noise characteristics.

b) Critical Values:

 Critical values confidence bounds are determined by the sample size and the desired
confidence level e.g. 95%,
 As the sample size increases, the critical values become narrower closer to zero
 This is because larger samples provide more accurate estimates, leading to higher
confidence intervals.

Autocorrelation:

 Even though the graphs refer to white noise, the sample size affects the precision of
the estimates.
 With more data points, the estimated autocorrelations become more stable and closer
to the true population values.
 Smaller sample sizes may exhibit more variability in the ACF due to random
fluctuations.

While all graphs refer to white noise, the larger sample size provides more reliable estimates,
resulting in smoother ACFs and narrower confidence bounds.

As sample size increases, critical values narrow, reflecting more accurate estimates and
tighter confidence intervals. Similarly, larger sample sizes stabilize autocorrelation estimates,
yielding smoother ACFs closer to true population values. Smaller sample sizes may exhibit
greater variability due to random fluctuations. Overall, larger samples offer more reliable
estimates, resulting in smoother ACFs and narrower confidence bounds compared to smaller
samples.

Larger sample sizes lead to smoother autocorrelation functions and narrower confidence
bounds, reflecting more accurate estimates. Despite variations, all figures indicate
characteristics of white noise, with larger samples providing more reliable results.
Question 5:

The provided plot illustrates characteristics such as trends, seasonality, or varying variance,
which suggest non-stationarity. If the series demonstrates a consistent upward or downward
trend, it indicates that the mean is not constant over time. Seasonality involves predictable
and recurring patterns over a specific period, which violates the stationarity assumption.
Varying variance, where the spread of series data points increases or decreases over time,
also suggests non-stationarity. Differencing the data can help stabilize the mean by removing
changes at a lagged level and potentially reduce trend and seasonality, making the series
stationary.

To make a series stationary, one common method is differencing, where you subtract the
current value from the previous value. This method can help to eliminate or reduce trend and
seasonality, potentially stabilizing the mean and variance of the series over time. The
question suggests that such an analysis was expected by examining each plot for these
characteristics and then applying differencing to check if it achieves stationarity.

Question 6:

1: Read the Data:

import pandas as pd

# Read data from excel

data = pd.read_excel("retail.xlsx", skiprows=1, index_col=0, parse_dates=True)

Imported the retail data from the "retail.xlsx" file, skipping the first row, and set given date
column in the index.

2: Plotting the Time Series Data:

import matplotlib.pyplot as plt

# Plot time series data

plt.figure(figsize=(10, 6))
plt.plot(data)
plt.title("Retail Data Time Series Plot")
plt.xlabel("Year")
plt.ylabel("Sales")
plt.show()

This visualized the retail sales data over time which let us observe any trends, seasonality, or
patterns.

3: Plotting the Autocorrelation Function (ACF):

from statsmodels.graphics.tsaplots import plot_acf

# Plot ACF
plt.figure(figsize=(10, 6))
plot_acf(data, lags=50, alpha=0.05)
plt.title("Autocorrelation Function (ACF) Plot")
plt.xlabel("Lag")
plt.ylabel("ACF")
plt.show()

Examined the ACF plot to see how quickly autocorrelation decreases as lag increases,
identifying any long-term dependencies.

4: Plotting the Partial Autocorrelation Function (PACF):

from statsmodels.graphics.tsaplots import plot_pacf

# Plot PACF
plt.figure(figsize=(10, 6))
plot_pacf(data, lags=50, alpha=0.05)
plt.title("Partial Autocorrelation Function (PACF) Plot")
plt.xlabel("Lag")
plt.ylabel("PACF")
plt.show()

Analysed the PACF plot to identify direct effects of each lag on the current observation,
looking for significant spikes beyond the first few lags indicating long-term dependencies.

import pandas as pd
import matplotlib.pyplot as plt
from statsmodels.graphics.tsaplots import plot_acf, plot_pacf

# Read data from excel

data = pd.read_excel("retail.xlsx", skiprows=1, index_col=0, parse_dates=True)

# Plot time series data

plt.figure(figsize=(10, 6))
plt.plot(data)
plt.title("Retail Data Time Series Plot")
plt.xlabel("Year")
plt.ylabel("Sales")
plt.show()

# Plot ACF
plt.figure(figsize=(10, 6))
plot_acf(data, lags=50, alpha=0.05)
plt.title("Autocorrelation Function (ACF) Plot")
plt.xlabel("Lag")
plt.ylabel("ACF")
plt.show()

# Plot PACF
plt.figure(figsize=(10, 6))
plot_pacf(data, lags=50, alpha=0.05)
plt.title("Partial Autocorrelation Function (PACF) Plot")
plt.xlabel("Lag")
plt.ylabel("PACF")
plt.show()

Presence of a trend in the time series plot. Slow decay or significant correlations in ACF and
PACF plots. Loaded the retail sales data from the Excel file, setting the date column as the
index. Visualized the sales data to detect trends or patterns over time. Checked how quickly
correlations decay as lag increases, indicating the series' dependence on past values.
Identified direct influences of each lag on the current observation, helping determine the
appropriate differencing order.

Clark The Penguin Dicionary of Geography
No ratings yet
Clark The Penguin Dicionary of Geography
472 pages
An Assignment On Social Change & Development
No ratings yet
An Assignment On Social Change & Development
16 pages
Class IX Session 2023-24 Subject - Science Sample Question Paper - 3
No ratings yet
Class IX Session 2023-24 Subject - Science Sample Question Paper - 3
6 pages
Axioms of Data Analysis - Wheeler
100% (1)
Axioms of Data Analysis - Wheeler
7 pages
Time Series Forecasting Techniques
100% (1)
Time Series Forecasting Techniques
11 pages
Form Four Geo-1
No ratings yet
Form Four Geo-1
6 pages
Crosss Sectional Areas - by Sections Part 1
No ratings yet
Crosss Sectional Areas - by Sections Part 1
5 pages
6 Random Processes v7
No ratings yet
6 Random Processes v7
38 pages
Quality Assurance and Quality Control (QA /QC)
No ratings yet
Quality Assurance and Quality Control (QA /QC)
3 pages
The Box-Jenkins Methodology For RIMA Models
No ratings yet
The Box-Jenkins Methodology For RIMA Models
172 pages
Stat
No ratings yet
Stat
43 pages
Resouce Guide The Giver
No ratings yet
Resouce Guide The Giver
46 pages
Physical Properties of Steel
No ratings yet
Physical Properties of Steel
1 page
ARIMA Modeling for Analysts
100% (1)
ARIMA Modeling for Analysts
205 pages
Auto
No ratings yet
Auto
43 pages
Some Important Things To Remember
No ratings yet
Some Important Things To Remember
17 pages
Applied Time Series Analysis Guide
No ratings yet
Applied Time Series Analysis Guide
340 pages
Lecture 11: Standard Error, Propagation of Error, Central Limit Theorem in The Real World
No ratings yet
Lecture 11: Standard Error, Propagation of Error, Central Limit Theorem in The Real World
13 pages
Nice Econometrics Notes!!!!!!!
No ratings yet
Nice Econometrics Notes!!!!!!!
238 pages
Probability and Statistics Ii: George Deligiannidis Module Lecturer 2020/21: Kalliopi Mylona
No ratings yet
Probability and Statistics Ii: George Deligiannidis Module Lecturer 2020/21: Kalliopi Mylona
107 pages
Factors in Predicting Health Behaviors Lecture
No ratings yet
Factors in Predicting Health Behaviors Lecture
20 pages
Time Series Analysis Unit 1
No ratings yet
Time Series Analysis Unit 1
80 pages
Psychology Guide for CBSE Students
No ratings yet
Psychology Guide for CBSE Students
8 pages
A Course in Time Series Analysis 1662068197
No ratings yet
A Course in Time Series Analysis 1662068197
300 pages
Checking Model Assumptions
No ratings yet
Checking Model Assumptions
4 pages
Econ 623 AsymptoticTheory 2023
No ratings yet
Econ 623 AsymptoticTheory 2023
74 pages
STA457
No ratings yet
STA457
30 pages
And Fluctuates Around A Constant Mean
No ratings yet
And Fluctuates Around A Constant Mean
15 pages
q3 Performance Task 1
No ratings yet
q3 Performance Task 1
4 pages
Autocorrelation 2
No ratings yet
Autocorrelation 2
36 pages
Mathematical Statistics Intro Course 1713243381
No ratings yet
Mathematical Statistics Intro Course 1713243381
142 pages
Time Series Data and Forecasting
No ratings yet
Time Series Data and Forecasting
8 pages
Time Series Analysis
No ratings yet
Time Series Analysis
115 pages
Syllabus Arch 353 Sec Sem.2024-2025
No ratings yet
Syllabus Arch 353 Sec Sem.2024-2025
4 pages
Chapter12 Autocorrelation
No ratings yet
Chapter12 Autocorrelation
256 pages
Autocorrelation
No ratings yet
Autocorrelation
49 pages
Notes For 18.6501x, Fundamentals of Statistics: v0.2 (2019 April 24)
100% (1)
Notes For 18.6501x, Fundamentals of Statistics: v0.2 (2019 April 24)
14 pages
Exploring Grammar in Writing Article
No ratings yet
Exploring Grammar in Writing Article
4 pages
Apprenticeship - From Theory To Method and Back Again (Suny - Michael W - Coy
No ratings yet
Apprenticeship - From Theory To Method and Back Again (Suny - Michael W - Coy
336 pages
Univariate Time Series
No ratings yet
Univariate Time Series
132 pages
Sasin DECS 434 Session 1 and 2 - Probability and Excel
No ratings yet
Sasin DECS 434 Session 1 and 2 - Probability and Excel
104 pages
Rao (2022) - A Course in Time Series Analysis
No ratings yet
Rao (2022) - A Course in Time Series Analysis
527 pages
Exponential Family
No ratings yet
Exponential Family
45 pages
SolutionsManual MCstyle 2018
No ratings yet
SolutionsManual MCstyle 2018
40 pages
Time Series Analysis
100% (1)
Time Series Analysis
66 pages
Môn Listening
No ratings yet
Môn Listening
19 pages
PTRP Theory by Sahil Sir
No ratings yet
PTRP Theory by Sahil Sir
35 pages
Time Series and Sequential Data
No ratings yet
Time Series and Sequential Data
143 pages
Advanced Statistical Theory
No ratings yet
Advanced Statistical Theory
132 pages
Control Charts & Normal Distribution
No ratings yet
Control Charts & Normal Distribution
25 pages
Assignment3 Group3.CC01.Forecasting
No ratings yet
Assignment3 Group3.CC01.Forecasting
7 pages
(Gabarito Dia 1) Quarto Bernoulli 2022
No ratings yet
(Gabarito Dia 1) Quarto Bernoulli 2022
50 pages
Assignment4 Group3.CC01.Forecasting-1
No ratings yet
Assignment4 Group3.CC01.Forecasting-1
11 pages
EII3002 Exercise 7 & 8
No ratings yet
EII3002 Exercise 7 & 8
24 pages
GED102 Week 8 WGN
No ratings yet
GED102 Week 8 WGN
8 pages
MSF 566 Topic 03 Stationary Time Series
No ratings yet
MSF 566 Topic 03 Stationary Time Series
61 pages
Lecture 7 PDF
No ratings yet
Lecture 7 PDF
30 pages
EE7401 Probability and Random Processes
No ratings yet
EE7401 Probability and Random Processes
58 pages
Stationary Time Series Analysis
No ratings yet
Stationary Time Series Analysis
35 pages
Math Syllabi
No ratings yet
Math Syllabi
8 pages
Auto/cross-Correlation: Generalized Regression Model
No ratings yet
Auto/cross-Correlation: Generalized Regression Model
37 pages
According To Official Estimates, About 330,000 Houses Were Damaged
No ratings yet
According To Official Estimates, About 330,000 Houses Were Damaged
5 pages
RSP Assignment: Kush Bansal Bba (Fia) 2A 18344
No ratings yet
RSP Assignment: Kush Bansal Bba (Fia) 2A 18344
22 pages
7th Science EM Term 2 Exam 2023 Question Paper Virudhunagar District English Medium PDF Download
No ratings yet
7th Science EM Term 2 Exam 2023 Question Paper Virudhunagar District English Medium PDF Download
2 pages
Random Processes: 8.1 Basic Concepts
No ratings yet
Random Processes: 8.1 Basic Concepts
14 pages
The Practice of Algebraic Curves A Second Course in Algebraic Geometry (David Eisenbud Etc.) (Z-Library)
No ratings yet
The Practice of Algebraic Curves A Second Course in Algebraic Geometry (David Eisenbud Etc.) (Z-Library)
432 pages
QMT 11 Notes
No ratings yet
QMT 11 Notes
150 pages
1.5 Estimation of Correlation: 26 1 Characteristics of Time Series
No ratings yet
1.5 Estimation of Correlation: 26 1 Characteristics of Time Series
20 pages
00000chen - Linear Regression Analysis3
No ratings yet
00000chen - Linear Regression Analysis3
252 pages
Lec7 PDF
No ratings yet
Lec7 PDF
20 pages
305MinNotes PDF
No ratings yet
305MinNotes PDF
148 pages
Collection of Formulae and Statistical Tables For The B2-Econometrics and B3-Time Series Analysis Courses and Exams
No ratings yet
Collection of Formulae and Statistical Tables For The B2-Econometrics and B3-Time Series Analysis Courses and Exams
21 pages
Gujarat Technological University
No ratings yet
Gujarat Technological University
1 page
Assumption C.5 States That The Values of The Disturbance Term in The Observations in The Sample Are Generated Independently of Each Other
No ratings yet
Assumption C.5 States That The Values of The Disturbance Term in The Observations in The Sample Are Generated Independently of Each Other
129 pages
Tsa Solutions
No ratings yet
Tsa Solutions
49 pages
Toray
No ratings yet
Toray
60 pages
Basic Stats
No ratings yet
Basic Stats
49 pages
PPTfor IIIDefense
No ratings yet
PPTfor IIIDefense
12 pages
CMN 211 - Rubric For Final Essay Demonstration of Research Skills /5
No ratings yet
CMN 211 - Rubric For Final Essay Demonstration of Research Skills /5
1 page
CIVWARE Lecture Topic 5.1 (Water Treatment) PDF
No ratings yet
CIVWARE Lecture Topic 5.1 (Water Treatment) PDF
19 pages
Exercises TECHNICAS ANALÍTICAS
No ratings yet
Exercises TECHNICAS ANALÍTICAS
23 pages
0830 Warning Codes
No ratings yet
0830 Warning Codes
10 pages
Math One Revision Booklet
No ratings yet
Math One Revision Booklet
121 pages
Golden Ratio in Architecture Design
No ratings yet
Golden Ratio in Architecture Design
2 pages
Time Table Summer 2024 SMME V1.1
No ratings yet
Time Table Summer 2024 SMME V1.1
1 page
ECON 762 Lecture Notes
No ratings yet
ECON 762 Lecture Notes
19 pages
Econometrics 1 Cumulative Final Study Guide
No ratings yet
Econometrics 1 Cumulative Final Study Guide
35 pages

Time Series Analysis Homework

Uploaded by

Time Series Analysis Homework

Uploaded by

Siddharth Singireddy

 Imagine a bell-shaped curve. This is the shape a histogram of normally distributed

a) Figure 1 36 Random Numbers:

Figure 2 360 Random Numbers

 The ACF is smoother and more evenly distributed around zero.

Figure 3 1000 Random Numbers

 The ACF is even smoother and more centred around zero.

1: Read the Data:

# Read data from excel

2: Plotting the Time Series Data:

import matplotlib.pyplot as plt

# Plot time series data

3: Plotting the Autocorrelation Function (ACF):

from statsmodels.graphics.tsaplots import plot_acf

4: Plotting the Partial Autocorrelation Function (PACF):

from statsmodels.graphics.tsaplots import plot_pacf

# Read data from excel

# Plot time series data

You might also like