Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
14 views6 pages

QB

The document outlines the syllabus for the T.Y. B. Tech. (AIDS) Semester VI course on Data Analytics and Modelling, detailing the course code, credits, and teaching scheme. It includes objectives, course outcomes, and a breakdown of units covering data definitions, analysis techniques, descriptive statistics, hypothesis testing, and exploratory data analysis. Additionally, it provides practice questions and topics for further study related to data analytics and statistical methods.

Uploaded by

p64115077
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views6 pages

QB

The document outlines the syllabus for the T.Y. B. Tech. (AIDS) Semester VI course on Data Analytics and Modelling, detailing the course code, credits, and teaching scheme. It includes objectives, course outcomes, and a breakdown of units covering data definitions, analysis techniques, descriptive statistics, hypothesis testing, and exploratory data analysis. Additionally, it provides practice questions and topics for further study related to data analytics and statistical methods.

Uploaded by

p64115077
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

(Faculty of Science & Technology)

Syllabus of T. Y. B. Tech. (AIDS) Semester VI


Course Code: AID351 Credits: 3-0-0
Course: Data Analytics and Modelling In Semester Examination-I: 15 Marks
Teaching Scheme: In Semester Examination-II: 15 Marks
Theory: 3 Hrs. / week Continuous In-semester Evaluation: 10
Marks
Teacher Assessment: 10 Marks
End Semester Examination: 50 Marks
End Semester Examination (Duration): 2
Hrs.
Prerequisite Data Engineering
1. Apply the data transformation/modelling techniques.
Objectives 2. To learn the probabilistic model of data science.
3. Understand the basic concepts of data analytics.
CO1 Apply data management and indexing techniques to categorize and
organize data for analysis. (III)
CO2 Utilize data transformation methods (Box-Cox, power, etc.) to prepare
data for statistical analysis. (IV)
CO3 Calculate and interpret descriptive statistics (measures of central
Course tendency, dispersion, shape) to summarize data characteristics. (IV)
Outcomes CO4 Perform hypothesis testing using various statistical tests (Chi-square,
Z-test, T-test) to assess data relationships. (IV)
CO5 Analyze variance and covariance between variables using techniques
like ANOVA, MANOVA, and ANCOVA. (IV)
CO6 Apply exploratory data analysis (EDA) and inferential statistics to
uncover patterns and draw conclusions from data. (V)
Unit 1: Data Definitions and Analysis Techniques
Elements, Variables, and Data categorization, Levels of Measurement, Data
Unit-I
management and indexing, Data Analysis types: Descriptive, Diagnostic,
Predictive (6 Hrs.)
Data transformation and standardization
Box-Cox and power transforms, Freeman-Tukey (square root and arcsine)
Unit-II
transforms, Log and Exponential transforms, Logit transforms, Normal
transform (6 Hrs.)
Descriptive Statistics
Unit-III
Counts and specific values, Measure of central tendency, Measure of spread,
Measure of distribution shape, Statistical indices, Moments, Key functions,
Measures of complexity and model selection. Measures of location of
dispersions. (6 Hrs.)
Basic Analysis Techniques
Goodness of fit tests: Anderson-Darling, Chi-square test, Kolmogorov-Smirnov,
Ryan-Joiner, Shapiro-Wilk, Jarque-Bera, Lilliefors; Z- test: test of single mean,
Unit-IV standard deviation known, Test of the difference between two means, standard
deviation known, test for proportions, P; T-tests: test of single mean, standard
deviation not known, Test of the difference between two means, standard
deviation not known, test of regression coefficients (6 Hrs.)
Analysis of Variance and Covariance
Variance test: Chi square test of single variable, F-test of two variables, test of
homogeneity; Wilcoxon rank-sum/Mann-Whitney U test; Sign test.
Contingency Tables: Chi-square contingency table test, G contingency table
test, Fisher's exact test, Measures of association, McNemar's test. ANOVA:
Unit-V
Single factor or one way ANOVA, Two factor or two-way and higher-way
ANOVA, MANOVA, ANCOVA; Non Parametric ANOVA: Kruskal Wallis
ANOVA, Friedam ANOVA test, Mood’s median, Correlation analysis,
Maximum likelihood test
(6 Hrs.)
Exploratory Data Analysis and Statistics
Inferential Statistics, Hypothesis testing, Univariate Analysis, Bivariate
Unit-VI
Analysis, Derived Metrics, Sampling and Sampling Distribution
(6 Hrs.)
2 Marks Practice Questions

1. Define with example


a. population in Statistics
b. variable in Statistics
c. independent variable in Statistics
d. dependent variable in Statistics
e. Count in statistics
f. EDA
g. Hypothesis testing
h. Univariate Analysis
i. Bivariate Analysis
j. Derived Metrics
k. Sampling
l. P value
m. Define Purpose of
i. ANOVA test
ii. one-way ANOVA
iii. t test
iv. Z test
v. Chi-square test
vi. Anderson-Darling test
vii. Correlation analysis
viii. Maximum likelihood test
ix. Data Transformation
x. sampling in inferential statistics
xi. testing regression coefficients
2. What are the three categories of variables? Explain using suitable examples.
3. List the fundamental elements of data.
4. Describe the following with suitable example.
a. Descriptive analysis
b. Diagnostic analysis
c. Predictive analysis
d. Basis components of a data set
e. Data Categorization
f. Data Classification
g. Levels of Measurement https://www.questionpro.com/blog/nominal-ordinal-
interval-ratio/
5. Difference between Data Categorization and Data Classification.
6. Difference between qualitative and quantitative data
7. data management Vs data indexing techniques
8. Define and explain Uses of Index Number in Statistics in detail.
9. List out the examples of nominal, ordinal, interval, and ratio variables.
10. List out the characteristics of nominal and ordinal data .
11. Describe with examples :
a. statistical techniques used in descriptive analysis.
b. Importance of data visualization in descriptive analysis
c. Importance of data visualization in predictive analysis
d. variables role/uses/importance in data analysis
e. Importance/ use of data management in statistical analysis
12. How do you handle missing values in count data?
13. Write the techniques for handling missing data, outliers, and data cleaning.
14. Explain the purpose of indexing in databases. How does indexing improve data
retrieval efficiency?
15. Define descriptive analysis. What are its goals, and which statistical measures are
commonly used .
16. What is diagnostic analysis? How does it identify patterns, anomalies, or
relationships in data?
17. Explain predictive analysis. How does it use historical data to make future
predictions? .
18. How logit transformation is related to logistic regression?
19. List out the similarities between box cox and power transform.
20. What is the purpose of analyzing the shape of a distribution?
21. List out the characteristics of symmetric and skewed distribution.
22. What are the commonly used indices and their applications?
23. How does Ryan-Joiner test detects deviations from a specified distribution?
24. List out Ryan-Joiner test pros and cons.
25. Explain when Kruskal-Wallis test is it preferred over the parametric ANOVA.
26. Explain relation between Sampling Distribution and standard error with an
example.
27. Differentiate between univariate and bivariate analysis.
28. What is the Variance test?

4Marks /8Marks TH

1. Explain following in detail with suitable example(8/4 marks each)

a. logit transformation

b. box cox transform

c. power transform

d. log transformation

e. Normal Transform/z-transform

f. exponential transformation

g. measures of dispersion

h. Ryan-Joiner test

i. chi-square goodness of fit test

j. t-test for a single mean when the population standard deviation is


unknown.

k. t-test for comparing means of two independent samples when the


population standard deviation is unknown with Null and Alternate
Hypothesis

l. Wilcoxon Rank Sum hypothesis test on a dataset

m. Kruskal-Wallis test

n. Bivariate Analysis

o. Univariate Analysis

p. arcsine transformation

q. Measures of central tendency

r. moments of a sample (all 4)

s. Anderson-Darling test
t. Z-test for a single mean when the population standard deviation is
known

u. chi-square test of a single variable with Null and Alternate Hypothesis

v. One way ANOVA

w. two-way ANOVA

x. inferential statistics

y. hypothesis testing

z. Freeman-Tukey (square root) transformation

aa. F test with Null and Alternate Hypothesis

ab. Z test with Null and Alternate Hypothesis

2. Explain pros and cons of Normal Transform(z-transform) in detail.

3. What is Exploratory Data Analysis (EDA)? Explain any three key aspects of
EDA in detail.

4. Explain any two variance tests in detail with suitable example.

5. Explain the application of Maximum Likelihood Test in detail with suitable


example.

6. What are the common techniques used for analyzing a single variable in
Univariate analysis? Discuss any two such techniques in detail.

7. What are derived metrics, and what is its importance in data analysis? Explain
with suitable example.

8. What is hypothesis testing in statistics? Explain the implementation of any


hypothesis test with an example.

9. Explain measures of distribution shape in statistics with suitable example.

For Numericals practice the numericals available on Internet for the points in
Unit 2, 3, 4 and 5

Do not completely depend upon the above practice questions.

All the best!!!

You might also like