SRI RAAJA RAAJAN COLLEGE OF ENGINEERING & TECHNOLOGY
Amaravathiputhur Post, Karaikudi Tk
Accredited by NAAC with B+ Grade
MODEL EXAM I – December 2024
Date :14.12.24
Reg. No: Yr/Sem/Dept: II/III/AI&DS
Session : AN
AD3301 DATA EXPLORATION AND VISUALIZATION 100 Marks
ANSWER ALL QUESTIONS
PART A (2*10 = 20 MARKS)
1. What is meant by EDA? (AU.,ND 2022) C105.1 BTL1
2. How do you get cross tabulation and State the purpose of data (AU.,ND 2022) C105.1 BTL1
aggregation.
3. What is Matplotlib used for? (AU.,ND 2022) C105.2 BTL1
4. Is histogram always a bar chart? Justify with your answer. (AU.,ND 2023) C105.2 BTL1
5. What is the main purpose of univariate analysis (AU.,ND 2023) C105.3 BTL1
6. What is the mathematical mean of the following numbers. (AU.,ND 2023) C105.3 BTL1
10,25,36,45,87,21,35,65,98,54
7. What are the three common methods for performing bivariate analysis? (AU.,ND 2022) C105.4 BTL1
8. Outline the differences between univariate and bivariate data. (AU.,ND 2022 ) C105.4 BTL1
9. Show the characteristics of multivariate analysis. (AU,ND 2022) C105.5 BTL1
10. What is TSA in statsmodel? (AU,ND 2022) C105.5 BTL1
PART B – (5*13 = 65 Marks)
11. a)What is primary purpose of EDA? What are the differences between EDA with (AU,ND 2022) C105.1 BTL3
classical and Bayesian analysis? Discuss it in detail.
(OR)
b) Explain various transformation techniques in EDA (AU,ND 2022) C105.1 BTL3
12. a)How to over plot a line on a scatter plot in python? Illustrate with code (AU,ND 2022) C105.2 BTL3
(OR)
b) Discuss with how seaborn helps to visualize the statistical relationships. (AU,ND 2022) C105.2 BTL3
Illustrate with code and example.
13. a)What is scaling and standardization? When and why to standardize a variable? (AU,ND 2023) C105.3 BTL3
Illustrate with suitable example.
(OR)
b) Explain the smoothing techniques for time series data with suitable (AU,ND 2023) C105.3 BTL3
example.
14. a) What is a table of frequency values for a bivariate distribution? (AU,ND 2023) C105.4 BTL3
Explain what graph is used in the analysis of bivariate data?
(OR)
b) How do you analyse a contingency table? Discuss. (AU,ND 2023) C105.4 BTL3
15. a) What are the characteristics of multivariate analysis?
How do you explain multivariate analysis? (AU,ND 2023) C105.5 BTL3
(OR)
b) What is meant by time series data? Describe its four components. (AU,ND 2022) C105.5 BTL3
PART C – (1*15 = 15 Marks)
16.a) What are the tools used for EDA? Give a case study on applying
EDA in a real business scenario. (AU,ND 2023) C105.1 BTL3
(OR)
b)Discuss in detail about Data cleaning
(missing data, outliers detection and treatment) (AU,ND 2023) C105.5BTL3
Prepared by: Mrs.M.Bhuvaneswari, AP/AI&DS Approved by:
SRI RAAJA RAAJAN COLLEGE OF ENGINEERING & TECHNOLOGY
Amaravathiputhur Post, Karaikudi Tk
Accredited by NAAC with B+ Grade
MODEL EXAM I – December 2024
Date :14.12.24
Reg. No: Yr/Sem/Dept: II/III/AI&DS
Session : AN
AD3301 DATA EXPLORATION AND VISUALIZATION 100 Marks
ANSWER ALL QUESTIONS
PART A (2*10 = 20 MARKS)
1. What is meant by EDA? (AU.,ND 2022) C105.1 BTL1
2. How do you get cross tabulation and State the purpose of data (AU.,ND 2022) C105.1 BTL1
aggregation.
3. What is Matplotlib used for? (AU.,ND 2022) C105.2 BTL1
4. Is histogram always a bar chart? Justify with your answer. (AU.,ND 2023) C105.2 BTL1
5. What is the main purpose of univariate analysis (AU.,ND 2023) C105.3 BTL1
6. What is the mathematical mean of the following numbers. (AU.,ND 2023) C105.3 BTL1
10,25,36,45,87,21,35,65,98,54
7. What are the three common methods for performing bivariate analysis? (AU.,ND 2022) C105.4 BTL1
8. Outline the differences between univariate and bivariate data. (AU.,ND 2022 ) C105.4 BTL1
9. Show the characteristics of multivariate analysis. (AU,ND 2022) C105.5 BTL1
10. What is TSA in statsmodel? (AU,ND 2022) C105.5 BTL1
PART B – (5*13 = 65 Marks)
11. a)What is primary purpose of EDA? What are the differences between EDA with (AU,ND 2022) C105.1 BTL3
classical and Bayesian analysis? Discuss it in detail.
(OR)
b) Explain various transformation techniques in EDA (AU,ND 2022) C105.1 BTL3
12. a)How to over plot a line on a scatter plot in python? Illustrate with code (AU,ND 2022) C105.1 BTL3
(OR)
b) Discuss with how seaborn helps to visualize the statistical relationships. (AU,ND 2022) C105.1 BTL3
Illustrate with code and example.
13. a)What is scaling and standardization? When and why to standardize a variable? (AU,ND 2023) C105.1 BTL3
Illustrate with suitable example.
(OR)
b) Explain the smoothing techniques for time series data with suitable (AU,ND 2023) C105.1 BTL3
example.
14. a) What is a table of frequency values for a bivariate distribution? (AU,ND 2023) C105.1 BTL3
Explain what graph is used in the analysis of bivariate data?
(OR)
b) How do you analyse a contingency table? Discuss. (AU,ND 2023) C105.1 BTL3
15.a) What are the characteristics of multivariate analysis?
How do you explain multivariate analysis? (AU,ND 2023) C105.1 BTL3
(OR)
b) What is meant by time series data? Describe its four components. (AU,ND 2022) C105.1 BTL3
PART C – (1*15 = 15 Marks)
16.a) What are the tools used for EDA? Give a case study on applying
EDA in a real business scenario. (AU,ND 2023) C105.1 BTL3
(OR)
b)Discuss in detail about Data cleaning
(missing data, outliers detection and treatment) (AU,ND 2023) C105.5 BTL3
Prepared by: Mrs.M.Bhuvaneswari, AP/AI&DS Approved by: