DATA SCIENCE IMP
unit 3
1) Explain Data Analytics life cycle with the help of diagram. [10]
2) List different phases in data analytics life cycle and explain Model Building phase in detail. [8]
3) What are different phases in data analytics life cycle? Explain Operationalize phase in detail. [10]
4) Explain Model building phase with its challenges.
5) Draw the diagram of the data analytics life cycle in big data and briefly explain the Model Planning phase. [6]
6) Write a note on the Data Preparation phase with its steps. [6]
7) Explain challenges in the Model building phase.
Unit 4
1) Explain association rules with example. [4]
2) Explain Python Libraries for Data Processing, Modeling and Data Visualization. [10]
3) Explain predictive, Descriptive, and Prescriptive data analysis. And also mention their difference. [4]
4) Write a short notes on Global Innovation Social Network and Analysis. [5]
6) Explain the use of logistic function in logistic regression in detail. List and explain the Types of Logistic
regression. [10]
7) Wirte short notes on ASM. [3]
8) What do you mean by Linear Regression? Elaborate the types. [6]
9) Explain the Apriori algorithm with an example. [6]
10) Write a short note on the following: [6]
i) FP growth
ii) Decision Tree Classification
11) Explain Data transformation using function and mapping. [6]
12) Write a short note on the following: [6]
i) Removing duplicates from the data set.
ii) Handling missing data.
Unit 5
a) What is clustering? With suitable example explain the steps involved in k-means algorithm. [7]
a) What is clustering? Explain hierarchical clustering with an example. [6]
b) Explain the Holdout method and Random Sub Sampling method. [6]
b) Discuss Holdout method and Random Sampling methods. [6]
c) Discuss parameter tuning and optimization. [5]
c) Wirte short note on [4]
i) Confusion matrix
ii) AVC- ROC curve
iii) Elbow plot
a) What do you mean by text analysis? Why text analysis need to be done? Explain the following text analysis
steps with suitable examples [11]
i) Part of speech (POS) tagging
ii) Lemmatization
iii) Stemming
b) Wirte short note on [6]
i) Time series Analysis
ii) TF- IDF.
Unit 6
1) What is data visualization? What are the different methods of data visualization explain in detail. [6]
2) What is data Visualization? List and explain any one type of data visualization. [6]
3) Explain in detail the Hadoop Ecosystem with suitable diagram. [11]
4) Describe the Data visualization tool “Tableau”. Explain its applications in brief. [6]
5) With a suitable example explain and draw a Box plot and explain its usages. [6]
6) With a suitable example explain Histogram and explain its usages. [5]
7) With a suitable example explain the Scatter plot and explain its usage.[5]
8) Discuss various challenges to Big data Visualization. [5]