Predictive Analytics
Unit 1
Fundamentals of Data Mining: Definition and importance of data mining,
KDD process model, CRISP-DM methodology, Types of data mining tasks,
Applications of data mining
Predictive Analytics Overview: What is predictive analytics? Relationship
between data mining and machine learning, Common applications, Key
challenges and issues
Data Types and Sources: Structured vs unstructured data, Mining different
data types, Basic data quality considerations
Unit 2
Data Understanding: Data quality assessment, Outlier detection methods,
Data collection and sampling techniques
Data Cleaning: Handling missing data, Data transformation techniques,
Discretization methods, Standardization and normalization
Exploratory Data Analysis: Univariate and multivariate analysis, Basic data
visualization techniques, Descriptive statistics (mean, SD, percentiles),
Categorical data analysis
Unit 3
Model Selection: Data partitioning (train/test), Cross-validation introduction.
Basic Modeling Approaches: Simple and multiple linear regression, Logistic
regression concepts, Decision trees.
Advanced Techniques Overview: Clustering fundamentals, Association
rules basics, Introduction to neural networks, Support Vector Machines (SVM).
Unit 4
Evaluation metrics: Accuracy, MAE, RMSE, Confusion Matrix, ROC, AUC.
Overfitting and underfitting, Cross-validation, Ensemble Learning and model
selection.
Basics of model deployment and updating, Web Mining.