Exercise Programs
Topic Covered Problem Definition
Data Cleaning, 1. Identify and handle missing values in the air
Handling quality dataset & Perform data visualization to
understand pollution trends.
Missing
2.Handle missing values in the Titanic dataset &
Values, and Visualize survival patterns.
Visualization
1. Naïve Bayes Classifier: Apply e1071 package on
the Titanic dataset to predict survival.
2. Support Vector Machine (SVM): Train an SVM
model using the e1071 package on the mtcars
Classification dataset.
3. k-Nearest Neighbors (k-NN) Classifier: Use class
package to classify handwritten digits from the
mnist dataset.
1. Multiple Linear Regression: Predict house prices
using multiple independent variables from the
Boston Housing dataset (MASS package).
Regression 2. Logistic Regression: Predict whether passengers
survived on the Titanic dataset using glm().
3. Polynomial Regression: Fit a polynomial
regression model on synthetic nonlinear data.
1. Hierarchical Clustering: Perform hierarchical
clustering on the mtcars dataset and create a
dendrogram.
Clustering
2. DBSCAN Clustering: Use the dbscan package to
cluster noisy data points.
Sample Lab Records
Each experiment must be written with problem definition, theory
background, and screenshots of the source program and input/output. A
sample laboratory record is given below.
Laboratory Course Name
Term Work
Name of the Student: XYZ
Roll No: ##
Programme: MCA
Year: March 2025
Semester: 4
Experiment No: 1
Date:
Problem definition:
R Program;
OUTPUT: (Paste Screen shots from R studio).