MACHINE LEARNING LAB PROGRAMS
1. Install and set up Python and essential libraries like
NumPy and pandas.
2. Introduce sci-kit-learn as a machine learning library.
Scikit-learn ( Sklearn ) is the most useful and robust library for machine
learning in Python. It provides a selection of efficient tools for machine
learning and statistical modelling including classification, regression,
clustering and dimensionality reduction via a consistence interface in
Python .This library, which is largely written in Python, is built upon
NumPy, SciPy and Matplotlib.
Installation
If you already installed NumPy and Scipy, the following are the two
easiest ways to install scikit-learn –
Using pip
The following command can be used to install sci-kit-learn via pip
pip install -U scikit-learn
Features
Rather than focusing on loading, manipulating and summarising data,
Scikit-learn library is focused on modelling the data. Some of the most
popular groups of models provided by Sklearn are as follows –
Supervised Learning algorithms − Almost all the popular supervised
learning algorithms, like Linear Regression, Support Vector Machine
(SVM), Decision Tree etc., are the part of scikit-learn.
Unsupervised Learning algorithms − On the other hand, it also has all
the popular unsupervised learning algorithms from clustering, factor
analysis, PCA (Principal Component Analysis) to unsupervised neural
networks.
Clustering − This model is used for grouping unlabeled data.
Cross Validation − It is used to check the accuracy of supervised models
on unseen data.
3. Install and set up scikit-learn and other necessary tools.
4. Write a program to Load and explore the dataset of .CVS
and excel files using pandas.
5. Write a program to Visualize the dataset to gain insights
using Matplotlib or Seaborn by plotting scatter plots,
and bar charts.
6. Write a program to Handle missing data, encode
categorical variables, and perform feature scaling.
7. Write a program to implement a k-Nearest Neighbours
(k-NN) classifier using scikit-learn and Train the classifier on
the dataset and evaluate its performance.
8.Write a program to implement a linear regression model
for regression tasks and Train the model on a dataset with
continuous target variables.
9. Write a program to implement a decision tree classifier
using scikit-learn and visualize the decision tree and
understand its splits.
10. Write a program to Implement K-Means clustering and
Visualize clusters.