0% found this document useful (0 votes)

343 views10 pages

Machine Learning Guide Line

Uploaded by

TANVIR SADAT

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

343 views10 pages

Machine Learning Guide Line

Uploaded by

TANVIR SADAT

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

Environment Setup for Machine Learning Class

Windows
Follow all the steps as it is from the link below:
https://www.youtube.com/watch?v=uOwCiZKj2rg&ab_channel
=MichaelGalarnyk
MAC
https://www.youtube.com/watch?v=E4k38RIUKvo

Ubuntu
https://www.youtube.com/watch?v=R2PWuaR_rZg

Lecture

Lecture 1

Python Tutorial Links

1. Microsoft : https://www.youtube.com/playlist?
list=PLlrxD0HtieHhS8VzuMCfQD4uJ9yne1mE6

(Links to an external site.)

2. Bangla Tutorial : https://www.youtube.com/playlist?

list=PLGPedopQSAJAoVkMxbENx99s2I4DKYdj7

Lecture 2

1. See attached file named INTRO.pdf from below

https://drive.google.com/file/d/1064U9EDhMjVdpw0WjNDxWJsipldIqfy8/view?usp=sharing
[reference: http://www.cs.toronto.edu/~urtasun/courses/CSC411_Fall16/01_intro.pdf ]

Lecture 3

1. Some Terminology:
https://developers.google.com/machine-learning/crash-course/framing/ml-terminology

2. TB2: Book page 7- 14 [Skip details of Batch and Online Learning]

NOTE: Page number corresponds to printed page number in PDF
Lecture 4

TB2: Book page 15-23 [Skip details of Batch and Online Learning]

Try and Practice Chapter one code from here:

https://github.com/ageron/handson-ml2/blob/master/01_the_machine_learning_la
ndscape.ipynb

Lecture 5
TB2 : Book page 23-30
Except [For the moment, we will come back later] - Regularization and Hyper
Parameter Tuning

Lecture 6

Understanding PANDAS library :

https://www.youtube.com/watch?v=CmorAWRsCAw
https://www.youtube.com/watch?v=F6kmIpWWEdU

Dataframe Basics: http://jalammar.github.io/gentle-visual-intro-to-data-

analysis-python-pandas/
Lecture 7
First Machine Learning Project using Iris Data set:
https://medium.com/gft-engineering/start-to-learn-machine-learning-with-the-iris-flower-
classification-challenge-4859a920e5e3

Data Explanation : https://raqueeb.gitbook.io/scikit-learn/iris-dataset/scikit-learn-iris

Lecture 8

TB2: Chapter 2 (End to End Machine Learning Project)

Page: 35-37, 46-51

Lecture 9

TB2: Chapter 2 (End to End Machine Learning Project)

Page: 51-55

Lecture 10

TB2: Chapter 2 (End to End Machine Learning Project)

Page: 56-62

Lecture 11

TB2: Chapter 2 (End to End Machine Learning Project)

Page: 62-64
Lecture 12 and 13

TB2: Chapter 2 (End to End Machine Learning Project)

Page: 65-73

Lecture 14

TB2: Chapter 2 (End to End Machine Learning Project)

Page: 73 - 80 (Without Grid Search)

Cross Validation:
https://drive.google.com/file/d/1eUdbQaxgzFMpUVuYYbpTBEAV3z8IsoRg/view

Lecture 15 , 16 , 17

1. ML terms for evaluating machine learning models (all the links should be
considered as part of the syllabus)

Key Terms and Definitions

Accuracy:
https://developers.google.com/machine-learning/crash-course/classification/accuracy
a. https://towardsdatascience.com/accuracy-recall-precision-f-score-specificity-
which-to-optimize-on-867d3f11124
b. https://developers.google.com/machine-learning/crash-course/classification/
precision-and-recall

Confusion Matrix
c. https://www.dataschool.io/simple-guide-to-confusion-matrix-terminology/
#:~:text=A%20confusion%20matrix%20is%20a,related%20terminology%20can%20be
%20confusing.

d. https://manisha-sirsat.blogspot.com/2019/04/confusion-matrix.html
e. Exercise (very important) :

https://developers.google.com/machine-learning/crash-course/classification/check-your-
understanding-accuracy-precision-recall

2. Lecture of Sensitivity, Specificity and Area Under Curve

a. https://developers.google.com/machine-learning/crash-course/classification/
roc-and-auc
b. https://www.youtube.com/watch?v=un6KTYMSzd4
c. https://www.youtube.com/watch?v=HXkrLmxNzUA
Exercise
https://developers.google.com/machine-learning/crash-course/classification/
check-your-understanding-roc-and-auc

Lecture 18

Titanic Problem Description

Lecture 19

Grid Search --> TB2: Chapter 2 (End to End Machine Learning Project)

Page: 75-78

Lecture 20 [KNN]

1. https://www.datacamp.com/community/tutorials/k-nearest-neighbor-classification-scikit-learn
2.
https://www.tutorialspoint.com/machine_learning_with_python/machine_learning_with_python_knn_
algorithm_finding_nearest_neighbors.htm

More on KNN training and testing phase

https://stackoverflow.com/questions/54505375/what-does-the-knn-algorithm-do-in-the-training-phase

Lecture 21 [Decision Tree]

Decision Tree
1. https://www.datacamp.com/community/tutorials/decision-tree-classification-python

2. https://www.youtube.com/watch?v=PHxYNGo8NcI&t=124s&ab_channel=codebasics

3.
https://www.bogotobogo.com/python/scikit-learn/scikt_machine_learning_Decision_Tree_Learning_I
nformatioin_Gain_IG_Impurity_Entropy_Gini_Classification_Error.php

Lecture 22 [Random Forest and PCA]

RANDOM FOREST

1. https://www.javatpoint.com/machine-learning-random-
forest-algorithm
2. As Regressor: https://www.geeksforgeeks.org/random-forest-
regression-in-python/

PCA
What is PCA?

Principal Component Analysis, or PCA, is a statistical method used to reduce the

number of variables in a dataset. It does so by lumping highly correlated variables
together. Naturally, this comes at the expense of accuracy. However, if you have 50
variables and realize that 40 of them are highly correlated, you will gladly trade a little
accuracy for simplicity.

High dimensionality means that the dataset has a large number of features. The primary
problem associated with high-dimensionality in the machine learning field is model
overfitting, which reduces the ability to generalize beyond the examples in the training
set. Richard Bellman described this phenomenon in 1961 as the Curse of
Dimensionality where Many algorithms that work fine in low dimensions become
intractable when the input is high-dimensional.

1. Also read from here:

https://datascienceplus.com/principal-component-analysis-pca-with-python/

2. sample code to show effectiveness of PCA:

https://colab.research.google.com/drive/1-6a02Ir87BNLSkM8uZ_-QZM23zZ2_WU0?

usp=sharing
Lecture 23 and 24 [Support Vector Machine, Ensemble Learning]

1. https://stackabuse.com/implementing-svm-and-kernel-svm-with-pythons-
scikit-learn/
2. https://www.youtube.com/watch?
v=N1vOgolbjSc&ab_channel=AliceZhao

Idea of C and Gamma Parameter in SVM

C is the cost of misclassification

A large C gives you low bias and high variance. Low bias because you penalize the cost of
misclassification a lot.

A small C gives you higher bias and lower variance.

Gamma is the parameter of a Gaussian Kernel (to handle non-linear classification). Check this
points:
They are not linearly separable in 2D so you want to transform them to a higher dimension
where they will be linearly separable. Imagine "raising" the green points, then you can separate
them from the red points with a plane (hyperplane)

To "raise" the points you use the RBF kernel, gamma controls the shape of the "peaks" where
you raise the points. A small gamma gives you a pointed bump in the higher dimensions, a large
gamma gives you a softer, broader bump.

So a small gamma will give you low bias and high variance while a large gamma will give you
higher bias and low variance.

You usually find the best C and Gamma hyper-parameters using Grid-Search.

3. ENSEMBLE LEARNING: https://towardsdatascience.com/basic-ensemble-

learning-random-forest-adaboost-gradient-boosting-step-by-step-
explained-95d49d1e2725

Textbook
[ TB1 ] শূন্য থেকে পাইথন মেশিন লার্নিং : হাতেকলমে সাইকিট-লার্ন (দ্বিতীয়
সংস্করণ)
Book website : https://raqueeb.gitbook.io/scikit-learn/

https://www.rokomari.com/book/187277/shunno-theke-python-machine-learning--hate-
kalame-scikit-learn--hatekolome-machine-learning-series--iris- dataset-project-
[ TB2 ] Hands-on Machine Learning with Scikit-Learn, Keras &
TensorFlow

https://drive.google.com/file/d/1sW8D9m30QYqmdou9ZwOp4Mo8YhY7exbh/view?
usp=sharing

[ TB3 ] Machine Learning for Absolute Beginners

https://drive.google.com/file/d/1D43PKTrAZG6z6V43k2SZDRoeJQxw8N70/view?
usp=sharing

Python Basics:

1. Microsoft Tutorial: https://www.youtube.com/watch?v=jFCNu1-

Xdsw&list=PLlrxD0HtieHhS8VzuMCfQD4uJ9yne1mE6
2. Python in Bangla: https://www.youtube.com/watch?
v=4QmifmQ7rHY&list=PLGPedopQSAJAoVkMxbENx99s2I4DKYd
j7

UNIT 2-3 - Notes - Unit-2-3-Notes
No ratings yet
UNIT 2-3 - Notes - Unit-2-3-Notes
16 pages
21 Feature Importance Methods in ML
100% (1)
21 Feature Importance Methods in ML
41 pages
Bootstrap Powerpoint
100% (1)
Bootstrap Powerpoint
20 pages
Stats & ML Model Comparisons
100% (1)
Stats & ML Model Comparisons
72 pages
Currency Recognition On Mobile Phones Proposed System Modules
No ratings yet
Currency Recognition On Mobile Phones Proposed System Modules
26 pages
The Complete Guide To Data Preprocessing
No ratings yet
The Complete Guide To Data Preprocessing
50 pages
Role of Machine Learning in The Field of Fiber Reinforced Polymer
No ratings yet
Role of Machine Learning in The Field of Fiber Reinforced Polymer
6 pages
Econ209 f2024 Lab 4 Truong Gia Han
No ratings yet
Econ209 f2024 Lab 4 Truong Gia Han
11 pages
(IJETA-V8I5P1) :yew Kee Wong
No ratings yet
(IJETA-V8I5P1) :yew Kee Wong
5 pages
Natural Language Toolkit NLTK PDF
No ratings yet
Natural Language Toolkit NLTK PDF
23 pages
HW1
100% (1)
HW1
8 pages
Pattern Recognition for CS Scholars
0% (1)
Pattern Recognition for CS Scholars
37 pages
Multivariate Linear Regression Guide
No ratings yet
Multivariate Linear Regression Guide
24 pages
Bayesian Learning Essentials
No ratings yet
Bayesian Learning Essentials
49 pages
Building Powerful Image Classification Models Using Very Little Data
No ratings yet
Building Powerful Image Classification Models Using Very Little Data
20 pages
Unit 4
No ratings yet
Unit 4
108 pages
Outliers, Hypothesis and Natural Language Processing
100% (1)
Outliers, Hypothesis and Natural Language Processing
7 pages
Neural Network Backpropagation Guide
No ratings yet
Neural Network Backpropagation Guide
7 pages
An Introduction To Mathematics Behind Neural Networks
No ratings yet
An Introduction To Mathematics Behind Neural Networks
5 pages
An Introduction To Feature Selection
No ratings yet
An Introduction To Feature Selection
45 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
3 pages
Seminar Report Machine Learning
No ratings yet
Seminar Report Machine Learning
20 pages
Face Detection and Recognition Using Image Processing
No ratings yet
Face Detection and Recognition Using Image Processing
43 pages
DL Practical File
No ratings yet
DL Practical File
58 pages
Python Plotly
No ratings yet
Python Plotly
8 pages
AI Statistical Methods Course
No ratings yet
AI Statistical Methods Course
23 pages
01 - ML Introduction - Course Outline
No ratings yet
01 - ML Introduction - Course Outline
21 pages
Distance Based Models
No ratings yet
Distance Based Models
58 pages
ML Unit 2
No ratings yet
ML Unit 2
25 pages
Data Science
No ratings yet
Data Science
31 pages
Multicollinearity Exercise
100% (1)
Multicollinearity Exercise
6 pages
Python Machine Learning - Machine Learning and Deep Learning With Python Scikit Learn and Tensorflow 2 Third Edition
No ratings yet
Python Machine Learning - Machine Learning and Deep Learning With Python Scikit Learn and Tensorflow 2 Third Edition
4 pages
Single Layer Perceptron Experiment
No ratings yet
Single Layer Perceptron Experiment
11 pages
Machine Learning Algorithms
No ratings yet
Machine Learning Algorithms
9 pages
Lecture13 ANFIS
No ratings yet
Lecture13 ANFIS
43 pages
4-Data Preprocessing (Cleaning) and Exploration
No ratings yet
4-Data Preprocessing (Cleaning) and Exploration
54 pages
Ensemble Methods Bagging Boosting and Stacking
100% (1)
Ensemble Methods Bagging Boosting and Stacking
19 pages
Bayes Classification for Fish Sorting
No ratings yet
Bayes Classification for Fish Sorting
86 pages
Logistic Regression
100% (1)
Logistic Regression
29 pages
Types of Data (Qualitative and Quantitative)
No ratings yet
Types of Data (Qualitative and Quantitative)
89 pages
Statistical Learning Theory Notes
No ratings yet
Statistical Learning Theory Notes
119 pages
Assignment # 01 Bscs - 7 Semester: Machine Learning
100% (1)
Assignment # 01 Bscs - 7 Semester: Machine Learning
5 pages
ML Lab
No ratings yet
ML Lab
21 pages
ML Unit 2
No ratings yet
ML Unit 2
90 pages
ML Project Guide for Practitioners
No ratings yet
ML Project Guide for Practitioners
7 pages
Study Notes - Lesson 1 - 7 PDF
No ratings yet
Study Notes - Lesson 1 - 7 PDF
25 pages
Machine Learning Loss Functions Guide
No ratings yet
Machine Learning Loss Functions Guide
37 pages
Machine Learning in Python Main Developments and T
100% (1)
Machine Learning in Python Main Developments and T
44 pages
CSE 473 Pattern Recognition: Instructor: Dr. Md. Monirul Islam
100% (1)
CSE 473 Pattern Recognition: Instructor: Dr. Md. Monirul Islam
57 pages
ML - Expectation-Maximization Algorithm
No ratings yet
ML - Expectation-Maximization Algorithm
3 pages
Dimensionality Reduction Guide
No ratings yet
Dimensionality Reduction Guide
79 pages
CEC453 Machine Learning
No ratings yet
CEC453 Machine Learning
168 pages
Matplotlib Basics for Beginners
No ratings yet
Matplotlib Basics for Beginners
16 pages
Deep Learning Unit-III
No ratings yet
Deep Learning Unit-III
9 pages
Machine Learning Lab: Regression Analysis
No ratings yet
Machine Learning Lab: Regression Analysis
15 pages
Intro to Machine Learning & kNN
No ratings yet
Intro to Machine Learning & kNN
90 pages
What Is Computer Vision?
No ratings yet
What Is Computer Vision?
120 pages
Machine Learning With Python - Machine Learning Terminology
No ratings yet
Machine Learning With Python - Machine Learning Terminology
1 page
Data Scientist
No ratings yet
Data Scientist
42 pages
MMC102 - Module 4 - Notes
No ratings yet
MMC102 - Module 4 - Notes
39 pages
Resume Presentation (BILINGUAL) 2023
No ratings yet
Resume Presentation (BILINGUAL) 2023
32 pages
Residential Tenancy Agreement
No ratings yet
Residential Tenancy Agreement
7 pages
Answe 2r
No ratings yet
Answe 2r
1 page
What Is The Relationship Between Part Count Red...
No ratings yet
What Is The Relationship Between Part Count Red...
2 pages
Bangladesh Telecom Regulation Overview
No ratings yet
Bangladesh Telecom Regulation Overview
16 pages
1 Quiz
No ratings yet
1 Quiz
1 page
Nsu Quiz 2
No ratings yet
Nsu Quiz 2
1 page
Kantian Ethics: Key Concepts & Analysis
No ratings yet
Kantian Ethics: Key Concepts & Analysis
6 pages

Machine Learning Guide Line

Uploaded by

Machine Learning Guide Line

Uploaded by

Environment Setup for Machine Learning Class

Python Tutorial Links

(Links to an external site.)

2. Bangla Tutorial : https://www.youtube.com/playlist?

1. See attached file named INTRO.pdf from below

2. TB2: Book page 7- 14 [Skip details of Batch and Online Learning]

Try and Practice Chapter one code from here:

Understanding PANDAS library :

Dataframe Basics: http://jalammar.github.io/gentle-visual-intro-to-data-

Data Explanation : https://raqueeb.gitbook.io/scikit-learn/iris-dataset/scikit-learn-iris

TB2: Chapter 2 (End to End Machine Learning Project)

Page: 35-37, 46-51

TB2: Chapter 2 (End to End Machine Learning Project)

TB2: Chapter 2 (End to End Machine Learning Project)

TB2: Chapter 2 (End to End Machine Learning Project)

TB2: Chapter 2 (End to End Machine Learning Project)

TB2: Chapter 2 (End to End Machine Learning Project)

Page: 73 - 80 (Without Grid Search)

Key Terms and Definitions

2. Lecture of Sensitivity, Specificity and Area Under Curve

Titanic Problem Description

More on KNN training and testing phase

Lecture 21 [Decision Tree]

Lecture 22 [Random Forest and PCA]

Principal Component Analysis, or PCA, is a statistical method used to reduce the

1. Also read from here:

2. sample code to show effectiveness of PCA:

Idea of C and Gamma Parameter in SVM

C is the cost of misclassification

A small C gives you higher bias and lower variance.

3. ENSEMBLE LEARNING: https://towardsdatascience.com/basic-ensemble-

[ TB3 ] Machine Learning for Absolute Beginners

1. Microsoft Tutorial: https://www.youtube.com/watch?v=jFCNu1-

You might also like