0% found this document useful (0 votes)

32 views59 pages

Lecture 4 - Basics of ML

The lecture by Prof. Ankit Gangwal covers the basics of Machine Learning (ML), including its definition, importance, and basic workflows. Key topics include supervised learning tasks such as regression and classification, neural network components, activation functions, and loss functions. The session emphasizes the learning process in ML models and the design of neural networks for various tasks.

Uploaded by

sumedha1174

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views59 pages

Lecture 4 - Basics of ML

Uploaded by

sumedha1174

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 59

Lecture 4 - Basics of ML

Prof. Ankit Gangwal

Assistant Professor, IIIT-H, India
email: [email protected]
Web: CiaoAnkit.github.io

Introduction Prof. Ankit Gangwal 93

Announcements

Introduction Prof. Ankit Gangwal 94

Table of Contents

● What is ML?
● Why ML?
● Basic Workflow of ML
● Basic Terminology in ML
● Tasks involved in Supervised Learning
○ Regression
○ Classification
● Perceptron
● Activation Functions
● Multi-layer perceptron

Introduction Prof. Ankit Gangwal 95

Table of Contents

Introduction Prof. Ankit Gangwal 96

Basics of Machine Learning

What is Artiﬁcial Intelli ence?

Introduction Prof. Ankit Gangwal 97

Basics of Machine Learning

What is Machine Learnin ?

Introduction Prof. Ankit Gangwal 98

Basics of Machine Learning

What is Machine Learnin ?

Machine learning (ML) is a ﬁeld o study in artiﬁcial intelli ence
concerned with the development and study o statistical al orithms
that can learn rom data and eneralize to unseen data, and thus
per orm tasks without explicit instructions.[1]

Introduction Prof. Ankit Gangwal 99

Basics of Machine Learning

Deep Learning??

Introduction Prof. Ankit Gangwal 100

Basics of Machine Learning

Introduction Prof. Ankit Gangwal 101

Basics of Machine Learning

AGI??

Introduction Prof. Ankit Gangwal 102

Basics of Machine Learning

AGI??

ANGI??

Introduction Prof. Ankit Gangwal 103

Why do we study ML?

● Because hard-coding is not always feasible

● We do not have an pattern algorithm for every problem
● Because when intelligently designed they can be scalable
● Adaptibility - Reinforcement Learning
● Can be applied in almost every area!
○ Healthcare - Biology, Microbiology
○ Finance
○ Robotics(Driving,Drones)
○ NLP(Chatgpt,claude)
○ Computer Vision
○ Graphs(2d and 3d)

Introduction Prof. Ankit Gangwal 104

Why do we study ML?

Introduction Prof. Ankit Gangwal 105

Basics of Machine Learning

Introduction Prof. Ankit Gangwal 106

Basic Workflow in ML

What is an ML model?

An ML model can be thought of as a function F(x).

Every function has an input and an output

Take an example of recommending movies on netflix.

In this scenario, input is the history of all the previous
movies watched by you. Output would be recommending a
movie you like. So, ML is basically trying to find the best
possible estimate for F(x).

Introduction Prof. Ankit Gangwal 107

Basic Workflow in ML

How do we find the best estimate of F(x)?

Ans: We learn!

How do we learn?
Ans: By Training

Sounds Familiar?
This is exactly how we humans learn anything

Introduction Prof. Ankit Gangwal 108

Basic Workflow in ML

Let’s take the example of humans learning how to drive..

Day 1: Terrible

Day 2: Decent This is exactly how

Day 3: Good ML models learn!!
Day 4: Expert

Introduction Prof. Ankit Gangwal 109

Basic Workflow in ML

Now let’s try to find the most important aspects of ‘learning how to drive’

Task/Goal: Learn to drive Task

Medium to learn: Driving Instructor Algorithm

Experience: longer you learn, better you get Data

This is literally how you train an ML model as well!

Introduction Prof. Ankit Gangwal 110

Basic Workflow in ML

Now, let’s try to build a model which outputs 1 if there

exists the word ‘Chubby cat’ in a given paragraph…

Now, let’s try to build a model which outputs 1 if there

exists a ‘Chubby cat’ in a given image…

Introduction Prof. Ankit Gangwal 111

Some Terminology

Now, let’s define some terminology before we explore more

interesting topics…

Introduction Prof. Ankit Gangwal 112

Taxonomy of Data

Introduction Prof. Ankit Gangwal 113

Taxonomy of ML Tasks

Introduction Prof. Ankit Gangwal 114

Regression

Predicts continuous numerical values.

Ex: Stock Market Prediction

Housing price prediction
Estimating Blood sugar
Estimated Delivery time in e-commerce

Introduction Prof. Ankit Gangwal 115

Classification

Categorizes data into classes or groups

Ex: Image Classification
Video Classification
Hate Speech Detection
Link Prediction in social networks
Recommendation system

Introduction Prof. Ankit Gangwal 116

Building a Neural network

● Now, our goal is to build a mathematical function F(x) such that

it is learnable and can process input.
● So, we basically want to mimic the human brain.
● We start with the smallest unit of the human brain.

Introduction Prof. Ankit Gangwal 117

Neuron

3 main parts:
● Dendrites
● Soma
● Axon terminal

This is similar to a function.

● Input
● Processing
● Output

Introduction Prof. Ankit Gangwal 118

Neuron

We can start with the simplest mathematical function which can take an
input, process it and produce an output.

This is obviously not enough complexity to learn complex equations.

Let’s try to list out some of the problems.

1. Multiple input features

2. Linearity

Now, let’s try fix these problems one by one.

Introduction Prof. Ankit Gangwal 119

Neuron

Before we fix the problems of Neuron, let’s try to visualize what it does.

Introduction Prof. Ankit Gangwal 120

Multiple Input features

This is an easy fix. We can simply assign weights to every single feature.

We can simplify this representation by using matrices.

Introduction Prof. Ankit Gangwal 121

Multiple input features - Visualization

Introduction Prof. Ankit Gangwal 122

Linearity

The capabilities of non-linear equations is limited.

To mitigate this issue, we introduce activation functions.

An activation function is a mathematical function applied to the output of a

neuron. It introduces non-linearity into the model, allowing the network to
learn and represent complex patterns in the data.

Examples include:
● Linear
● Sigmoid
● Tanh
● ReLU
● GeLU
● Softmax etc..

Introduction Prof. Ankit Gangwal 123

Activation Functions
Linear Activation Function

Introduction Prof. Ankit Gangwal 124

Activation Functions
Sigmoid Activation Function

Characterized by it’s S shape.

Output is restricted to between 0 and 1.

Introduction Prof. Ankit Gangwal 125

Activation Functions
Tanh Activation Function

Shifted version of sigmoid that is stretched along the y-axis

Output is between -1 and 1.

Known for it’s 0-centered distribution

Introduction Prof. Ankit Gangwal 126

Activation Functions
ReLU Activation Function

Output range is between 0 and -inf

Introduction Prof. Ankit Gangwal 127

Neural Network
Now, let’s try to piece all of them together to create the most basic form of a neural
network that can model non-linear functions

Introduction Prof. Ankit Gangwal 128

Deep Neural Network

Introduction Prof. Ankit Gangwal 129

Deep Neural Network (Multilayer Perceptron)

Why do we need deeper neural networks?

Introduction Prof. Ankit Gangwal 130

Universal approximation theorem

In simple words, the universal approximation theorem says that neural

networks(with at least one hidden layer) can approximate any function.

Introduction Prof. Ankit Gangwal 131

Deep Neural Network

Now, let’s try to build a simple feedforward neural network with

multiple hidden layers.

Introduction Prof. Ankit Gangwal 132

Forward Propagation

Introduction Prof. Ankit Gangwal 133

Forward Propagation

Introduction Prof. Ankit Gangwal 134

Forward Propagation

Introduction Prof. Ankit Gangwal 135

Forward Propagation

Introduction Prof. Ankit Gangwal 136

Forward Propagation

Introduction Prof. Ankit Gangwal 137

Forward Propagation

Introduction Prof. Ankit Gangwal 138

Forward Propagation

Introduction Prof. Ankit Gangwal 139

Forward Propagation

Introduction Prof. Ankit Gangwal 140

Forward Propagation

Introduction Prof. Ankit Gangwal 141

Designing Neural Networks
How do you design an MLP for Regression task?

How do you design an MLP for Classification task?

Introduction Prof. Ankit Gangwal 142

Training a Neural Network
Now, we built a function that is capable of acting as an approximator for any
continuous function when you assign appropriate weights to it

How do we find these weights?

Revisiting how humans learn a task…

Whenever we make a mistake, we try to learn from it and adjust ourselves

accordingly.

For MLPs also we need to define a metric that shows how far off we are from
the actual output

Introduction Prof. Ankit Gangwal 143

Loss Functions

Introduction Prof. Ankit Gangwal 144

Loss Functions

Introduction Prof. Ankit Gangwal 145

Loss Functions
Regression

● Mean Square Error

● Mean Absolute Error

● Relative Mean Square Error

Introduction Prof. Ankit Gangwal 146

Loss Functions
Classification

● Binary Cross entropy loss

● Cross entropy loss

Introduction Prof. Ankit Gangwal 147

Loss Functions

● Now, we have defined the metric for error.

Introduction Prof. Ankit Gangwal 148

Loss Functions

● Now, we have defined the metric for error.

● Next, we have to come up with an algorithm such that weights can be adjusted
in order to decrease this error.

Introduction Prof. Ankit Gangwal 149

Loss Functions

● Now, we have defined the metric for error.

● Next, we have to come up with an algorithm such that weights can be adjusted
in order to decrease this error.
● We can now rewrite this as an optimization problem.

Introduction Prof. Ankit Gangwal 150

Thank you!

Introduction Prof. Ankit Gangwal 151

Part 1
No ratings yet
Part 1
48 pages
UNIT 4 - Perceptron and DL
No ratings yet
UNIT 4 - Perceptron and DL
39 pages
Advanced ML Slides Intro
No ratings yet
Advanced ML Slides Intro
14 pages
Machine Learning Study Guide
No ratings yet
Machine Learning Study Guide
8 pages
Module 2
100% (1)
Module 2
62 pages
NN Unit - 1
No ratings yet
NN Unit - 1
27 pages
AIMLB PGP 2025 Session 13 14
No ratings yet
AIMLB PGP 2025 Session 13 14
44 pages
FDL Module1
No ratings yet
FDL Module1
102 pages
Lecture 1
100% (1)
Lecture 1
51 pages
CS2011 5
No ratings yet
CS2011 5
43 pages
Deep Learning MCQ
No ratings yet
Deep Learning MCQ
6 pages
1.2 ML Termenoly and Activation Function
No ratings yet
1.2 ML Termenoly and Activation Function
17 pages
Week 1 - Artificial Neural Networks - Part I - Justin
No ratings yet
Week 1 - Artificial Neural Networks - Part I - Justin
56 pages
Short Course Machine Learning F de Vuyst 1715052496
No ratings yet
Short Course Machine Learning F de Vuyst 1715052496
74 pages
Module - 4 - ANN1
No ratings yet
Module - 4 - ANN1
31 pages
2024-05-07 - Module Réseaux de Neurones Pour La Performance Industrielle
No ratings yet
2024-05-07 - Module Réseaux de Neurones Pour La Performance Industrielle
61 pages
Ad3451 ML Unit 4 Notes
No ratings yet
Ad3451 ML Unit 4 Notes
36 pages
Unit 5 RNN
No ratings yet
Unit 5 RNN
14 pages
ML Neural Networks
No ratings yet
ML Neural Networks
71 pages
Neural Networks in Healthcare Lecture 2 - 021808
No ratings yet
Neural Networks in Healthcare Lecture 2 - 021808
73 pages
L1 Overview
No ratings yet
L1 Overview
28 pages
Neural Network (Basics)
No ratings yet
Neural Network (Basics)
48 pages
Chapter-4 Fundamental of Neural Network
No ratings yet
Chapter-4 Fundamental of Neural Network
26 pages
Deep Learning
No ratings yet
Deep Learning
299 pages
Machine Learning (CSO851) - Lecture 08
No ratings yet
Machine Learning (CSO851) - Lecture 08
27 pages
Kannan M5L3 Notes
No ratings yet
Kannan M5L3 Notes
98 pages
ML Unit 4
No ratings yet
ML Unit 4
23 pages
01 ML Basics
No ratings yet
01 ML Basics
61 pages
Chapter 2 - Artificial Neural Networks
No ratings yet
Chapter 2 - Artificial Neural Networks
19 pages
Week 4
No ratings yet
Week 4
61 pages
(Fall 2024) Intro To ML
No ratings yet
(Fall 2024) Intro To ML
51 pages
02 ML Fundatmentals 2
No ratings yet
02 ML Fundatmentals 2
81 pages
AN2DL 02 2324 Perceptron 2 FeedForward
No ratings yet
AN2DL 02 2324 Perceptron 2 FeedForward
55 pages
Deep Learning Unit-1 Finals
No ratings yet
Deep Learning Unit-1 Finals
23 pages
CS217 2024 Lec11
No ratings yet
CS217 2024 Lec11
7 pages
Week 2 Artificial Neural Networks
No ratings yet
Week 2 Artificial Neural Networks
62 pages
Unit 2.1
No ratings yet
Unit 2.1
37 pages
Ad3451 ML Unit 4 Notes
No ratings yet
Ad3451 ML Unit 4 Notes
34 pages
Deep Learning
No ratings yet
Deep Learning
19 pages
Module 2
No ratings yet
Module 2
44 pages
AI ML Session Slides
No ratings yet
AI ML Session Slides
34 pages
Unit 1
No ratings yet
Unit 1
38 pages
Machine Learning With Artificial Neural Networks
No ratings yet
Machine Learning With Artificial Neural Networks
6 pages
Machine Learning and Pattern Recognition Week 8 Neural Net Intro
No ratings yet
Machine Learning and Pattern Recognition Week 8 Neural Net Intro
3 pages
Session NN
No ratings yet
Session NN
32 pages
EE353 - 769 06 Intro To ML
No ratings yet
EE353 - 769 06 Intro To ML
27 pages
Lecture6 Neural Network Basics v1.1
No ratings yet
Lecture6 Neural Network Basics v1.1
40 pages
UNit 6 Machine Learning
No ratings yet
UNit 6 Machine Learning
23 pages
Artificial Neural Network Concepts/Terminology
No ratings yet
Artificial Neural Network Concepts/Terminology
22 pages
2.game AI 1
No ratings yet
2.game AI 1
268 pages
Artificial Neural Networks & Fuzzy Logic
No ratings yet
Artificial Neural Networks & Fuzzy Logic
13 pages
CNN and Gan: Introduction To
No ratings yet
CNN and Gan: Introduction To
58 pages
Lecture2 Slides 1
No ratings yet
Lecture2 Slides 1
28 pages
EE769 7 Introduction To Neural Networks
No ratings yet
EE769 7 Introduction To Neural Networks
52 pages
Naive - Bayes - Ipynb - Colab
No ratings yet
Naive - Bayes - Ipynb - Colab
3 pages
Neural Network Models & MATLAB
No ratings yet
Neural Network Models & MATLAB
7 pages
DeepLearing Theory
No ratings yet
DeepLearing Theory
51 pages
Ad3451 ML Unit 4 Notes Eduengg
No ratings yet
Ad3451 ML Unit 4 Notes Eduengg
36 pages
Deep Learning Turorial PDF
No ratings yet
Deep Learning Turorial PDF
301 pages
Machine Learning Updated Lesson Plan
No ratings yet
Machine Learning Updated Lesson Plan
6 pages
Yolo Ocr
No ratings yet
Yolo Ocr
7 pages
Introduction To Neural Network
No ratings yet
Introduction To Neural Network
20 pages
MATH 370: Intro to Machine Learning
No ratings yet
MATH 370: Intro to Machine Learning
60 pages
BMM 2018 - Deep Learning Tutorial
No ratings yet
BMM 2018 - Deep Learning Tutorial
47 pages
Machine Learning-Gkouzionis
No ratings yet
Machine Learning-Gkouzionis
14 pages
Neural Networks
No ratings yet
Neural Networks
38 pages
Unit 1 Question and Answers
100% (1)
Unit 1 Question and Answers
29 pages
Machine Learning
No ratings yet
Machine Learning
25 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
68 pages
Boosting Algorithms Explained
No ratings yet
Boosting Algorithms Explained
2 pages
I2ml3e Chap11
No ratings yet
I2ml3e Chap11
38 pages
Deep Learning Final Sheet
No ratings yet
Deep Learning Final Sheet
915 pages
8.lecture7 28a 29 NN
No ratings yet
8.lecture7 28a 29 NN
60 pages
AD3511 DL
No ratings yet
AD3511 DL
2 pages
UNIT-IV - Decision Tree Induction
No ratings yet
UNIT-IV - Decision Tree Induction
19 pages
Transformers Without Tears
No ratings yet
Transformers Without Tears
11 pages
DL Bits
No ratings yet
DL Bits
3 pages
Project Report
No ratings yet
Project Report
30 pages
Generative Adversarial Networks Seminar
No ratings yet
Generative Adversarial Networks Seminar
67 pages
Module 5
No ratings yet
Module 5
1 page
Literature Survey Diabetes Prediction
No ratings yet
Literature Survey Diabetes Prediction
2 pages
1.4 Deep Feed Forward Networks
No ratings yet
1.4 Deep Feed Forward Networks
16 pages
Pseudo Label Final
No ratings yet
Pseudo Label Final
7 pages
Support Vector Machine-Updated Version
No ratings yet
Support Vector Machine-Updated Version
13 pages
Neural Network Activation Insights
No ratings yet
Neural Network Activation Insights
9 pages
Technology - Mca Master of Computer Applications - Semester 3 - 2023 - December - Elective 3 Deep Learning Rev 2019 C Scheme
No ratings yet
Technology - Mca Master of Computer Applications - Semester 3 - 2023 - December - Elective 3 Deep Learning Rev 2019 C Scheme
1 page
Syllabus - CS 231N PDF
No ratings yet
Syllabus - CS 231N PDF
1 page

Lecture 4 - Basics of ML

Uploaded by

Lecture 4 - Basics of ML

Uploaded by

Lecture 4 - Basics of ML

Prof. Ankit Gangwal

Introduction Prof. Ankit Gangwal 93

Introduction Prof. Ankit Gangwal 94

Introduction Prof. Ankit Gangwal 95

● Learning parameters of a neural network

Introduction Prof. Ankit Gangwal 96

What is Artiﬁcial Intelli ence?

Introduction Prof. Ankit Gangwal 97

What is Machine Learnin ?

Introduction Prof. Ankit Gangwal 98

What is Machine Learnin ?

Introduction Prof. Ankit Gangwal 99

Introduction Prof. Ankit Gangwal 100

Introduction Prof. Ankit Gangwal 101

Introduction Prof. Ankit Gangwal 102

Introduction Prof. Ankit Gangwal 103

● Because hard-coding is not always feasible

Introduction Prof. Ankit Gangwal 104

Introduction Prof. Ankit Gangwal 105

Introduction Prof. Ankit Gangwal 106

An ML model can be thought of as a function F(x).

Every function has an input and an output

Take an example of recommending movies on netflix.

Introduction Prof. Ankit Gangwal 107

How do we find the best estimate of F(x)?

Introduction Prof. Ankit Gangwal 108

Let’s take the example of humans learning how to drive..

Day 2: Decent This is exactly how

Introduction Prof. Ankit Gangwal 109

Task/Goal: Learn to drive Task

Medium to learn: Driving Instructor Algorithm

Experience: longer you learn, better you get Data

This is literally how you train an ML model as well!

Introduction Prof. Ankit Gangwal 110

Now, let’s try to build a model which outputs 1 if there

Now, let’s try to build a model which outputs 1 if there

Introduction Prof. Ankit Gangwal 111

Now, let’s define some terminology before we explore more

Introduction Prof. Ankit Gangwal 112

Introduction Prof. Ankit Gangwal 113

Introduction Prof. Ankit Gangwal 114

Predicts continuous numerical values.

Ex: Stock Market Prediction

Introduction Prof. Ankit Gangwal 115

Categorizes data into classes or groups

Introduction Prof. Ankit Gangwal 116

● Now, our goal is to build a mathematical function F(x) such that

Introduction Prof. Ankit Gangwal 117

This is similar to a function.

Introduction Prof. Ankit Gangwal 118

This is obviously not enough complexity to learn complex equations.

1. Multiple input features

Now, let’s try fix these problems one by one.

Introduction Prof. Ankit Gangwal 119

Introduction Prof. Ankit Gangwal 120

We can simplify this representation by using matrices.

Introduction Prof. Ankit Gangwal 121

Introduction Prof. Ankit Gangwal 122

The capabilities of non-linear equations is limited.

To mitigate this issue, we introduce activation functions.

An activation function is a mathematical function applied to the output of a

Introduction Prof. Ankit Gangwal 123

Introduction Prof. Ankit Gangwal 124

Characterized by it’s S shape.

Output is restricted to between 0 and 1.

Introduction Prof. Ankit Gangwal 125

Shifted version of sigmoid that is stretched along the y-axis

Output is between -1 and 1.

Introduction Prof. Ankit Gangwal 126

Output range is between 0 and -inf

Introduction Prof. Ankit Gangwal 127

Introduction Prof. Ankit Gangwal 128

Introduction Prof. Ankit Gangwal 129

Why do we need deeper neural networks?

Introduction Prof. Ankit Gangwal 130