0% found this document useful (0 votes)

6 views7 pages

How SVM Works: Support Vector Machine (SVM)

Support Vector Machine (SVM) is a supervised machine learning algorithm used for classification and regression, focusing on finding the optimal hyperplane that maximizes the margin between classes. It is effective in high-dimensional spaces and can handle non-linear boundaries through kernel functions. SVM is particularly useful in applications such as email filtering and image recognition due to its robust decision-making capabilities.

Uploaded by

Kusum Gore

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views7 pages

How SVM Works: Support Vector Machine (SVM)

Uploaded by

Kusum Gore

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

A Support Vector Machine (SVM) is a supervised machine learning algorithm

commonly used for classification tasks, though it can also be applied to

regression. SVM’s primary goal is to find a hyperplane (a boundary) that best
separates data points of different classes, especially in cases where the data is
not linearly separable.

How SVM Works

SVM works by finding the optimal hyperplane that maximizes the margin — the
distance between the hyperplane and the closest data points from each class.
These closest points are called support vectors, and they define the margin's
boundaries. By maximizing the margin, SVM aims to create a model that
generalizes well to new, unseen data.

Key Concepts in SVM

1. Hyperplane: The decision boundary that separates different classes. In

2D, it’s a line; in 3D, it’s a plane; in higher dimensions, it’s called a
hyperplane.
2. Margin: The distance between the hyperplane and the nearest data points
from each class. SVM maximizes this margin to make the classification as
robust as possible.
3. Support Vectors: The data points closest to the hyperplane, which define
the margin and influence its position. They are crucial for determining the
optimal hyperplane.

Example of SVM

Imagine a scenario where we want to classify emails as either "Spam" or "Not

Spam" based on features like the frequency of certain words or phrases.

1. Training Data: Suppose we have a dataset of emails with labels (Spam or

Not Spam). We extract features from each email, such as word
frequency, length, etc., which we plot in a high-dimensional space.
2. Finding the Hyperplane: SVM will analyze this feature space and find the
optimal hyperplane that best separates "Spam" emails from "Not Spam"
emails.
3. Maximizing the Margin: SVM maximizes the margin between this
hyperplane and the nearest emails from each class (these emails are the
support vectors).
4. Classifying New Emails: Once trained, the model can classify new emails
by determining which side of the hyperplane they fall on — if an email
falls on the "Spam" side of the hyperplane, it will be classified as spam,
and vice versa.

Why SVM is Effective

 SVM is particularly effective for high-dimensional spaces and is robust

when there’s a clear margin of separation between classes.
 It works well even with small datasets and can handle non-linear
boundaries by applying a kernel function that transforms the input data
into a higher-dimensional space.

In summary, SVM is a powerful classifier that focuses on maximizing the margin

between classes to create a robust decision boundary, making it suitable for
tasks like email filtering, image recognition, and text categorization.

Here’s an example of using Support Vector Machine (SVM) in Python with

scikit-learn for a binary classification task. In this case, we'll use the SVM to
classify a dataset (e.g., predicting whether a flower is of type "setosa" or not
based on petal and sepal measurements from the popular Iris dataset).

from sklearn import datasets

from sklearn.model_selection import train_test_split

from sklearn.svm import SVC # Support Vector Classifier

from sklearn.metrics import accuracy_score, classification_report

# Load dataset

iris = datasets.load_iris()

X = iris.data # Features (sepal length, sepal width, petal length, petal width)

y = iris.target # Target classes (setosa, versicolor, virginica)

# For binary classification, let's classify only two types (e.g., setosa vs. others)

# Adjust labels to make it binary: 1 for "setosa" and 0 for "not setosa"

y_binary = (y == 0).astype(int) # Setosa as 1, others as 0

# Split the data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y_binary, test_size=0.3,

random_state=42)

# Initialize and train the Support Vector Classifier (SVC)

# Here, we'll use a linear kernel (other options: 'poly', 'rbf', 'sigmoid')

model = SVC(kernel='linear', C=1.0) # C is the regularization parameter

model.fit(X_train, y_train)

# Make predictions on the test set

y_pred = model.predict(X_test)

# Evaluate the model

accuracy = accuracy_score(y_test, y_pred)

report = classification_report(y_test, y_pred)

# Output the accuracy and classification report

print("Accuracy:", accuracy)

print("Classification Report:\n", report)) model = SVC(kernel='linear', C=1.0)

1. What is a Support Vector Machine (SVM)?

 Support Vector Machine (SVM) is a supervised machine learning algorithm

primarily used for classification tasks (though it can also be adapted for
regression). It works by finding the optimal hyperplane that separates data
points of different classes in a way that maximizes the distance (or margin)
between the closest points of each class.

 Purpose: SVM aims to create a robust classification model that minimizes

misclassification by creating the largest possible separation (margin) between
classes. This separation helps the model generalize well to new data.

2. Concept of the Margin in SVM

 Margin: The margin in SVM is the distance between the hyperplane (decision
boundary) and the closest data points of each class, known as support vectors.
The margin represents the "buffer zone" around the hyperplane, separating
classes.

 Significance of Maximizing the Margin: Maximizing the margin is crucial

because:

o It increases the separation between classes, which helps the model be

less sensitive to slight variations or noise in the data.

o A larger margin generally means better generalization, as the model is

less likely to overfit.

o The larger the margin, the more "robust" the classifier is, as it reduces
the risk of misclassifying points near the boundary.

3. What is a Hyperplane in SVM?

 Hyperplane: In the context of SVM, a hyperplane is a flat decision boundary

that separates data points in different classes. It has dimensions one less than
the data space (for example, a line in 2D space, a plane in 3D space).

 Role in Separating Classes: The hyperplane is positioned such that it divides

the data points in a way that maximizes the margin between classes. This
separation is what SVM optimizes for, aiming to reduce classification errors by
placing the hyperplane at an optimal position between the classes.

4. Support Vectors and Their Importance in SVM

 Support Vectors: Support vectors are the data points that lie closest to the
hyperplane. These are the critical points that determine the position and
orientation of the hyperplane.

 Influence on Hyperplane:
o Support vectors are essential because they define the margin's
boundaries. If any support vector moves, it directly influences the
placement of the hyperplane.

o Only the support vectors (not all data points) affect the hyperplane's
positioning. This focus on a subset of data points makes SVM
computationally efficient.

o The support vectors ensure that the margin is maximized while

maintaining correct classification. Their position on the edges of the
margin makes the hyperplane as robust as possible without overfitting.

In summary, SVM is a powerful classification tool that leverages the concept of a

hyperplane and support vectors to create a high-margin, optimal separation between
classes, leading to strong generalization on unseen data.

1. Linear Regression vs. Logistic Regression

 Linear Regression: Linear regression is a statistical method used to model the

relationship between a continuous dependent variable and one or more
independent variables. The objective is to find the best-fitting line (or
hyperplane in multiple dimensions) that minimizes the difference between
predicted and actual values, thus allowing for accurate predictions of a
continuous outcome.

 Logistic Regression: Logistic regression, while also a type of regression, is used

for classification tasks, particularly for binary outcomes (e.g., yes/no, 0/1).
Instead of predicting a continuous value, logistic regression estimates the
probability that an observation belongs to a particular class. It outputs values
between 0 and 1 by applying a logistic (sigmoid) function to a linear combination
of input features.

2. Interpreting the Outputs of Linear and Logistic Regression Models

 Linear Regression:

o The output is a point estimate, which is a continuous value representing

the predicted outcome.

o For instance, if predicting house prices, the model may output a single
value (e.g., $250,000) as the estimated price based on the input
features.
o The prediction is made by plugging the input values into the linear
equation y=β0+β1x1+β2x2+⋯+βnxny

 Logistic Regression:

o The output is a probability that an observation belongs to the positive

class (class 1), with values ranging from 0 to 1.

o This probability is calculated using the logistic (sigmoid) function applied

to the linear equation

o Predictions are made by setting a threshold (e.g., 0.5): if the probability

is above the threshold, the observation is classified into the positive
class (1), otherwise, it’s classified into the negative class (0).

o Logistic regression also gives us the odds (ratio of probabilities) of an

outcome occurring, which is particularly useful in interpreting model
output in terms of risk or likelihood.

3. Loss Functions in Linear and Logistic Regression

 Linear Regression – Mean Squared Error (MSE):

o Linear regression commonly uses the Mean Squared Error (MSE) as its
loss function.

o MSE Formula: MSE=

o where yiy_iyi is the actual value,

yi^\hat{y_i}yi^ is the predicted value, and NNN is the number of
observations.

o Why MSE is Appropriate: MSE measures the average squared

difference between the actual and predicted values, penalizing large
deviations more heavily. This approach fits linear regression well because
it seeks to minimize error for continuous outcomes, ensuring the best line
fit by reducing the total squared error.

 Logistic Regression – Log Loss (Cross-Entropy):

o Logistic regression uses log loss (or cross-entropy loss) as its loss
function.

o Log Loss Formula:

o Why Log Loss is Appropriate: Log loss measures how far each predicted
probability diverges from the true binary outcome (0 or 1). It penalizes
predictions based on the confidence of the incorrect prediction,
encouraging the model to provide probabilities close to the true class
labels. This fits logistic regression well since it is focused on
probabilities and classification, rather than point estimates.

Summary:

 Linear Regression provides continuous predictions and minimizes MSE to achieve

the best fit.

 Logistic Regression predicts probabilities, uses log loss to focus on accurate

class predictions, and assigns greater penalties for confident misclassifications,
aiding in robust binary classification.

Support Vector Machines - PPT - Machine Learning
No ratings yet
Support Vector Machines - PPT - Machine Learning
5 pages
Unit2 Notes What Is A Support Vector Machine
No ratings yet
Unit2 Notes What Is A Support Vector Machine
11 pages
SVM7
No ratings yet
SVM7
53 pages
Unit 3 Aam
No ratings yet
Unit 3 Aam
30 pages
Course Title: Fundamentals of Machine Learning Course Code: Group Assignment On
No ratings yet
Course Title: Fundamentals of Machine Learning Course Code: Group Assignment On
9 pages
Support Vector Machines (SVMS) - Introduction and Key Concepts
No ratings yet
Support Vector Machines (SVMS) - Introduction and Key Concepts
52 pages
NLP Review Classfication: Knowledge Solutions India
No ratings yet
NLP Review Classfication: Knowledge Solutions India
11 pages
Support Vector Machine
No ratings yet
Support Vector Machine
9 pages
Support Vector Machine
No ratings yet
Support Vector Machine
8 pages
Support Vector Machine
No ratings yet
Support Vector Machine
9 pages
UNIT-II-Support Vector Machine Algorithm
No ratings yet
UNIT-II-Support Vector Machine Algorithm
13 pages
Notes On Support Vector Machines
No ratings yet
Notes On Support Vector Machines
2 pages
Ankita
No ratings yet
Ankita
10 pages
SVM Algorithm: Key Concepts & Implementation
No ratings yet
SVM Algorithm: Key Concepts & Implementation
30 pages
Third Year Engineering: Unit II: Supervised Machine Learning
No ratings yet
Third Year Engineering: Unit II: Supervised Machine Learning
11 pages
Support Vector Machine (SVM) Classifier:: Key Features
No ratings yet
Support Vector Machine (SVM) Classifier:: Key Features
6 pages
Support Vector Machine
No ratings yet
Support Vector Machine
17 pages
Understanding Support Vector Machines
No ratings yet
Understanding Support Vector Machines
32 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
4 pages
Support Vector Machine
100% (1)
Support Vector Machine
11 pages
SVM Guide: Concepts, Implementation, Tuning
No ratings yet
SVM Guide: Concepts, Implementation, Tuning
13 pages
SVM (Repaired)
No ratings yet
SVM (Repaired)
39 pages
SVM MJJ
No ratings yet
SVM MJJ
19 pages
What Is A Support Vector Machine
No ratings yet
What Is A Support Vector Machine
3 pages
Unit - 2-1
No ratings yet
Unit - 2-1
7 pages
Support Vector Machine Guide
No ratings yet
Support Vector Machine Guide
21 pages
Support Vector Machine (SVM) Terminology Hyperplane WX + B 0 Support Vectors Margin Kernel Hard Margin Soft Margin
No ratings yet
Support Vector Machine (SVM) Terminology Hyperplane WX + B 0 Support Vectors Margin Kernel Hard Margin Soft Margin
6 pages
Unit 2 - SVM - 241016 - 104220
No ratings yet
Unit 2 - SVM - 241016 - 104220
47 pages
10 Classification SVM
No ratings yet
10 Classification SVM
22 pages
SVM
No ratings yet
SVM
11 pages
2.6 Supervised-Support Vector Machine
No ratings yet
2.6 Supervised-Support Vector Machine
18 pages
Support Vector Machines
No ratings yet
Support Vector Machines
4 pages
Support Vector Machine
No ratings yet
Support Vector Machine
21 pages
SVM Notes
No ratings yet
SVM Notes
4 pages
SVM Notes Unit 4
No ratings yet
SVM Notes Unit 4
8 pages
SVM Fully Translated Fixed
No ratings yet
SVM Fully Translated Fixed
5 pages
SVM PPTX
No ratings yet
SVM PPTX
15 pages
SVM Guide for MCA Students
No ratings yet
SVM Guide for MCA Students
17 pages
Support Vector Machine
No ratings yet
Support Vector Machine
13 pages
Support Vector Machine
No ratings yet
Support Vector Machine
7 pages
Support Vectors
No ratings yet
Support Vectors
7 pages
Support Vector Machine
100% (1)
Support Vector Machine
40 pages
SUpport Vector Machine
No ratings yet
SUpport Vector Machine
28 pages
SVM Part A
No ratings yet
SVM Part A
16 pages
Ann Unit III
No ratings yet
Ann Unit III
20 pages
SVM Classifier Techniques Guide
No ratings yet
SVM Classifier Techniques Guide
15 pages
Advanced Regression Techniques
No ratings yet
Advanced Regression Techniques
6 pages
Unit 2 PPT - Part 2
100% (1)
Unit 2 PPT - Part 2
81 pages
SVM VS SVC
No ratings yet
SVM VS SVC
27 pages
Unit - 2
No ratings yet
Unit - 2
15 pages
Support Vector Machines
No ratings yet
Support Vector Machines
12 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
4 pages
Machine Learning Unit-3.3
No ratings yet
Machine Learning Unit-3.3
38 pages
Hands On Machine Learning 3 Edition
No ratings yet
Hands On Machine Learning 3 Edition
43 pages
Presentation On Support Vector Machine (SVM)
100% (2)
Presentation On Support Vector Machine (SVM)
22 pages
Unit5 ML
No ratings yet
Unit5 ML
12 pages
Support Vector Machine Algorithm
No ratings yet
Support Vector Machine Algorithm
8 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
5 pages
Data Analytics Assignment
No ratings yet
Data Analytics Assignment
8 pages
DA Question Bank
No ratings yet
DA Question Bank
4 pages
Da Mini Project Report
No ratings yet
Da Mini Project Report
16 pages
Project Report 1
No ratings yet
Project Report 1
8 pages
Xero Accounting Midterm Guide
No ratings yet
Xero Accounting Midterm Guide
9 pages
Bikram Das 2k25
No ratings yet
Bikram Das 2k25
1 page
Annual Storage Fee Inquiry Guide
No ratings yet
Annual Storage Fee Inquiry Guide
8 pages
General FAQ
No ratings yet
General FAQ
4 pages
BIR S1905 - Registration Update Sheet
No ratings yet
BIR S1905 - Registration Update Sheet
1 page
Viden Io Data Analytics Lecture7 Data Stream Filtering PDF
No ratings yet
Viden Io Data Analytics Lecture7 Data Stream Filtering PDF
20 pages
Jebin 2
No ratings yet
Jebin 2
22 pages
Demat Account Activation Notice
No ratings yet
Demat Account Activation Notice
1 page
AOPBE Explanation
No ratings yet
AOPBE Explanation
3 pages
Cryptography Decryption Challenge
No ratings yet
Cryptography Decryption Challenge
2 pages
Online Registration for e-Services
No ratings yet
Online Registration for e-Services
1 page
CS-VU918896N CertificateOfIns
No ratings yet
CS-VU918896N CertificateOfIns
13 pages
CV Alex
No ratings yet
CV Alex
6 pages
Footprinting
No ratings yet
Footprinting
43 pages
Export Leemyhill@
No ratings yet
Export Leemyhill@
6 pages
A Href : Buy Aged Yahoo Accounts
No ratings yet
A Href : Buy Aged Yahoo Accounts
13 pages
KNN Limitations in Spam Filtering
No ratings yet
KNN Limitations in Spam Filtering
9 pages
Email and Password List for Access
No ratings yet
Email and Password List for Access
44 pages
No More Heroes - Manual - WII
No ratings yet
No More Heroes - Manual - WII
12 pages
Close-Up B1+ - Quiz Unit 05
100% (1)
Close-Up B1+ - Quiz Unit 05
3 pages
Curriculum Vitae Sample For Students
100% (2)
Curriculum Vitae Sample For Students
6 pages
Siebel CTI
No ratings yet
Siebel CTI
2 pages
Dispute Case Instructions & Form
No ratings yet
Dispute Case Instructions & Form
4 pages
New eXp Realty Agent Onboarding
No ratings yet
New eXp Realty Agent Onboarding
1 page
FEDAIStudy Material 10 Oct 2022
No ratings yet
FEDAIStudy Material 10 Oct 2022
5 pages
How To Request Full Lic in Navi Planner PDF
No ratings yet
How To Request Full Lic in Navi Planner PDF
5 pages
Tle 118a Lesson 1 MS Powerpoint
No ratings yet
Tle 118a Lesson 1 MS Powerpoint
41 pages
Deloitte-NDA and PII Waiver - (Revised 7.27.2023) - TC
No ratings yet
Deloitte-NDA and PII Waiver - (Revised 7.27.2023) - TC
8 pages
Cold Emailing
No ratings yet
Cold Emailing
31 pages

How SVM Works: Support Vector Machine (SVM)

Uploaded by

How SVM Works: Support Vector Machine (SVM)

Uploaded by

A Support Vector Machine (SVM) is a supervised machine learning algorithm

commonly used for classification tasks, though it can also be applied to

How SVM Works

Key Concepts in SVM

1. Hyperplane: The decision boundary that separates different classes. In

Imagine a scenario where we want to classify emails as either "Spam" or "Not

1. Training Data: Suppose we have a dataset of emails with labels (Spam or

Why SVM is Effective

 SVM is particularly effective for high-dimensional spaces and is robust

In summary, SVM is a powerful classifier that focuses on maximizing the margin

Here’s an example of using Support Vector Machine (SVM) in Python with

from sklearn import datasets

from sklearn.model_selection import train_test_split

from sklearn.svm import SVC # Support Vector Classifier

from sklearn.metrics import accuracy_score, classification_report

y = iris.target # Target classes (setosa, versicolor, virginica)

y_binary = (y == 0).astype(int) # Setosa as 1, others as 0

# Split the data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y_binary, test_size=0.3,

# Initialize and train the Support Vector Classifier (SVC)

model = SVC(kernel='linear', C=1.0) # C is the regularization parameter

# Make predictions on the test set

# Evaluate the model

accuracy = accuracy_score(y_test, y_pred)

report = classification_report(y_test, y_pred)

# Output the accuracy and classification report

print("Classification Report:\n", report)) model = SVC(kernel='linear', C=1.0)

 Support Vector Machine (SVM) is a supervised machine learning algorithm

 Purpose: SVM aims to create a robust classification model that minimizes

2. Concept of the Margin in SVM

 Significance of Maximizing the Margin: Maximizing the margin is crucial

o It increases the separation between classes, which helps the model be

o A larger margin generally means better generalization, as the model is

3. What is a Hyperplane in SVM?

 Hyperplane: In the context of SVM, a hyperplane is a flat decision boundary

 Role in Separating Classes: The hyperplane is positioned such that it divides

4. Support Vectors and Their Importance in SVM

o The support vectors ensure that the margin is maximized while

In summary, SVM is a powerful classification tool that leverages the concept of a

1. Linear Regression vs. Logistic Regression

 Linear Regression: Linear regression is a statistical method used to model the

 Logistic Regression: Logistic regression, while also a type of regression, is used

2. Interpreting the Outputs of Linear and Logistic Regression Models

o The output is a point estimate, which is a continuous value representing

o The output is a probability that an observation belongs to the positive

o This probability is calculated using the logistic (sigmoid) function applied

o Predictions are made by setting a threshold (e.g., 0.5): if the probability

o Logistic regression also gives us the odds (ratio of probabilities) of an

3. Loss Functions in Linear and Logistic Regression

 Linear Regression – Mean Squared Error (MSE):

o MSE Formula: MSE=

o where yiy_iyi is the actual value,

o Why MSE is Appropriate: MSE measures the average squared

 Logistic Regression – Log Loss (Cross-Entropy):

o Log Loss Formula:

 Linear Regression provides continuous predictions and minimizes MSE to achieve

 Logistic Regression predicts probabilities, uses log loss to focus on accurate

You might also like