0% found this document useful (0 votes)

124 views6 pages

1.4. Support Vector Machines - Scikit-Learn

Support vector machines (SVMs) are a supervised learning method used for classification and regression. SVMs work well in high dimensional spaces and cases where the number of dimensions exceeds the number of samples. They use a subset of training points, called support vectors, to define decision boundaries, making them memory efficient. Different kernel functions, such as linear or radial basis function kernels, can be specified to fit nonlinear models. While effective, SVMs can overfit if the number of features is much greater than samples, and they do not directly provide probability estimates. Scikit-learn supports both dense and sparse sample vectors for SVMs.

Uploaded by

Amit Deshai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

124 views6 pages

1.4. Support Vector Machines - Scikit-Learn

Uploaded by

Amit Deshai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

10/13/23, 1:10 PM 1.4. Support Vector Machines — scikit-learn 1.3.

1 documentation

1.4. Support Vector Machines

Support vector machines (SVMs) are a set of supervised learning methods used for classification, regression and outliers detection.

The advantages of support vector machines are:

Effective in high dimensional spaces.

Still effective in cases where number of dimensions is greater than the number of samples.
Uses a subset of training points in the decision function (called support vectors), so it is also memory efficient.
Versatile: different Kernel functions can be specified for the decision function. Common kernels are provided, but it is also possible to specify custom kernels.

The disadvantages of support vector machines include:

If the number of features is much greater than the number of samples, avoid over-fitting in choosing Kernel functions and regularization term is crucial.
SVMs do not directly provide probability estimates, these are calculated using an expensive five-fold cross-validation (see Scores and probabilities, below).

The support vector machines in scikit-learn support both dense ( numpy.ndarray and convertible to that by numpy.asarray ) and sparse (any scipy.sparse ) sample
vectors as input. However, to use an SVM to make predictions for sparse data, it must have been fit on such data. For optimal performance, use C-
ordered numpy.ndarray (dense) or scipy.sparse.csr_matrix (sparse) with dtype=float64 .

1.4.1. Classification
SVC, NuSVC and LinearSVC are classes capable of performing binary and multi-class classification on a dataset.
10/13/23, 1:10 PM 1.4. Support Vector Machines — scikit-learn 1.3.1 documentation

>>> # get support vectors

>>> clf.support_vectors_
array([[0., 0.],
[1., 1.]])
>>> # get indices of support vectors
>>> clf.support_
array([0, 1]...)
>>> # get number of support vectors for each class
>>> clf.n_support_
array([1, 1]...)

Examples:
SVM: Maximum margin separating hyperplane,
Non-linear SVM
SVM-Anova: SVM with univariate feature selection,

1.4.1.1. Multi-class classification

SVC and NuSVC implement the “one-versus-one” approach for multi-class classification. In total, n_classes * (n_classes - 1) / 2 classifiers are constructed and each
one trains data from two classes. To provide a consistent interface with other classifiers, the decision_function_shape option allows to monotonically transform the
results of the “one-versus-one” classifiers to a “one-vs-rest” decision function of shape (n_samples, n_classes) .

>>> X = [[0], [1], [2], [3]]

>>> Y = [0, 1, 2, 3]
>>> clf = svm.SVC(decision_function_shape='ovo')
>>> clf.fit(X, Y)
SVC(decision_function_shape='ovo')
>>> dec = clf.decision_function([[1]])
>>> dec.shape[1] # 4 classes: 4*3/2 = 6
6
>>> clf.decision_function_shape = "ovr"
>>> dec = clf.decision_function([[1]])
>>> dec.shape[1] # 4 classes
4

On the other hand, LinearSVC implements “one-vs-the-rest” multi-class strategy, thus training n_classes models.

>>> lin_clf = svm.LinearSVC(dual="auto")

>>> lin_clf.fit(X, Y)
LinearSVC(dual='auto')
10/13/23, 1:10 PM 1.4. Support Vector Machines — scikit-learn 1.3.1 documentation

SVC (but not NuSVC) implements the parameter class_weight in the fit method. It’s a dictionary of the form {class_label : value} , where value is a floating point
number > 0 that sets the parameter C of class class_label to C * value . The figure below illustrates the decision boundary of an unbalanced problem, with and
without weight correction.

SVC, NuSVC, SVR, NuSVR, LinearSVC, LinearSVR and OneClassSVM implement also weights for individual samples in the fit method through
the sample_weight parameter. Similar to class_weight , this sets the parameter C for the i-th example to C * sample_weight[i] , which will encourage the classifier
to get these samples right. The figure below illustrates the effect of sample weighting on the decision boundary. The size of the circles is proportional to the sample
weights:
10/13/23, 1:10 PM 1.4. Support Vector Machines — scikit-learn 1.3.1 documentation

Support Vector Regression (SVR) using linear and non-linear kernels

1.4.3. Density estimation, novelty detection

The class OneClassSVM implements a One-Class SVM which is used in outlier detection.

See Novelty and Outlier Detection for the description and usage of OneClassSVM.

1.4.4. Complexity
Support Vector Machines are powerful tools, but their compute and storage requirements increase rapidly with the number of training vectors. The core of an SVM is a
quadratic programming problem (QP), separating support vectors from the rest of the training data. The QP solver used by the libsvm-based implementation scales
between and depending on how efficiently the libsvm cache is used in practice (dataset dependent). If the data is very sparse should be replaced by the average
number of non-zero features in a sample vector.

For the linear case, the algorithm used in LinearSVC by the liblinear implementation is much more efficient than its libsvm-based SVC counterpart and can scale almost
linearly to millions of samples and/or features.

1.4.5. Tips on Practical Use

Avoiding data copy: For SVC, SVR, NuSVC and NuSVR, if the data passed to certain methods is not C-ordered contiguous and double precision, it will be copied
before calling the underlying C implementation. You can check whether a given numpy array is C-contiguous by inspecting its flags attribute.

For LinearSVC (and LogisticRegression) any input passed as a numpy array will be copied and converted to the liblinear internal sparse data representation
(double precision floats and int32 indices of non-zero components). If you want to fit a large-scale linear classifier without copying a dense numpy C-contiguous
double precision array as input, we suggest to use the SGDClassifier class instead. The objective function can be configured to be almost the same as
the LinearSVC model.

Kernel cache size: For SVC, SVR, NuSVC and NuSVR, the size of the kernel cache has a strong impact on run times for larger problems. If you have enough RAM
available, it is recommended to set cache_size to a higher value than the default of 200(MB), such as 500(MB) or 1000(MB).

Setting C: C is 1 by default and it’s a reasonable default choice. If you have a lot of noisy observations you should decrease it: decreasing C corresponds to more
regularization.

LinearSVC and LinearSVR are less sensitive to C when it becomes large, and prediction results stop improving after a certain threshold. Meanwhile,
larger C values will take more time to train, sometimes up to 10 times longer, as shown in [11].

Support Vector Machine algorithms are not scale invariant, so it is highly recommended to scale your data. For example, scale each attribute on the input vector
10/13/23, 1:10 PM 1.4. Support Vector Machines — scikit-learn 1.3.1 documentation

Different kernels are specified by the kernel parameter:

>>> linear_svc = svm.SVC(kernel='linear')

>>> linear_svc.kernel
'linear'
>>> rbf_svc = svm.SVC(kernel='rbf')
>>> rbf_svc.kernel
'rbf'

See also Kernel Approximation for a solution to use RBF kernels that is much faster and more scalable.

1.4.6.1. Parameters of the RBF Kernel

When training an SVM with the Radial Basis Function (RBF) kernel, two parameters must be considered: C and gamma . The parameter C , common to all SVM kernels,
trades off misclassification of training examples against simplicity of the decision surface. A low C makes the decision surface smooth, while a high C aims at classifying
all training examples correctly. gamma defines how much influence a single training example has. The larger gamma is, the closer other examples must be to be affected.

Proper choice of C and gamma is critical to the SVM’s performance. One is advised to use GridSearchCV with C and gamma spaced exponentially far apart to choose
good values.

Examples:
RBF SVM parameters
Non-linear SVM

1.4.6.2. Custom Kernels

You can define your own kernels by either giving the kernel as a python function or by precomputing the Gram matrix.

Classifiers with custom kernels behave the same way as any other classifiers, except that:

Field support_vectors_ is now empty, only indices of support vectors are stored in support_
A reference (and not a copy) of the first argument in the fit() method is stored for future reference. If that array changes between the use
of fit() and predict() you will have unexpected results.

Using Python functions as kernels

Using the Gram matrix

10/13/23, 1:10 PM 1.4. Support Vector Machines — scikit-learn 1.3.1 documentation

Intuitively, we’re trying to maximize the margin (by minimizing ), while incurring a penalty when a sample is misclassified or within the margin boundary. Ideally, the
value would be for all samples, which indicates a perfect prediction. But problems are usually not always perfectly separable with a hyperplane, so we allow some
samples to be at a distance from their correct margin boundary. The penalty term C controls the strength of this penalty, and as a result, acts as an inverse regularization
parameter (see note below).

The dual problem to the primal is

where is the vector of all ones, and is an by positive semidefinite matrix, , where is the kernel. The terms are called the dual coefficients, and they are upper-
bounded by . This dual representation highlights the fact that training vectors are implicitly mapped into a higher (maybe infinite) dimensional space by the function :
see kernel trick.

Once the optimization problem is solved, the output of decision_function for a given sample becomes:

and the predicted class correspond to its sign. We only need to sum over the support vectors (i.e. the samples that lie within the margin) because the dual coefficients are
zero for the other samples.

These parameters can be accessed through the attributes dual_coef_ which holds the product , support_vectors_ which holds the support vectors,
and intercept_ which holds the independent term

Note While SVM models derived from libsvm and liblinear use C as regularization parameter, most other estimators use alpha . The exact equivalence between the
amount of regularization of two models depends on the exact objective function optimized by the model. For example, when the estimator used is Ridge regression,
the relation between them is given as .

LinearSVC

NuSVC

1.4.7.2. SVR

Standard Ii: Talent Search Examination - 2021-22
100% (2)
Standard Ii: Talent Search Examination - 2021-22
17 pages
Diagnostic Table For Yanmar 4TNV98 ZNMS Tier 3 Engine
100% (1)
Diagnostic Table For Yanmar 4TNV98 ZNMS Tier 3 Engine
3 pages
Based On The UK Construction Industry Key Performance Indicators
No ratings yet
Based On The UK Construction Industry Key Performance Indicators
30 pages
Process Verification Audit Checklist
100% (1)
Process Verification Audit Checklist
5 pages
1.4. Support Vector Machines - Scikit-Learn 1.5.1 Documentation
No ratings yet
1.4. Support Vector Machines - Scikit-Learn 1.5.1 Documentation
20 pages
SVC - Scikit-Learn 1.5.1 Documentation
No ratings yet
SVC - Scikit-Learn 1.5.1 Documentation
19 pages
Support Vector Machine in R Paper
No ratings yet
Support Vector Machine in R Paper
28 pages
Classifying Data Using Support Vector Machines (SVMS) in Python
No ratings yet
Classifying Data Using Support Vector Machines (SVMS) in Python
5 pages
Support Vector Machine (SVM) Classifier:: Key Features
No ratings yet
Support Vector Machine (SVM) Classifier:: Key Features
6 pages
HandsOnML Ch5E
No ratings yet
HandsOnML Ch5E
31 pages
SVM Algorithm Guide with Python Code
No ratings yet
SVM Algorithm Guide with Python Code
10 pages
Honours Endsem Notes
No ratings yet
Honours Endsem Notes
163 pages
Machine Learning With Python - Machine Learning Algorithms-SVM
No ratings yet
Machine Learning With Python - Machine Learning Algorithms-SVM
26 pages
27-Module 4 - Support Vector Machine and Naïve Bayes-20-09-2024
No ratings yet
27-Module 4 - Support Vector Machine and Naïve Bayes-20-09-2024
31 pages
Support Vector Machine
No ratings yet
Support Vector Machine
52 pages
SVM Classifier Techniques Guide
No ratings yet
SVM Classifier Techniques Guide
15 pages
CMPE 442 Introduction To Machine Learning: Support Vector Machines
No ratings yet
CMPE 442 Introduction To Machine Learning: Support Vector Machines
64 pages
LIBSVM: A Library For Support Vector Machines
No ratings yet
LIBSVM: A Library For Support Vector Machines
39 pages
Support Vector Machines
No ratings yet
Support Vector Machines
24 pages
Fundamentals of Machine Learning Support Vector Machines, Practical Session
No ratings yet
Fundamentals of Machine Learning Support Vector Machines, Practical Session
4 pages
Hands On Machine Learning 3 Edition
No ratings yet
Hands On Machine Learning 3 Edition
43 pages
Taz TFG 2016 2057
No ratings yet
Taz TFG 2016 2057
52 pages
Ain3001 - 04 - Support - Vector.machines
No ratings yet
Ain3001 - 04 - Support - Vector.machines
50 pages
Unit - 2-1
No ratings yet
Unit - 2-1
7 pages
Support Vector Machine For Classification
No ratings yet
Support Vector Machine For Classification
38 pages
SVM Guide: Concepts, Implementation, Tuning
No ratings yet
SVM Guide: Concepts, Implementation, Tuning
13 pages
Presentation - SVM & KM - May 2009
No ratings yet
Presentation - SVM & KM - May 2009
24 pages
SVM Guide for Data Scientists
No ratings yet
SVM Guide for Data Scientists
48 pages
2.1 SVM
No ratings yet
2.1 SVM
16 pages
PML Lab Exp 10
No ratings yet
PML Lab Exp 10
3 pages
Support Vector Machine
No ratings yet
Support Vector Machine
45 pages
MODULE - 4 - PART 2 - Support Vector Machines
No ratings yet
MODULE - 4 - PART 2 - Support Vector Machines
6 pages
2.11 Chapter 5 SVM
No ratings yet
2.11 Chapter 5 SVM
25 pages
Lab 2 SVM
No ratings yet
Lab 2 SVM
23 pages
Seminar
No ratings yet
Seminar
51 pages
MLT 07
No ratings yet
MLT 07
8 pages
Module10 - Support Vector Machine
No ratings yet
Module10 - Support Vector Machine
23 pages
Unit - 2
No ratings yet
Unit - 2
15 pages
Fundamental Knowledge of Machine Learning: Abstract This Chapter Introduces The Basic Concepts and Methods of Machine
No ratings yet
Fundamental Knowledge of Machine Learning: Abstract This Chapter Introduces The Basic Concepts and Methods of Machine
14 pages
SVM Presentation
No ratings yet
SVM Presentation
27 pages
Support Vector Machines
No ratings yet
Support Vector Machines
12 pages
Unit 3 Aam
No ratings yet
Unit 3 Aam
30 pages
Chapter 6 Data-DrivenModelingUsingMATLAB-6
No ratings yet
Chapter 6 Data-DrivenModelingUsingMATLAB-6
7 pages
Title: Implement Support Vector Machine Classifier: Department of Computer Science and Engineering
No ratings yet
Title: Implement Support Vector Machine Classifier: Department of Computer Science and Engineering
5 pages
Support Vector Machine Guide
No ratings yet
Support Vector Machine Guide
21 pages
ML Lecture 14 SVM
No ratings yet
ML Lecture 14 SVM
15 pages
SVM Applications and Properties
100% (1)
SVM Applications and Properties
34 pages
SVM Using Iris Dataset by Hyparlink
No ratings yet
SVM Using Iris Dataset by Hyparlink
19 pages
ML06 Classical Techniques
No ratings yet
ML06 Classical Techniques
38 pages
SVM Types
No ratings yet
SVM Types
12 pages
Support Vector Machine: Prof. Subodh Kumar Mohanty
No ratings yet
Support Vector Machine: Prof. Subodh Kumar Mohanty
52 pages
Support Vector Machine: Abinas Panda
No ratings yet
Support Vector Machine: Abinas Panda
52 pages
SVM Lab.7
No ratings yet
SVM Lab.7
4 pages
UNIT-II-Support Vector Machine Algorithm
No ratings yet
UNIT-II-Support Vector Machine Algorithm
13 pages
Classification Review
No ratings yet
Classification Review
8 pages
SVM Everything
No ratings yet
SVM Everything
5 pages
Da Pra Week 12 (SVM)
No ratings yet
Da Pra Week 12 (SVM)
15 pages
Ex 6, EX 7 AIML
No ratings yet
Ex 6, EX 7 AIML
9 pages
Lecture 6 - Support Vector Regression Imran 07032025 114229am
No ratings yet
Lecture 6 - Support Vector Regression Imran 07032025 114229am
30 pages
Lecture 18 - SVM
No ratings yet
Lecture 18 - SVM
54 pages
2012 - Huang Et Al. Extreme Learning Machine For Regression and Multiclass Classification
No ratings yet
2012 - Huang Et Al. Extreme Learning Machine For Regression and Multiclass Classification
17 pages
Unit1 Operatingsystem
No ratings yet
Unit1 Operatingsystem
23 pages
CM204G DBMS
No ratings yet
CM204G DBMS
10 pages
Model Question Paper - AdvanceJava
100% (1)
Model Question Paper - AdvanceJava
3 pages
Cs Unison
No ratings yet
Cs Unison
4 pages
Dell Latitude E5400 and E5500 Spec Sheet
100% (1)
Dell Latitude E5400 and E5500 Spec Sheet
2 pages
WM 2024
No ratings yet
WM 2024
6 pages
Unit 1 DBMS
No ratings yet
Unit 1 DBMS
107 pages
2013 SNUG SV Synthesizable SystemVerilog Paper
No ratings yet
2013 SNUG SV Synthesizable SystemVerilog Paper
45 pages
Fractal Geometry and Superformula To Model Natural Shapes Over The World
No ratings yet
Fractal Geometry and Superformula To Model Natural Shapes Over The World
15 pages
Resume Limpia Banerjee
No ratings yet
Resume Limpia Banerjee
3 pages
Workflow Attributes - HTML Body
No ratings yet
Workflow Attributes - HTML Body
12 pages
LCR Measurements
No ratings yet
LCR Measurements
16 pages
Block Retráctil
No ratings yet
Block Retráctil
1 page
Pharma Code Printing Guide
No ratings yet
Pharma Code Printing Guide
12 pages
CV Syllabus
No ratings yet
CV Syllabus
3 pages
Investigating and Ranking The Rate of Penetration (ROP) Features For Petroleum Drilling Monitoring and Optimization
No ratings yet
Investigating and Ranking The Rate of Penetration (ROP) Features For Petroleum Drilling Monitoring and Optimization
7 pages
Sales Performance Report
No ratings yet
Sales Performance Report
4 pages
Log
No ratings yet
Log
44 pages
Snapdragon 616 Processor Product Brief
No ratings yet
Snapdragon 616 Processor Product Brief
2 pages
Bluetooth Communication Using A Touchscreen Interface With The Raspberry Pi
No ratings yet
Bluetooth Communication Using A Touchscreen Interface With The Raspberry Pi
4 pages
36 - Extracted - CN LAB FILE
No ratings yet
36 - Extracted - CN LAB FILE
21 pages
Solution HW4
No ratings yet
Solution HW4
5 pages
FFS120 FashionandRace F24
No ratings yet
FFS120 FashionandRace F24
18 pages
MCQ Ec-405
No ratings yet
MCQ Ec-405
2 pages
Chord Implementation Using RMI
0% (1)
Chord Implementation Using RMI
8 pages
2 Static & Dynamic Web Pages
No ratings yet
2 Static & Dynamic Web Pages
24 pages
Operating Systems Course Guide
No ratings yet
Operating Systems Course Guide
2 pages
Anaconda Training PDF
100% (1)
Anaconda Training PDF
2 pages
Dsei30 06a
No ratings yet
Dsei30 06a
3 pages
Computer Applications in Hydraulic Engineering Tutorials 2020-Jul-21
No ratings yet
Computer Applications in Hydraulic Engineering Tutorials 2020-Jul-21
100 pages

1.4. Support Vector Machines - Scikit-Learn

Uploaded by

1.4. Support Vector Machines - Scikit-Learn

Uploaded by

10/13/23, 1:10 PM 1.4. Support Vector Machines — scikit-learn 1.3.

1.4. Support Vector Machines

The advantages of support vector machines are:

Effective in high dimensional spaces.

The disadvantages of support vector machines include:

>>> # get support vectors

1.4.1.1. Multi-class classification

>>> X = [[0], [1], [2], [3]]

>>> lin_clf = svm.LinearSVC(dual="auto")

Support Vector Regression (SVR) using linear and non-linear kernels

1.4.3. Density estimation, novelty detection

1.4.5. Tips on Practical Use

Different kernels are specified by the kernel parameter:

>>> linear_svc = svm.SVC(kernel='linear')

1.4.6.1. Parameters of the RBF Kernel

1.4.6.2. Custom Kernels

Using Python functions as kernels

Using the Gram matrix

The dual problem to the primal is

You might also like