0% found this document useful (0 votes)

12 views14 pages

Machine Learning-Lecture 1 (Student)

The document outlines a Machine Learning course taught by Ya-Mei Chang, including details on textbooks, grading criteria, and office hours. It covers concepts in statistical learning, linear regression, and multiple linear regression, providing examples and R code for practical application. Key topics include estimating coefficients, assessing model accuracy, and hypothesis testing.

Uploaded by

hubertkuo418

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views14 pages

Machine Learning-Lecture 1 (Student)

Uploaded by

hubertkuo418

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Machine Learning

Lecturer :Ya-Mei Chang

Office: Room 446, ext 66117

[email protected]
Textbook
Title: An Introduction to Statistical Learning: with Applications in R, 2021
Authors: G. James, D. Witten, T. Hastie and R. Tibshirani

Reference Book
Title: The Elements of Statistical Learing: Data mining, Inference and Prediction
Authors: D. Hastie, R. Tibshirani and J. Friedman

Grading:
⚫ Attendance 10%
⚫ Mark of usual 30%
⚫ Midterm Exam 30%
⚫ Final Report 30%

Office hours:
Tue. 10:00~11:00
Thr. 10:00~11:00
What Is Statistical Learning?
⚫ An exemple:
X

Y
More generally, suppose that we observe a quantitative response Y and p
different predictors, X1,X2, . . .,Xp. We assume that there is some relationship
between Y and X = (X1,X2, . . .,Xp), which can be written in the very general form

Here f is some fixed but unknown function of X1, . . . , Xp, and  is a random
error term, which is independent of X and has mean zero. In this formulation, f
represents the systematic information that X provides about Y .

⚫ In essence, statistical learning refers to

Linear Regression
⚫ Simple Linear Regression
➢ Assumption:
It assumes that there is approximately a linear relationship between X and
Y . Mathematically, we can write this linear relationship as

 0 and  1 are two unknown constants that represent the intercept and
slope terms in the linear model. Once we have used our training data to

produce estimates ̂ 0 and ˆ1 for the model coefficients, we can predict

future sales on the basis of a particular value of TV advertising by

computing
➢ Estimating the Coefficients:
Let (x1, y1), (x2, y2), . . . , (xn, yn) represent n observation pairs, each of which
consists of a measurement of X and a measurement of Y . Let

yˆi = ˆ0 + ˆ1 xi be the prediction for Y based on the ith value of X. Then

ei = yi − yˆi represents the ith residual—this is the difference between the

ith observed response value and the ith response value that is predicted by
our linear model. We define the residual sum of squares (RSS) as

or equivalently

The least squares approach chooses ̂ 0 and ˆ1 to minimize the RSS.

Using some calculus, one can show that the minimizers are

n n
where y =  yi / n and x =  xi / n are the sample means.
i =1 i =1

➢ Assessing the Accuracy of the Coefficient Estimates:

1. About  (the population mean of Y),

In general, σ2 is not known. It is estimated by residual standard error

(RSE) and is given by the formula RSE = RSS / (n − 2).

2. About ̂ 0 ,

the 95% confidence interval for ̂ 0 approximately takes the form

3. About ˆ1 ,

the 95% confidence interval for  1 approximately takes the form

H0 : There is no relationship between X and Y

versus
Ha : There is some relationship between X and Y .
Mathematically, this corresponds to testing

➢ Assessing the Accuracy of the Model:

1. RSE
2. R 2 Statistics:
3.

⚫ Multiple Linear Regression

➢ Assumption:

Y = 0 + 1 X1 + + p X p +

➢ Estimating the Coefficients:

We choose 0 , 1, , p to minimize the sum of squared residuals

➢ Some Important Questions
1. Is at least one of the predictors X1,X2, . . . , Xp useful in predicting
the response?
◼ We test the null hypothesis,
H0 : β1 = β2 = ···= βp = 0
versus
Ha : at least one βj is non-zero.
This hypothesis test is performed by computing the F-statistic,

◼ 0 , 1, , p−q

 p−q+1, , p : The last q variables maybe not useful in prediction

This corresponds to a null hypothesis

H0 : βp−q+1 = βp−q+2 = . . . = βp = 0,
In this case we fit a second model that uses all the variables except those
last q. Suppose that the residual sum of squares for that model is RSS0.
Then the appropriate F-statistic is

2. Do all the predictors help to explain Y , or is only a subset of the

predictors useful?
◼
3. How well does the model fit the data?
◼
4. Given a set of predictor values, what response value should we
predict, and how accurate is our prediction?
◼
Computer Session

➢ Simple Linear Regression

library (MASS)
library (ISLR2)

##
## 載入套件：'ISLR2'

## 下列物件被遮斷自 'package:MASS':
##
## Boston

head (Boston)

## crim zn indus chas nox rm age dis rad tax ptratio lstat medv
## 1 0.00632 18 2.31 0 0.538 6.575 65.2 4.0900 1 296 15.3 4.98 24.0
## 2 0.02731 0 7.07 0 0.469 6.421 78.9 4.9671 2 242 17.8 9.14 21.6
## 3 0.02729 0 7.07 0 0.469 7.185 61.1 4.9671 2 242 17.8 4.03 34.7
## 4 0.03237 0 2.18 0 0.458 6.998 45.8 6.0622 3 222 18.7 2.94 33.4
## 5 0.06905 0 2.18 0 0.458 7.147 54.2 6.0622 3 222 18.7 5.33 36.2
## 6 0.02985 0 2.18 0 0.458 6.430 58.7 6.0622 3 222 18.7 5.21 28.7

lm.fit <- lm(medv~lstat, data=Boston)

attach(Boston)
lm.fit<- lm(medv~lstat)
lm.fit

##
## Call:
## lm(formula = medv ~ lstat)
##
## Coefficients:
## (Intercept) lstat
## 34.55 -0.95

summary(lm.fit)

##
## Call:
## lm(formula = medv ~ lstat)
##
## Residuals:
## Min 1Q Median 3Q Max
## -15.168 -3.990 -1.318 2.034 24.500
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 34.55384 0.56263 61.41 <2e-16 ***
## lstat -0.95005 0.03873 -24.53 <2e-16 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 6.216 on 504 degrees of freedom
## Multiple R-squared: 0.5441, Adjusted R-squared: 0.5432
## F-statistic: 601.6 on 1 and 504 DF, p-value: < 2.2e-16

names(lm.fit)

## [1] "coefficients" "residuals" "effects" "rank"

## [5] "fitted.values" "assign" "qr" "df.residual"
## [9] "xlevels" "call" "terms" "model"

coef(lm.fit)

## (Intercept) lstat
## 34.5538409 -0.9500494

confint(lm.fit)

## 2.5 % 97.5 %
## (Intercept) 33.448457 35.6592247
## lstat -1.026148 -0.8739505

predict(lm.fit, data.frame(lstat = (c(5, 10, 15) )),interval = "confidence")

## fit lwr upr

## 1 29.80359 29.00741 30.59978
## 2 25.05335 24.47413 25.63256
## 3 20.30310 19.73159 20.87461

predict(lm.fit, data.frame(lstat = (c(5, 10, 15) )),interval = "prediction")

## fit lwr upr

## 1 29.80359 17.565675 42.04151
## 2 25.05335 12.827626 37.27907
## 3 20.30310 8.077742 32.52846

plot(lstat,medv)
abline(lm.fit)
abline(lm.fit,lwd=3)
abline(lm.fit,lwd=3,col="red")

plot(lstat,medv, col = "red")

plot(lstat,medv, pch=20)

plot(lstat,medv,pch="+")
plot (1:20 , 1:20 , pch = 1:20)
par( mfrow = c(1, 2))
plot(lm.fit)
plot(predict(lm.fit), residuals(lm.fit))

➢ Multiple Linear Regression

lm.fit <- lm(medv~lstat+age, data=Boston)

summary(lm.fit)

## Call:
## lm(formula = medv ~ lstat + age, data = Boston)
##
## Residuals:
## Min 1Q Median 3Q Max
## -15.981 -3.978 -1.283 1.968 23.158
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 33.22276 0.73085 45.458 < 2e-16 ***
## lstat -1.03207 0.04819 -21.416 < 2e-16 ***
## age 0.03454 0.01223 2.826 0.00491 **
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 6.173 on 503 degrees of freedom
## Multiple R-squared: 0.5513, Adjusted R-squared: 0.5495
## F-statistic: 309 on 2 and 503 DF, p-value: < 2.2e-16
lm.fit <- lm(medv~., data=Boston)
summary(lm.fit)

##
## Call:
## lm(formula = medv ~ ., data = Boston)
##
## Residuals:
## Min 1Q Median 3Q Max
## -15.1304 -2.7673 -0.5814 1.9414 26.2526
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 41.617270 4.936039 8.431 3.79e-16 ***
## crim -0.121389 0.033000 -3.678 0.000261 ***
## zn 0.046963 0.013879 3.384 0.000772 ***
## indus 0.013468 0.062145 0.217 0.828520
## chas 2.839993 0.870007 3.264 0.001173 **
## nox -18.758022 3.851355 -4.870 1.50e-06 ***
## rm 3.658119 0.420246 8.705 < 2e-16 ***
## age 0.003611 0.013329 0.271 0.786595
## dis -1.490754 0.201623 -7.394 6.17e-13 ***
## rad 0.289405 0.066908 4.325 1.84e-05 ***
## tax -0.012682 0.003801 -3.337 0.000912 ***
## ptratio -0.937533 0.132206 -7.091 4.63e-12 ***
## lstat -0.552019 0.050659 -10.897 < 2e-16 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## Residual standard error: 4.798 on 493 degrees of freedom
## Multiple R-squared: 0.7343, Adjusted R-squared: 0.7278
## F-statistic: 113.5 on 12 and 493 DF, p-value: < 2.2e-16
library (car)

## 載入需要的套件：carData

vif(lm.fit)

## crim zn indus chas nox rm age dis

## 1.767486 2.298459 3.987181 1.071168 4.369093 1.912532 3.088232 3.954037
## rad tax ptratio lstat
## 7.445301 9.002158 1.797060 2.870777

lm.fit1<-lm( medv~.-age, data = Boston)

summary (lm.fit1)

## Call:
## lm(formula = medv ~ . - age, data = Boston)
##
## Residuals:
## Min 1Q Median 3Q Max
## -15.1851 -2.7330 -0.6116 1.8555 26.3838
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 41.525128 4.919684 8.441 3.52e-16 ***
## crim -0.121426 0.032969 -3.683 0.000256 ***
## zn 0.046512 0.013766 3.379 0.000785 ***
## indus 0.013451 0.062086 0.217 0.828577
## chas 2.852773 0.867912 3.287 0.001085 **
## nox -18.485070 3.713714 -4.978 8.91e-07 ***
## rm 3.681070 0.411230 8.951 < 2e-16 ***
## dis -1.506777 0.192570 -7.825 3.12e-14 ***
## rad 0.287940 0.066627 4.322 1.87e-05 ***
## tax -0.012653 0.003796 -3.333 0.000923 ***
## ptratio -0.934649 0.131653 -7.099 4.39e-12 ***
## lstat -0.547409 0.047669 -11.483 < 2e-16 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 4.794 on 494 degrees of freedom
## Multiple R-squared: 0.7343, Adjusted R-squared: 0.7284
## F-statistic: 124.1 on 11 and 494 DF, p-value: < 2.2e-16

MATH6183 Introduction+Regression
No ratings yet
MATH6183 Introduction+Regression
70 pages
Year 5 Maths
100% (1)
Year 5 Maths
11 pages
Regression Analysis Guide
No ratings yet
Regression Analysis Guide
34 pages
Stat 362 UNIT 1
No ratings yet
Stat 362 UNIT 1
53 pages
HW 2
No ratings yet
HW 2
9 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
3 pages
Exam 1 Notes
No ratings yet
Exam 1 Notes
4 pages
Part 11 Multiple Linear Regression - Pdf.crdownload
No ratings yet
Part 11 Multiple Linear Regression - Pdf.crdownload
41 pages
R-Programming - Unit 5
No ratings yet
R-Programming - Unit 5
43 pages
Lab 3
No ratings yet
Lab 3
10 pages
Experiment No.8 - Fit Simple Linear Regression Models Using Built-In Functions.
No ratings yet
Experiment No.8 - Fit Simple Linear Regression Models Using Built-In Functions.
8 pages
EC311 Slides Spring25 Week9 Part1
No ratings yet
EC311 Slides Spring25 Week9 Part1
16 pages
Etman MachineL4
No ratings yet
Etman MachineL4
55 pages
Sample Lab File
No ratings yet
Sample Lab File
4 pages
Regression and Prediction
No ratings yet
Regression and Prediction
56 pages
Exam Practice 4
No ratings yet
Exam Practice 4
5 pages
Stats135 Reviewer
No ratings yet
Stats135 Reviewer
5 pages
BT PTTKNC
No ratings yet
BT PTTKNC
5 pages
II-I - MCA - Data Science and Analytics - Course Material - Unit2
No ratings yet
II-I - MCA - Data Science and Analytics - Course Material - Unit2
15 pages
Business Analytics C-2
No ratings yet
Business Analytics C-2
7 pages
Linear Regression
No ratings yet
Linear Regression
13 pages
Regression
No ratings yet
Regression
56 pages
Stepwiseselection MATTOUHI AICHA
No ratings yet
Stepwiseselection MATTOUHI AICHA
7 pages
Exercice V
No ratings yet
Exercice V
5 pages
L10 Multiple Regression
No ratings yet
L10 Multiple Regression
14 pages
RM Unit-4
No ratings yet
RM Unit-4
5 pages
Lecture 12 - Adv. Correlation and Multiple Regression
No ratings yet
Lecture 12 - Adv. Correlation and Multiple Regression
32 pages
6th Lecture Note 108335647 230518 203102
No ratings yet
6th Lecture Note 108335647 230518 203102
35 pages
Hybrid Excited Synchronous Machines: Topologies, Design and Analysis 1st Edition Yacine Amara Download
100% (1)
Hybrid Excited Synchronous Machines: Topologies, Design and Analysis 1st Edition Yacine Amara Download
132 pages
Section 2
No ratings yet
Section 2
22 pages
Ch21 - Lecture 2025 Updated
No ratings yet
Ch21 - Lecture 2025 Updated
80 pages
IE 451 Fall 2023-2024 Homework 4 Solutions
No ratings yet
IE 451 Fall 2023-2024 Homework 4 Solutions
19 pages
Linear Regression for Researchers
No ratings yet
Linear Regression for Researchers
41 pages
Ch25 - Lecture 2025 Updated
No ratings yet
Ch25 - Lecture 2025 Updated
84 pages
Ch22 - Lecture 2025 Updated
No ratings yet
Ch22 - Lecture 2025 Updated
68 pages
H-311 Linear Regression Analysis With R
100% (1)
H-311 Linear Regression Analysis With R
71 pages
Team8 Lab3
No ratings yet
Team8 Lab3
12 pages
Multiple Linear Regression & Nonlinear Regression Models
No ratings yet
Multiple Linear Regression & Nonlinear Regression Models
51 pages
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
100% (1)
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
15 pages
Ch23 - Lecture 2025 Updated
No ratings yet
Ch23 - Lecture 2025 Updated
55 pages
Ch24 - Lecture 2025 Updated
No ratings yet
Ch24 - Lecture 2025 Updated
51 pages
R Lab 4
No ratings yet
R Lab 4
7 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
27 pages
Chapter 14
No ratings yet
Chapter 14
18 pages
HW 07
No ratings yet
HW 07
3 pages
HW 09
No ratings yet
HW 09
3 pages
Simple Regression Model Fitting
No ratings yet
Simple Regression Model Fitting
5 pages
Module01.1 LinearRegression
No ratings yet
Module01.1 LinearRegression
32 pages
Assingment 8: Chitresh Kumar
No ratings yet
Assingment 8: Chitresh Kumar
20 pages
HW 06
No ratings yet
HW 06
2 pages
Mathematical Analysis Lecture 20240911 - 241012 - 161638
No ratings yet
Mathematical Analysis Lecture 20240911 - 241012 - 161638
21 pages
Mathematical Analysis Lecture 20241015 - 241019 - 163822
No ratings yet
Mathematical Analysis Lecture 20241015 - 241019 - 163822
18 pages
Mathematical Analysis Lecture 20241017 - 241019 - 114136
No ratings yet
Mathematical Analysis Lecture 20241017 - 241019 - 114136
18 pages
HW 11
No ratings yet
HW 11
3 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
18 pages
Eda Exam Answers
No ratings yet
Eda Exam Answers
7 pages
TCMG - MEEG 573 - SP - 20 - Lecture - 7
No ratings yet
TCMG - MEEG 573 - SP - 20 - Lecture - 7
69 pages
Mathematical Analysis Lecture 20240925 - 241012 - 115607
No ratings yet
Mathematical Analysis Lecture 20240925 - 241012 - 115607
13 pages
Sec2 Regression PDF
No ratings yet
Sec2 Regression PDF
183 pages
Course Book of IOT PDF
No ratings yet
Course Book of IOT PDF
94 pages
Mathematical Analysis Lecture 20241029 - 241105 - 233716
No ratings yet
Mathematical Analysis Lecture 20241029 - 241105 - 233716
12 pages
Mathematical Analysis Lecture 20241022 - 241102 - 155552
No ratings yet
Mathematical Analysis Lecture 20241022 - 241102 - 155552
12 pages
Mathematical Analysis Lecture 20240926 - 241019 - 114056
No ratings yet
Mathematical Analysis Lecture 20240926 - 241019 - 114056
12 pages
Linear Regression
No ratings yet
Linear Regression
17 pages
Mathematical Analysis Lecture 20241008 - 241019 - 114257
No ratings yet
Mathematical Analysis Lecture 20241008 - 241019 - 114257
11 pages
Simple Linear Regression Guide
100% (1)
Simple Linear Regression Guide
23 pages
Mathematical Analysis Lecture 20241024 - 241102 - 155553
No ratings yet
Mathematical Analysis Lecture 20241024 - 241102 - 155553
10 pages
Mathematical Analysis Lecture 20241009 - 241019 - 134610
No ratings yet
Mathematical Analysis Lecture 20241009 - 241019 - 134610
10 pages
Mathematical Analysis Lecture 20240910 - 250224 - 211702
No ratings yet
Mathematical Analysis Lecture 20240910 - 250224 - 211702
10 pages
CH 14 Handout
No ratings yet
CH 14 Handout
6 pages
Machine Learning-Lecture 2 (Student)
No ratings yet
Machine Learning-Lecture 2 (Student)
9 pages
Review Questions On Chapter 4
No ratings yet
Review Questions On Chapter 4
2 pages
Summary of Topics For Midterm Exam #2: STA 371G, Fall 2017
No ratings yet
Summary of Topics For Midterm Exam #2: STA 371G, Fall 2017
6 pages
Improved Techniques For Lower Bounds For Odd Perfe
No ratings yet
Improved Techniques For Lower Bounds For Odd Perfe
2 pages
TU DELFT MSC Thesis Jolien Rip - Probabilistic Downtime Analysis
100% (1)
TU DELFT MSC Thesis Jolien Rip - Probabilistic Downtime Analysis
144 pages
Lab-5-1-Regression and Multiple Regression
100% (2)
Lab-5-1-Regression and Multiple Regression
8 pages
Machine Learning-Lecture 17 (Student)
No ratings yet
Machine Learning-Lecture 17 (Student)
7 pages
Support Vector Classifier
No ratings yet
Support Vector Classifier
7 pages
Support Vector Classifier
No ratings yet
Support Vector Classifier
7 pages
The Effects of Mental Health To Academic Performance
No ratings yet
The Effects of Mental Health To Academic Performance
24 pages
DATA GATHERING PROCEDURE of A Research
No ratings yet
DATA GATHERING PROCEDURE of A Research
8 pages
Support Vector Machine R程式練習2
No ratings yet
Support Vector Machine R程式練習2
3 pages
Resampling-Methods 411210002
No ratings yet
Resampling-Methods 411210002
3 pages
LL LDA Practice 411210002
No ratings yet
LL LDA Practice 411210002
3 pages
Pca 411210002
No ratings yet
Pca 411210002
4 pages
分組作業三
No ratings yet
分組作業三
4 pages
Fotios 2017 Semi Cylindrical Illuminance TEXT VERSION
No ratings yet
Fotios 2017 Semi Cylindrical Illuminance TEXT VERSION
7 pages
Simpreg
No ratings yet
Simpreg
6 pages
MAS316/Math352 Regression Analysis: 1 Multiple Linear Regression Models
No ratings yet
MAS316/Math352 Regression Analysis: 1 Multiple Linear Regression Models
12 pages
Isye4031 Regression and Forecasting Practice Problems 2 Fall 2014
No ratings yet
Isye4031 Regression and Forecasting Practice Problems 2 Fall 2014
5 pages
Multiple Linear Regression
100% (1)
Multiple Linear Regression
14 pages
Data Science Career Guide 2020
No ratings yet
Data Science Career Guide 2020
11 pages
SDCA Thesis Complete Guide
No ratings yet
SDCA Thesis Complete Guide
56 pages
Using R For Linear Regression
No ratings yet
Using R For Linear Regression
9 pages
Dr. R. Gangai Selvi Biography
100% (1)
Dr. R. Gangai Selvi Biography
2 pages
Linear Regression Lecture Notes
100% (2)
Linear Regression Lecture Notes
228 pages
(Ebook PDF) Global Business Today 10th Edition Instant Download
No ratings yet
(Ebook PDF) Global Business Today 10th Edition Instant Download
64 pages
Thesis On Teachers Job Satisfaction
100% (3)
Thesis On Teachers Job Satisfaction
4 pages
Regression Analysis Guide
100% (1)
Regression Analysis Guide
280 pages
Prosocial Behavior in Tamu Schools
No ratings yet
Prosocial Behavior in Tamu Schools
11 pages
Chapter 1. Biostatistics
No ratings yet
Chapter 1. Biostatistics
34 pages
STAT630Slide Adv Data Analysis
0% (1)
STAT630Slide Adv Data Analysis
238 pages
2.2.1 Example: Income and Money Supply Using SIMPLIS Syntax: Example 4: Non-Recursive System
No ratings yet
2.2.1 Example: Income and Money Supply Using SIMPLIS Syntax: Example 4: Non-Recursive System
24 pages
Chapter 10 Simple Linear Regression and Correlation
No ratings yet
Chapter 10 Simple Linear Regression and Correlation
28 pages
Literature Review of Tuition Impact On Learning of Students
50% (4)
Literature Review of Tuition Impact On Learning of Students
33 pages
Week 2 - Demographics and Introduction To Statistics
50% (2)
Week 2 - Demographics and Introduction To Statistics
53 pages
Aldi PPT 1w2
No ratings yet
Aldi PPT 1w2
17 pages
Grubbs
No ratings yet
Grubbs
2 pages
Q:-Explain The Probability and Nonprobability Sampling Techniques
No ratings yet
Q:-Explain The Probability and Nonprobability Sampling Techniques
3 pages
Proposal Car
No ratings yet
Proposal Car
8 pages
Time Series Analysis
No ratings yet
Time Series Analysis
21 pages
Internship Report On The Post of Assistant Statistical Officer in The Department OF State Income AT
No ratings yet
Internship Report On The Post of Assistant Statistical Officer in The Department OF State Income AT
45 pages
PTSP Ii Ece
No ratings yet
PTSP Ii Ece
3 pages
Miller Expert Report Re: Ziyad Yaghi Case No. 5:09-cr-00216-FL
No ratings yet
Miller Expert Report Re: Ziyad Yaghi Case No. 5:09-cr-00216-FL
10 pages
Tmme Statistics
No ratings yet
Tmme Statistics
22 pages
Kadi Sarva Vishwa Vidhyalaya Guidelines For Summer Project Report
No ratings yet
Kadi Sarva Vishwa Vidhyalaya Guidelines For Summer Project Report
5 pages

Machine Learning-Lecture 1 (Student)

Uploaded by

Machine Learning-Lecture 1 (Student)

Uploaded by

Machine Learning

Lecturer :Ya-Mei Chang

Office: Room 446, ext 66117

⚫ In essence, statistical learning refers to

future sales on the basis of a particular value of TV advertising by

ei = yi − yˆi represents the ith residual—this is the difference between the

➢ Assessing the Accuracy of the Coefficient Estimates:

In general, σ2 is not known. It is estimated by residual standard error

(RSE) and is given by the formula RSE = RSS / (n − 2).

the 95% confidence interval for ̂ 0 approximately takes the form

the 95% confidence interval for  1 approximately takes the form

H0 : There is no relationship between X and Y

➢ Assessing the Accuracy of the Model:

⚫ Multiple Linear Regression

➢ Estimating the Coefficients:

We choose 0 , 1, , p to minimize the sum of squared residuals

 p−q+1, , p : The last q variables maybe not useful in prediction

This corresponds to a null hypothesis

2. Do all the predictors help to explain Y , or is only a subset of the

➢ Simple Linear Regression

lm.fit <- lm(medv~lstat, data=Boston)

## [1] "coefficients" "residuals" "effects" "rank"

predict(lm.fit, data.frame(lstat = (c(5, 10, 15) )),interval = "confidence")

## fit lwr upr

predict(lm.fit, data.frame(lstat = (c(5, 10, 15) )),interval = "prediction")

## fit lwr upr

plot(lstat,medv, col = "red")

➢ Multiple Linear Regression

lm.fit <- lm(medv~lstat+age, data=Boston)

## crim zn indus chas nox rm age dis

lm.fit1<-lm( medv~.-age, data = Boston)

You might also like