0% found this document useful (0 votes)

20 views4 pages

Multiple Regression for Analysts

Uploaded by

pedropinto8400

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views4 pages

Multiple Regression for Analysts

Uploaded by

pedropinto8400

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

MultipleRegression

March 14, 2024

1 Regressão múltipla
• E se mais de uma variável influenciar o que está sendo interessado?
• Exemplo: predizer o preço de um carro com base em seus vários atributos.
• Se também houver multiplas variáveis dependentes - coisas que estão tentando ser previstas
- isso é uma regressão multivariável.

1.0.1 Ainda usa least squares

• A unica diferença é que agora terá coeficientes diferentes para cada fator.
• Esses coeficientes implicam no quão importante cada fator realmente é, se os dados estiverem
normalizados.
• Se é livrado de variáveis que não influenciam.
• Ainda pode medir a adequação com r-squared.
• Precisa assumir que os diferentes fatores não são dependentes uns dos outros.

1.1 Pratica
[2]: import pandas as pd

df = pd.read_excel('cars.xls')

[3]: %matplotlib inline

import numpy as np
df1=df[['Mileage','Price']]
bins = np.arange(0,50000,10000)
groups = df1.groupby(pd.cut(df1['Mileage'],bins)).mean()
print(groups.head())
groups['Price'].plot.line()

Mileage Price
Mileage
(0, 10000] 5588.629630 24096.714451
(10000, 20000] 15898.496183 21955.979607
(20000, 30000] 24114.407104 20278.606252
(30000, 40000] 33610.338710 19463.670267
/tmp/ipykernel_12254/679127490.py:5: FutureWarning: The default of
observed=False is deprecated and will be changed to True in a future version of

1
pandas. Pass observed=False to retain current behavior or observed=True to adopt
the future default and silence this warning.
groups = df1.groupby(pd.cut(df1['Mileage'],bins)).mean()

[3]: <Axes: xlabel='Mileage'>

[4]: import statsmodels.api as sm

from sklearn.preprocessing import StandardScaler
scale = StandardScaler()

X = df[['Mileage', 'Cylinder', 'Doors']]

y = df['Price']

X[['Mileage', 'Cylinder', 'Doors']] = scale.fit_transform(X[['Mileage',␣

↪'Cylinder', 'Doors']].values)

X = sm.add_constant(X)

print (X)

est = sm.OLS(y, X).fit()

2
print(est.summary())

const Mileage Cylinder Doors

0 1.0 -1.417485 0.52741 0.556279
1 1.0 -1.305902 0.52741 0.556279
2 1.0 -0.810128 0.52741 0.556279
3 1.0 -0.426058 0.52741 0.556279
4 1.0 0.000008 0.52741 0.556279
.. … … … …
799 1.0 -0.439853 0.52741 0.556279
800 1.0 -0.089966 0.52741 0.556279
801 1.0 0.079605 0.52741 0.556279
802 1.0 0.750446 0.52741 0.556279
803 1.0 1.932565 0.52741 0.556279

[804 rows x 4 columns]

OLS Regression Results
==============================================================================
Dep. Variable: Price R-squared: 0.360
Model: OLS Adj. R-squared: 0.358
Method: Least Squares F-statistic: 150.0
Date: Thu, 14 Mar 2024 Prob (F-statistic): 3.95e-77
Time: 16:21:34 Log-Likelihood: -8356.7
No. Observations: 804 AIC: 1.672e+04
Df Residuals: 800 BIC: 1.674e+04
Df Model: 3
Covariance Type: nonrobust
==============================================================================
coef std err t P>|t| [0.025 0.975]
------------------------------------------------------------------------------
const 2.134e+04 279.405 76.388 0.000 2.08e+04 2.19e+04
Mileage -1272.3412 279.567 -4.551 0.000 -1821.112 -723.571
Cylinder 5587.4472 279.527 19.989 0.000 5038.754 6136.140
Doors -1404.5513 279.446 -5.026 0.000 -1953.085 -856.018
==============================================================================
Omnibus: 157.913 Durbin-Watson: 0.069
Prob(Omnibus): 0.000 Jarque-Bera (JB): 257.529
Skew: 1.278 Prob(JB): 1.20e-56
Kurtosis: 4.074 Cond. No. 1.03
==============================================================================

Notes:
[1] Standard Errors assume that the covariance matrix of the errors is correctly
specified.
/tmp/ipykernel_12254/1575598944.py:8: SettingWithCopyWarning:
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead

3
See the caveats in the documentation: https://pandas.pydata.org/pandas-
docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
X[['Mileage', 'Cylinder', 'Doors']] = scale.fit_transform(X[['Mileage',
'Cylinder', 'Doors']].values)

[5]: y.groupby(df.Doors).mean()

[5]: Doors
2 23807.135520
4 20580.670749
Name: Price, dtype: float64

[11]: scaled = scale.transform([[45000, 8, 4]])

scaled = np.insert(scaled[0], 0, 1)
print(scaled)
predicted = est.predict(scaled)
print(predicted)

[1. 3.07256589 1.96971667 0.55627894]

[27658.15707316]

ADHD Assessment
No ratings yet
ADHD Assessment
6 pages
Boss ME-10 Service Manual
50% (2)
Boss ME-10 Service Manual
23 pages
Linear Regression Guide for Data Analysts
No ratings yet
Linear Regression Guide for Data Analysts
16 pages
Multiple Regression1
No ratings yet
Multiple Regression1
27 pages
SiddharthShah 1032221195 DivC 50 DL LabAssignment2
No ratings yet
SiddharthShah 1032221195 DivC 50 DL LabAssignment2
7 pages
Exp 2 (Multiple Linear Regression)
No ratings yet
Exp 2 (Multiple Linear Regression)
6 pages
ML Unit
No ratings yet
ML Unit
23 pages
Python Data Analysis Guide
No ratings yet
Python Data Analysis Guide
171 pages
Experiment 4 ML
No ratings yet
Experiment 4 ML
9 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
3 pages
Linear Regression for Data Science
No ratings yet
Linear Regression for Data Science
30 pages
Multiple Linear Regression 3
No ratings yet
Multiple Linear Regression 3
68 pages
ML0101EN Reg Mulitple Linear Regression Co2 Py v1
No ratings yet
ML0101EN Reg Mulitple Linear Regression Co2 Py v1
5 pages
Linear Regression
100% (1)
Linear Regression
16 pages
Pro Yec To Machine Learning
No ratings yet
Pro Yec To Machine Learning
35 pages
PLSandPLSDA - Torino2021 - Federico Marini
No ratings yet
PLSandPLSDA - Torino2021 - Federico Marini
53 pages
Decision Tree
No ratings yet
Decision Tree
4 pages
Mlmultiplelinearregression 170919114353 PDF
No ratings yet
Mlmultiplelinearregression 170919114353 PDF
8 pages
1 Regression
No ratings yet
1 Regression
4 pages
Multi Regression
No ratings yet
Multi Regression
12 pages
Oil Export Indonesia
100% (1)
Oil Export Indonesia
12 pages
Assignment AI-ML
No ratings yet
Assignment AI-ML
13 pages
Exp - 6-Model Development - SDK - Ok
No ratings yet
Exp - 6-Model Development - SDK - Ok
11 pages
Exercises D'application Regression Analysis
No ratings yet
Exercises D'application Regression Analysis
4 pages
Assignment 2 ML
No ratings yet
Assignment 2 ML
11 pages
DSBDAL - Assignment No 4
No ratings yet
DSBDAL - Assignment No 4
15 pages
Experiment 7 ML Vtu
No ratings yet
Experiment 7 ML Vtu
5 pages
Linear Regression
No ratings yet
Linear Regression
3 pages
Zerox Ready
No ratings yet
Zerox Ready
21 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
10 pages
ML LN 3
No ratings yet
ML LN 3
44 pages
Introduction To R Program and Output
No ratings yet
Introduction To R Program and Output
6 pages
ML Linear Regression Trupesh Patel
No ratings yet
ML Linear Regression Trupesh Patel
23 pages
LR LogReg
No ratings yet
LR LogReg
53 pages
Multiple Regression Analysis
No ratings yet
Multiple Regression Analysis
10 pages
Machine Learning
No ratings yet
Machine Learning
10 pages
Xətti Reqressiya Modelinin Qurulması
No ratings yet
Xətti Reqressiya Modelinin Qurulması
4 pages
Linear Regression Analysis Report
No ratings yet
Linear Regression Analysis Report
12 pages
Linear Regression in Scikit-Learn (Sklearn) - An Introduction - Datagy
No ratings yet
Linear Regression in Scikit-Learn (Sklearn) - An Introduction - Datagy
22 pages
Python Linear Regression Guide
No ratings yet
Python Linear Regression Guide
1 page
ML Lab-3
No ratings yet
ML Lab-3
14 pages
En Tanagra Python StatsModels PDF
No ratings yet
En Tanagra Python StatsModels PDF
20 pages
Mtcars Dataset: Multilinear Regression Analysis
No ratings yet
Mtcars Dataset: Multilinear Regression Analysis
13 pages
Lecture 3
No ratings yet
Lecture 3
42 pages
ML Exp4
No ratings yet
ML Exp4
4 pages
S2 Linear Regression LKW 9march2025
No ratings yet
S2 Linear Regression LKW 9march2025
23 pages
Regression Model
No ratings yet
Regression Model
30 pages
Machine Learning for Data Scientists
No ratings yet
Machine Learning for Data Scientists
41 pages
Data Mining Lab: Regression & Clustering
No ratings yet
Data Mining Lab: Regression & Clustering
36 pages
Introduction To Management Science: Post Mid Sessions 2 & 3 November 4 and 6 2019
No ratings yet
Introduction To Management Science: Post Mid Sessions 2 & 3 November 4 and 6 2019
26 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
GVPCOEW-Supervised ML - Linear Regression - DONE
No ratings yet
GVPCOEW-Supervised ML - Linear Regression - DONE
24 pages
Machine Learning - Develop Machine Learning Model - Regression
No ratings yet
Machine Learning - Develop Machine Learning Model - Regression
36 pages
Linear Regression Mca Lab - Jupyter Notebook
No ratings yet
Linear Regression Mca Lab - Jupyter Notebook
2 pages
Advanced Regression with IPL Data
No ratings yet
Advanced Regression with IPL Data
25 pages
ML Manoj
No ratings yet
ML Manoj
51 pages
Day.11 What Is Multiple Linear Regression
No ratings yet
Day.11 What Is Multiple Linear Regression
10 pages
Mod2 - Multiple Linear Regression
No ratings yet
Mod2 - Multiple Linear Regression
10 pages
INSY446 - 02 - Linear Model Part 1
No ratings yet
INSY446 - 02 - Linear Model Part 1
27 pages
Ash Regression
No ratings yet
Ash Regression
11 pages
Linear Regression with Boston Housing Data
No ratings yet
Linear Regression with Boston Housing Data
14 pages
2016 - The Episteme Journal of Linguistics and Literature Vol 2 No 3 - 2.nima Saragi Analysis of Conditional OBAMA Speech
No ratings yet
2016 - The Episteme Journal of Linguistics and Literature Vol 2 No 3 - 2.nima Saragi Analysis of Conditional OBAMA Speech
28 pages
Final Examination in Major 3214 - Language Research: Table of Specification
No ratings yet
Final Examination in Major 3214 - Language Research: Table of Specification
2 pages
EfkaPB2001 TDS
No ratings yet
EfkaPB2001 TDS
2 pages
SAT Suite Question Bank - Problem Solving and Data Analysis AnsResults
No ratings yet
SAT Suite Question Bank - Problem Solving and Data Analysis AnsResults
113 pages
The Things They Carry
No ratings yet
The Things They Carry
9 pages
PHY104 Electricity Lectures 2024RevisedFinal
No ratings yet
PHY104 Electricity Lectures 2024RevisedFinal
156 pages
BHU RET Geology 2020
0% (1)
BHU RET Geology 2020
41 pages
Cohesity License Terms Overview
No ratings yet
Cohesity License Terms Overview
5 pages
GEZE - Product Data Sheet - EN - 697800130822
No ratings yet
GEZE - Product Data Sheet - EN - 697800130822
3 pages
Beltscale Handbook 03 12 TL
No ratings yet
Beltscale Handbook 03 12 TL
8 pages
4A Lesson Plan in English Grade 2: Valencia City Central School
No ratings yet
4A Lesson Plan in English Grade 2: Valencia City Central School
3 pages
Law Firm Questions
No ratings yet
Law Firm Questions
5 pages
Malaysian School Counsellors' Challenges in Job Description, Job Satisfaction and Competency
No ratings yet
Malaysian School Counsellors' Challenges in Job Description, Job Satisfaction and Competency
7 pages
Northern Black Polished Ware in India
100% (1)
Northern Black Polished Ware in India
19 pages
CO2 Fire Suppression Systems Guide
100% (2)
CO2 Fire Suppression Systems Guide
21 pages
M4 L1Assessment For Learning Using Assessment To Classify Learning and Understanding
No ratings yet
M4 L1Assessment For Learning Using Assessment To Classify Learning and Understanding
5 pages
Listof C25 Batcheswith Times&Syllabus
No ratings yet
Listof C25 Batcheswith Times&Syllabus
4 pages
RTU Specification for SCADA Systems
100% (1)
RTU Specification for SCADA Systems
18 pages
MiniROVER Data Sheet 2013 Lo 1
No ratings yet
MiniROVER Data Sheet 2013 Lo 1
2 pages
Vickers Hardness Test
No ratings yet
Vickers Hardness Test
3 pages
Infectious Smile Gui
No ratings yet
Infectious Smile Gui
4 pages
National Conference Hybrid
No ratings yet
National Conference Hybrid
5 pages
Kaldor'S Growth Theory Nancy J. Wulwick
No ratings yet
Kaldor'S Growth Theory Nancy J. Wulwick
19 pages
Substation
No ratings yet
Substation
10 pages
Sociology of Families Change Continuity and Diversity 1st Edition Ciabattari Test Bankinstant Download
100% (9)
Sociology of Families Change Continuity and Diversity 1st Edition Ciabattari Test Bankinstant Download
49 pages
3RB30461XW1
No ratings yet
3RB30461XW1
7 pages
Irislocker
No ratings yet
Irislocker
23 pages
C13-Rating A
100% (1)
C13-Rating A
5 pages

Multiple Regression for Analysts

Uploaded by

Multiple Regression for Analysts

Uploaded by

MultipleRegression

March 14, 2024

1.0.1 Ainda usa least squares

[3]: %matplotlib inline

[3]: <Axes: xlabel='Mileage'>

[4]: import statsmodels.api as sm

X = df[['Mileage', 'Cylinder', 'Doors']]

X[['Mileage', 'Cylinder', 'Doors']] = scale.fit_transform(X[['Mileage',␣

est = sm.OLS(y, X).fit()

const Mileage Cylinder Doors

[804 rows x 4 columns]

[11]: scaled = scale.transform([[45000, 8, 4]])

[1. 3.07256589 1.96971667 0.55627894]

You might also like