MULTIPLE LINEAR
REGRESSION
PRESENTERS-RANJITHA MSC 1ST YR
RAMYA MSC 1ST YR
What is Regression?
Regression analysis is a fundamental statistical technique used to model the
relationship between a dependent variable and one or more independent
variables.
When do we use Regression?
1. Measure the influence of one or more variables on another variables
2. Prediction of one variable by one or more other variables
Sir Francis Galton is credited with introducing and elucidating the concept of
regression in biostatistics.
He coined the term "regression" in the late 1800s to describe the phenomenon
of children's heights tending to revert toward the average of the population, even
if their parents were unusually tall or short.
SIMPLE REGRESSION EQUATION
The general regression equation, in its simplest form, is Y = a + bX
• Y (Dependent Variable): This is the variable you're trying to predict or
explain.
• X (Independent Variable): This is the variable you're using to predict or
explain the dependent variable.
• a (Y-intercept): This is the point where the regression line crosses the y-
axis. It represents the value of Y when X is 0.
• b (Slope): This indicates how much Y changes for every unit increase in X
MULTIPLE LINEAR
REGRESSION
Multiple Linear Regression (MLR) is a statistical method used to examine
the relationship between one continuous dependent variable (outcome)
and two or more independent variables (predictors).
It extends simple linear regression (which involves just one predictor) by
allowing multiple predictors.
EXAMPLE
Research question: What factors influence systolic blood pressure?
• Dependent variable: Systolic Blood Pressure (continuous)
• Independent variables: Age, BMI, physical activity level, smoking status
PARAMETER SIMPLE LINEAR MULTIPLE LINEAR
REGRESSION REGRESSION
Definition Models the relationship Models the relationship between
between one dependent and one dependent and two or more
one independent variable independent variable
Equation Y = a + bX Y = βο+β1X1 + β2X2 + ... + ε
Complexity Simpler dealing with one More complex due to multiple
relationship relationships
Use Cases Suitable when there is one Suitable when multiple factors
clear predictor affect the outcome
Assumptions Linearity, Independence, Same as Simple linear
Homoscedasticity, Normality regression, with added concern
of Multicollinearity
Application Predicting weight based on Predicting house prices based on
height multiple factors
MLR MODEL EQUATION
KEY ASSUMPTIONS
1.Linearity – The relationship between each predictor and the outcome is
linear.
2.Independence – Observations are independent of each other.
3.Homoscedasticity – Constant variance of residuals across levels of the
predictors.
4.Normality of residuals – Residuals should be approximately normally
distributed.
5.No multicollinearity – Predictors should not be too highly correlated with
each other.