What does multiple linear regression model?
Models relationship between numeric DV & several IV (numeric or categorical).
What is another name for multiple linear regression?
Multivariate linear regression.
What is the multiple linear regression equation?
Y = B0 + B1X1 + B2X2 + … + BpXp + ε
What does the fitted line represent in multiple regression?
best-fit line through a multi-dimensional data space (multi-dimensional scatter plot).
What can the independent variables in a multiple regression model be?
Numeric or categorical.
What type of dependent variable does MLR require?
multiple linear regression
Numeric (continuous) dependent variable.
How many beta coefficients (β) are estimated in multiple linear regression?
One for each predictor variable.
What does each beta coefficient represent?
Estimated change (increase/decrease) in DV (Y) for each 1-unit increase in predictor variable
* assuming all other predictors are constant.
* controlling/adjusting for the effects of other predictors variables
What does “controlling for the effects of other predictors” mean?
Holding all other variables constant while estimating the unique effect of one predictor on the outcome.
What does 𝛽0 represent for MLR equation?
intercept - predicted value of 𝑌 when all 𝑋 variables are equal to 0.
What does
Y represent in the MLR equation?
Dependent (outcome) variable - numeric variable being predicted or explained.
What do coefficients B1, B2, …., Bp represent in MLR equation?
linear reg
regression coefficients - estimated change in 𝑌 for a 1-unit increase in each 𝑋, controlling for all other variables.
What does the term ε represent?
hat is the practical meaning of a positive β coefficient?
As predictor increases, DV increases, assuming other predictors remain constant.
What is the meaning of a negative β coefficient?
As exosure increases, DV decreases, holding other predictors constant.
What does it mean when a β coefficient is 0 or not statistically significant?
no evidence of an association between that predictor and the outcome after adjusting for others.
What is the form of the simple linear regression equation?
Y=β0+β1X+ε
How does multiple regression extend the equation of simple linear regression?
Y=β0+β1X+ε
By adding more predictors (β₂X₂, β₃X₃, etc.) to account for multiple influences on 𝑌
What stata command for a multiple linear regression equation?
regress dependent independent1 independent2 independent3
In a model with categorical predictors, how are dummy variables handled in Stata?
Prefix variable with i. (e.g., i.smoker)- this adds one β for each non-baseline category.
Example: Interpret β = 1.07 for weight (p < 0.001).
For every 1-kg increase in weight, systolic BP increases on average by 1.07 units, adjusting for height, age, sex, and smoking.
Example: Interpret β = −0.89 for height (p < 0.001).
For every 1-cm increase in height, systolic BP decreases by 0.89 units, controlling for other variables.
What are the two main uses of multiple regression?
Confounding vs mediator vs effect modifier?