Multiple linear regression Flashcards

(10 cards)

1
Q

What is the equation for multiple linear regression and what does it mean?

A

π˜Μ‚=𝐛_𝟎+𝐛_𝟏 𝑿_𝟏+𝐛_𝟐 𝑿_𝟐+𝐛_πŸ‘ 𝑿_πŸ‘+𝐛_πŸ’ 𝑿_πŸ’

YΜ‚ (Y hat) = The predicted outcome based on the regression model

X1, X2, X3, X4 = The predictor variables (the factors influencing Y)

𝑏_0 “(betaβˆ’zero)” = The intercept, representing the predicted value of Y when all predictors are 0

𝑏_1 (beta-one), 𝑏_2 (beta-two), 𝑏_3 (beta-three), 𝑏_4 (beta-four) represent the change in the predicted Y for each 1-unit increase in each predictor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

all the assumptions for multiple linear regression are the same, however there is one new one. What is that?

A

No multicollinearity
Multicollinearity = when predictor variables are strongly correlated with one another, means they are measuring the same thing

Can have little or moderate multicollinearity, but not strong

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How can we assess or detect multicollinearity?

A
  • High correlation coefficients
  • High variance inflation factors (VIFS)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the equation for VIF?

A

VIF = 1/(1 βˆ’π‘…π‘–2)

Where Ri2 is the R squared value obtained by regressing the ith predictor variable on the remaining predictor variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How do we calculate the VIF by hand?

A

we run a linear regression model for each of the predictors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the key concepts?

A

R-squared
F-value
Regression coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does R-squared measure?

A

The proportion of variance in the outcome variable that is predictable from the predictor variable
- ranges between 0 and 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is adjusted R-squared?

A

adjusts R-squared for the number of predictors in the model
- provides a more accurate measure of the goodness of fit/tells us more about model accuracy when more than one predictor variable is used
- always less than or equal to R squared

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is Cohen’s f2 (f squared)

A

The effect size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How to interpret the effect size?

A

small = 0.02
medium = 0.15
large = 0.35

How well did you know this?
1
Not at all
2
3
4
5
Perfectly