linear regression Flashcards

Question 1

Q

regression

Answer

A

a method to understand how an outcome Y changes as predictor X1, X2,…, Xp vary

Regression finds the best-fitting line through the cloud of points

Question 2

Q

single regression formula

Answer

A

Y = B0+ B1X1

B is beta

Question 3

Q

Y =

Answer

A

outcome variable (DV)

Question 4

Q

X =

Answer

A

predictor variables (IV)

Question 5

Q

B0 =

Answer

A

intercept (model’s predicted value of Y when X = 0)

Question 6

Q

Bi =

Answer

A

Slope coefficients (change in Y per unit change in X)

How much the outcome changes for each 1-unit increase in 𝑋𝑖, holding all else constant

Question 7

Q

E =

Answer

A

Error term (unexplained variance)

Question 8

Q

R^2

Answer

A

coefficient of determination

represents the proportion of the variance in a dependent variable that is predictable from the independent variables in a regression model.

It indicates how well the regression line fits the data, with a value of (1) meaning all data points fall perfectly on the line and (0) meaning the line explains none of the variability.

ex: R^2 = 0.215; About 21.5% of burnout (Y) variation is explained by age (X)

Question 9

Q

residuals

Answer

A

the distance between a point and the reference line (linear) – error

added up, squared, know how much error it present → want them to be around 0, want the least amount of error

Question 10

Q

The more slope (steeper slope)….

Answer

A

the more of a relationship the variables have

Large beta

Question 11

Q

assumptions of regression

Answer

A

linearity
independence
Homoscedasticity
Normality of residuals
No multicollinearity - only for multiple regression

Question 12

Q

linearity

Answer

A

Relationship between predictors and outcome is learn

Check: scatterplots, residual plots

Violation: curved relationships, U-shaped effects

Question 13

Q

independence

Answer

A

Observations are independent of each other

Violation: clustered data, repeated measures

Solution: multilevel modeling

Question 14

Q

Homoscedasticity

Answer

A

Constant variance of residuals across predictor values

Check: residual vs fitted plots

Violation: funnel-shaped patterns

Question 15

Q

Normality of residuals

Answer

A

Residuals are normally distributed

Check: Q-Q plots, histograms

Violation: skewed distributions

Question 16

Q

No multicollinearity - only for multiple regression

Answer

Study These Flashcards

A

Predictors are not highly correlated

Check: correlation matrix, VIF value

Rule: VIF < 5 (or < 10)

Question 17

Q

key takeaways

Answer

Study These Flashcards

A

Regression quantifies relationships between variables

Coefficients tell us the size and direction of effects

Confidence intervals show uncertainty in estimates

P-values indicate statistical significance

Question 18

Q

multiple regression

Answer

Study These Flashcards

A

Multiple regression is an extension of simple regression — we’re simply adding more predictors to the model.

Question 19

Q

multiple regression formula

Answer

Study These Flashcards

A

Y = B0+ B1X1+ B2BX2 + … + E

B is beta

Question 20

Q

How to interpret the coefficients

Answer

Study These Flashcards

A

Each coefficient represents the expected change in the outcome for a one-unit change in that predictor, holding all other variables constant.

In multiple regression, each regression coefficient (β) represents the unique contribution of its predictor to the outcome, after controlling for the effects of the other predictors in the model

Question 21

Q

which predictor has the strongest effect

Answer

Study These Flashcards

A

We can use standardized coefficients to compare the strength of the effects of the predictors

we need to know if it has a scale function

without it, we don’t know if they are standardized – can draw conclusions about effect size

linear regression Flashcards

(21 cards)