MLR & Assumptions Flashcards by Charron Davis

What is statistical power?

The probability of detecting a true effect when the effect actually exists

How well did you know this?

Not at all

Perfectly

Why is reporting only a p-value insufficient?

Because it does not convey the magnitude of the effect or whether the study was adequately powered

How well did you know this?

Not at all

Perfectly

True / False: A statistically significant result always implies a practically important effect

False

How well did you know this?

Not at all

Perfectly

What is a Type I error?

Rejecting the null hypothesis when it is actually true (false positive)

How well did you know this?

Not at all

Perfectly

What values correspond to medium and large effects?

Medium:
𝑑 =0.5

Large:
𝑑 = 0.8

How well did you know this?

Not at all

Perfectly

Power analysis depends on sample size, α, and _______

Effect size

How well did you know this?

Not at all

Perfectly

What combination should be reported to strengthen scientific conclusions?

p-value

Effect size

Power analysis / sample size justification

How well did you know this?

Not at all

Perfectly

True / False: Increasing sample size increases statistical power

True

How well did you know this?

Not at all

Perfectly

According to Cohen, what is considered a small effect?

d=0.2

How well did you know this?

Not at all

Perfectly

What is leverage?

An observation with extreme X values

How well did you know this?

Not at all

Perfectly

Residual = observed value minus ______ value

Predicted (fitted)

How well did you know this?

Not at all

Perfectly

What is collinearity?

High correlation between independent variables

How well did you know this?

Not at all

Perfectly

What does a curved pattern in a residual plot indicate?

Violation of linearity

How well did you know this?

Not at all

Perfectly

True / False: A good residual plot should show a random horizontal band

True

How well did you know this?

Not at all

Perfectly

When does the intercept have no intrinsic meaning?

When predictors never take the value 0

How well did you know this?

Not at all

Perfectly

What are residuals in regression?

The unexplained error:

e𝑖 = y𝑖 − ŷ𝑖

How well did you know this?

Not at all

Perfectly

What does VIF = 1 indicate?

No collinearity among predictors

How well did you know this?

Not at all

Perfectly

What is multicollinearity?

High correlation among more than two predictors.

How well did you know this?

Not at all

Perfectly

What is an outlier in regression?

An observation with an extreme Y value.

How well did you know this?

Not at all

Perfectly

Why are bivariate correlations insufficient to detect multicollinearity?

Study These Flashcards

They cannot capture combined relationships among multiple predictors

Why are regression diagnostics essential?

Study These Flashcards

Because valid inference depends on assumptions being met

What is the tolerance statistic?

Study These Flashcards

The inverse of VIF; the proportion of variance not explained by other predictors.

T/F: Multicollinearity violates OLS assumptions

Study These Flashcards

False — but it makes estimation unreliable.

Why should the global F-test be run before individual t-tests?

Study These Flashcards

To control Type I error inflation.

T/F: Predictor variables must be completely uncorrelated to run MLR

False — perfect independence is unrealistic, but strong correlation causes problems.

What is multiple linear regression?

A method for modeling or predicting a continuous response variable using two or more independent variables with linear relationships

T/F: Multiple linear regression always improves model performance.

False — adding predictors can inflate R² without improving predictive value

What is the global (omnibus) F-test?

A test of whether any predictor contributes to explaining Y

State the hypotheses for the global F-test.

H₀: β₁ = β₂ = … = βₖ = 0 H₁: At least one βⱼ ≠ 0 (for j = 1, …, k)

What is a partial regression coefficient?

The effect of a predictor after controlling for all others.

T/F: The estimate of 𝜎^2 depends on the model specification

True

What does adjusted R² correct for?

The number of predictors in the model.

Why is multiple regression preferred over bivariate regression in practice?

It allows adjustment for confounders, improves precision, and better reflects real-world complexity.

T/F: Removing an insignificant predictor does not require refitting the model

False

Why can regression coefficients flip signs?

Omitted variables, suppression, or multicollinearity.

What does higher precision mean?

Smaller variance or standard error

In a curvilinear model, quadratic terms are written as ______.

𝑥^2

What test compares nested regression models?

Partial F-test

Fill in the blank – Partial regression coefficients measure the effect of a predictor ________ controlling for all other predictors

on the dependent variable

What is the main advantage of adding predictors in multiple regression?

Improves prediction/explanation by reducing unexplained variance and allowing control of extraneous variables

How is precision of a regression coefficient defined?

The inverse of its variance; smaller variance → higher precision.

What is suppression in regression?

A variable that increases the predictive power of other variables without being directly related to the dependent variable.

What does R² represent in multiple regression?

The proportion of variance in Y explained by the predictors.

What is the difference between a confidence interval and a prediction interval?

CI estimates variability in the mean response; PI estimates variability in individual predicted values.

Give an example of moderation in regression.

An interaction term: X×W→Y, e.g., diet program * BMI → weight loss.

T/F – Confounding is operationally present if: βcrude ≠ βadjusted

True

Give an example of mediation in regression.

A causal pathway: X→M→Y, e.g., physical activity → caloric expenditure → weight loss.

How is multicollinearity detected?

Using Variance Inflation Factor (VIF) or tolerance. VIF > 10 → high multicollinearity Tolerance = 1/VIF

What is the formula for the Variance Inflation Factor (VIF) of a predictor Xj in multiple regression?

VIFj = 1 / (1 - Rj²)

MLR & Assumptions Flashcards

(49 cards)