Topic_3_Multivariate_OLS_Flashcards

Question 1

Q

What is the multivariate linear regression model?

Answer

A

Yi = β0 + β1X1i + β2X2i + … + βkXki + ui. It studies how multiple explanatory variables jointly affect one dependent variable.

Question 2

Q

What does the zero conditional mean assumption imply in the multivariate case?

Answer

A

E[ui | X1i, X2i, …, Xki] = 0. It ensures the error term is uncorrelated with all regressors, making OLS unbiased and consistent.

Question 3

Q

What are the four key Least Squares assumptions in the multivariate model?

Answer

A

1) Zero conditional mean: E[ui | X1i, …, Xki] = 0; 2) Data are i.i.d.; 3) Large outliers are unlikely (finite fourth moments); 4) No perfect multicollinearity.

Question 4

Q

What is the dummy variable trap?

Answer

A

Including all categories’ dummies (plus an intercept), which creates perfect multicollinearity and prevents OLS estimation. Solution: omit one base category.

Question 5

Q

What is imperfect multicollinearity?

Answer

A

Regressors are highly, but not perfectly, correlated, making it difficult to distinguish individual effects.

Question 6

Q

Why is multicollinearity a problem?

Answer

A

It inflates the variances of estimated coefficients, yielding less precise estimates and smaller t-statistics.

Question 7

Q

What is omitted variable bias (OVB)?

Answer

A

Bias that occurs when a relevant variable correlated with an included regressor is omitted and absorbed in the error term.

Question 8

Q

Which Least Squares assumption is violated when there is omitted variable bias?

Answer

A

The zero conditional mean assumption: E[ui | Xi] ≠ 0.

Question 9

Q

Should we include all variables to avoid omitted variable bias? Why (not)?

Answer

A

No. Include relevant, predetermined regressors. Irrelevant variables increase variance, and ‘bad controls’ bias causal interpretation.

Question 10

Q

What are ‘bad controls’?

Answer

A

Variables that are outcomes of the treatment or regressors of interest (e.g., occupation when estimating the effect of education on earnings).

Question 11

Q

What is a joint hypothesis?

Answer

A

A hypothesis imposing multiple coefficient restrictions simultaneously (e.g., H0: β3 = β4 = 0).

Question 12

Q

Which test is used to test a joint hypothesis?

Answer

A

The F-test, which compares the fit of restricted vs. unrestricted models.

Question 13

Q

When should the F-test be used instead of the t-test?

Answer

A

Use an F-test for two or more simultaneous restrictions; use a t-test for a single restriction on one coefficient.

Question 14

Q

Why can’t t-tests be used for joint hypotheses?

Answer

A

Because coefficients can be correlated; the F-test accounts for their covariance and tests joint significance.

Question 15

Q

What is the conditional mean independence assumption?

Answer

A

Once controls are included, the expected error does not depend on regressors: E[ui | X1i, …, Xki] = 0.

Question 16

Q

What is the difference between a control variable and a variable of interest?

Answer

Study These Flashcards

A

A variable of interest’s coefficient measures the causal effect of interest; controls isolate this effect by holding other influences constant.

Question 17

Q

Do coefficients on control variables measure causal effects?

Answer

Study These Flashcards

A

Not necessarily. They often capture associations used to adjust for confounding rather than direct causal effects.

Question 18

Q

What are the main measures of fit in a multivariate regression model?

Answer

Study These Flashcards

A

R², Adjusted R², and the Standard Error of the Regression (SER).

Question 19

Q

When interpreting coefficients in log-linear models, what does β1 mean?

Answer

Study These Flashcards

A

β1 × 100 gives the approximate percentage change in Y for a one-unit increase in X (if β1 is small).

Question 20

Q

What does model specification in practice involve?

Answer

Study These Flashcards

A

Choosing a base model grounded in theory, testing alternative specifications, and checking robustness of coefficients to ensure results are not driven by model choice.

Topic_3_Multivariate_OLS_Flashcards

(20 cards)