Topic_5_Static_Linear_Panel_Data_Models_Flashcards

Question 1

Q

What is panel data?

Answer

A

Panel data consist of observations for N cross-sectional units (e.g., individuals, firms) observed over T time periods. It combines both cross-sectional and time-series dimensions.

Question 2

Q

What are the main advantages of using panel data?

Answer

A

1) Controls for omitted variables that are time-invariant; 2) Increases precision through more observations; 3) Allows study of dynamic behavior; 4) Identifies effects not detectable in pure cross-sections or time series.

Question 3

Q

Write the general static panel data model and explain its components.

Answer

A

Y_it = α + X_itβ + η_i + u_it, where η_i captures unobserved, time-invariant individual effects and u_it is the idiosyncratic error term varying over i and t.

Question 4

Q

What problem arises if cov(X_it, η_i) ≠ 0 in the pooled OLS model?

Answer

A

The zero conditional mean assumption E[u_it|X_it] = 0 is violated, causing biased and inconsistent OLS estimates due to omitted variable bias.

Question 5

Q

What is the identifying assumption in static panel data models?

Answer

A

Strict exogeneity: E[u_it | X_i1, …, X_iT, η_i] = 0. This implies regressors are uncorrelated with past, current, and future idiosyncratic errors.

Question 6

Q

What happens if the strict exogeneity assumption is violated?

Answer

A

The within (FE) and first-difference estimators become inconsistent; results cannot be given a causal interpretation.

Question 7

Q

Explain the within (fixed effects) estimator using the within-transformation.

Answer

A

The within transformation demeans all variables: Y_it - Ȳ_i = β(X_it - X̄_i) + (u_it - ū_i). FE estimator: β̂_FE = (X̃’X̃)^(-1) X̃’Ỹ, where X̃ = X_it - X̄_i.

Question 8

Q

What is the main source of variation exploited by the fixed effects estimator?

Answer

A

Within-individual variation over time; it uses how changes in X_it within an individual affect Y_it while holding time-invariant factors constant.

Question 9

Q

Why are time-invariant regressors not identified in a fixed effects model?

Answer

A

Because the within-transformation removes all time-invariant characteristics (their variation is fully captured by η_i).

Question 10

Q

What is the formula for the first-difference (FD) estimator?

Answer

A

ΔY_it = βΔX_it + Δu_it, estimated by OLS. The FD estimator is consistent if E[(X_it - X_it−1)(u_it - u_it−1)] = 0 (strict exogeneity).

Question 11

Q

What are the main disadvantages of fixed effects estimation?

Answer

A

1) Cannot estimate time-invariant regressors; 2) Out-of-sample prediction impossible; 3) Loss of degrees of freedom; 4) May produce imprecise estimates if most variation is between individuals.

Question 12

Q

Write the random effects model and its key additional assumption.

Answer

A

Y_it = α + X_itβ + η_i + u_it, with E[X_it, η_i] = 0. The RE model assumes that individual effects are uncorrelated with regressors.

Question 13

Q

What is the main difference between the FE and RE models?

Answer

A

FE allows correlation between X_it and η_i, while RE assumes independence. Thus, RE is more efficient under E[X_it, η_i]=0, but inconsistent otherwise.

Question 14

Q

What is the Hausman test used for in panel data analysis?

Answer

A

To test whether RE is consistent. H0: E[η_i|X_it]=0 (both FE and RE consistent, RE efficient). H1: E[η_i|X_it]≠0 (only FE consistent).

Question 15

Q

What is the formula for the Hausman test statistic?

Answer

A

HW = (β̂_FE − β̂_RE)’ [Var(β̂_FE) − Var(β̂_RE)]⁻¹ (β̂_FE − β̂_RE) ~ χ²(k), where k is the number of regressors.

Question 16

Q

How can you test for the existence of individual fixed effects?

Answer

Study These Flashcards

A

Use an F-test: H0: η_1 = η_2 = … = η_N (pooled model). F = ((SSR₀−SSR₁)/(N−1)) / (SSR₁/(NT−N−K)). Reject H0 if F > critical value.

Question 17

Q

How can you test the validity of strict exogeneity in a FE model?

Answer

Study These Flashcards

A

Estimate Y_it = βX_it + γX_it+1 + η_i + u_it and test H0: γ = 0. If rejected, strict exogeneity is violated.

Question 18

Q

Why is the RE estimator more efficient than FE under the null hypothesis of the Hausman test?

Answer

Study These Flashcards

A

Because RE exploits both within- and between-individual variation, whereas FE only uses within variation.

Question 19

Q

In what cases are the FE, LSDV, and FD estimators identical?

Answer

Study These Flashcards

A

If T = 2 (two time periods), FE, LSDV, and FD estimators yield identical estimates.

Question 20

Q

What type of standard errors should be used in FE estimation and why?

Answer

Study These Flashcards

A

Clustered (HAC) standard errors, since they are robust to heteroskedasticity and serial correlation within individuals.

Topic_5_Static_Linear_Panel_Data_Models_Flashcards

(20 cards)