Panel data regression
several time points
several observations
several items
Sources of variation in Panel data regression
Adv Panel data
+ high internal & external validity
+ allows to control the impact of latent heterogeneity
+ more df –> higher efficacy in estimating
+ testing of more complex hypothesis possible
Disadv Panel data
+ high internal & external validity
+ allows to control the impact of latent heterogeneity
+ more df –> higher efficacy in estimating
+ testing of more complex hypothesis possible
Ways to analyze Panel data
fixed effects
random effecty
pooled OLS
fixed effects?
–> measure the effect of the explanatory variable, even if individual, time-constant heterogeneity is correlated with the explanatory variable
If we assume fixed effects, we impose time independent effects for each entity that are possibly correlated with the regressors.
How to deal with fixed effects alpha i?
OR –>within-groups fixed effects:
eliminating alphai by subtracting from each
variable for each unit its mean value (over time)
Interpretation of fixed effects
–> we lose n degress of freedom for the estimation of betak
(we either lose n observations or estaimate n additional parameters (dummy v))
Pooled OLS
take every observations of every time period as totally indpendent point
–> only possible to use, if you have no unobserved variables
Random effects
random effects works under the assumption that alphai are totally uncorrelated with Xj, purely random
Hausman Test
to assess whether to use fixed or random effects estimation
–> tests if the unobserved effects alphai are independent of the explanatory variables xj
which types of models can be used for Panel data?
OLS Logit Probit, Tobit autoregressive models other time series models
Panel regression is not so much an … method, but a type of … or …
Panel regression is not so much an ANALYSIS method, but a type of DATA SET
or DATA STRUCTURE