What is cross-sectional data?
Data on several individuals at a single point in time
What notation did we use for cross sectional data?
π¦π = π½0 + π½1π₯1π + π½2π₯2π + π£i
for which the subscript π denotes individual π.
What is Panel Data
Observe data for several individuals, and observe each individual at several points in time e.g. N individuals for T time periods
What notation do we use for panel data
π¦ππ‘ = π½0 + π½1π₯1ππ‘ + π½2π₯2ππ‘ + π£ππ‘.
Subscript ππ‘ denotes individual π at time π‘.
Hoiw do we control for time-invariant effects with the error term in panel data?
Decompose error term π£ππ‘ into** ππ, representing a time-invariant piece, and π’ππ‘, representing a
time-varying piece.**
ai is essentially the effect of being an individual
What is the concern with ai (time-invariant effects)
Unobserved and possibly correlated with our regressors –> confounder
Concern always there but discuss in panel data because data is strong enough to allow us to overcome the concern
How do we overcome the confounder of time invariant effects?
How do we do First Differences to overcome time-invariant effects?
For any variable, π€, define the notation Ξπ€ππ‘ β π€ππ‘ β π€π,π‘β1.
Ξπ¦ππ‘ = π½0 + π½1π₯1ππ‘ + π½2π₯2ππ‘ + ππ + π’ππ‘ β (π½0 + π½1π₯1π,π‘β1 + π½2π₯2π,π‘β1 + ππ + π’π,π‘β1)
Ξπ¦ππ‘ = π½1Ξπ₯1ππ‘ + π½2Ξπ₯2ππ‘ + Ξπ’ππ‘
ai is differenced away thus ai is no longer a confounder because it does not affect the difference across time periods in either treatment or outcome
Does only ai get removed by differencing?
No, any time invariant effects (constant, other variables) are removed
Cost of removing bias is that we lose all these time invariant effects
How do we normally overcome bias due to confounders
we take it out of the error term and include it directly in the model
we can do an analgous operation for time-invariant effects tthrough dummy variables (fixed effects)
How do we perform fixed effects to overcome time-invariant effects
we include the dummy variable, πΏπ for all individuals except one (we must exclude one
dummy variable to avoid perfect collinearity due to the dummy variable trap).
The individual without
a dummy variable in the model is often called the βomitted groupβ or βcomparison group.β
Interpret coefficients for the fixed effects formula
π½1 and π½2 are βthe average change in the outcome associated with π₯1 (or π₯2) increasing by 1, holding fixed all other π₯ and holding fixed who the individual is.β
π½0 is βthe expected outcome for the omitted group when all π₯ are 0.β
ππ is βthe average change in the outcome associated with being individual π compared to the omitted
group, holding fixed all π₯.
Can we include time-invariant regressors in a fixed effects regression
No, including the variable violates no perfect collinearity
The heuristic explanation is that the dummy variable, πΏπ, and effect ππ, capture the effect of all time-invariant characteristics of person π.
If a variable, π₯2π, does not change over time, we could lump its effect in with ππ, and do not need to separately estimate the effect.
What is the cost of removing time-invariant bias with fixed effects
Again can’t estimate the effect of any time-invariant variables
Any time invariant variables must be excluded
How do econometricians generally use the term fixed effects?
Use it to refer to any situation in which dummy variables are included all possible values of a variable
commonly applied to time periods
What are two way fixed effects?
When we control for both individual and time fixed effects
Is first differences or fixed effects more common
For the purpose of the exam, just know that first differences and fixed effects are two methods of overcoming the bias caused by ππ. Know the mechanics as described above.
In practice, fixed effects is more common because of the βsimplicityβ of implementation and because of the desirability of directly estimating the ππ.
what si the equation of the first difference estimator D
time invariant effects exlcuded
interpret fixed effects
dummy variables add the time-invarient effect - average change in y associated difference in y associated with being individual i compared to the excluded individual
How do fixed effects estimators work graphically
allows intercept to differ for each indv. and quantity the diff. in intercept for an indv. compared to control indv.
can we incl. regressors that do not change over tiem for an indv. when using the fixed effects estimator with dummy vairables
Incl. any time-invariant variable will fail no perfect collinearity when using dummy
what is more commonly used? Fixed effects or first differences?
Fixed effects more commonly used to measure the ‘fixed effects’ of ai. First differences, differences ai away
Simplicity of implementation too
What are time fixed effects
Including a time dummy that represents the effect of beign in time period t, since this will also induce bias
Write notation for a dummy variable fixed effects estimator with time and individual fixed effects