What is the linear regression formula?
Yi = a + ß1X1i + ß2X21 + ….,
Where:
Yi = Dependent variable for observation i
Xti = Independent variable for observation i
A,ß = coefficients to be estimated
Example: If independent variables are # of experience and job description:
Wage = a + ß# of years of experience + ß job description
What are the 5 outputs in R of linear regressions?
What are three types of data in panel regression?
What is a balanced panel?
same number of observations per unit
What is an unbalanced panel?
not same number of observations per unit
What is a pooled regression model?
running a linear OLS regression model with panel data
What are the two main problems with pooled regression models?
Give two reasons errors are often correlated within units in a pooled regression model?
What are the two types of variation within panel data?
Which type of variation is more credible to establish causality?
Within unit variation is more credible to establish how X causes & to change because, comparing the same unit against itself over time is more comprehensive to establish causal effects of X on Y
How does the regression formula look when using only within unit variation?
Yi = a + ß1X1i + … ßnXn1 + ci + ei, where:
Ci = a constant that is different for each unit to be considered an intercept different for each unit. This is called the fixed effect
What are three ways of runing regressions using within unit variation?
How do regressions of panel data work when using seperate dummy variables?
If one uses separate dummy variables for each unit, each dummy is just another line in the R output with the Estimate, Std. Error, t value and significance. This shows the effect of the variation within units.
What is the main drawback of using seperate dummy variables for each unit in panel data regressions?
Drawback of using separate dummy variables for each unit: not suitable for larger datasets. That is what you would use within transformation for.
What are the steps of within transformation regressions? And give three benefits of within transformation method?
What is meant with clustering standard errors?
Clustering standard errors: this is a solution for accounting for correlation in error terms. Clusters should be made on the same level as your fixed effects.
What are three characteristics of panel regression outcomes?
Give two things panel regression cannot be used for?