correlation and causation
Regression Analysis
A statistical method of identifying the relationship between one or multiple independent variables (x) and a dependent variable (y)
Types of data
Ordinary Least Square (OLS) estimation and the 5 assumptions for unbiasdness, consistency and efficiency.
To select which fitted line fits the data best.
1. the parameters have a linear relationship
2. random sampling
3. there is variation in x
4. the mean of the error should be 0
5. Homoscedasticity; error does not depend on x
Unbiased, consistent and efficient estimator
What can you use to evaluate the fit of a model to data?
R2 tells how well the model fits the data, between 0 and 1, 1 is perfect.
What is Omitted Variable Bias and what can you do to solve it?
When important variable that influence Y are omitted from the model. To solve use multiple linear regressions and add a sixth assumption: the independent variables must not be milticollinear.