Econometrix Flashcards

Question

What are measures to describe the central tendency of a dataset?

Answer 1

- Mean - Median - Mode

Answer 2

- Mode is at the peak - Median is in the direction of the skew - Mean is somewhere in the middle of the skew

Answer 3

- The difference between the min and the max (max-min) Measured through: - Standard deviation - Variance

Answer 4

- Low variance strong centrality - High variance low centrality

Answer 5

Values that fall far outside the general range of data

Answer 6

- Measures the strength and direction of a relationship between two numeric values Types: - Positive: When one grows the other does too - Negative: When one grows the other shrinks - Non: when one grows the other stays the same

Answer 7

- Apparent correlation due to coincidence or a hidden third variable

Answer 8

- effect of one variable on the other is not constant --> may require transformation or regression to analyse

Answer 9

- Matrix of data analysing the relationship between two or more categorical variables

Answer 10

not starting the axis at 0 ---> leads to deception

Answer 11

When the mean is skewed by few outliers --> average pay top 1% earn a lot

Answer 12

The variable affected

Answer 13

The variable affecting another variable

Answer 14

E(y|x) or μy∣x or ^I (in estimation) - the conditional mean is the expected average value of y when x is fixed

Answer 15

β1 or b1 - The expected value of y when x = 0

Answer 16

β2 or b2 -The change in conditional mean of y when x changes by 1 unit

Answer 17

E(y|x) = β1 + β2x + ε ----> ^i=b1+b2xi+ ε - E(y|x) = conditional mean - β1 = intercept - β2 = slope - x = value of x at that point - ε = error term

Answer 18

ε represents all other factors influencing y other than x

Answer 19

SR1: y=β1+β2x+ε --> the value of y, for each value of x is: SR2: E(y)=β1+β2x --> the expected value of the random error is 0 meaning that: SR3: Var(ε)=σ2=Var(y) --> the variance of random error is equal to that of y

Answer 20

- Models explain y equally well at all levels of x - All levels have bell shaped curve with same variance

Answer 21

- The model fits some ranges of x better than others - Variability of y depends on x

Answer 22

SR4: The covariance between any pair of random errors is: - Cov(εi,εj)=0 (i !=j) -->errors from different observations are unrelated SR5: The variable x is not random and must take at least 2 values SR6: (optional) The values of ε are normally distributed about their mean

Answer 23

Under the assumption's of SR1-SR5 --> Model is correctly specified --> Errors behave nicely --> Regressors are exogenous (not related to x) Then: No other linear unbiased estimator can systematically do better than OLS in terms of precision.

Answer 24

- Ordinary Least Squared --> A rule to determine the "best" line through the data to estimate β1 and β2

Answer 25

To fit a line to the data values we should make the sum of the squares of the vertical distances from each point to the line as small as possible. Meaning: - Difference between actual y and line y at value x is squared (because they can be negative and positive) - should be as low as possible

Answer 26

The proportion of variation in y explained by x in your model --> closer to 1 = better

Answer 27

How much y changes if x increases by 1 unit First intercept then coefficient

Answer 28

How many standard errors away from 0 the estimate is |t|>2 = significant

Answer 29

the p-value -> if there where no effect, what is the probability of seeing an effect this large by coincidence? p< 0.10 Weak/moderate evidence p< 0.05 significant

Answer 30

The residuals show how much the model got wrong --> difference between slope y and actual y -->positive = under-predicted -->negative = over-predicted

Answer 31

- Residuals are scattered randomly - Residuals are entered around 0 - Roughly the same spread everywhere - No patterns

Answer 32

- Residual spread increases with increasing fitted values

Answer 33

- Systematic patterns - systematically above and below 0

Answer 34

The null hypothesis --> H0:β2=0 - Nothing special is going on, everything we see is due to random noise

Answer 35

The alternative hypothesis --> H0:β2!=0 - Opposite of null hypothesis --> something is going on

Answer 36

1. Two-sided: H1:β2!=0 2. One-sided: - H1:β2<0 - H1:β2>0

Answer 37

The range of values of the t-statistic in which we reject H0

Answer 38

0.01 0.05 0.10

Answer 39

One-tail test with alternative

Answer 40

Either: 𝐻1 ∶ 𝛽𝑘 > 𝑐 --> We reject H0 when the t-statistic is larger than the critical value for the level of significance 𝜶 or 𝐻1 ∶ 𝛽𝑘 < 𝑐 --> We reject H0 when the t-statistic is smaller than the critical value for the level of significance 𝜶

Answer 41

𝐻1 ∶ 𝛽𝑘 ≠ 𝑐 --> We reject H0 when the t-statistic is either larger or smaller than the critical value for the level of significance 𝜶

Answer 42

1. Null hypothesis 2. Alternative hypothesis 3. Test statistic (t-statistic) 4. Rejection region 5. Conclusion

Answer 43

The point where only 5% lie above or below it --> the cutoff point for the null hypothesis

Answer 44

t= b2−c / se(b2) slope - H0 value / standard deviation

Answer 45

y = β₁ + β₂x₂ + β₃x₃ + … + β_k x_k + ε

Answer 46

when there are two factors being tested, the effect of one without the other changing

Answer 47

it usually rises

Answer 48

1.Logs: x --> ln(x) 2. powers: x --> x2

Answer 49

ln(y)=β1+β2ln(x)

Answer 50

ln(y)=β1+β2x

Answer 51

y=β1+β2ln(x)

Answer 52

y=β1+β2x^2+β3x

Answer 53

1. If x increases by 1%, how much does y change? 2. Does each extra unit of x matter less than the previous one?

Answer 54

--> compares the percentage change in y to the percentage change in x If x increases by 1%, y increases by B2 percent

Answer 55

B1: Baseline of y B2: Elasticity (increase in 1% --> B2% increase in y)

Answer 56

- When each percent increase leads to a percent increase in y E.g. each percent of income increase leads to y% increase in food spending

Answer 57

When each unit increase leads to a percentage increase E.g. 1 year of education leads to y% increase in income

Answer 58

When each percent increase leads t a unit increase e.g. 1% income increase leads to y increase in food spending

Answer 59

B2/100 = effect on y if 1% change in x

Answer 60

B2*100 = effect on y if 1 unit change in x

Answer 61

B2 =effect on y% if 1% change in x

Answer 62

-Convenient percentage/elasticity interpretation - Mitigates outliers - Helps with normality and homoscedasticity

Answer 63

- Wages - Salaries - Sales - Market value - Population

Answer 64

- Variables measured in years - Ratios and percentages - Zero or negative values

Answer 65

𝑃𝑅𝐼𝐶𝐸 = 𝛽1 + 𝛽2x + 𝛽3x2 + 𝜀

Answer 66

Allows the effect of x on y to change with the level of x

Answer 67

B3=0 --> Marginal effect of X is constant --> linear shape B3>0 --> Marginal effect of X increases with X --> U-shaped B3<0 --> Marginal effect of X decreases with X --> reverse U-shaped

Answer 68

β3X2 Terms that allow us to depend the effect of one variable on the outcome on another variable

Answer 69

δD --> Dummy or binary variables --> variables that take only 2 values, 0 or 1 (True/False)

Answer 70

- Turning qualitative content int quantitative variables

Answer 71

1. As a regressor --> Y=β1+δD+β2x+e 2. As interaction term --> Y=β1+δD+β2x+γ(B2x × D)+e

Answer 72

Level: change in outcome compare to baseline Log: percentage change relative to baseline

Answer 73

When for every possibility there is a dummy leading to there not being a baseline

Answer 74

Models that allow to explore choices and decisions are labelled binary choice models: - Linear probability model - Probit/Logit models

Answer 75

When we model individuals and firms choices

Answer 76

A model where y captures a choice that individuals make Y= ( 1 if first alternative, 0 if second alternative)

Answer 77

binary if Y= 2 values

Answer 78

A 1-unit change in x leads to a B% increase in y

Answer 79

- Easy to add duties, interactions and controls - Coefficients are directly interpretable as change in probability

Answer 80

- Model is heteroskedastic by design as error term changes with X - Can get p-values lower than 0 or greater than 1 -

Answer 81

- They adjust the change of x on probability so it doesn't exceed 100% or get below 0% --> therefore the effect of x on y changes depending on where on the curve you are

Answer 82

They use 𝐺(𝛽0 + 𝛽1𝑥1 + ⋯ + 𝛽𝑘𝑥𝑘) which cramps the score of linear probability models on a scale of 0 to 1, preventing numbers above 1 or below 0

Answer 83

The likelihood function

Answer 84

logit = 4Blpm Bprobit = 2.5Blpm logit=1.6Bprobit

Answer 85

-Level-Level: linear relationship -Log-Level: y grows proportionally -Level-Log: diminishing effects of x -Log-Log: Both grow proportionally -Quadratic: Effect of x on y depends on x —> you expect an optimum or turning point

Econometrix Flashcards

(110 cards)