18. Missing Data, Internal Validity, External Validity Flashcards

Question 1

Q

Effect of data missing at random on bias and other

Answer

A

No concerns of bias
Just a smaller sample –> increase s.e.
OLS estimator is still unbiased

Question 2

Q

Effect of data of missing based on a cutoff of the x value

Answer

A

No Bias
smaller sample –> higher s.e.e
reduces var(x) –> higher s.e.

As the slope of the regression line is the same across the domain of all x, we just have a smaller domain but still the same slope

Question 3

Q

Effect of data missing based on a cutoff of the y value

Answer

A

Causes Bias

Error is represented by vertical distance between a point and the line.

small x values need a large positive error to meet threshold, thus as x increases, error term decreases on average –> omitted variable in ui that changes on avg. when x changes –> confounder

Question 4

Q

how to test if MAR believable

Answer

A

look at summary statistics of other variables and compare those with missing andn non-missing

Question 5

Q

What is ‘internal validity?

Answer

A

Estimate can be interpreted as a causal effect for the population that is used in the study

no issues (confounders, attenuation bias, bias due to y cutoffs, no simultanaeity/reverse causality, no bad control)

Question 6

Q

External validity

Answer

A

estimate is represenative of the effect for another population

nearly always an assumption, checked by creating estimates in various settings and checking if effects are comparable

Question 7

Q

Question 8

Q

Question 9

Q

Question 10

Q

Effect of data missing at random on bias and other

Answer

A

No concerns of bias
Just a smaller sample
OLS estimator is still unbiased

Question 11

Q

Effect of data of missing based on a cutoff of the x value

Answer

A

No Bias

As the slope of the regression line is the same across the domain of all x, we just have a smaller domain but still the same slope

Question 12

Q

Effect of data missing based on a cutoff of the y value

Answer

A

Causes Bias

Error is represented by vertical distance between a point and the line.

small x values need a large positive error to meet threshold, thus as x increases, error term decreases on average –> omitted variable in ui that changes on avg. when x changes –> confounder

Question 13

Q

What is ‘internal validity?

Answer

A

Estimate can be interpreted as a causal effect for the population that is used in the study

no issues (confounders, attenuation bias, bias due to y cutoffs, no simultanaeity/reverse causality, no bad control)

Question 14

Q

External validity

Answer

A

estimate is represenative of the effect for another population

nearly always an assumption, checked by creating estimates in various settings and checking if effects are comparable

Question 15

Q

What is standardising a variable?

Answer

A

standardising is a form of normalising where we

u (mean)
/ (divide) by s.d.

useful for when units cannot be easily understood

Question 16

Q

When standardiisng just x1, what is B1 interpreted as

Answer

Study These Flashcards

A

𝛽1∗ is interpreted as “the average change in 𝑦 that is associated with 𝑥1 increasing by 1 standard deviation.”

Question 17

Q

standardising just y, what is B1 interpreted as

Answer

Study These Flashcards

A

𝛽1∗ is interpreted as “the average number of standard deviations that 𝑦 changes by that is associated with 𝑥1 increasing by 1.”

Question 18

Q

standardising both x1 and y, what is B1 interpreted as?

Answer

Study These Flashcards

A

“the average number of standard deviations that 𝑦 changes by that is associated with 𝑥1 increasing by 1 standard deviation.”

Question 19

Q

Do we have to subtract the mean and dividie by the standard deviation

Answer

Study These Flashcards

A

No for the interpretation it is sufficient to divide by the standard deviation

18. Missing Data, Internal Validity, External Validity Flashcards

(19 cards)