18. Missing Data, Internal Validity, External Validity Flashcards

(19 cards)

1
Q

Effect of data missing at random on bias and other

A
  • No concerns of bias
  • Just a smaller sample –> increase s.e.
  • OLS estimator is still unbiased
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Effect of data of missing based on a cutoff of the x value

A
  • No Bias
  • smaller sample –> higher s.e.e
  • reduces var(x) –> higher s.e.

As the slope of the regression line is the same across the domain of all x, we just have a smaller domain but still the same slope

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Effect of data missing based on a cutoff of the y value

A
  • Causes Bias

Error is represented by vertical distance between a point and the line.

small x values need a large positive error to meet threshold, thus as x increases, error term decreases on average –> omitted variable in ui that changes on avg. when x changes –> confounder

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

how to test if MAR believable

A

look at summary statistics of other variables and compare those with missing andn non-missing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is ‘internal validity?

A

Estimate can be interpreted as a causal effect for the population that is used in the study

no issues (confounders, attenuation bias, bias due to y cutoffs, no simultanaeity/reverse causality, no bad control)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

External validity

A

estimate is represenative of the effect for another population

nearly always an assumption, checked by creating estimates in various settings and checking if effects are comparable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Effect of data missing at random on bias and other

A
  • No concerns of bias
  • Just a smaller sample
  • OLS estimator is still unbiased
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Effect of data of missing based on a cutoff of the x value

A
  • No Bias

As the slope of the regression line is the same across the domain of all x, we just have a smaller domain but still the same slope

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Effect of data missing based on a cutoff of the y value

A
  • Causes Bias

Error is represented by vertical distance between a point and the line.

small x values need a large positive error to meet threshold, thus as x increases, error term decreases on average –> omitted variable in ui that changes on avg. when x changes –> confounder

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is ‘internal validity?

A

Estimate can be interpreted as a causal effect for the population that is used in the study

no issues (confounders, attenuation bias, bias due to y cutoffs, no simultanaeity/reverse causality, no bad control)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

External validity

A

estimate is represenative of the effect for another population

nearly always an assumption, checked by creating estimates in various settings and checking if effects are comparable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is standardising a variable?

A

standardising is a form of normalising where we

  • u (mean)
  • / (divide) by s.d.

useful for when units cannot be easily understood

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

When standardiisng just x1, what is B1 interpreted as

A

𝛽1∗ is interpreted as “the average change in 𝑦 that is associated with 𝑥1 increasing by 1 standard deviation.”

16
Q

standardising just y, what is B1 interpreted as

A

𝛽1∗ is interpreted as “the average number of standard deviations that 𝑦 changes by that is associated with 𝑥1 increasing by 1.”

17
Q

standardising both x1 and y, what is B1 interpreted as?

A

“the average number of standard deviations that 𝑦 changes by that is associated with 𝑥1 increasing by 1 standard deviation.”

18
Q

Do we have to subtract the mean and dividie by the standard deviation

A

No for the interpretation it is sufficient to divide by the standard deviation