2 - Impact Evaluation Flashcards

Question 1

Q

4 different forms of evaluation:

Answer

A

Ex-ante appraisal (potential?)
Programmatic evaluation
Comprehensive expenditure review
Impact analysis (some are just glad to help out but investigate if the project actually worked out well and had good impacts are increasingly popular).

Question 2

Q

Why evaluate? Objectives:

Answer

A

Lesson learning (has it done what it was supposed to?beneficiaries, program, organization, world).
Accountability
Result-based management (use results to improve, ex test on small scale and then scale up)

Question 3

Q

The logical framework/model of evaluation:

Answer

A

Needs - ex too low literacy in rural India.
Inputs - ex monitor teacher attendance and activity.
Output - parents vist schools daily and report.
Outcome - teachers attend school more regularly and better quality.
Impact - hopefully higher rate of literacy.
Long-term goal - improved educational outcomes and career opportunities.

Question 4

Q

Different levels of program evaluation:

Answer

A

Needs Assessment (who is the pop and what do they need?)
Program Assessment (how address the needs and what are the prerequisites and shortcomings?)
Process evaluation (are the things delivered? built? don’t assess impact, just process).
Impact evaluation (all this lecture is about –> lead to the Q: why and when do it work? Can we scale up?)
Cost-benefit analysis

Question 5

Q

Theory of change:

Answer

A

ToC analyses how inputs lead to intended outcomes/impacts. Identify causal steps and which underlying assumptions need to hold, what data we need etc..

Question 6

Q

Different types of correlation:

Answer

A

Causation: X–>Y.
Reverse causality: Y–>X
Simultaneity: Y–>X and X–>Y
Spurious correlation/OV bias: Z–>X and Z–>Y a third variable affecting both.

Question 7

Q

Counterfactual:

Answer

A

Need a group of people telling us what would have been the case if we did NOT implement the program. This cannot be done 100% since we don’t have two identical worlds… But we do our best to find a good enough counterfactual so that w can measure the impact (difference between T and C). This helps us measure causality.

Question 8

Q

What is the basic formula for measuring impacts?

Answer

A

To take the difference between outcome for participants vs non-participants:
Yi(1) - Yi(0)
But, as we cannot observe same unit, we must take the average impact:
E(Yi(1)) - E(Yi(0)).
So this is the expected value for the T minus the expected value for the C group.

Question 9

Q

What is the bias of the impact measurement?

Answer

A

The bias is:
E(Y(0)|T) - E(Y(0)|C).
So it’s the difference between being in the treatment group but not receiving the treatment and being in the control group where you obviously not receive the treatment.
If we have a perfect counterfactual, this bias=0.
This B happens because we use an estimate of ATE.

Question 10

Q

3 techniques for impact evaluation:

Answer

A

Experimental design with randomisation (RCT)
Matching methods (PSM)
Difference in difference
Other in the book… see notes.

Question 11

Q

Random sampling and assignment:

Answer

A

When we randomly select a sample from a population and den randomly assign some of them in the sample to the T and the rest to the C.

Question 12

Q

RCT

Answer

A

Random control trial. When using random sample and assignment, we create a relevant comparison group.
There shouldn’t be any systematical differences between the groups, no bias. –> T and C have same outcome Y in absence of the program.

Question 13

Q

Is it ethical to randomise?

Answer

A

Not always. If the program involves large benefits for the treated ones, then why should my neighbour get those benefits but not me? Just by luck? If we had the chance to prove who needed it the most, maybe it would have been me. But self selection destroys the properties of a relevant counterfactual…

Question 14

Q

ATE=

Answer

A

Average treatment effect

Question 15

Q

Issues with RCT:

Answer

A

External validity (specific context)
Hawthorne effects - changed behavior for the observed ones.
John Henry effect - changed behavior for the controlled, work harder)
Contamination/spillover
Dropout or attrition
Partial eq - measuring short term effects.

Question 16

Q

PSM

Answer

Study These Flashcards

A

Propensity score matching. Find a group that are similar in the observable characteristics and assume that the unobservables also are similar across treated and untreated.

Question 17

Q

When use PSM?

Answer

Study These Flashcards

A

When RCT is not possible, ex in ex-post situations where program is already implemented or when RCT is too expensive.

Question 18

Q

PSM method’s 3 steps:

Answer

Study These Flashcards

A

Use surveys to select several characteristics (X)(age, income etc) that help predict participation. Estimate the probability of participation, p(X). This is someone’s propensity score.
Match treated to untreated using p(X), as close as possible.
Impact = average difference in outcomes between the groups.

Question 19

Q

PSM issues:

Answer

Study These Flashcards

A

Requires a loooot of data to find relevant characteristics

- Strict assumption that unobservables also are similar.

Question 20

Q

Difference-in-difference

Answer

Study These Flashcards

A

No random sample available, maybe because the program aimed at help out a certain group.

Look at before and after the program
Or alternatively look at the change over time of non-beneficiaries as counterfactual (subtracting these differences creates a diff-in-diff).

Question 21

Q

Diff-in-diff ATE=

Answer

Study These Flashcards

A

[E(Yt1|T) - E(Yt0|T)] - E((Yc1|C) - E(Yc0|C)).

SO simply the difference of the changes over time for the two groups.

Question 22

Q

Key assumption of DiD

Answer

Study These Flashcards

A

Parallel trends - that the groups have the same pace of change before the program starts, bc then we can assume that if the treated group did had the program they would have been equally well off as the untreated.

Question 23

Q

Issues of DiD

Answer

Study These Flashcards

A

Are the parallel trends true? Maybe affected by spurious correlation?
In practice, ex-ante time-varying unobserved heterogeneity could have been taken care of in the program design to ensure that T and C areas share similar pre-program characteristics. But not possible now…
(if not similar before, the measured impact is not true since Y will be affected).

2 - Impact Evaluation Flashcards

(23 cards)