What is censored data?
What is truncated data?
Data are truncated when the data set does not include observations in the analysis that are beyond a boundary value. Having a value beyond the boundary eliminates that individual from being in the analysis.
What is the Tobit model used for?
The Tobit model is used in situations in which the dependent variable y is censored
What are corner solutions?
The dependent variable y has bunches at certain points (typically zero) due to individual behaviour (corner solution)
Here, the term “censoring” is also used for corner solutions.
What are the Tobit model assumptions for the dependent variable?
The latent variable y* = xβ + ϵ ϵ | x ~ N(0; σ²) iid
is assumed to :
How can you express “left-censoring” or “censoring from below” (at zero) for observed variable y as a function of latent variable y*?
What is the expected value of the observed variable y in a Tobit model?
What is the probability that y is positive in censored data?
with
In the tobit model, what is the expected value of y, given that y is positive?
In the tobit model, what is the expected value of y?
What is the partial effect of continuous regressor xK on the observed variable y in the Tobit model?
What is the partial effect of continuous regressor xK on the latent variable y* in the Tobit model?
Which partial effect is more interesting for which case of censored data?
What is the average partial effect (APE) of a continuous variable xKon y?
What is the average partial effect (APE) of a discrete variable xKon y?
How would you interpret this average partial effect of the variable ‘kids05 = number of kids below 6’ on ‘hours = hours worked per year’?
The effect of an additional kid<6 ranges from -855.7 to -13.78, and the Average Partial Effect (APE) is -526.2 One additional child (younger of six) reduces averagely the yearly working time by approximately 526 hours.
Can you compare these coefficients?
Coefficients of OLS vs. Tobit are only comparable in
signs, not in magnitudes! To compare magnitudes, we have to compare e.g. APEs (for OLS: the coefficient, for Tobit: the APE on observed value)
How can you solve the problem of taking the logarithm of censored data (where 0 is a frequent value)?