What is the multivariate MSE?
Var(theta^hat) + (bias(theta hat))(bias(theta hat)’
What is the multivariate normal distribution?
f(x) = 2 pi ^(-n/2) |𝝨|^(1/2) exp(-(1/2) (x-mu)’ 𝝨^-1(x-mu))
With mu = vector of mu
And Sigma = diagonal matrix of variances
How can a (multivariate) normal distribution be rewritten to a chi squared distribution?
(x - μ)’Σ^-1(x-μ) or
If x is N(0, I.n) distributed and matrix A is symmetric idempotent of rank r then
x’Ax
How can a Normal and chi squared distribution be rewritten to a students t-distribution?
x/sqrt(w/n) is t(n) distributed, this converges to a normal distribution
How can two chi squared distributions be rewritten to a F-distribution?
(v/m)/(w/n) is F(m, n) distributed (given that v and w are independent.
How can a Student’s t-distribution be rewritten to a F-distribution?
t^2 = F(1, n) distribution
What can be said about the limit of a F-distribution?
If w/n -> inf then F(m, n) is (chi squared (m))/m distributed
What is the derivation of the Least Squares estimation?
ɸ(β) = y’y = y’Xβ - β’X’y + β’X’Xβ
= y’y - 2y’Xβ + β’X’Xβ (because y’Xβ = β’X’y)
derivative of ɸ(β) with respect to β = -2y’X + 2β’X’X = 0
gives
X’Xb = X’y, so b = (X’X)-1X’y
What is the residual maker matrix, and why is it named like that?
What is the formula for b?
(X’X)-1X’y
What is the least absolute deviations estimator?
Instead of minimizing the sum on ui2, we now minimize the sum of the absolute value of ui.
What are the advantages and disadvantages of the least absolute deviations?
Advantages:
Disadvantages:
How is R2 derived?
(yhat‘yhat)/(yhat‘yhat) = 1 - (e’e)/(yhat‘yhat)
Ay = AXb + Ae = AXB + e, so then
y’Ay = b’X’AXb + e’e (a.k.a. SST (a.k.a. TSS) = SSE + SSR)
(AB)’ = … ?
B’A’
(AB)-1 = … ?
B-1A-1
What is ɩ?
It’s a vector of ones.