What is the Lasso optimisation problem?
Can drop the constant term to make the algebra easier
What is the prediction error of the Lasso solution?
What is Theorem 2: the Slow Rate of the prediction error of the Lasso solution?
What is the proof of the Theorem 2?
CHECK: Why does the proof he uses not contain the Y-bar part of the proof?
What can we note about the slow-rate of convergence of Lasso?
What is some useful notation we will use for column and component extraction operations?
How do we denote β* as a sparse vector, though using signal coordinates ?
What can we do once we have set up these sets in an artificial sort of situation?
How does this relate to Lasso, and what do we need to do this?
What is Definition 2: The compatibility condition?
As the infimum of all δ in RP such that δS is not equal to to the zero vector and the L1 norm of the noise coodinates is not too big compared to the L1 of the signal coordinates
numerator:
* Xδ –> average magnitude over all the N coordinates
denominator:
* δ2 –> looking at the signal part of delta
* as |S| is the size of S, the denominator is roughly the average of the S coordinates
IF XTX/n has a minimum eigenvalue cmin 0, what do we know about the compatibility condition? How do we verify this?
GO THROUGH THIS
around eigenvector and c_min being the answer
12 Feb 30mins
How does this change in high dimensions?
Theorem 3 (fast rate) of convergence of Lasso?
What is the proof of Theorem 3 (fast rate) of convergence of Lasso?
How do we know that the compatibility condition is a property of the design matrix?
Why does Lasso not have a closed-form solution, unlike ridge regression?
What is the definition of a subgradient of a convex function?
What do we call the set of subgradients of f at x?
What is Proposition 3 regarding the value of ∂f(x) given f is convex and differentiable at x ∈ int(C)
What is the subgradient of:
f: x I–> |x|
Proposition 4: Let f1, f2: Rd –> R be convex
a) for any α > 0, what is ∂(αf1)(x)
b) ∂(f1 + f2))(x)
c) h(x) = f1(Ax +b) what is ∂h(x)
Proposition 5: What is the (simplified) KKT condition?
What is the proof of Proposition 5: What is the (simplified) KKT condition?
What is Proposition 6 relating to the subdifferential of the l1-norm ||.||?
What is the proof of Proposition 6
12(2) Feb 20 mins–> go through the proof
What is the theorem that fully characterises Lasso solutions?
What is the proof of Theorem 4, the fully characterised Lasso solution?
EXPAND AND FILL IN THIS PROOF