When do generalized MDPs converge?
Add note about non expansions
Why does Q-Learning converge.
a
What is Convergence
a
What does Q-learning converges to?
Q*
Non Expansion
a
Contraction Mapping
a
Generalized MDP
a
Control within TD
Action chosen by the learner.
List types of non expansions.