what is hierarchical reinforcement learning?
the granularity of abstractions is larger than the fine grain of the primitive actions of the environment (taking a train instead of individual steps)
advantages of hierarchical methods
disadvantages of hierarchical methods
what is the options framework?
Whenever a state is reached that is a subgoal, then, apart from following a primitive action (main policy), you can follow the option policy, a macro action consisting of a different subpolicy specially aimed at satisfying the subgoal in one large step. In this way macros are incorporated into the reinforcement learning framework.
what is an option?
a group of actions with a termination condition
what is an option?
a group of actions with a termination condition. They take in environment observations and output actions until a
termination condition is met.
what are the tree elements of an option π?
what are macros?
any group of actions, possibly open-ended
what is intrinsic motivation?
An inner drive to explore, named so to contrast it with classic extrinsic motivation (the conventional RL reward signal). They are related to reward signals for achieving subgoals
How do multi agent and hierarchical reinforcement learning fit together?
agents often work together in teams or other hierarchical structure
what is so special about Montezuma’s Revenge?
it is a difficult situation to learn for RL, because is has little reward signal and the reward signal is delayed. It consists of long stretches in which the agent has to walk without the reward changing.