Reward of a state
Immediate feedback from entering a state.
Utility of a state
Long term feedback from entering a state including future reward based on the policy.
Markovian Property
Only the present matters / Your transition state only depends on the current state.