The issue of planning is one that is common to Artificial
Intelligence methods: an agent must devise a sequence of actions that
leads from an initial state to a goal state. However, in the context
of RL, planning is often required to be done under uncertainty. This
arises from several sources: incomplete knowledge of states and
actions, and potentially conflicting objectives.