“…The most widely explored approach is Programming by Demonstration (PbD) which involves taking demonstrations of a task as input and inferring the goal of the task or a policy that can be used to accomplish the task [9], [10]. A majority of the work focuses on directly learning a policy or modeling higher level actions from lower level control signals [11], [12], [13], [14], while some explore learning task goals or task structure represented in various ways [15], [16], [17], [18], [19]. Most closely related to our work, Akgun et al explored simultaneous learning of actions and goals by demonstration, focusing on manipulation tasks, such as closing a box and pouring beans into a bowl [20].…”