“…Frameworks for learning from language-based communication have been previously proposed. Common approaches include: reducing the learning problem to reinforcement learning [16,18,21,37,39,55], grounding language to demonstration [6,9,34,35,36,38,45,59], or devising EM-based algorithms to parse language into logical forms [30,41]. The first approach may discard useful learning signals from language feedback and inherits the limitations of RL algorithms.…”