“…Similarly, the value of following a trajectory n, V n i (b i ), can be computed by iteratively applying the Bellman equation and following *This work was partially supported by Sony AI. 1 The authors are with School of Computer Science, Carnegie Mellon University {anahitam, mmv, maxim}@cs.cmu.edu. the remaining trajectory [2].…”