“…There has been a rich body of work [1, 3, 5, 7, 10-12, 17, 21, 26] that has focused on generating animations from still images. While [3,7,12,17,21,26] focus on uncontrollable image-to-video synthesis, attempts [1,2,5,10,11] have been made for controllable image-to-video synthesis with the user-provided direction of the motion of the objects in the images. While these methods provide some control to the user, they suffer from certain drawbacks.…”