“…The Generator is applied sequentially, producing the output frames one after the other, until the entire output sequence has been created. Similar to Head2Head [15], the Generator consists of two identical encoders, operating in parallel, as well as a decoder. The first encoder receives the concatenated NMFC and eye images X t−2:t , while the second is given the two previously generated frames Ỹt−2:t−1 .…”