“…Subsequent work has proposed two decoders for modeling latent representations and facial attributes [20], selective transfer units [35], and geometryaware flow [76] to further improve editing fidelity. Yao et al [73] extend facial attribute editing to video sequences via latent transformation and a identity preservation loss, which is further improved by Xu et al [70], incorporating flow-based consistency. More recent works propose 3Daware generative models to achieve view-consistent synthesis [8,10,49,55,63,67,71].…”