“…Differently from the signal models used in video spatial prediction, deep generative models are much more complex and highly non-linear, and can potentially approximate any class of signal provided that enough training samples are available to learn the original data distribution. In the last couple of years, these models have been applied to image compression [3,4,5,6,7,8,9], producing image representations able to provide, in some cases, equivalent or better visual quality than traditional image codecs [10].…”