“…Due to the extreme non-linearity of the networks in both the generator and the discriminator, it is highly unlikely that the training objective of GANs can be convex-concave. In particular, even if the generator and the discriminator are linear functions over prescribed feature mappings-such as the neural tangent kernel (NTK) feature mappings [3,8,9,17,18,32,35,40,41,47,51,54,65,69,92,97] -the training objective can still be non-convex-concave. 1 Even worse, unlike supervised learning where some non-convex learning problems can be shown to have no bad local minima [44], to the best of our knowledge, it still remains unclear what the qualities are of those critical points in GANs except in the most simple setting when the generator is a one-layer neural network [42,62].…”