“…The differences can be noticed if we take into account not only the method used but also the type of input that the method accepts. These input constraints can be Graph-based, which take the form of bubble diagrams as input (Hu et al, 2020;Nauata et al, 2021;Wu et al, 2019), Language-based, which takes linguistic descriptions as input to the generative model (Chen et al, 2020;Galanos, 2021), and last but not least Pixel-based approaches, which use the pixel color as constrains to the generative model, whereas information like shape, orientation or area could be further determined (Chaillou, 2020;Peters, 2018;Rahbar et al, 2019).…”