GANILLA: Generative adversarial networks for image to illustration translation

Hiçsönmez, Samet; Samet, Nermin; Akbaş, Emre; Duygulu, Pınar

doi:10.1016/j.imavis.2020.103886

Cited by 51 publications

(40 citation statements)

References 33 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The animation images are also segmented manually according to the same semantic classes of the photo images. The previous models [1], [2], [20], [21], [36]- [38] were retrained with these datasets for fair comparisons. All images were resized to 256 × 256 for training.…”

Section: A Implementation Detailsmentioning

confidence: 99%

“…Fig. 7 compares the qualitative results of our method with the CycleGAN [1], DiscoGAN [21], DualGAN [20], MUNIT [36], CartoonGAN [2], U-GAT-IT [37], and GANILLA [38]. It can be seen by comparing the results object-by-object that our method produces the best results with the fine details of the target anime object styles.…”

Section: A Implementation Detailsmentioning

confidence: 99%

“…Also, because the generators of the previous methods fail to learn the dedicated mapping between the tree in the real photo images and that in the target style images, the unique texture of the tree in the target style domain is lost in the resulting outputs. The qualitative results show that our proposed pseudo-supervised learning framework can [20], (e) MUNIT [36], (f) CartoonGAN [2], (g) U-GAT-IT [37], (h) GANILLA [38], and (i) our method. (j) Real anime sample from the target anime for visual comparison.…”

Section: A Implementation Detailsmentioning

confidence: 99%

See 2 more Smart Citations

Pseudo-Supervised Learning for Semantic Multi-Style Transfer

Kim

2021

IEEE Access

View full text Add to dashboard Cite

Numerous methods for style transfer have been developed using unsupervised learning and gained impressive results. However, optimal style transfer cannot be conducted from a global fashion in certain style domains, mainly when a single target-style domain contains semantic objects that have their own distinct and unique styles, e.g., those objects in the anime-style domain. Previous methods are incongruent because the unsupervised learning can not provide the semantic mappings between the multistyle objects according to their unique styles. Thus, in this paper, we propose a pseudo-supervised learning framework for the semantic multi-style transfer (SMST), which consists of (i) a pseudo ground truth (pGT) generation phase and (ii) a SMST learning phase. In the pGT generation phase, multiple semantic objects of the photo images are separately transferred to the target-domain object styles in an object-oriented fashion. Then the transferred objects are composed back to an image, which is the pGT. In the SMST learning phase, a SMST network (SMSTnet) is trained with the pairs of the photo images and its respective pGT in a supervised manner. From this, our framework can provide the semantic mappings of multi-style objects. Moreover, to embrace the multi-styles of various objects into a single generator, we design the SMSTnet with channel attentions in conjunction with a discriminator dedicated to our pseudo-supervised learning. Our method has been applied and intensively tested for anime-style transfer learning. The experimental results demonstrate the effectiveness of our method and show its superiority compared to the state-of-theart methods.

show abstract

Section: A Implementation Detailsmentioning

confidence: 99%

Section: A Implementation Detailsmentioning

confidence: 99%

Section: A Implementation Detailsmentioning

confidence: 99%

See 1 more Smart Citation

Pseudo-Supervised Learning for Semantic Multi-Style Transfer

Kim

2021

IEEE Access

View full text Add to dashboard Cite

show abstract

“…Chen et al [116] propose an adversarial gated networks, called Gated-GAN, to transfer multiple styles while using a single model based on three modalities: an encoder, a gated transformer, and a decoder. GANILLA [117] is a proposed novel framework with the ability to better balance between content and style. Style transfer is the process of rendering the content of an image with a specific style while preserving the content, as shown in Figure 6.…”

Section: Style Transfermentioning

confidence: 99%

“…Metrics of success of image-to-image translation usually evaluate the quality of generated images while using a limited number of test images or user studies. The evaluation of a limited number of test images must consider both style and content simultaneously, which is difficult to do [117]. In addition, user studies are based on human judgment, which is a subjective metric [1].…”

Section: Lack Of Evaluation Metricsmentioning

confidence: 99%

Deep Generative Adversarial Networks for Image-to-Image Translation: A Review

Alotaibi

2020

Symmetry

View full text Add to dashboard Cite

Many image processing, computer graphics, and computer vision problems can be treated as image-to-image translation tasks. Such translation entails learning to map one visual representation of a given input to another representation. Image-to-image translation with generative adversarial networks (GANs) has been intensively studied and applied to various tasks, such as multimodal image-to-image translation, super-resolution translation, object transfiguration-related translation, etc. However, image-to-image translation techniques suffer from some problems, such as mode collapse, instability, and a lack of diversity. This article provides a comprehensive overview of image-to-image translation based on GAN algorithms and its variants. It also discusses and analyzes current state-of-the-art image-to-image translation techniques that are based on multimodal and multidomain representations. Finally, open issues and future research directions utilizing reinforcement learning and three-dimensional (3D) modal translation are summarized and discussed.

show abstract

An Overview of Image-to-Image Translation Using Generative Adversarial Networks

Chen

Jia

2021

Pattern Recognition. ICPR International Workshops and Challenges

View full text Add to dashboard Cite

GANILLA: Generative adversarial networks for image to illustration translation

Cited by 51 publications

References 33 publications

Pseudo-Supervised Learning for Semantic Multi-Style Transfer

Pseudo-Supervised Learning for Semantic Multi-Style Transfer

Deep Generative Adversarial Networks for Image-to-Image Translation: A Review

An Overview of Image-to-Image Translation Using Generative Adversarial Networks

Contact Info

Product

Resources

About