Photographic Text-to-Image Synthesis with a Hierarchically-Nested Adversarial Network

Zhang, Zizhao; Xie, Yuanpu; Yang, Lin

doi:10.1109/cvpr.2018.00649

Cited by 273 publications

(251 citation statements)

References 48 publications

(86 reference statements)

Supporting

Mentioning

251

Contrasting

Order By: Relevance

“…For fair comparison, the multi-scale resolution settings are the same as used with the experiments on the CelebA dataset. In particular, we use 32 × 32 and 64 × 64 resolution scales for StackGAN [15] training and 16 × 16, 32 × 32 and 64 × 64 multiple resolution scales for HDGAN [17] and StackGAN++ [16] as well as our proposed method. The disCVAE method produces reconstructions which are blurry.…”

Section: B Lfw Dataset Resultsmentioning

confidence: 99%

“…Previous conditional GAN-based approaches such as GAN-INT-CLS [14] and StackGAN [15] also produce poor quality results due to the model collapse during training. Recent StackGAN++ and HDGAN works generate plausible facial images (HDGAN is better at the color Methods FID score HDGAN [17] 114.912 StackGAN++ [16] 35.988 Single-scale (proposed method) 37.381 Proposed method 30.566 diversity). The previous work Attribute2Sketch2Face, which is a combination of CVAE and GAN, is also able to generate facial images with corresponding attributes.…”

Section: B Lfw Dataset Resultsmentioning

confidence: 99%

“…The CelebA-HQ dataset [50] is a high-quality version of the CelebA dataset, which consists of 30,000 images with 1024 × 1024 resolution. Due to GPU and memory limitations, we conduct experiments on 256 × 256 resolution images and compare the performance with StackGAN++ [16] and HDGAN [17]. The reason why we chose these two baselines is due to their capability to deal with high resolution images.…”

Section: Celeba-hq Dataset Resultsmentioning

confidence: 99%

“…Xu et al [18] proposed an attentiondriven method to improve the synthesis results. Zhang et al [17] (HDGAN) adopted a multi-adversarial loss to improve the synthesis by leveraging more effective image and text information at multi-scale layers.…”

Section: Background and Related Workmentioning

confidence: 99%

“…4. In order to learn the discrimination in both image content and semantics, we adopt the triplet matching training strategy [14], [16], [17], [53]. Specifically, given sketch attributes, the discriminator is trained by using the following triplets: (i) real-sketch and real-sketch-attributes, (ii) synthesized-sketch and real-sketch-attributes, and (iii) wrong-sketch (real sketch but mismatching attributes) and same real-sketch-attributes.…”

Section: A Stage 1: Attribute-to-sketchmentioning

confidence: 99%

See 4 more Smart Citations

Facial Synthesis From Visual Attributes via Sketch Using Multiscale Generators

Patel

2020

IEEE Trans. Biom. Behav. Identity Sci.

View full text Add to dashboard Cite

Automatic synthesis of faces from visual attributes is an important problem in computer vision and has wide applications in law enforcement and entertainment. With the advent of deep generative convolutional neural networks (CNNs), attempts have been made to synthesize face images from attributes and text descriptions. In this paper, we take a different approach, where we formulate the original problem as a stagewise learning problem. We first synthesize the facial sketch corresponding to the visual attributes and then we generate the face image based on the synthesized sketch. The proposed framework, is based on a combination of two different Generative Adversarial Networks (GANs) -(1) a sketch generator network which synthesizes realistic sketch from the input attributes, and (2) a face generator network which synthesizes facial images from the synthesized sketch images with the help of facial attributes. Extensive experiments and comparison with recent methods are performed to verify the effectiveness of the proposed attributebased two-stage face synthesis method.

show abstract

Section: B Lfw Dataset Resultsmentioning

confidence: 99%

Section: B Lfw Dataset Resultsmentioning

confidence: 99%

Section: Celeba-hq Dataset Resultsmentioning

confidence: 99%

Section: Background and Related Workmentioning

confidence: 99%

Section: A Stage 1: Attribute-to-sketchmentioning

confidence: 99%

See 3 more Smart Citations

Facial Synthesis From Visual Attributes via Sketch Using Multiscale Generators

Patel

2020

IEEE Trans. Biom. Behav. Identity Sci.

View full text Add to dashboard Cite

show abstract

Efficient land desertification detection using a deep learning‐driven generative adversarial network approach: A case study

Zerrouki

Dairi

Harrou

et al. 2021

Concurrency and Computation

View full text Add to dashboard Cite

Summary Precisely detecting land cover changes aids in improving the analysis of the dynamics of the landscape and plays an essential role in mitigating the effects of desertification. Mainly, sensing desertification is challenging due to the high correlation between desertification and like‐desertification events (e.g., deforestation). An efficient and flexible deep learning approach is introduced to address desertification detection through Landsat imagery. Essentially, a generative adversarial network (GAN)‐based desertification detector is designed and for uncovering the pixels influenced by land cover changes. In this study, the adopted features have been derived from multi‐temporal images and incorporate multispectral information without considering image segmentation preprocessing. Furthermore, to address desertification detection challenges, the GAN‐based detector is constructed based on desertification‐free features and then employed to identify atypical events associated with desertification changes. The GAN‐detection algorithm flexibly learns relevant information from linear and nonlinear processes without prior assumption on data distribution and significantly enhances the detection's accuracy. The GAN‐based desertification detector's performance has been assessed via multi‐temporal Landsat optical images from the arid area nearby Biskra in Algeria. This region is selected in this work because desertification phenomena heavily impact it. Compared to some state‐of‐the‐art methods, including deep Boltzmann machine (DBM), deep belief network (DBN), convolutional neural network (CNN), as well as two ensemble models, namely, random forests and AdaBoost, the proposed GAN‐based detector offers superior discrimination performance of deserted regions. Results show the promising potential of the proposed GAN‐based method for the analysis and detection of desertification changes. Results also revealed that the GAN‐driven desertification detection approach outperforms the state‐of‐the‐art methods.

show abstract

A novel hybrid augmented loss discriminator for text‐to‐image synthesis

Gan

Liu

et al. 2020

Int J Intell Syst

View full text Add to dashboard Cite

Photographic Text-to-Image Synthesis with a Hierarchically-Nested Adversarial Network

Cited by 273 publications

References 48 publications

Facial Synthesis From Visual Attributes via Sketch Using Multiscale Generators

Facial Synthesis From Visual Attributes via Sketch Using Multiscale Generators

Efficient land desertification detection using a deep learning‐driven generative adversarial network approach: A case study

A novel hybrid augmented loss discriminator for text‐to‐image synthesis

Contact Info

Product

Resources

About