Seungjoo Yoo scite author profile

Despite recent advancements in deep learning-based automatic colorization, they are still limited when it comes to few-shot learning. Existing models require a significant amount of training data. To tackle this issue, we present a novel memory-augmented colorization model Memo-Painter that can produce high-quality colorization with limited data. In particular, our model is able to capture rare instances and successfully colorize them. We also propose a novel threshold triplet loss that enables unsupervised training of memory networks without the need of class labels. Experiments show that our model has superior quality in both few-shot and one-shot colorization tasks.

show abstract

Coloring with Words: Guiding Image Colorization Through Text-Based Palette Generation

Bahng

Yoo

Cho

et al. 2018

View full text Add to dashboard Cite

This paper proposes a novel approach to generate multiple color palettes that reflect the semantics of input text and then colorize a given grayscale image according to the generated color palette. In contrast to existing approaches, our model can understand rich text, whether it is a single word, a phrase, or a sentence, and generate multiple possible palettes from it. For this task, we introduce our manually curated dataset called Palette-and-Text (PAT). Our proposed model called Text2Colors consists of two conditional generative adversarial networks: the text-topalette generation networks and the palette-based colorization networks. The former captures the semantics of the text input and produce relevant color palettes. The latter colorizes a grayscale image using the generated color palette. Our evaluation results show that people preferred our generated palettes over ground truth palettes and that our model can effectively reflect the given palette when colorizing an image.

show abstract

Exploring Unlabeled Faces for Novel Attribute Discovery

Bahng¹,

Chung

Yoo

et al. 2020

View full text Add to dashboard Cite

MEGAN: Mixture of Experts of Generative Adversarial Networks for Multimodal Image Generation

Park¹,

Yoo²,

Bahng³

et al. 2018

View full text Add to dashboard Cite

Recently, generative adversarial networks (GANs) have shown promising performance in generating realistic images. However, they often struggle in learning complex underlying modalities in a given dataset, resulting in poor-quality generated images. To mitigate this problem, we present a novel approach called mixture of experts GAN (MEGAN), an ensemble approach of multiple generator networks. Each generator network in MEGAN specializes in generating images with a particular subset of modalities, e.g., an image class. Instead of incorporating a separate step of handcrafted clustering of multiple modalities, our proposed model is trained through an end-to-end learning of multiple generators via gating networks, which is responsible for choosing the appropriate generator network for a given condition. We adopt the categorical reparameterization trick for a categorical decision to be made in selecting a generator while maintaining the flow of the gradients. We demonstrate that individual generators learn different and salient subparts of the data and achieve a multiscale structural similarity (MS-SSIM) score of 0.2470 for CelebA and a competitive unsupervised inception score of 8.33 in CIFAR-10.

show abstract

MEGAN: Mixture of Experts of Generative Adversarial Networks for Multimodal Image Generation

Park¹,

Yoo²,

Bahng³

et al. 2018

Preprint

View full text Add to dashboard Cite

Recently, generative adversarial networks (GANs) have shown promising performance in generating realistic images. However, they often struggle in learning complex underlying modalities in a given dataset, resulting in poor-quality generated images.To mitigate this problem, we present a novel approach called mixture of experts GAN (MEGAN), an ensemble approach of multiple generator networks. Each generator network in MEGAN specializes in generating images with a particular subset of modalities, e.g., an image class. Instead of incorporating a separate step of handcrafted clustering of multiple modalities, our proposed model is trained through an end-to-end learning of multiple generators via gating networks, which is responsible for choosing the appropriate generator network for a given condition. We adopt the categorical reparameterization trick for a categorical decision to be made in selecting a generator while maintaining the flow of the gradients. We demonstrate that individual generators learn different and salient subparts of the data and achieve a multiscale structural similarity (MS-SSIM) score of 0.2470 for CelebA and a competitive unsupervised inception score of 8.33 in CIFAR-10.

show abstract

Coloring with Words: Guiding Image Colorization Through Text-based Palette Generation

Bahng¹,

Yoo²,

Cho³

et al. 2018

Preprint

View full text Add to dashboard Cite

Coloring With Limited Data: Few-Shot Colorization via Memory-Augmented Networks

Yoo¹,

Bahng²,

Chung³

et al. 2019

Preprint

View full text Add to dashboard Cite

Exploring Unlabeled Faces for Novel Attribute Discovery

Bahng¹,

Chung²,

Yoo³

et al. 2019

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Seungjoo Yoo

Coloring With Limited Data: Few-Shot Colorization via Memory Augmented Networks

Coloring with Words: Guiding Image Colorization Through Text-Based Palette Generation

Exploring Unlabeled Faces for Novel Attribute Discovery

MEGAN: Mixture of Experts of Generative Adversarial Networks for Multimodal Image Generation

MEGAN: Mixture of Experts of Generative Adversarial Networks for Multimodal Image Generation

Coloring with Words: Guiding Image Colorization Through Text-based Palette Generation

Coloring With Limited Data: Few-Shot Colorization via Memory-Augmented Networks

Exploring Unlabeled Faces for Novel Attribute Discovery

Contact Info

Product

Resources

About