Kihyuk Sohn scite author profile

Convolutional neural network-based approaches for semantic segmentation rely on supervision with pixel-level ground truth, but may not generalize well to unseen image domains. As the labeling process is tedious and labor intensive, developing algorithms that can adapt source ground truth labels to the target domain is of great interest. In this paper, we propose an adversarial learning method for domain adaptation in the context of semantic segmentation. Considering semantic segmentations as structured outputs that contain spatial similarities between the source and target domains, we adopt adversarial learning in the output space. To further enhance the adapted model, we construct a multi-level adversarial network to effectively perform output space domain adaptation at different feature levels. Extensive experiments and ablation study are conducted under various domain adaptation settings, including synthetic-to-real and cross-city scenarios. We show that the proposed method performs favorably against the stateof-the-art methods in terms of accuracy and visual quality.

show abstract

Attribute2Image: Conditional Image Generation from Visual Attributes

Yan

et al. 2016

View full text Add to dashboard Cite

Abstract. This paper investigates a novel problem of generating images from visual attributes. We model the image as a composite of foreground and background and develop a layered generative model with disentangled latent variables that can be learned end-to-end using a variational auto-encoder. We experiment with natural images of faces and birds and demonstrate that the proposed models are capable of generating realistic and diverse samples with disentangled latent representations. We use a general energy minimization algorithm for posterior inference of latent variables given novel images. Therefore, the learned generative models show excellent quantitative and visual results in the tasks of attributeconditioned image reconstruction and completion.

show abstract

CutPaste: Self-Supervised Learning for Anomaly Detection and Localization

et al. 2021

View full text Add to dashboard Cite

ReMixMatch: Semi-Supervised Learning with Distribution Alignment and Augmentation Anchoring

Berthelot¹,

Carlini²,

Cubuk³

et al. 2019

Preprint

138

251

View full text Add to dashboard Cite

Towards Large-Pose Face Frontalization in the Wild

et al. 2017

View full text Add to dashboard Cite

Despite recent advances in face recognition using deep learning, severe accuracy drops are observed for large pose variations in unconstrained environments. Learning poseinvariant features is one solution, but needs expensively labeled large-scale data and carefully designed feature learning algorithms. In this work, we focus on frontalizing faces in the wild under various head poses, including extreme profile views. We propose a novel deep 3D Morphable Model (3DMM) conditioned Face Frontalization Generative Adversarial Network (GAN), termed as FF-GAN, to generate neutral head pose face images. Our framework differs from both traditional GANs and 3DMM based modeling. Incorporating 3DMM into the GAN structure provides shape and appearance priors for fast convergence with less training data, while also supporting end-to-end training. The 3DMMconditioned GAN employs not only the discriminator and generator loss but also a new masked symmetry loss to retain visual quality under occlusions, besides an identity loss to recover high frequency information. Experiments on face recognition, landmark localization and 3D reconstruction consistently show the advantage of our frontalization method on faces in the wild datasets. 1 * This work was supported by a research gift from NEC Labs to Michigan State University.1 Detail results and resources can be refered to: http://cvlab.cse. msu.edu/project-face-frontalization.html. 3DMM Coefficients Pose-Variant Input Recogni8on Engine Frontalized Output Generator FF-GAN D Discriminator Extreme Pose Input Frontalized Output

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Kihyuk Sohn

Learning to Adapt Structured Output Space for Semantic Segmentation

Attribute2Image: Conditional Image Generation from Visual Attributes

CutPaste: Self-Supervised Learning for Anomaly Detection and Localization

ReMixMatch: Semi-Supervised Learning with Distribution Alignment and Augmentation Anchoring

Towards Large-Pose Face Frontalization in the Wild

Contact Info

Product

Resources

About