2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020
DOI: 10.1109/cvpr42600.2020.00771
|View full text |Cite
|
Sign up to set email alerts
|

Deep Image Spatial Transformation for Person Image Generation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
175
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
2
2

Relationship

1
8

Authors

Journals

citations
Cited by 181 publications
(191 citation statements)
references
References 23 publications
0
175
0
Order By: Relevance
“…The performance may be vulnerable to parsing errors. Some other methods tackle this task by proposing efficient spatial transformation modules [17,24,26,27,29,33]. Siarohin et al [26] introduce deformable skip connections to spatially transform the source neural textures with a set of affine transformations.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The performance may be vulnerable to parsing errors. Some other methods tackle this task by proposing efficient spatial transformation modules [17,24,26,27,29,33]. Siarohin et al [26] introduce deformable skip connections to spatially transform the source neural textures with a set of affine transformations.…”
Section: Related Workmentioning
confidence: 99%
“…Recently, advances in computer vision fields have made tremendous progress in generating realistic images [2,7,13,14]. Some algorithms [19,21,24,26] are proposed to automatically synthesize person images from references using learning-based methods. Formally, the pose-guided person image synthesis task aims to synthesize person images by transforming the poses of reference images according to the given modifications while preserving the Session 25: Multimedia Art, Entertainment and Culture MM '21, October 20-24, 2021, Virtual Event, China reference identities.…”
Section: Introductionmentioning
confidence: 99%
“…At an early stage, multiple deep-learning structures, such as GAN [30] and variational autoencoders (VAE) [31], were attempted. Because features in a real scene are difficult to represent with a latent tensor of fixed length, recent models have been based on a GAN with an attention mechanism in the generator [32], [33]. In addition, adversarial learning has also shown its effectiveness with many other tasks, such as aging faces [34], elevation data simulation [35], 3D imagebased unconditional geostatistical simulation [36], particle interaction [37] and ultrasonic image generation [38].…”
Section: Gan Models Related To Lucc Predictionmentioning
confidence: 99%
“…On this basis, Men et al [64] put forward a new network architecture with style block connections and a human parser to separate the attributes and encode them respectively. In order to deal with person image spatial transformation problems, Ren et al [65] combine flow-based operations with attention mechanisms and the model consists of a Global Flow Field Estimator and a Local Neural Texture Renderer. Furthermore, [66], [67] also use an unsupervised manner to tackle this task via end-to-end training.…”
Section: Pose-guided Person Image Generationmentioning
confidence: 99%