2019 IEEE/CVF International Conference on Computer Vision (ICCV) 2019
DOI: 10.1109/iccv.2019.00239
|View full text |Cite
|
Sign up to set email alerts
|

PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization

Abstract: Recent advances in image-based 3D human shape estimation have been driven by the significant improvement in representation power afforded by deep neural networks. Although current approaches have demonstrated the potential in real world settings, they still fail to produce reconstructions with the level of detail often present in the input images. We argue that this limitation stems primarily form two conflicting requirements; accurate predictions require large context, but precise predictions require high res… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
990
1
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 1,073 publications
(1,052 citation statements)
references
References 98 publications
2
990
1
1
Order By: Relevance
“…skirts or dresses. Implicit function based representations [71,56,46,48,31] might be beneficial to deal with different topologies, but they do not allow control. Although it is remarkable that our model can predict the occluded appearance of the person, the model struggles to predict high frequency detail and complex texture patterns.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…skirts or dresses. Implicit function based representations [71,56,46,48,31] might be beneficial to deal with different topologies, but they do not allow control. Although it is remarkable that our model can predict the occluded appearance of the person, the model struggles to predict high frequency detail and complex texture patterns.…”
Section: Discussionmentioning
confidence: 99%
“…Hence, in this work, we learn from complete texture maps obtained from 3D registrations. 3D person reconstruction from images While promising, recent methods for 3D person reconstruction either require video as input [6,7,8], scans [74], do not allow control over pose, shape and clothing [48,56], focus only on faces [72,32,63,57,47,62], or only on garments [68].…”
Section: Related Workmentioning
confidence: 99%
“…Varol et al [35] proposed a synthetic human dataset for monocular model based human segmentation and depth estimation. However, synthetic data trained models suffer from limitations on real world images in high-frequency depth estimation of the human body [29]. [13] introduced another synthetic human dataset to train multi-view surface estimation network.…”
Section: Related Workmentioning
confidence: 99%
“…Existing state-of-the-art virtual try-on systems require a depth camera for tracking and overlay the human body with the t garment. Saito et al [27] introduced a novel pixel-aligned implicit function, which spatially aligns the pixel-level information of the input image with the shape of the 3D object, for deep learning-based 3D shape and texture inference of clothed humans from a single input image. But this method should train an encoder to learn individual feature vectors for each pixel of an image, which is timeconsuming.…”
Section: Image-basedmentioning
confidence: 99%