2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022
DOI: 10.1109/cvpr52688.2022.01982
|View full text |Cite
|
Sign up to set email alerts
|

PINA: Learning a Personalized Implicit Neural Avatar from a Single RGB-D Video Sequence

Abstract: Figure 1. We propose PINA, a method to acquire personalized and animatable neural avatars from RGB-D videos. Left: our method uses only a single sequence, captured via a commodity depth sensor. The depth frames are noisy and contain only partial views of the body. Middle: Using global optimization, we fuse these partial observations into an implicit surface representation that captures geometric details, such as loose clothing. The shape is learned alongside a pose-independent skinning field, supervised only v… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
15
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 42 publications
(21 citation statements)
references
References 63 publications
0
15
0
Order By: Relevance
“…These approaches use SMPL as a prior to unpose the human body across multiple frames by transforming the rays from observation space to canonical space which is then rendered using a NeRF. PINA [18] learns a SDF and a learned deformation field to create an animatable avatar from an RGB-D image sequence. Chen et al [15] used a polygon rasterization pipeline to speed up NeRF rendering as a post-process; however, their approach does not reduce NeRF training times.…”
Section: Human Reconstruction Via Optimizationmentioning
confidence: 99%
See 2 more Smart Citations
“…These approaches use SMPL as a prior to unpose the human body across multiple frames by transforming the rays from observation space to canonical space which is then rendered using a NeRF. PINA [18] learns a SDF and a learned deformation field to create an animatable avatar from an RGB-D image sequence. Chen et al [15] used a polygon rasterization pipeline to speed up NeRF rendering as a post-process; however, their approach does not reduce NeRF training times.…”
Section: Human Reconstruction Via Optimizationmentioning
confidence: 99%
“…Rendering the visual hull of both objects gives us the same set of binary silhouettes for all camera angles, making it ambiguous for any optimization scheme to recover a unique mesh from the set of silhouettes. This ambiguity necessitates the use of auxiliary inputs, e.g., depth or normals [18,69,47]. Now consider the same scenario, but with the same overlaid texture on both objects.…”
Section: A Motivating Toy Problemmentioning
confidence: 99%
See 1 more Smart Citation
“…To overcome the topology and resolution limitations of meshes, other representations, including point clouds [35,37,65], implicit surfaces [12,47,52,55,58,60], and radiance fields [32,45,49,56,63], have been explored. In particular, neural implicit surface representations have emerged as a powerful tool to model 3D (clothed) human shapes [6,15,17,20,21,30,41,50,51,62,66,67] due to their topological flexibility and resolution independence. Recent work [12,52,58] uses implicit surfaces to learn human avatars for a single subject, wearing a specific garment.…”
Section: Related Workmentioning
confidence: 99%
“…This approach has been shown to generalize to arbitrary poses. While SNARF [12] only models the major body bones, other works have focused on creating implicit models of the face [21,64], the hands [16], or how to model humans that appear in garments [19] and how to additionally capture appearance [47,52]. Although neural implicit avatars hold great promise, to date no model exists that holistically captures the body and all the parts that are important for human expressiveness jointly.…”
mentioning
confidence: 99%