2023
DOI: 10.1145/3592455
|View full text |Cite
|
Sign up to set email alerts
|

NeRSemble: Multi-view Radiance Field Reconstruction of Human Heads

Abstract: We focus on reconstructing high-fidelity radiance fields of human heads, capturing their animations over time, and synthesizing re-renderings from novel viewpoints at arbitrary time steps. To this end, we propose a new multi-view capture setup composed of 16 calibrated machine vision cameras that record time-synchronized images at 7.1 MP resolution and 73 frames per second. With our setup, we collect a new dataset of over 4700 high-resolution, high-framerate sequences of more than 220 human heads, from which w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 7 publications
(3 citation statements)
references
References 33 publications
0
2
0
Order By: Relevance
“…Last but not least, several datasets are used in audio‐driven face synthesis: Examples are Fanelli et al . [FGR*10], VOCAset [CBL*19], Voxceleb [NCXZ20] and NeRSemble [KQG*23b]. They provide video clips [FGR*10, NCXZ20, KQG*23b], dense and deformable 3D facial geometry [FGR*10, CBL*19] or deformable NeRFs [KQG*23b] and the corresponding synchronised audio [FGR*10,CBL*19,NCXZ20,KQG*23b].…”
Section: Datamentioning
confidence: 99%
See 2 more Smart Citations
“…Last but not least, several datasets are used in audio‐driven face synthesis: Examples are Fanelli et al . [FGR*10], VOCAset [CBL*19], Voxceleb [NCXZ20] and NeRSemble [KQG*23b]. They provide video clips [FGR*10, NCXZ20, KQG*23b], dense and deformable 3D facial geometry [FGR*10, CBL*19] or deformable NeRFs [KQG*23b] and the corresponding synchronised audio [FGR*10,CBL*19,NCXZ20,KQG*23b].…”
Section: Datamentioning
confidence: 99%
“…[FGR*10], VOCAset [CBL*19], Voxceleb [NCXZ20] and NeRSemble [KQG*23b]. They provide video clips [FGR*10, NCXZ20, KQG*23b], dense and deformable 3D facial geometry [FGR*10, CBL*19] or deformable NeRFs [KQG*23b] and the corresponding synchronised audio [FGR*10,CBL*19,NCXZ20,KQG*23b]. The number of sequences in these datasets varies from hundreds [CBL*19] (12 subjects) and thousands (14 [FGR*10] and 220 [KQG*23b] subjects) to over 10 6 [NCXZ20] (7 k subjects).…”
Section: Datamentioning
confidence: 99%
See 1 more Smart Citation