2021
DOI: 10.48550/arxiv.2110.09936
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

NeuralDiff: Segmenting 3D objects that move in egocentric videos

Abstract: Figure 1: Given a video sequence captured from an egocentric viewpoint, we segment all the objects that the actor/observer interacts with. We achieve this by means of a neural architecture, NeuralDiff, that learns to decompose each frame into a static background and a dynamic foreground, comprising the manipulated objects, which seldomly move in the sequence, and the actor's body, which moves continually and heavily occludes the scene. The neural network contains three streams that, via different inductive bia… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 30 publications
(73 reference statements)
0
2
0
Order By: Relevance
“…The application of neural fields for "high-level" semantic tasks remains an open problem. Examples of these problems include understanding 3D scene layout [WYN21], 3D scene interaction [LMW * 21a], and grouping of data into more meaningful entities [TLV21]. Furthermore, neural fields have focused on a single data modality, but exploring the fusion of multiple modalities could be a fruitful topic of research.…”
Section: Discussionmentioning
confidence: 99%
“…The application of neural fields for "high-level" semantic tasks remains an open problem. Examples of these problems include understanding 3D scene layout [WYN21], 3D scene interaction [LMW * 21a], and grouping of data into more meaningful entities [TLV21]. Furthermore, neural fields have focused on a single data modality, but exploring the fusion of multiple modalities could be a fruitful topic of research.…”
Section: Discussionmentioning
confidence: 99%
“…Thus far, neural fields have primarily been used to solve "lowlevel" (e.g., image synthesis) and "mid-level" (e.g., 3D reconstruction) tasks. We believe that "high-level" tasks that involve grouping of data into more meaningful concepts is a fruitful direction of future work [TLV21]. A common limitation of many neural fields is their inability to generalize well to new data.…”
Section: Discussion and Conclusion 14 Discussionmentioning
confidence: 99%