2018
DOI: 10.1007/978-3-030-01225-0_44
|View full text |Cite
|
Sign up to set email alerts
|

Scaling Egocentric Vision: The "Equation missing" Dataset

Abstract: First-person vision is gaining interest as it offers a unique viewpoint on people's interaction with objects, their attention, and even intention. However, progress in this challenging domain has been relatively slow due to the lack of sufficiently large datasets. In this paper, we introduce EPIC-KITCHENS, a large-scale egocentric video benchmark recorded by 32 participants in their native kitchen environments. Our videos depict non-scripted daily activities: we simply asked each participant to start recording… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

2
926
1
1

Year Published

2018
2018
2020
2020

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 482 publications
(931 citation statements)
references
References 44 publications
2
926
1
1
Order By: Relevance
“…Dataset. Our previous work, EPIC Kitchens [8], offers a unique opportunity to test domain adaptation for finegrained action recognition, as it is recorded in 32 environments. Similar to previous works for action recognition [14,19], we evaluate on pairs of domains.…”
Section: Implementation Detailsmentioning
confidence: 99%
See 3 more Smart Citations
“…Dataset. Our previous work, EPIC Kitchens [8], offers a unique opportunity to test domain adaptation for finegrained action recognition, as it is recorded in 32 environments. Similar to previous works for action recognition [14,19], we evaluate on pairs of domains.…”
Section: Implementation Detailsmentioning
confidence: 99%
“…In testing, as in [58], we use an average over 5 temporal windows, equidistant within the segment. We use the RGB and Optical Flow frames provided publicly [8]. The output of F is the result of the final average pooling layer of I3D, with 1024 dimensions.…”
Section: Implementation Detailsmentioning
confidence: 99%
See 2 more Smart Citations
“…In particular, our contributions can be summarized as follows: (i) We provide an extensive evaluation and comparison with published methods of the proposed multimodal architecture on the EPIC-Kitchens dataset [12] (ii) In addition to action performance, we provide for the first time a detailed results on the object and verb components. The rest of the paper is organized as follows.…”
Section: Introductionmentioning
confidence: 99%