2020
DOI: 10.48550/arxiv.2010.14982
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Toyota Smarthome Untrimmed: Real-World Untrimmed Videos for Activity Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
3

Relationship

2
1

Authors

Journals

citations
Cited by 3 publications
(9 citation statements)
references
References 0 publications
0
9
0
Order By: Relevance
“…Action detection has received a lot of interest in recent years Dai et al (2021a); Huang et al (2020); Dai et al (2019a); Piergiovanni & Ryoo (2018). In this work, we focus on densely labelled action detection for handling videos with additional temporal relationships between different action classes Yeung et al (2018);Sigurdsson et al (2016); Dai et al (2020). Different from the sparsely labelled detection methods which output a sparse set of action snippets Caba Heilbron et al ( 2015 2019) frameworks apply temporal filters over snippet-wise features with a snippet-level classification, therefore, it interprets the sequence of images to a sequence of predictions.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Action detection has received a lot of interest in recent years Dai et al (2021a); Huang et al (2020); Dai et al (2019a); Piergiovanni & Ryoo (2018). In this work, we focus on densely labelled action detection for handling videos with additional temporal relationships between different action classes Yeung et al (2018);Sigurdsson et al (2016); Dai et al (2020). Different from the sparsely labelled detection methods which output a sparse set of action snippets Caba Heilbron et al ( 2015 2019) frameworks apply temporal filters over snippet-wise features with a snippet-level classification, therefore, it interprets the sequence of images to a sequence of predictions.…”
Section: Related Workmentioning
confidence: 99%
“…Datasets. We evaluate our method on three densely labelled action detection datasets: Charades Sigurdsson et al ( 2017), TSU Dai et al (2020) and MultiTHUMOS Yeung et al (2018). These datasets contain videos of different types: (1) sports and daily living videos, (2) short and long videos.…”
Section: Experimental Settingsmentioning
confidence: 99%
“…Action detection has received a lot of interest in recent years [10,12,18,26,44,46]. In this work, we focus on action detection in densely labelled videos [8,35,45]. The early attempts on modeling complex temporal relations are to use anchor-based methods [5,43], although dense action distribution requires large amount of anchors.…”
Section: Related Workmentioning
confidence: 99%
“…The input to the MS-TCT action detection network is an untrimmed video that can last for a very long duration [8]. Processing long videos in both spatial and temporal dimensions challenge the current computation resources.…”
Section: Visual Encodingmentioning
confidence: 99%
See 1 more Smart Citation