2013
DOI: 10.1016/j.cviu.2013.01.013
|View full text |Cite
|
Sign up to set email alerts
|

A survey of video datasets for human action and activity recognition

Abstract: Vision-based human action and activity recognition has an increasing importance among the computer vision community with applications to visual surveillance, video retrieval and human-computer interaction. In recent years, more and more datasets dedicated to human action and activity recognition have been created. The use of these datasets allows us to compare different recognition systems with the same input data. The survey introduced in this paper tries to cover the lack of a complete description of the mos… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
183
0
1

Year Published

2014
2014
2022
2022

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 382 publications
(184 citation statements)
references
References 142 publications
0
183
0
1
Order By: Relevance
“…There is a large volume of literature on capturing spatiotemporal dependencies among visual cues for activity recognition [1,25,5]. Representative models include Dynamic Bayesian Networks [28,27], Hidden Conditional Random Fields (HCRFs) [24,17], hierarchical graphical models [17,16,15,22], AND-OR graphs [21,2], and Logic Networks [19,4].…”
Section: Related Workmentioning
confidence: 99%
“…There is a large volume of literature on capturing spatiotemporal dependencies among visual cues for activity recognition [1,25,5]. Representative models include Dynamic Bayesian Networks [28,27], Hidden Conditional Random Fields (HCRFs) [24,17], hierarchical graphical models [17,16,15,22], AND-OR graphs [21,2], and Logic Networks [19,4].…”
Section: Related Workmentioning
confidence: 99%
“…but these systems have a pervasiveness issue: the only place where the activity is recognized is in the user's home or where the sensors are located. Another kind of research venue focuses on the usage of cameras for the recognition of gestures [20][21][22]. This is especially suitable for security (e.g.…”
Section: Activity Recognition Systems For Eldersmentioning
confidence: 99%
“…Therefore, other datasets were created such as the CAVIAR, ETISEO, CASIA Action, MSR Action, HOLLYWOOD, UCF datasets, Olympic Sports and HMDB51, BEHAVE, TV Human Interaction, UT-Tower, UT-Interaction, etc. Please refer to [37] for a complete list of the currently available datasets in HMA. Due to the advancement of the technology, using networks of multiple cameras for monitoring public places such as airports, shopping malls, etc.…”
Section: [35] L Chen H Wei and J Ferrymanmentioning
confidence: 99%