2017
DOI: 10.1177/1729881417709079
|View full text |Cite
|
Sign up to set email alerts
|

Collecting public RGB-D datasets for human daily activity recognition

Abstract: Human daily activity recognition has been a hot spot in the field of computer vision for many decades. Despite best efforts, activity recognition in naturally uncontrolled settings remains a challenging problem. Recently, by being able to perceive depth and visual cues simultaneously, RGB-D cameras greatly boost the performance of activity recognition. However, due to some practical difficulties, the publicly available RGB-D data sets are not sufficiently large for benchmarking when considering the diversity o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(3 citation statements)
references
References 36 publications
0
3
0
Order By: Relevance
“…Generally, the data have been compared with those collected by other kinds of sensors like the inertial sensor in [45] to test multimodal approaches and develop multimodal datasets [46], [47]. In particular, the diffusion of the Kinect sensor and its accessibility, has allowed the creation of different datasets on the basis of gesture, primitive movements, action and activities as in [48] and [49]. This sensor has also been used as a camera for monitoring emotions related to pain in a rehabilitation context [50].…”
Section: A Related Workmentioning
confidence: 99%
“…Generally, the data have been compared with those collected by other kinds of sensors like the inertial sensor in [45] to test multimodal approaches and develop multimodal datasets [46], [47]. In particular, the diffusion of the Kinect sensor and its accessibility, has allowed the creation of different datasets on the basis of gesture, primitive movements, action and activities as in [48] and [49]. This sensor has also been used as a camera for monitoring emotions related to pain in a rehabilitation context [50].…”
Section: A Related Workmentioning
confidence: 99%
“…They also developed a spatio temporal AND-OR graph for recognizing actions. Hanbo Wu et al [15] combined the above three datasets with MSR Daily Activity 3D and Cornell activity data set (CAD) and created a new dataset with 4528 samples of seven major categories. These actions have 46 subcategories and those had been performed by 74 persons.…”
Section: Literature Surveymentioning
confidence: 99%
“…Part II, which is composed of five papers, is related to heterogonous multimodal methods. The first paper 12 assembles a large-scale RGB-D activity data set by merging five public RGB-D data sets that differ from each other on many aspects such as length of actions, nationality of subjects, or camera angles. This data set provides a new benchmark for research purposes.…”
Section: The Papersmentioning
confidence: 99%