2019
DOI: 10.1016/j.jvcir.2019.01.024
|View full text |Cite
|
Sign up to set email alerts
|

Learning spatiotemporal representations for human fall detection in surveillance video

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
24
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 59 publications
(31 citation statements)
references
References 48 publications
0
24
0
Order By: Relevance
“…Though such a limitation is imposed, literature shows that traditional machine learning, based on support vector machines, hidden Markov models, and decision trees are still very active in the field of fall detection that uses individual wearable non-visual or ambient sensors (e.g., accelerometer) (Wang et al, 2017a , b ; Chen et al, 2018 ; Saleh and Jeannès, 2019 ; Wu et al, 2019 ). For visual sensors the trend has been moving toward deep learning for convolutional neural networks (CNN) (Adhikari et al, 2017 ; Kong et al, 2019 ; Han et al, 2020 ), or LSTM (Shojaei-Hashemi et al, 2018 ). Deep learning is a sophisticated learning framework that besides the mapping function (as mentioned above and used in traditional machine learning), it also learns the features (in a hierarchy fashion) that characterize the concerned classes (e.g., falls and no falls).…”
Section: Fall Detection Using Individual Sensorsmentioning
confidence: 99%
“…Though such a limitation is imposed, literature shows that traditional machine learning, based on support vector machines, hidden Markov models, and decision trees are still very active in the field of fall detection that uses individual wearable non-visual or ambient sensors (e.g., accelerometer) (Wang et al, 2017a , b ; Chen et al, 2018 ; Saleh and Jeannès, 2019 ; Wu et al, 2019 ). For visual sensors the trend has been moving toward deep learning for convolutional neural networks (CNN) (Adhikari et al, 2017 ; Kong et al, 2019 ; Han et al, 2020 ), or LSTM (Shojaei-Hashemi et al, 2018 ). Deep learning is a sophisticated learning framework that besides the mapping function (as mentioned above and used in traditional machine learning), it also learns the features (in a hierarchy fashion) that characterize the concerned classes (e.g., falls and no falls).…”
Section: Fall Detection Using Individual Sensorsmentioning
confidence: 99%
“…Then, the VGG-16-based Con-vNet takes dynamic image as input and outputs the scores of four phases: standing, falling, fallen, and not moving. In reference [34], a three-stream Convolutional Neural Network is used to model the spatio-temporal representations in videos. The inputs to the three-stream Convolutional Neural Network are silhouettes, motion history images, and dynamic images.…”
Section: The Overview Of Proposed Methodsmentioning
confidence: 99%
“…Then, the extracted C3D features are fed into an LSTM-based attention network. In references [21,[33][34][35], the proposed models detect fall at frame level, therefore, can only detect fall only in the temporal dimension. In essence, the four methods refs.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…It increases the user's complexity and reduces the user's experience in repeated wearing. Kong et al [27] propose an effective fall detection method based on computer vision-based framework, which learned to take full advantage of the appearance and motion information. Leila et al [28] propose a machine vision-based system combined with Support Vector Machine (SVM) classifier, which has high sensitivity and specificity of 100% and 97.5% respectively.…”
Section: Introductionmentioning
confidence: 99%