The existing deep learning methods for human fall detection have difficulties to distinguish falls from similar daily activities such as lying down because of not using the 3D network. Meanwhile, they are not suitable for mobile devices because they are heavyweight methods and consume a large number of memories. In order to alleviate these problems, a two-stream approach to fall detection with the MobileVGG is proposed in this paper. One stream is based on the motion characteristics of the human body for detection of falls, while the other is an improved lightweight VGG network, named the MobileVGG, put forward in the paper. The MobileVGG is constructed as a lightweight network model through replacing the traditional convolution with a simplified and efficient combination of point convolution, depth convolution and point convolution. The residual connection between layers is designed to overcome the gradient disappeared and the obstruction of gradient reflux in the deep model. The experimental results show that the proposed two-stream lightweight fall classification model outperforms the existing methods in distinguishing falls from similar daily activities such as lying and reducing the occupied memory. Therefore, it is suitable for mobile devices. INDEX TERMS Deep learning, fall detection, motion characteristics, the two-stream model, the MobileVGG.
In face recognition, searching a person's face in the whole picture is generally too time-consuming to ensure highdetection accuracy. Objects similar to the human face or multi-view faces in low-resolution images may result in the failure of face recognition. To alleviate the above problems, a real-time face recognition method based on pre-identification and multiscale classification is proposed in this study. The face area is segmented based on the proportion of human faces in the pedestrian area to reduce the search range, and faces can be robustly detected in complicated scenarios such as heads moving frequently or with large angles. To accurately recognise small-scale faces, the authors propose the multi-scale and multi-channel shallow convolution network, which combines a multi-scale mechanism on the feature map with a multi-channel convolution network for real-time face recognition. It performs face matching only in the pre-identified face areas instead of the whole image, therefore it is more efficient. Experimental results showed that the proposed real-time face recognition method detects and recognises faces correctly, and outperforms the existing methods in terms of effectiveness and efficiency.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.