An image attention model and its application to image quality assessment are discussed in this paper. The attention model is based on rarity quantification, which is related to self-information to attract the attention in an image. It is relatively simpler than the others but results in taking more consideration of global contrasts between a pixel and the whole image. The visual attention model is used to develop a local distortion predictor, named color visual differences predictor (CVDP), in color images in order to effectively detect luminance and color distortions.
This paper proposes a method for detecting a helmet for the safety of workers from risk factors and a mask worn indoors and verifying a worker's identity while wearing a helmet and mask for security. The proposed method consists of a part for detecting the worker's helmet and mask and a part for verifying the worker's identity. An algorithm for helmet and mask detection is generated by transfer learning of Yolov5's s-model and m-model. Both models are trained by changing the learning rate, batch size, and epoch. The model with the best performance is selected as the model for detecting masks and helmets. At a learning rate of 0.001, a batch size of 32, and an epoch of 200, the s-model showed the best performance with a mAP of 0.954, and this was selected as an optimal model. The worker's identification algorithm consists of a facial feature extraction part and a classifier part for the worker's identification. The algorithm for facial feature extraction is generated by transfer learning of Facenet, and SVM is used as the classifier for identification. The proposed method makes trained models using two datasets, a masked face dataset with only a masked face, and a mixed face dataset with both a masked face and an unmasked face. And the model with the best performance among the trained models was selected as the optimal model for identification when using a mask. As a result of the experiment, the model by transfer learning of Facenet and SVM using a mixed face dataset showed the best performance. When the optimal model was tested with a mixed dataset, it showed an accuracy of 95.4%. Also, the proposed model was evaluated as data from 500 images of taking 10 people with a mobile phone. The results showed that the helmet and mask were detected well and identification was also good.
This paper proposes a human action recognition algorithm which can be efficiently applied to a real-time intelligent surveillance system. This method models the background, obtains the difference image between input image and the modeled background image, extracts the silhouette of human object from input image, and recognizes human action by using coordinates of object, directions of that and accumulated moving regions of that. The human actions recognized in this study amount to a total of 8 type of actions, which include walking, raising an arm (left, right), raising a leg (left, right), sitting and crouching. The proposed method has been experimented for 8 different movements using 4 people using video input of a webcam and it has shown good results in terms of recognizing human action.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.