As a popular research direction in computer vision, deep learning technology has promoted breakthroughs in the field of object detection. In recent years, the combination of object detection and the Internet of Things (IoT) has been widely used in the fields of face recognition, pedestrian detection, unmanned driving, and customs detection. With the development of object detection, two different detection algorithms, one-stage, and two-stage have gradually formed. This paper mainly introduces the one-stage object detection algorithm. Firstly, the development process of the convolutional neural network is briefly reviewed, Then, the current mainstream one-stage object detection model is summarized. Based on YOLOv1, it is continuously optimized, and the improvements and shortcomings are summarized in detail. Finally, a summary is made based on the difficulties and challenges of one-stage object detection algorithms.
With the advent of big data era and the enhancement of computing power, Deep Learning has swept the world. Based on Convolutional Neural Network (CNN) image classification technique broke the restriction of classical image classification methods, becoming the dominant algorithm of image classification. How to use CNN for image classification has turned into a hot spot. After systematically studying convolutional neural network and in-depth research of the application of CNN in computer vision, this research briefly introduces the mainstream structural models, strengths and shortcomings, time/space complexity, challenges that may be suffered during model training and associated solutions for image classification. This research also compares and analyzes the differences between different methods and their performance on commonly used data sets. Finally, the shortcomings of Deep Learning methods in image classification and possible future research directions are discussed.
In this age of artificial intelligence, facial expression recognition is an essential pool to describe emotion and psychology. In recent studies, many researchers have not achieved satisfactory results. This paper proposed an expression recognition system based on ResNet-152. Statistical analysis showed our method achieved 96.44% accuracy. Comparative experiments show that the model is better than mainstream models. In addition, we briefly described the application of facial expression recognition technology in the IoT (Internet of things).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.