To address the problems of less semantic information and low measurement accuracy when the SSD (single shot multibox detector) algorithm detects small targets, an MPH-SSD (multiscale pyramid hybrid SSD) algorithm that integrates the attention mechanism and multiscale double pyramid feature enhancement is proposed in this paper. In this algorithm, firstly, the SSD algorithm is used to extract the feature map of small targets, and the shallow feature enhancement module is added to expand the receptive field of the shallow feature layer so as to enrich the semantic information in the feature layer for small targets and improve the expression ability of shallow features. The processed shallow feature layer and deep feature layer are fused at multiple scales, and the semantic information and location information are fused together to obtain a feature map with rich information. Secondly, the cascaded double pyramid structure is used to transfer from the deep layer to the shallow layer so that the context information between different feature layers can be effectively transferred and the feature information can be further strengthened. The hybrid attention mechanism can retain more context information in the network, adaptively adjust the feature map after addition and fusion, and reduce the background interference. The experimental analysis of MPH-SSD algorithm on Pascal VOC and MS COCO datasets shows that the map of this algorithm is 87.7% and 51.1%, respectively. The results show that the MPH-SSD algorithm can make better use of the feature information in the shallow feature layer in the process of small target detection and has better detection performance for small targets.
Because the lack of semantic information exchange between characteristic layers, the SSD (Single Shot multibox Detector) algorithm has insufficient detection performance. To address this problem, a detection algorithm called VPE-SSD (Visual Path Enhancement SSD) by incorporating a visual expansion mechanism and path syndication proposed in this paper. Firstly, a visual expansion mechanism is added to the shallow characteristic layer to increase the perceptual field. This enables the semantic information in the shallow layer to be more fully utilized by the network. It can also achieve the purpose of enhancing the expressiveness of the shallow feature layer. Then, the processed deep and shallow characteristic layers are fed into the path syndication module for bi-directional fusion. This improves the global information of the feature layers and generates multi-scale global feature maps. Next, to enhance the detailed information of deep characteristics and improve their expression, the deep characteristic enhancement module is applied to the last three characteristic maps. Finally, using the blended attention module to reduce the negative interference and improve the expression of characteristic maps during target detection. The experimental analysis of the VPE-SSD algorithm is conducted on VOC and COCO, and the mAP is 83.4% and 48.4%. From the result, VPE-SSD algorithm can make better use of the different size characteristic information which helps to improve the performance of the algorithm.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.