2021
DOI: 10.3390/s21217422
|View full text |Cite
|
Sign up to set email alerts
|

Small Object Detection in Traffic Scenes Based on YOLO-MXANet

Abstract: In terms of small objects in traffic scenes, general object detection algorithms have low detection accuracy, high model complexity, and slow detection speed. To solve the above problems, an improved algorithm (named YOLO-MXANet) is proposed in this paper. Complete-Intersection over Union (CIoU) is utilized to improve loss function for promoting the positioning accuracy of the small object. In order to reduce the complexity of the model, we present a lightweight yet powerful backbone network (named SA-MobileNe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
13
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 27 publications
(13 citation statements)
references
References 34 publications
(37 reference statements)
0
13
0
Order By: Relevance
“…Overfitting may occur during the training of a deep CNN. An effective way to avoid overfitting is to increase the amount of training data [ 43 , 44 , 45 ]. However, the acquisition of data requires considerable human and financial resources, and in some cases, images cannot be obtained.…”
Section: Methodsmentioning
confidence: 99%
“…Overfitting may occur during the training of a deep CNN. An effective way to avoid overfitting is to increase the amount of training data [ 43 , 44 , 45 ]. However, the acquisition of data requires considerable human and financial resources, and in some cases, images cannot be obtained.…”
Section: Methodsmentioning
confidence: 99%
“…IIHNet is a convolution-based network based on three key concepts: (i) information fusion; (ii) information exchange between different resolutions and modules; and (iii) a multiscale network. Furthermore, [123] proposed a lightweight network known as YOLO-MXANet which uses a powerful backbone based on the MobileNext [124] named SA-MobileNeXt, as a mean to incorporate both spatial and channel attention. Along with the addition of another scale from the shallower layers to improve the performance of SOD, the number of parameters was markedly reduced from 61.5 M to 13.8 M. The authors in [125] proposed a single stage SODNet composed of an adaptively spatial parallel convolution module (ASPConv) and a fast multi-scale fusion module (FMF) to optimize the spatial information extraction and to fuse the spatial and semantic information.…”
Section: Feature Learningmentioning
confidence: 99%
“…These works achieved high detection accuracy, but the detection speeds when used on traffic scenes demonstrated the limitations of these methods. He et al [15] used a one-stage detector called YOLO-MXANet to perform small object detection in traffic scenes to improve the detection speed. Based on mask R-CNN, the mask scoring (MS) R-CNN approach [16] uses a mask IoU head to learn the predicted mask quality and then obtain a new network structure; this approach combines the characteristics of the example with the corresponding predictive mask to enable regression to the mask IoU.…”
Section: A Object Detectionmentioning
confidence: 99%