UAV-YOLOv8: A Small-Object-Detection Model Based on Improved YOLOv8 for UAV Aerial Photography Scenarios

Wang, Gang; Chen, Yanfei; An, Pei; Hu, Hong; Hu, Jia; Huang, Tiange

doi:10.3390/s23167190

Cited by 164 publications

(44 citation statements)

References 59 publications

(68 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The ERGW-net consists of a backbone, neck, and head. The backbone leverages modified CSPDarknet53 [29] with a new block called iRepblock, which combines the advantages of InceptionNet [30] and ResNet [31] to improve feature acquisition while decreasing computational demands. The neck fuses and categorizes the infrared image features, and a new loss function is provided at the head to improve the network's ability to process small road objects from aerial infrared images.…”

Section: Methodsmentioning

confidence: 99%

“…The role of the backbone is to extract features from images. To improve the overall performance of the backbone, we propose a new Rep-style backbone structure based on modified CSPDarknet53 [29] from YOLOv8. In other words, we provide a Rep-style capability that supports the modified CSPDarknet53 by orchestrating ResNet, InceptionNet, and efficient RepVGG ConvNet capabilities [32].…”

Section: Backbonementioning

confidence: 99%

See 1 more Smart Citation

An Efficient Rep-Style Gaussian–Wasserstein Network: Improved UAV Infrared Small Object Detection for Urban Road Surveillance and Safety

Aibibu,

Lan,

Zeng

et al. 2023

Remote Sensing

View full text Add to dashboard Cite

Owing to the significant application potential of unmanned aerial vehicles (UAVs) and infrared imaging technologies, researchers from different fields have conducted numerous experiments on aerial infrared image processing. To continuously detect small road objects 24 h/day, this study proposes an efficient Rep-style Gaussian–Wasserstein network (ERGW-net) for small road object detection in infrared aerial images. This method aims to resolve problems of small object size, low contrast, few object features, and occlusions. The ERGW-net adopts the advantages of ResNet, Inception net, and YOLOv8 networks to improve object detection efficiency and accuracy by improving the structure of the backbone, neck, and loss function. The ERGW-net was tested on a DroneVehicle dataset with a large sample size and the HIT-UAV dataset with a relatively small sample size. The results show that the detection accuracy of different road targets (e.g., pedestrians, cars, buses, and trucks) is greater than 80%, which is higher than the existing methods.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Backbonementioning

confidence: 99%

An Efficient Rep-Style Gaussian–Wasserstein Network: Improved UAV Infrared Small Object Detection for Urban Road Surveillance and Safety

Aibibu,

Lan,

Zeng

et al. 2023

Remote Sensing

View full text Add to dashboard Cite

show abstract

“…The improved YOLOv8 network [16] incorporates the characteristics of small target objects in real-world scenarios by introducing blur and noise operations. Subsequently, the network introduces the Asymptotic Feature Pyramid Network (AFPN) [17] to highlight the impact of key layer features after feature fusion, addressing direct interaction issues between non-adjacent layers.UAV-YOLOv8 [18] has optimized YOLOv8 in several aspects. Firstly, it adopts Wise-IoU (WIoU) [19] v3 as the bounding box regression loss and employs a judicious gradient assignment strategy to focus the model more on samples with common quality, thereby improving the model's localization capability.…”

Section: Related Workmentioning

confidence: 99%

DHC-YOLO: Improved YOLOv8 for Lesion Detection in Brain Tumors, Colon Polyps, and Esophageal Cancer

Ren,

Song,

et al. 2024

Preprint

View full text Add to dashboard Cite

The detection of lesions in various diseases remains a challenging task in medical image processing, given the diverse morphologies, sizes, and boundaries of lesions associated with different illnesses. In this paper, we propose an advanced lesion detection model named DHC-YOLO, which integrates Multi-Scale Dilated attention (MSDA) and multi-head self-attention (MHSA) within the YOLOv8 network. The method also introduces an enhanced feature fusion through the Concatenation (Concat) operation in the Feature Pyramid Networks (FPN) structure of YOLOv8. The DHC-YOLO model achieves superior performance in lesion detection by effectively aggregating semantic information across various scales in the attended receptive field, reducing redundancy in self-attention mechanisms without the need for complex operations or additional computational costs. The incorporation of MHSA enhances the network’s ability to extract diverse features, and the Concat operation in FPN improves multi-scale feature fusion. Our evaluations on brain tumor, colonic polyp, and esophageal cancer datasets demonstrate the superiority of our method over baseline YOLOv8 and several state-of-the-art object detection models. Specifically, on the brain tumor dataset, DHC-YOLO achieves mAP50 and mAP50:95 scores of 88.3% and 73.5%, respectively; on the colonic polyp dataset, the scores are 88.8% and 67.2%; and on the esophageal cancer dataset, the scores are 51.3% and 20.7%. These compelling results underscore the robust performance of DHC-YOLO in lesion detection tasks.

show abstract

“…Furthermore, processing the context region of targets instead of simple pixel-by-pixel processing during training [24] yields an efficient multi-scale training approach. Another study [25] has demonstrated improved detection performance through the utilization of relevant information across different feature maps.…”

Section: Related Workmentioning

confidence: 99%

Enhancing Small Object Detection in Aerial Images: A Novel Approach with PCSG Model

An,

Duanmu,

et al. 2024

Aerospace

View full text Add to dashboard Cite

Generalized target detection algorithms perform well for large- and medium-sized targets but struggle with small ones. However, with the growing importance of aerial images in urban transportation and environmental monitoring, detecting small targets in such imagery has been a promising research hotspot. The challenge in small object detection lies in the limited pixel proportion and the complexity of feature extraction. Moreover, current mainstream detection algorithms tend to be overly complex, leading to structural redundancy for small objects. To cope with these challenges, this paper recommends the PCSG model based on yolov5, which optimizes both the detection head and backbone networks. (1) An enhanced detection header is introduced, featuring a new structure that enhances the feature pyramid network and the path aggregation network. This enhancement bolsters the model’s shallow feature reuse capability and introduces a dedicated detection layer for smaller objects. Additionally, redundant structures in the network are pruned, and the lightweight and versatile upsampling operator CARAFE is used to optimize the upsampling algorithm. (2) The paper proposes the module named SPD-Conv to replace the strided convolution operation and pooling structures in yolov5, thereby enhancing the backbone’s feature extraction capability. Furthermore, Ghost convolution is utilized to optimize the parameter count, ensuring that the backbone meets the real-time needs of aerial image detection. The experimental results from the RSOD dataset show that the PCSG model exhibits superior detection performance. The value of mAP increases from 97.1% to 97.8%, while the number of model parameters decreases by 22.3%, from 1,761,871 to 1,368,823. These findings unequivocally highlight the effectiveness of this approach.

show abstract

UAV-YOLOv8: A Small-Object-Detection Model Based on Improved YOLOv8 for UAV Aerial Photography Scenarios

Cited by 164 publications

References 59 publications

An Efficient Rep-Style Gaussian–Wasserstein Network: Improved UAV Infrared Small Object Detection for Urban Road Surveillance and Safety

An Efficient Rep-Style Gaussian–Wasserstein Network: Improved UAV Infrared Small Object Detection for Urban Road Surveillance and Safety

DHC-YOLO: Improved YOLOv8 for Lesion Detection in Brain Tumors, Colon Polyps, and Esophageal Cancer

Enhancing Small Object Detection in Aerial Images: A Novel Approach with PCSG Model

Contact Info

Product

Resources

About