2024
DOI: 10.1109/access.2024.3363413
|View full text |Cite
|
Sign up to set email alerts
|

Enhancing UAV Aerial Image Analysis: Integrating Advanced SAHI Techniques With Real-Time Detection Models on the VisDrone Dataset

M. Muzammul,
Abdulmohsen Algarni,
Yazeed Yasin Ghadi
et al.

Abstract: This research presents a groundbreaking approach in aerial image analysis by integrating the Real-Time Detection and Recognition (RT-DETR-X) model with the Slicing Aided Hyper Inference (SAHI) methodology, utilizing the VisDrone-DET dataset. Aimed at enhancing the efficiency of drone technology across a spectrum of applications, including water conservancy, geological exploration, and military operations, this study focuses on harnessing the real-time, end-to-end object detection capabilities of RT-DETR-X. Cha… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 31 publications
0
1
0
Order By: Relevance
“…Comparing the improved YOLOv5 with state-of-the-art aerial image detection algorithms, such as the recent research from Zhejiang University—combining the real-time detection and recognition (RT-DETR-X) model with the SAHI (slice-assisted hyper-inference) method [ 44 ] for detection using the VisDrone-DET dataset—the improved YOLOv5 slightly lags behind the RT-DETR-X model, but YOLOv5 has superior processing speed and a higher FPS rate, which is crucial for real-time surveillance applications.…”
Section: Conclusion and Future Workmentioning
confidence: 99%
“…Comparing the improved YOLOv5 with state-of-the-art aerial image detection algorithms, such as the recent research from Zhejiang University—combining the real-time detection and recognition (RT-DETR-X) model with the SAHI (slice-assisted hyper-inference) method [ 44 ] for detection using the VisDrone-DET dataset—the improved YOLOv5 slightly lags behind the RT-DETR-X model, but YOLOv5 has superior processing speed and a higher FPS rate, which is crucial for real-time surveillance applications.…”
Section: Conclusion and Future Workmentioning
confidence: 99%