2021 2nd Information Communication Technologies Conference (ICTC) 2021
DOI: 10.1109/ictc51749.2021.9441643
|View full text |Cite
|
Sign up to set email alerts
|

Object Detection in Complex Road Scenarios: Improved YOLOv4-Tiny Algorithm

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 19 publications
(8 citation statements)
references
References 6 publications
0
8
0
Order By: Relevance
“…A real-time object recognition system that can recognize multiple objects within an image frame. YOLO has evolved into new versions over time, e.g., YOLOv2, YOLOv3, and YOLOv4 [25]. YOLOv4 is an object detection algorithm that evolves the YOLOv3 model.…”
Section: Proposed Two-stage Deep-learning Based Designmentioning
confidence: 99%
See 1 more Smart Citation
“…A real-time object recognition system that can recognize multiple objects within an image frame. YOLO has evolved into new versions over time, e.g., YOLOv2, YOLOv3, and YOLOv4 [25]. YOLOv4 is an object detection algorithm that evolves the YOLOv3 model.…”
Section: Proposed Two-stage Deep-learning Based Designmentioning
confidence: 99%
“…Moreover, the performances for average accuracy and frames per second of YOLOv4 are increased compared to YOLOv3. YOLOv4-tiny [25,26] is a compressed version of YOLOv4. Based on YOLOv4, it is proposed to simplify the network structure, reduce parameters and enable development on embedded devices, and YOLOv4-tiny based model performs faster training and faster detection by comparison with YOLOv4.…”
Section: Proposed Two-stage Deep-learning Based Designmentioning
confidence: 99%
“…The role of Patch Merging is similar to the maximum pooling layer of CNN, but the maximum pooling used by CNN to achieve down sampling will discard some information, so using Patch Merging can increase the accuracy of the model. Another advantage of using Swin Transformer is that the core point of the algorithm uses the Swin Transformer Block, which consists of Window Multi-Head Self-Attention (W-MSA) [31][32][33] and Shifted-Window Multi-Head Self-Attention (SW-MSA) [34][35][36], as shown in Fig. 3.…”
Section: Backbone Selectionmentioning
confidence: 99%
“…Hence, for the purpose of drone detection, a deep convolutional neural networkbased model known as YOLO (You Only Look Once), essentially a state-of-theart object detection model, is chosen and trained on a dataset of drone images. The parameters of the model have been tuned in such a way so as to better YOLOv3-tiny [2], YOLOv4 [5], YOLOv4-tiny [31], and we compare them on the basis of some performance metrics to choose the one best suited for our problem.…”
Section: Drone Detectionmentioning
confidence: 99%