2021
DOI: 10.3390/info13010005
|View full text |Cite
|
Sign up to set email alerts
|

Object Detection of Road Assets Using Transformer-Based YOLOX with Feature Pyramid Decoder on Thai Highway Panorama

Abstract: Due to the various sizes of each object, such as kilometer stones, detection is still a challenge, and it directly impacts the accuracy of these object counts. Transformers have demonstrated impressive results in various natural language processing (NLP) and image processing tasks due to long-range modeling dependencies. This paper aims to propose an exceeding you only look once (YOLO) series with two contributions: (i) We propose to employ a pre-training objective to gain the original visual tokens based on t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 17 publications
(14 citation statements)
references
References 32 publications
0
12
0
Order By: Relevance
“…RFBnet is constructed by SSD with VGG as the backbone network. It mainly adds dilated convolution on the basis of inception, thus effectively increasing the receptive field. Faster‐RCNN: it removes the selective search to generate the region proposal instead of using RPN (region proposal network) to generate the region candidate box. YOLO series: we re‐implement the models including YOLOv3 [27], YOLOv4 [6],Scaled‐YOLOv4‐large‐p5 [27], PP‐YOLO‐r18vd [24], YOLOX‐s [25], YOLOv5s [26], YOLOX‐tiny [25], YOLOv3‐tiny [27], YOLOv4‐tiny [27] and PP‐YOLO‐tiny [24] using the same data splits and experimental setting.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…RFBnet is constructed by SSD with VGG as the backbone network. It mainly adds dilated convolution on the basis of inception, thus effectively increasing the receptive field. Faster‐RCNN: it removes the selective search to generate the region proposal instead of using RPN (region proposal network) to generate the region candidate box. YOLO series: we re‐implement the models including YOLOv3 [27], YOLOv4 [6],Scaled‐YOLOv4‐large‐p5 [27], PP‐YOLO‐r18vd [24], YOLOX‐s [25], YOLOv5s [26], YOLOX‐tiny [25], YOLOv3‐tiny [27], YOLOv4‐tiny [27] and PP‐YOLO‐tiny [24] using the same data splits and experimental setting.…”
Section: Methodsmentioning
confidence: 99%
“…Recently, edge-device based objects detection has been raised more attention form the community boosted by the huge demand from the hundreds of millions edge devices, such as, vehicles, smart phones etc. The typical edge-device based object detectors are PP-YOLO [24], YOLOX [25], YOLOv5s [26], YOLOv4-tiny [27]. PP-YOLO replaced part of the convolution layer with deformable convolution, deleted the dropout of the backbone part then changed it to the detection head part And use a matrix NMS that runs faster than traditional NMS.…”
Section: Edge-device Based Object Detection Methodsmentioning
confidence: 99%
“…It achieved 50.0% AP on COCO dataset at a speed of 68.9 FPS on Tesla V100, exceeding YOLOv5-L by 1.8% AP. The system presented by Teerapong et al [42] uses YOLOX algorithm to detect road assets on the highways. To avoid using multiple cameras to detect objects from all four sides of the vehicle, the authors developed a solution that uses a 360-degree spherical camera to detect objects and other approaching vehicles.…”
Section: Related Workmentioning
confidence: 99%
“…Currently it is not used much in articles. Panboonyuen et al (2021) utilizing pre-training Vision Transformer (ViT) as a backbone, apply Feature Pyramid Network (FPN) decoder detection of Road Assets, It significantly outperforms other state-of-the-art (SOTA) detectors. Zhang et al (2021) used the YOLOX algorithm to detect vehicle targets in UAV images, and through a self-made dataset, the detection results surpassed traditional algorithms.…”
Section: Introductionmentioning
confidence: 99%