2022
DOI: 10.3390/s22134833
|View full text |Cite
|
Sign up to set email alerts
|

DetectFormer: Category-Assisted Transformer for Traffic Scene Object Detection

Abstract: Object detection plays a vital role in autonomous driving systems, and the accurate detection of surrounding objects can ensure the safe driving of vehicles. This paper proposes a category-assisted transformer object detector called DetectFormer for autonomous driving. The proposed object detector can achieve better accuracy compared with the baseline. Specifically, ClassDecoder is assisted by proposal categories and global information from the Global Extract Encoder (GEE) to improve the category sensitivity a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
19
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
8

Relationship

1
7

Authors

Journals

citations
Cited by 24 publications
(19 citation statements)
references
References 52 publications
0
19
0
Order By: Relevance
“…Weighted full convolution layers are used in R-FCN approaches [14] to discover ROI and detect the category of objects as well as the information of their surroundings. With the use of deep learning algorithms, object detection approaches also seem promising for autonomous vehicles [15] and traffic scene object detection [16]. The You Only Look Once (YOLO) architecture processes 155 frames per second in real-time cases to produce quicker results.…”
Section: Related Workmentioning
confidence: 99%
“…Weighted full convolution layers are used in R-FCN approaches [14] to discover ROI and detect the category of objects as well as the information of their surroundings. With the use of deep learning algorithms, object detection approaches also seem promising for autonomous vehicles [15] and traffic scene object detection [16]. The You Only Look Once (YOLO) architecture processes 155 frames per second in real-time cases to produce quicker results.…”
Section: Related Workmentioning
confidence: 99%
“…They tested the method along several augmentation techniques aim to have a more robust traffic sign detection under light condition changes. In a similar manner, DetecFormer ( Liang et al, 2022c ) was introduced by fusing local and global information in a global context encoder with the same purpose of traffic scene detection.…”
Section: Two-dimensional Object Detectionmentioning
confidence: 99%
“…It leverages a self-supervised learning framework seamlessly integrated with a robust optimization method. Unlike approaches tailored for specific applications, such as object detection, which often enhance deep learning generalization through domainspecific augmentations [34] or particular modifications to transformer architecture [35], our focus lies in assessing the broader efficacy of self-supervised learning representations in domain generalization.…”
Section: Related Workmentioning
confidence: 99%