2022
DOI: 10.48550/arxiv.2203.03605
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
225
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
2

Relationship

1
8

Authors

Journals

citations
Cited by 142 publications
(225 citation statements)
references
References 0 publications
0
225
0
Order By: Relevance
“…Following, Deformable DETR [51] develops a sparse attention module named deformable attention to fasten the convergence speed of DETR. Sharing the same spirit, many researchers [9,26,48,29] proposed various schemes to speed up the convergence of DETR. More recently, Wang et al pointed out that DETR has the issue of data hunger and proposed to solve it by augmenting the supervision.…”
Section: Related Workmentioning
confidence: 99%
“…Following, Deformable DETR [51] develops a sparse attention module named deformable attention to fasten the convergence speed of DETR. Sharing the same spirit, many researchers [9,26,48,29] proposed various schemes to speed up the convergence of DETR. More recently, Wang et al pointed out that DETR has the issue of data hunger and proposed to solve it by augmenting the supervision.…”
Section: Related Workmentioning
confidence: 99%
“…Other methods have introduced transformer architectures [38] that base their workflow on attention mechanisms and have obtained remarkable experimental results with a promising future [39,40].…”
Section: Network Modelsmentioning
confidence: 99%
“…Transformer-based networks were successfully applied in various computer vision tasks and held impressive results. Mask DINO [11] extends DINO [12] by adding a new branch to perform mask prediction for panoptic, instance and semantic segmentation. Content query embeddings from DINO [12] are used to perform mask classification for all segmentation tasks.…”
Section: Semantic Instance Segmentationmentioning
confidence: 99%
“…Mask DINO [11] extends DINO [12] by adding a new branch to perform mask prediction for panoptic, instance and semantic segmentation. Content query embeddings from DINO [12] are used to perform mask classification for all segmentation tasks. QueryInst [13] proposes a query-based end-to-end instance segmentation with parallel supervision on six dynamic mask heads.…”
Section: Semantic Instance Segmentationmentioning
confidence: 99%