2021
DOI: 10.48550/arxiv.2104.13534
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

PAFNet: An Efficient Anchor-Free Object Detector Guidance

Ying Xin,
Guanzhong Wang,
Mingyuan Mao
et al.

Abstract: Object detection is a basic but challenging task in computer vision, which plays a key role in a variety of industrial applications. However, object detectors based on deep learning usually require greater storage requirements and longer inference time, which hinders its practicality seriously. Therefore, a trade-off between effectiveness and efficiency is necessary in practical scenarios. Considering that without constraint of pre-defined anchors, anchorfree detectors can achieve acceptable accuracy and infer… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
2
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 35 publications
(57 reference statements)
0
2
0
Order By: Relevance
“…It uses Gaussian kernels in both object localization and size regression, which allows the network to encode more training samples and accelerate the training process. PAFNet [25] extended TTFNet by using a better pre-trained model and combining several existing tricks, such as exponential moving average [26] and CutMix [27].…”
Section: Object Detection Based On Deep Learningmentioning
confidence: 99%
“…It uses Gaussian kernels in both object localization and size regression, which allows the network to encode more training samples and accelerate the training process. PAFNet [25] extended TTFNet by using a better pre-trained model and combining several existing tricks, such as exponential moving average [26] and CutMix [27].…”
Section: Object Detection Based On Deep Learningmentioning
confidence: 99%
“…Wang [10] introduced a pyramid structure into the transformer framework, using a progressive shrinking strategy to control the scale of feature maps. While these models demonstrate outstanding detection accuracy, they heavily rely on powerful GPUs to achieve rapid detection speed [11]. This poses a significant challenge in achieving a balance between accuracy and inference speed on mobile devices with limited computational resources [12][13][14].…”
Section: Introductionmentioning
confidence: 99%
“…Many prior works have performed this in two stages [17,24,25,34,37]: the localization stage finds bounding boxes of potential objects, which are then fed to a classification stage that classifies each object within a finite set of categories. In Faster R-CNN [34], for example, the classification stage is conditioned on the boxes generated from a region proposal network (RPN) and have achieved great success on several datasets; however, despite their success, these networks have shown limitations in detecting objects with large sizing variability and require exhaustive tuning for cross-dataset generalization [4,39,46]. Furthermore, by conditioning the classification stage on localization, the proposals generated by the localization stage tend to overfit the training class categories, so they tend not to be accurate on objects unavailable in the training set [11,14].…”
Section: Introductionmentioning
confidence: 99%