Proceedings of the 28th ACM International Conference on Multimedia 2020
DOI: 10.1145/3394171.3413970
|View full text |Cite
|
Sign up to set email alerts
|

Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation

Abstract: Despite the previous success of object analysis, detecting and segmenting a large number of object categories with a long-tailed data distribution remains a challenging problem and is less investigated. For a large-vocabulary classifier, the chance of obtaining noisy logits is much higher, which can easily lead to a wrong recognition. In this paper, we exploit prior knowledge of the relations among object categories to cluster fine-grained classes into coarser parent classes, and construct a classification tre… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
41
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 76 publications
(48 citation statements)
references
References 41 publications
0
41
0
Order By: Relevance
“…Object detection and instance segmentation has attracted increasing attention in the long-tailed learning community [19], [35], [68], [78], [103], [126], [159], [160], where most existing studies are conducted based on LVIS and COCO. In addition to these widelyused benchmarks, many other applications have also been explored, including urban scene understanding [26], [161], unmanned aerial vehicle detection [27], point cloud segmentation [162], [163].…”
Section: Image Detection and Segmentationmentioning
confidence: 99%
“…Object detection and instance segmentation has attracted increasing attention in the long-tailed learning community [19], [35], [68], [78], [103], [126], [159], [160], where most existing studies are conducted based on LVIS and COCO. In addition to these widelyused benchmarks, many other applications have also been explored, including urban scene understanding [26], [161], unmanned aerial vehicle detection [27], point cloud segmentation [162], [163].…”
Section: Image Detection and Segmentationmentioning
confidence: 99%
“…Later on, a series of works tried to alleviate classification bias. One line of works tried to improve the sample strategy [4,10,42], while another major line of works focus on loss engineering. Equalization Loss [32] and its improvements [31] down-weight the negative gradients for tail classes from head classes, while droploss [17] further takes the gradients from background into consideration.…”
Section: Related Workmentioning
confidence: 99%
“…Similarly, ACSL [38] only penalize negative classes over threshold. Separating the categories into some small groups [21,42] and simple calibration [26,47] helps, too. [28,36] modified the original soft-max function by embedding the distribution prior, achieving success.…”
Section: Related Workmentioning
confidence: 99%
“…Object detection is one of the most fundamental and challenging tasks in computer vision. In recent years, with the development of deep convolutional neural networks (CNN), many high-performance object detectors have been proposed [28,17,26,27,15,31]. In general, the current popular detectors can be divided into two categories: two-Figure 1.…”
Section: Introductionmentioning
confidence: 99%