2022
DOI: 10.1109/tpami.2022.3197152
|View full text |Cite
|
Sign up to set email alerts
|

Detecting Rotated Objects as Gaussian Distributions and Its 3-D Generalization

Abstract: Existing detection methods commonly use a parameterized bounding box (BBox) to model and detect (horizontal) objects and an additional rotation angle parameter is used for rotated objects. We argue that such a mechanism has fundamental limitations in building an effective regression loss for rotation detection, especially for high-precision detection with high IoU (e.g. 0.75). Instead, we propose to model the rotated objects as Gaussian distributions. A direct advantage is that our new regression loss regardin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 31 publications
(14 citation statements)
references
References 85 publications
0
14
0
Order By: Relevance
“…To solve the discontinuous boundaries issue originated by the angular periodicity or corner ordering, Yang et al proposed the Circular Smooth Label model (CSL) [21] and the Densely Coded Labels model (DCL) [22], which transform the angular prediction task from a regression to a classification problem. Although these methods have been broadly used for detecting oriented objects in aerial/satellite images [15], [16], we are unaware of existing datasets that provide OBB annotations for brightfield microscopic images. As mentioned before, using OBBs might severely reduce the annotation burden when compared to segmentation approaches, and still provide enough geometrical information to enable cell confluence and polarity estimation, which are broadly used on biological applications [1]- [3], [8].…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…To solve the discontinuous boundaries issue originated by the angular periodicity or corner ordering, Yang et al proposed the Circular Smooth Label model (CSL) [21] and the Densely Coded Labels model (DCL) [22], which transform the angular prediction task from a regression to a classification problem. Although these methods have been broadly used for detecting oriented objects in aerial/satellite images [15], [16], we are unaware of existing datasets that provide OBB annotations for brightfield microscopic images. As mentioned before, using OBBs might severely reduce the annotation burden when compared to segmentation approaches, and still provide enough geometrical information to enable cell confluence and polarity estimation, which are broadly used on biological applications [1]- [3], [8].…”
Section: Related Workmentioning
confidence: 99%
“…1a. Meanwhile, the literature regarding object detection with OBBs is recent and scarce, and focuses mostly on niche applications such as aerial/satellite imagery [15], [16]. Although HBB and OBB detectors share the same main concepts in terms of network architecture (e.g., both can be achieved using two- or one-stage methods), moving from HBBs to OBBs adds some challenges, such as the ambiguous parametrization of OBBs, the difficulty in regressing angular information, and the adaptation of anchor-based methods [16].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Qian et al [24] directly regressed the four vertices of a quadrangle and devised a modulated rotation loss to optimize the regression process, largely eliminating the problems of angular periodicity and bounding box boundary discontinuity [2,22]. Apart from the above strategies, a series of works [34,35] model the rotated objects as Gaussian distributions and build the regression loss with certain distributional distance measurements, e.g., Wasserstein distance and Kullback-Leibler Divergence.…”
Section: Oriented Bounding Boxesmentioning
confidence: 99%
“…This paper proposes a loss function based on KFIoU: compressibility IoU (CPIoU). The process of rotating IoU calculation is strengthened as follows: in Figure 4, the GT box and predicted box are first converted into Gaussian distribution [9] 𝐺 , 𝐺 ;Kalman filter [6] was used to calculate the Gaussian distribution 𝐺 of the overlapping region between these two boxes.…”
Section: Compression Loss Functionmentioning
confidence: 99%