2022
DOI: 10.1007/978-3-031-19769-7_38
|View full text |Cite
|
Sign up to set email alerts
|

RBP-Pose: Residual Bounding Box Projection for Category-Level Pose Estimation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
18
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 28 publications
(18 citation statements)
references
References 46 publications
0
18
0
Order By: Relevance
“…And then, they formulate the pose estimation problem as a camera-and world-space correspondence learning problem which explicitly aligns the coordinates [30]. Although considerable progress has been attained with prior-based methods [3,40,35], the requirements of collecting a large amount of ground-truth 3D models of target objects for obtaining the 3D prior and supervising training the prior deformation module hinders their practical applicability. This motivates us to investigate the mechanism that makes prior-based methods effective.…”
Section: Prior-based Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…And then, they formulate the pose estimation problem as a camera-and world-space correspondence learning problem which explicitly aligns the coordinates [30]. Although considerable progress has been attained with prior-based methods [3,40,35], the requirements of collecting a large amount of ground-truth 3D models of target objects for obtaining the 3D prior and supervising training the prior deformation module hinders their practical applicability. This motivates us to investigate the mechanism that makes prior-based methods effective.…”
Section: Prior-based Methodsmentioning
confidence: 99%
“…Fan et al [9] adopt a shape prior guided reconstruction network and a discriminator network to learn high-quality canonical representations. Zhang et al [40] use the shape priors as the indicator to predict pose and zero-mean residual vectors which encapsulate the spatial cues of the pose and enable geometry-guided consistency terms. Zhang et al [39] learn dense correspondences between input images and the canonical shape prior via surface embedding.…”
Section: Prior-based Methodsmentioning
confidence: 99%
“…Symmetry axis. Following FS-Net [29] some approaches [32,53,54] tailor the orientation representation to the predefined axis-symmetries. Specifically, these methods parameterize the rotation by two axes (similar to [69]), which is sufficient to construct a rotation matrix (i.e., the third vector must be orthogonal to the first two).…”
Section: Symmetry Handlingmentioning
confidence: 99%
“…Furthermore, self-supervised learning is employed by augmenting the input point sets in two different ways, which enables the use of a consistency loss when no ground truth is available. [54] generally follows FS-Net [29] in directly regressing the pose in a correspondence-free manner, but extends it with shape reconstruction and various additional losses. In particular, the pose is estimated by two parallel branches: first, by a simple regression branch (like FS-Net [29]); second, by predicting the deformation and assignment (like SPD [29]) and subsequently the residual pointwise bounding box projections (hence RBP; see [32] for details on pointwise bounding box projection).…”
Section: Asm-net Akizuki and Hashimotomentioning
confidence: 99%
“…robotics manipulation [2], [3], [4], and scene understanding [5]. The bottleneck in this task is scale-ambiguous distance perception [6], [7], [8] induced by the perspective projection onto the image plane.…”
mentioning
confidence: 99%