2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2017
DOI: 10.1109/cvprw.2017.73
|View full text |Cite
|
Sign up to set email alerts
|

3D Pose Regression Using Convolutional Neural Networks

Abstract: Abstract3D pose estimation is a key component of many important computer vision tasks such as autonomous navigation and 3D scene understanding. Most state-of-the-art approaches to 3D pose estimation solve this problem as a pose-classification problem in which the pose space is discretized into bins and a CNN classifier is used to predict a pose bin. We argue that the 3D pose space is continuous and propose to solve the pose estimation problem in a CNN regression framework with a suitable representation, data a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
85
0

Year Published

2018
2018
2019
2019

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 78 publications
(86 citation statements)
references
References 12 publications
1
85
0
Order By: Relevance
“…To avoid this problem, the newer PoseCNN architecture [38] is trained to predict 6D object pose from a single RGB image in multiple stages, by decoupling the translation and rotation predictors. A geodesic loss function more suitable for optimizing over 3D rotations have been suggested in [23]. Another way to address this issue has recently emerged.…”
Section: Related Workmentioning
confidence: 99%
“…To avoid this problem, the newer PoseCNN architecture [38] is trained to predict 6D object pose from a single RGB image in multiple stages, by decoupling the translation and rotation predictors. A geodesic loss function more suitable for optimizing over 3D rotations have been suggested in [23]. Another way to address this issue has recently emerged.…”
Section: Related Workmentioning
confidence: 99%
“…Compared to most previous works [33,25,17], our method does not require the intermediate prediction of 2D/3D key points. In addition, we assume a full perspective model, which is a more challenging setting than previous works of estimating discrete/continuous viewpoint angles (azimuth) [4] or recovering the rotation matrices only [14]. Our expected goal is that by projecting the fine-grained 3D model according to the regressed pose estimation, the projection can align well with the object in the 2D image.…”
Section: Fine-grained 2d Imagementioning
confidence: 99%
“…The number of discretization bins is chosen as b = 500 in our experiments. An alternative would be to directly regress each scalar using methods similar to [15,27], but we did not pursue that here.…”
Section: Implementation Detailsmentioning
confidence: 99%