2018
DOI: 10.48550/arxiv.1807.07226
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Monocular Object Orientation Estimation using Riemannian Regression and Classification Networks

Abstract: We consider the task of estimating the 3D orientation of an object of known category given an image of the object and a bounding box around it. Recently, CNN-based regression and classification methods have shown significant performance improvements for this task. This paper proposes a new CNN-based approach to monocular orientation estimation that advances the state of the art in four different directions. First, we take into account the Riemannian structure of the orientation space when designing regression … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
3
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 25 publications
0
3
0
Order By: Relevance
“…4 (c) shows an example. CNNs have been employed for 3D pose estimation from a single image [56]. Simultaneous depth and 3D orientation estimation of optical microrobots can be achieved via CNNs as well [53] (see Fig.…”
Section: Machine Learning-based Methodsmentioning
confidence: 99%
“…4 (c) shows an example. CNNs have been employed for 3D pose estimation from a single image [56]. Simultaneous depth and 3D orientation estimation of optical microrobots can be achieved via CNNs as well [53] (see Fig.…”
Section: Machine Learning-based Methodsmentioning
confidence: 99%
“…Several clear avenues for improvement exist. Predictions might be made on a pixel-wise basis [13] to improve spatial accuracy, and pose binning [44] might improve accuracy. The Cross Entropy Method could sample around proposals for assessment with a Q function [12].…”
Section: B Physical Implications and Future Workmentioning
confidence: 99%
“…For example, CNNs have been employed for the estimation of the 3D pose of an object from a single image. 25 A CNN-based method for estimating the 3D pose and depth of optically transparent microrobots has been developed. 26 However, for a neural network to be able to estimate the 3D pose of different microrobots, a large database should be collected, which consists of microscopic images of different microrobots with the combination of different depth values and poses.…”
mentioning
confidence: 99%