2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021
DOI: 10.1109/cvpr46437.2021.01020
|View full text |Cite
|
Sign up to set email alerts
|

Tangent Space Backpropagation for 3D Transformation Groups

Abstract: We address the problem of performing backpropagation for computation graphs involving 3D transformation groups SO(3), SE(3), and Sim(3). 3D transformation groups are widely used in 3D vision and robotics, but they do not form vector spaces and instead lie on smooth manifolds. The standard backpropagation approach, which embeds 3D transformations in Euclidean spaces, suffers from numerical difficulties. We introduce a new library, which exploits the group structure of 3D transformations and performs backpropaga… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 22 publications
(8 citation statements)
references
References 26 publications
0
8
0
Order By: Relevance
“…Both the Fuzzy Metaballs and PyTorch3D optimization use an axis-angle, 3 parameter rotation estimation. These is some evidence suggesting PyTorch Autograd for SO(3) might be unstable at times in its native form [69]. Lastly, we suspect that the same scale invariance issues we address in Section 7.1 may exist for the PyTorch3D baselines.…”
Section: E Softrasterizer Performancementioning
confidence: 74%
“…Both the Fuzzy Metaballs and PyTorch3D optimization use an axis-angle, 3 parameter rotation estimation. These is some evidence suggesting PyTorch Autograd for SO(3) might be unstable at times in its native form [69]. Lastly, we suspect that the same scale invariance issues we address in Section 7.1 may exist for the PyTorch3D baselines.…”
Section: E Softrasterizer Performancementioning
confidence: 74%
“…It is based mainly on three works: The quaternion-based accurate pose representation for multi-source spatial data, the fast and accurate 2D/3D spatial relationship calculation of spatial objects and the high-speed asynchronous method for vector visualization. In the camera's quaternion-based pose transformation, any increment (adjacent pose transformation relation) is calculated on the tangent space SE(3) [98] at the identity matrix, and the obtained increment is exponentially mapped back to the global spatial pose of the moving AR device. This smoothed difference property of quaternions avoids singularities and ensures that small transformation matrices can also be represented, supporting smooth expression of differences between arbitrary directions.…”
Section: Discussionmentioning
confidence: 99%
“…Lie groups are widely used in robotics and vision to represent 2D/3D positions and rotations [32]. Due to their non-Euclidean geometry, it is difficult to apply them to deep learning, which primarily operates with Euclidean tensors, but recently there is growing interest in making them compatible [23,[52][53][54][55][56]. LieTorch [53] generalizes automatic differentiation on the Lie group tangent space through local parameterization around the identity, but the implementation is complex since every operation requires a custom kernel.…”
Section: Application Agnostic Interfacementioning
confidence: 99%
“…Due to their non-Euclidean geometry, it is difficult to apply them to deep learning, which primarily operates with Euclidean tensors, but recently there is growing interest in making them compatible [23,[52][53][54][55][56]. LieTorch [53] generalizes automatic differentiation on the Lie group tangent space through local parameterization around the identity, but the implementation is complex since every operation requires a custom kernel. In contrast, Theseus computes common Lie group operators, e.g., the exponential and logarithm map, inverse, composition, etc., in closed form, and provides their corresponding analytical derivatives on the tangent space.…”
Section: Application Agnostic Interfacementioning
confidence: 99%