2019
DOI: 10.48550/arxiv.1910.02550
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

ClearGrasp: 3D Shape Estimation of Transparent Objects for Manipulation

Abstract: Transparent objects are a common part of everyday life, yet they possess unique visual properties that make them incredibly difficult for standard 3D sensors to produce accurate depth estimates for. In many cases, they often appear as noisy or distorted approximations of the surfaces that lie behind them. To address these challenges, we present ClearGrasp -a deep learning approach for estimating accurate 3D geometry of transparent objects from a single RGB-D image for robotic manipulation. Given a single RGB-D… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 52 publications
0
5
0
Order By: Relevance
“…We chose the ClearPose dataset [ 16 ] to test our approach because it contains several real image scenes, in total over 350,000 annotated images, with many transparent, symmetric objects within cluttered scenes and annotated ground-truth segmentation maps and ground-truth poses. Other known available real-world datasets are ClearGrasp [ 27 ] and TransCG [ 28 ], which use plastic glasses instead of realistic real glass and include almost no glass clutter, making them uninteresting for our comparison. The complete ClearPose dataset includes several sets of scenes of different types, for example, normal objects placed jumbled on a table but also objects with filled liquors or with an additional translucent cover.…”
Section: Datasetmentioning
confidence: 99%
“…We chose the ClearPose dataset [ 16 ] to test our approach because it contains several real image scenes, in total over 350,000 annotated images, with many transparent, symmetric objects within cluttered scenes and annotated ground-truth segmentation maps and ground-truth poses. Other known available real-world datasets are ClearGrasp [ 27 ] and TransCG [ 28 ], which use plastic glasses instead of realistic real glass and include almost no glass clutter, making them uninteresting for our comparison. The complete ClearPose dataset includes several sets of scenes of different types, for example, normal objects placed jumbled on a table but also objects with filled liquors or with an additional translucent cover.…”
Section: Datasetmentioning
confidence: 99%
“…In Equation ( 5), µ denotes the mean, and σ denotes the variance, while σ xy denotes the cross-correlation between x and y. In addition, the constants c 1 = (255 × 0.01) 2 and c 2 = (255 × 0.03) 2 are present to ensure numerical stability [11,31].…”
Section: Metricsmentioning
confidence: 99%
“…As is well known, deep neural networks require vast amounts of (mostly labelled) data to achieve good results. Specifically, the impressive results of deep learning on challenging computer vision tasks, such as depth estimation, surface normal estimation and segmentation have leveraged (mostly free) high-quality (and sometimes synthetic) datasets [1][2][3][4]. In contrast, underwater image data is expensive to acquire due to the equipment and transportation costs involved.…”
Section: Introductionmentioning
confidence: 99%
“…Training using CGI and simulation data is a promising approach for solving these limitations. 39 A major obstacle for this approach is the lack of research on generating photorealistic CGI images of chemical systems. The approach presented here can easily be used on UV and IR images, which can add another layer of information to the visible light images.…”
Section: ■ Conclusionmentioning
confidence: 99%