2022
DOI: 10.48550/arxiv.2208.03792
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Domain Randomization-Enhanced Depth Simulation and Restoration for Perceiving and Grasping Specular and Transparent Objects

Abstract: Commercial depth sensors usually generate noisy and missing depths, especially on specular and transparent objects, which poses critical issues to downstream depth or point cloud-based tasks. To mitigate this problem, we propose a powerful RGBD fusion network, Swin-DRNet, for depth restoration. We further propose Domain Randomization-Enhanced Depth Simulation (DREDS) approach to simulate an active stereo depth system using physically based rendering and generate a large-scale synthetic dataset that contains 13… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(3 citation statements)
references
References 32 publications
0
3
0
Order By: Relevance
“…ClearGrasp [1] employed depth completion for use with pose estimation on robotic grasping tasks, where they trained three DeepLabv3+ [19] models to learn transparency mask, surface normal, and boundary, respectively. Follow-on studies developed different approaches for depth completion, including implicit functions [20], NeRF reconstruction [2], combined point cloud and depth features [14], adversarial learning [21], multi-view geometry [22], RGB image completion [3], and sim2real transfer [6]. Weng et al [4] used transfer learning from the RGB to the depth sensor domain without completing raw depth.…”
Section: A Transparent Object Visual Perception For Manipulationmentioning
confidence: 99%
See 2 more Smart Citations
“…ClearGrasp [1] employed depth completion for use with pose estimation on robotic grasping tasks, where they trained three DeepLabv3+ [19] models to learn transparency mask, surface normal, and boundary, respectively. Follow-on studies developed different approaches for depth completion, including implicit functions [20], NeRF reconstruction [2], combined point cloud and depth features [14], adversarial learning [21], multi-view geometry [22], RGB image completion [3], and sim2real transfer [6]. Weng et al [4] used transfer learning from the RGB to the depth sensor domain without completing raw depth.…”
Section: A Transparent Object Visual Perception For Manipulationmentioning
confidence: 99%
“…However, these techniques require high-quality depth input provided by opaque objects with Lambertian light reflectance. One recent work by Dai et al [6] proposed a data generation system that simulates the noise on non-Lambertian surfaces of active stereo depth cameras and demonstrated its usage in category-level pose estimation and robotic grasping.…”
Section: B Opaque Object Category-level Pose Estimationmentioning
confidence: 99%
See 1 more Smart Citation