2022
DOI: 10.1109/lra.2022.3184779
|View full text |Cite
|
Sign up to set email alerts
|

Neural Scene Representation for Locomotion on Structured Terrain

Abstract: We propose a learning-based method to reconstruct the local terrain for locomotion with a mobile robot traversing urban environments. Using a stream of depth measurements from the onboard cameras and the robot's trajectory, the algorithm estimates the topography in the robot's vicinity. The raw measurements from these cameras are noisy and only provide partial and occluded observations that in many cases do not show the terrain the robot stands on. Therefore, we propose a 3D reconstruction model that faithfull… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
13
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 14 publications
(16 citation statements)
references
References 27 publications
0
13
0
Order By: Relevance
“…It ingests noisy and heavily occluded point clouds of the scene coming from depth cameras and LiDAR to produce a 3D estimate of the terrain centered around the robot as well as a compact latent representation of the scene. Similar to ( 37 ), the point cloud data are spatially and temporally processed using a fully convolutional encoder-decoder network architecture. The encoder takes in the point cloud and compresses it into a compact representation that is used by the decoder to complete the missing information and filter out noise.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…It ingests noisy and heavily occluded point clouds of the scene coming from depth cameras and LiDAR to produce a 3D estimate of the terrain centered around the robot as well as a compact latent representation of the scene. Similar to ( 37 ), the point cloud data are spatially and temporally processed using a fully convolutional encoder-decoder network architecture. The encoder takes in the point cloud and compresses it into a compact representation that is used by the decoder to complete the missing information and filter out noise.…”
Section: Methodsmentioning
confidence: 99%
“…In each occupied voxel, a feature describes the position of the centroid of the points that fall within that voxel, and the features of unoccupied voxels were set to 0. Despite the sparse implementation used in ( 37 ), the library did not scale well to the typical batch sizes required for reinforcement learning. Unexpectedly, a dense formulation can handle such a large batch size with satisfactory speeds, but this comes at the cost of high memory requirements (approximately 45 GB of GPU memory for a batch size of 4096).…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…Contrary to explicit representations, implicitly defined, continuous, differentiable shape representations parameterized by MLP have emerged as a powerful paradigm. It is memory-efficient and easily deals with a wide variety of surface topologies without resolution limitation, enabling downstream tasks ranging from robotic perception [9] and 3D reconstruction to navigation [1]. Recently, research about RGBD-based incremental implicit mappings has made significant progress [18], [32], but they are all used for indoor scene reconstruction.…”
Section: Introductionmentioning
confidence: 99%