Martin R. Oswald scite author profile

We propose a novel method for instance label segmentation of dense 3D voxel grids. We target volumetric scene representations which have been acquired with depth sensors or multi-view stereo methods and which have been processed with semantic 3D reconstruction or scene completion methods. The main task is to learn shape information about individual object instances in order to accurately separate them, including connected and incompletely scanned objects. We solve the 3D instance-labeling problem with a multi-task learning strategy. The first goal is to learn an abstract feature embedding which groups voxels with the same instance label close to each other while separating clusters with different instance labels from each other. The second goal is to learn instance information by estimating directional information of the instances' centers of mass densely for each voxel. This is particularly useful to find instance boundaries in the clustering post-processing step, as well as for scoring the quality of segmentations for the first goal. Both synthetic and real-world experiments demonstrate the viability of our approach. Our method achieves state-ofthe-art performance on the ScanNet 3D instance segmentation benchmark [4].

show abstract

NICE-SLAM: Neural Implicit Scalable Encoding for SLAM

Zhu

Peng

Larsson

et al. 2022

200

View full text Add to dashboard Cite

show abstract

A comprehensive shape analysis pipeline for stereoscopic measurements of particulate populations in suspension

et al. 2017

View full text Add to dashboard Cite

RoutedFusion: Learning Real-Time Depth Map Fusion

Weder

Schönberger

Pollefeys

et al. 2020

View full text Add to dashboard Cite

2017 ICCV Challenge: Detecting Symmetry in the Wild

Funk

Lee

Oswald

et al. 2017

View full text Add to dashboard Cite

Learning Priors for Semantic 3D Reconstruction

Cherabier

Schönberger

Oswald

et al. 2018

View full text Add to dashboard Cite

Image-Based 3D Modeling via Cheeger Sets

Töppe

Oswald

Cremers

et al. 2011

View full text Add to dashboard Cite

We propose a novel variational formulation for generating 3D models of objects from a single view. Based on a few user scribbles in an image, the algorithm automatically extracts the object silhouette and subsequently determines a 3D volume by minimizing the weighted surface area for a fixed user-specified volume. The respective energy can be efficiently minimized by means of convex relaxation techniques, leading to visually pleasing smooth surfaces within a matter of seconds. In contrast to existing techniques for single-view reconstruction, the proposed method is based on an implicit surface representation and a transparent optimality criterion, assuring high-quality 3D models of arbitrary topology with a minimum of user input.

show abstract

Semantically Informed Multiview Surface Refinement

Blaha

Rothermel

Oswald

et al. 2017

View full text Add to dashboard Cite

We present a method to jointly refine the geometry and semantic segmentation of 3D surface meshes. Our method alternates between updating the shape and the semantic labels. In the geometry refinement step, the mesh is deformed with variational energy minimization, such that it simultaneously maximizes photo-consistency and the compatibility of the semantic segmentations across a set of calibrated images. Label-specific shape priors account for interactions between the geometry and the semantic labels in 3D. In the semantic segmentation step, the labels on the mesh are updated with MRF inference, such that they are compatible with the semantic segmentations in the input images. Also, this step includes prior assumptions about the surface shape of different semantic classes. The priors induce a tight coupling, where semantic information influences the shape update and vice versa. Specifically, we introduce priors that favor (i) adaptive smoothing, depending on the class label; (ii) straightness of class boundaries; and (iii) semantic labels that are consistent with the surface orientation. The novel mesh-based reconstruction is evaluated in a series of experiments with real and synthetic data. We compare both to state-of-the-art, voxel-based semantic 3D reconstruction, and to purely geometric mesh refinement, and demonstrate that the proposed scheme yields improved 3D geometry as well as an improved semantic segmentation.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Martin R. Oswald

3D Instance Segmentation via Multi-Task Metric Learning

NICE-SLAM: Neural Implicit Scalable Encoding for SLAM

A comprehensive shape analysis pipeline for stereoscopic measurements of particulate populations in suspension

RoutedFusion: Learning Real-Time Depth Map Fusion

2017 ICCV Challenge: Detecting Symmetry in the Wild

Learning Priors for Semantic 3D Reconstruction

Image-Based 3D Modeling via Cheeger Sets

Semantically Informed Multiview Surface Refinement

Contact Info

Product

Resources

About