Johann Sawatzky scite author profile

Inferring the 3D geometry and the semantic meaning of surfaces, which are occluded, is a very challenging task. Recently, a first end-to-end learning approach has been proposed that completes a scene from a single depth image. The approach voxelizes the scene and predicts for each voxel if it is occupied and, if it is occupied, the semantic class label. In this work, we propose a two stream approach that leverages depth information and semantic information, which is inferred from the RGB image, for this task. The approach constructs an incomplete 3D semantic tensor, which uses a compact three-channel encoding for the inferred semantic information, and uses a 3D CNN to infer the complete 3D semantic tensor. In our experimental evaluation, we show that the proposed two stream approach substantially outperforms the state-of-the-art for semantic scene completion.

show abstract

Weakly Supervised Affordance Detection

Sawatzky

Srikantha

Gall

2017

View full text Add to dashboard Cite

Harvesting Information from Captions for Weakly Supervised Semantic Segmentation

Sawatzky

Banerjee

Gall

2019

View full text Add to dashboard Cite

Since acquiring pixel-wise annotations for training convolutional neural networks for semantic image segmentation is time-consuming, weakly supervised approaches that only require class tags have been proposed. In this work, we propose another form of supervision, namely image captions as they can be found on the Internet. These captions have two advantages. They do not require additional curation as it is the case for the clean class tags used by current weakly supervised approaches and they provide textual context for the classes present in an image. To leverage such textual context, we deploy a multi-modal network that learns a joint embedding of the visual representation of the image and the textual representation of the caption. The network estimates text activation maps (TAMs) for class names as well as compound concepts, i.e. combinations of nouns and their attributes. The TAMs of compound concepts describing classes of interest substantially imrpove the quality of the estimated class activation maps which are then used to train a network for semantic segmentation. We evaluate our method on the COCO dataset where it achieves state of the art results for weakly supervised image segmentation.

show abstract

Two Stream 3D Semantic Scene Completion

Garbade¹,

Chen²,

Sawatzky³

et al. 2018

Preprint

View full text Add to dashboard Cite

Ex Paucis Plura: Learning Affordance Segmentation from Very Few Examples

Sawatzky

Garbade

Gall

2019

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Johann Sawatzky

Two Stream 3D Semantic Scene Completion

Weakly Supervised Affordance Detection

Harvesting Information from Captions for Weakly Supervised Semantic Segmentation

Two Stream 3D Semantic Scene Completion

Ex Paucis Plura: Learning Affordance Segmentation from Very Few Examples

Contact Info

Product

Resources

About