2018 International Conference on Content-Based Multimedia Indexing (CBMI) 2018
DOI: 10.1109/cbmi.2018.8516271
|View full text |Cite
|
Sign up to set email alerts
|

Overcoming Missing and Incomplete Modalities with Generative Adversarial Networks for Building Footprint Segmentation

Abstract: The integration of information acquired with different modalities, spatial resolution and spectral bands has shown to improve predictive accuracies. Data fusion is therefore one of the key challenges in remote sensing. Most prior work focusing on multi-modal fusion, assumes that modalities are always available during inference. This assumption limits the applications of multi-modal models since in practice the data collection process is likely to generate data with missing, incomplete or corrupted modalities. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
14
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 18 publications
(17 citation statements)
references
References 29 publications
0
14
0
Order By: Relevance
“…The typical generative networks include generative adversarial network(GAN) [8] and variational auto encoder [11]. For modality missing problem, a group of methods [2,3,19,26] utilize GAN or its varieties including cGAN [15] and cycleGAN [33] to generate the data of missing modalities. The CRA [21] use cascaded residual autoencoder adapted from stacked denoising autoencoder [24] to calculate the residual and reconstruct the corrupted multimodal data sequence.…”
Section: Generative Networkmentioning
confidence: 99%
“…The typical generative networks include generative adversarial network(GAN) [8] and variational auto encoder [11]. For modality missing problem, a group of methods [2,3,19,26] utilize GAN or its varieties including cGAN [15] and cycleGAN [33] to generate the data of missing modalities. The CRA [21] use cascaded residual autoencoder adapted from stacked denoising autoencoder [24] to calculate the residual and reconstruct the corrupted multimodal data sequence.…”
Section: Generative Networkmentioning
confidence: 99%
“…This may be attributed to the characteristics of the generative model or we have not found how to use GAN in some special applications. In [104], GAN was used to translate the input of RGB image into a synthetic representation of the missing one (synthetic depth), and make an improvement to semantic segmentation of building footprints with missing modalities. In reference [103], to compensate for the poor quality of the point clouds, in generator, the proposed method adds the crossmodal guidance from the side-output features of the RGB stream to the decoder network of depth stream.…”
Section: B Point Clouds and Optical Image Fusionmentioning
confidence: 99%
“…In some applications, the data collection process is likely to generate data with missing, incomplete, or corrupted modalities. In [140], the authors focus on semantic segmentation of building footprints with missing modalities, in which GAN are effectively used to synthesis the missing or incomplete data in the depth map.…”
Section: A Missing Data Reconstructionmentioning
confidence: 99%
“…The issue of a completely missing modality is emphasized in [12], where a hallucination network is used to recreate features from the missing modality using the available modality. Bischke et al [13] extend this idea by using GANs to instead recreate the missing modality and feeding into a segmentation network along with the original modality. [14] uses a different approach to handle missing modality for medical imaging fusion by training with dropout of modalities and using late fusion, which allows for independent training of separate modalities.…”
Section: Missing Modalities In Fusionmentioning
confidence: 99%