2019 IEEE International Conference on Image Processing (ICIP) 2019
DOI: 10.1109/icip.2019.8803050
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Task Learning for Segmentation of Building Footprints with Deep Neural Networks

Abstract: The increased availability of high resolution satellite imagery allows to sense very detailed structures on the surface of our planet. Access to such information opens up new directions in the analysis of remote sensing imagery. However, at the same time this raises a set of new challenges for existing pixel-based prediction methods, such as semantic segmentation approaches. While deep neural networks have achieved significant advances in the semantic segmentation of high resolution images in the past, most of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
149
0
3

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 210 publications
(170 citation statements)
references
References 38 publications
1
149
0
3
Order By: Relevance
“…We used state-of-the-art pyramid sampling pooling (Zhao et al, 2017) to aggregate spatial context and found that this architecture outperformed fully convolutional networks (Maggiori et al, 2017b) and Mask-RCNNs (Ohleyer, 2018) on building footprint segmentation from very highresolution images. We showed that building footprint predictions obtained by only using publicly-available mediumresolution radar and optical satellite images in Multi 3 Net almost performs on par with building footprint segmentation models that use very high-resolution satellite imagery (Bischke et al, 2017). Building on this result, we used Multi 3 Net to segment flooded buildings, fusing multiresolution, multisensor, and multitemporal satellite imagery, and showed that full-fusion outperformed alternative fusion approaches.…”
Section: Resultsmentioning
confidence: 84%
“…We used state-of-the-art pyramid sampling pooling (Zhao et al, 2017) to aggregate spatial context and found that this architecture outperformed fully convolutional networks (Maggiori et al, 2017b) and Mask-RCNNs (Ohleyer, 2018) on building footprint segmentation from very highresolution images. We showed that building footprint predictions obtained by only using publicly-available mediumresolution radar and optical satellite images in Multi 3 Net almost performs on par with building footprint segmentation models that use very high-resolution satellite imagery (Bischke et al, 2017). Building on this result, we used Multi 3 Net to segment flooded buildings, fusing multiresolution, multisensor, and multitemporal satellite imagery, and showed that full-fusion outperformed alternative fusion approaches.…”
Section: Resultsmentioning
confidence: 84%
“…It is also worth noting that our strategy succeeds while CRF did not improve classification results on this dataset as reported in Marmanis et al (2017). Bischke et al (2017). We report the overall accuracy (OA) and the intersection over union (IoU).…”
Section: Methodsmentioning
confidence: 75%
“…Inspired by this recent idea, we introduce a distance transform regression loss in a multi-task learning framework, which acts as a natural regularizer for semantic segmentation. This idea was tested independently from us in Bischke et al (2017), although only for building footprint extraction using a quantized distance transform that was roughly equivalent to standard multi-class classification task. Our method is simpler as it directly works on the distance transform using a true regression.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…And the decoder network first use max-pooling indices generated from the corresponding encoder to enhance the location information. In [18], Bischke et al use SegNet with a new cascaded multi-task loss to further preserve semantic segmentation boundaries in high resolution satellite images.…”
Section: Segnetmentioning
confidence: 99%