2020
DOI: 10.3390/rs12101544
|View full text |Cite
|
Sign up to set email alerts
|

U-Net-Id, an Instance Segmentation Model for Building Extraction from Satellite Images—Case Study in the Joanópolis City, Brazil

Abstract: Currently, there exists a growing demand for individual building mapping in regions of rapid urban growth in less-developed countries. Most existing methods can segment buildings but cannot discriminate adjacent buildings. Here, we present a new convolutional neural network architecture (CNN) called U-net-id that performs building instance segmentation. The proposed network is trained with WorldView-3 satellite RGB images (0.3 m) and three different labeled masks. The first is the building mask; the second is … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
16
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
2

Relationship

1
8

Authors

Journals

citations
Cited by 42 publications
(19 citation statements)
references
References 25 publications
(36 reference statements)
1
16
0
Order By: Relevance
“…Reinforcing the findings of Uhl et al [25] specifically, our study highlights the value of large, quality training datasets for training DL semantic segmentation algorithms to recognize features in topographic maps. Lastly, our study reinforces the documented strong performance of the UNet semantic segmentation method for extracting features and classifying pixels from a wide variety of data sources to support varying mapping tasks (for example, [54,55,[62][63][64][65][66][67]69,70,[91][92][93]). Such techniques, including future advancements and modifications, may eventually replace traditional ML methods, such as random forests (RF) and support vector machines (SVM), as operational standards in the field [46,[52][53][54][55].…”
Section: Discussionsupporting
confidence: 75%
See 1 more Smart Citation
“…Reinforcing the findings of Uhl et al [25] specifically, our study highlights the value of large, quality training datasets for training DL semantic segmentation algorithms to recognize features in topographic maps. Lastly, our study reinforces the documented strong performance of the UNet semantic segmentation method for extracting features and classifying pixels from a wide variety of data sources to support varying mapping tasks (for example, [54,55,[62][63][64][65][66][67]69,70,[91][92][93]). Such techniques, including future advancements and modifications, may eventually replace traditional ML methods, such as random forests (RF) and support vector machines (SVM), as operational standards in the field [46,[52][53][54][55].…”
Section: Discussionsupporting
confidence: 75%
“…UNet and other semantic segmentation methods have been applied to a variety of feature extraction and classification problems and have also been applied to a variety of geospatial and remotely sensed data. For example, modifications of UNet have been applied to the mapping of general land cover change [62], coastal wetlands [63], palm trees [64], cloud and cloud shadows [65], urban buildings and change detection [66][67][68], roads [69], and landslides [70]. Generally, UNet and other FCNs have shown great promise due to their ability to model complex spatial patterns and context while generating data abstractions that generalize well to new data [54,55].…”
Section: Deep Learning Semantic Segmentationmentioning
confidence: 99%
“…With the development of DCNNs in recent years, many algorithms have been proposed for processing remote sensing images [25][26][27][28][29][30][31][32]. The fully convolutional network [33] (FCN) replaces the fully connected layers with convolutional layers, making it possible for large-scale dense prediction.…”
Section: Related Workmentioning
confidence: 99%
“…Promising building footprint detection approaches have been proposed in the literature. Wagner et al [40] presented a modified U-Net capable of discriminating between adjacent buildings. To incorporate the structure information of buildings, Hui et al [41] opted for a multi-task learning strategy, replacing the vanilla U-Net encoder with an Xception module.…”
Section: Semantic Segmentation Of Buildings and Roadsmentioning
confidence: 99%