“…Sampling focuses therefore on target objects only; no background or negative samples are needed, which is an advantage over other approaches in a time‐critical, operational setting. Mask R‐CNN has been successfully applied in similar satellite image analysis tasks, for example for sparse and multi‐sized object detection in VHR images (Wang, Tao, Wang, Wang, & Li, 2019), building extraction (Wen et al., 2019) or within the DeepGlobe Building Extraction Challenge (Zhao, Kang, Jung, & Sohn, 2018).…”