In recent years, building change detection has made remarkable progress through using deep learning. The core problems of this technique are the need for additional data (e.g., Lidar or semantic labels) and the difficulty in extracting sufficient features. In this paper, we propose an end-to-end network, called the pyramid feature-based attention-guided Siamese network (PGA-SiamNet), to solve these problems. The network is trained to capture possible changes using a convolutional neural network in a pyramid. It emphasizes the importance of correlation among the input feature pairs by introducing a global co-attention mechanism. Furthermore, we effectively improved the long-range dependencies of the features by utilizing various attention mechanisms and then aggregating the features of the low-level and co-attention level; this helps to obtain richer object information. Finally, we evaluated our method with a publicly available dataset (WHU) building dataset and a new dataset (EV-CD) building dataset. The experiments demonstrate that the proposed method is effective for building change detection and outperforms the existing state-of-the-art methods on high-resolution remote sensing orthoimages in various metrics.
Change detection based on remote sensing images plays an important role in the field of remote sensing analysis, and it has been widely used in many areas, such as resources monitoring, urban planning, disaster assessment, etc. In recent years, it has aroused widespread interest due to the explosive development of artificial intelligence (AI) technology, and change detection algorithms based on deep learning frameworks have made it possible to detect more delicate changes (such as the alteration of small buildings) with the help of huge amounts of remote sensing data, especially high-resolution (HR) data. Although there are many methods, we still lack a deep review of the recent progress concerning the latest deep learning methods in change detection. To this end, the main purpose of this paper is to provide a review of the available deep learning-based change detection algorithms using HR remote sensing images. The paper first describes the change detection framework and classifies the methods from the perspective of the deep network architectures adopted. Then, we review the latest progress in the application of deep learning in various granularity structures for change detection. Further, the paper provides a summary of HR datasets derived from different sensors, along with information related to change detection, for the potential use of researchers. Simultaneously, representative evaluation metrics for this task are investigated. Finally, a conclusion of the challenges for change detection using HR remote sensing images, which must be dealt with in order to improve the model’s performance, is presented. In addition, we put forward promising directions for future research in this area.
Automatic extraction of region objects from high-resolution satellite imagery presents a great challenge, because there may be very large variations of the objects in terms of their size, texture, shape, and contextual complexity in the image. To handle these issues, we present a novel, deep-learning-based approach to interactively extract non-artificial region objects, such as water bodies, woodland, farmland, etc., from high-resolution satellite imagery. First, our algorithm transforms user-provided positive and negative clicks or scribbles into guidance maps, which consist of a relevance map modified from Euclidean distance maps, two geodesic distance maps (for positive and negative, respectively), and a sampling map. Then, feature maps are extracted by applying a VGG convolutional neural network pre-trained on the ImageNet dataset to the image X, and they are then upsampled to the resolution of X. Image X, guidance maps, and feature maps are integrated as the input tensor. We feed the proposed attention-guided, multi-scale segmentation neural network (AGMSSeg-Net) with the input tensor above to obtain the mask that assigns a binary label to each pixel. After a post-processing operation based on a fully connected Conditional Random Field (CRF), we extract the selected object boundary from the segmentation result. Experiments were conducted on two typical datasets with diverse region object types from complex scenes. The results demonstrate the effectiveness of the proposed method, and our approach outperforms existing methods for interactive image segmentation.
Highway markings (HMs) are representative elements of inventory digitalization in highway scenes. The accurate position, semantics, and maintenance information of HMs provide significant support for the intelligent management of highways. This article presents a robust and efficient approach for extracting, reconstructing, and degrading analyzing HMs in complex highway scenes. Compared with existing road marking extraction methods, not only can extract HMs in presence of wear and occlusion from point clouds, but we also perform a degradation analysis for HMs. First, the HMs candidate area is determined accurately by sophisticated image processing. Second, the prior knowledge of marking design rules and edge-based matching model that leverages the standard geometric template and radiometric appearance of HMs is used for accurately extracting and reconstructing solid lines and nonsolid markings of HMs, respectively. Finally, two degradation indicators are constructed to describe the completeness of the marking contour and consistency within the marking. Comprehensive experiments on two existing highways revealed that the proposed methods achieved an overall performance of 95.4% and 95.4% in the recall and 93.8% and 95.5% in the precision for solid line and nonsolid line markings, respectively, even with imperfect data. Meanwhile, a database can be established to facilitate agencies' efficient maintenance.
Building subclass segmentation, aimed at predicting classes of buildings (high-rise zone, low-rise zone, single highrise, and single low-rise) from satellite images, is beneficial in numerous applications, including human geography, urban planning, and humanitarian aid. However, problems such as complex scenes and similar characteristics of different building categories make it difficult for general models to balance the accuracy of localization and classification in building subclass segmentation. Therefore, this paper proposes a novel network for building subclass segmentation called building subclass segmentation network (BSSNet), which uses two subnetworks to divide and conquer the problem. The first network guides the building locations through binary building segmentation, called localization network. The spatial gradient fusion module in the localization network improves the binary segmentation result by supervising the spatial gradient map of prediction. The second network is a classification network, which predicts building subclasses. Intermediate features of the second network are optimized by contrastive learning loss to improve feature consistency. Finally, predictions of the two networks are combined to obtain the final result. The experimental results demonstrate that our BSSNet can perform significant improvements on the Hainan dataset we produced and the xBD dataset. In particular, the BSSNet achieves the best performance compared to current methods on the Hainan dataset.
The Jiaodong Peninsula hosts the main large gold deposits and was the first gold production area in China; multisource and multiscale geoscience datasets are available. The area is the biggest drilling mineral-exploration zone in China. This study used three-dimensional (3D) modeling, geology, and ore body and alteration datasets to extract and synthesize mineralization information and analyze the exploration targeting in the Zhaoxian gold deposit in the northwestern Jiaodong Peninsula. The methodology and results are summarized as follows: The regional Jiaojia fault is the key exploration criterion of the gold deposit. The compression torsion characteristics and concave–convex section zones in the 3D deep environment are the main indicators of mineral exploration using 3D geological and ore-body modeling in the Zhaoxian gold deposit. The hyperspectral detailed measurement, interpretation, and data mining used drill-hole data (>1000 m) to analyze the vectors and trends of the ore body and ore-forming fault and the alteration-zone rocks in the Zhaoxian gold deposit. The short-wave infrared Pos2200 values and illite crystallinity in the alteration zone can be used to identify 3D deep gold mineralization and potential targets for mineral exploration. This research methodology can be globally used for other deep mineral explorations.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.