Active learning, which has a strong impact on processing data prior to the classification phase, is an active research area within the machine learning community, and is now being extended for remote sensing applications. To be effective, classification must rely on the most informative pixels, while the training set should be as compact as possible. Active learning heuristics provide capability to select unlabeled data that are the "most informative" and to obtain the respective labels, contributing to both goals. Characteristics of remotely sensed image data provide both challenges and opportunities to exploit the potential advantages of active learning. We provide an overview of active learning methods, then review the latest techniques proposed to cope with the problem of interactive sampling of training pixels for classification of remotely sensed data with support vector machines. We discuss remote sensing specific approaches dealing with multi-source and spatially and time-varying data, and provide examples for high dimensional hyperspectral and very high resolution multispectral imagery.
The shear volumes of data generated from earth observation and remote sensing technologies continue to make major impact; leaping key geospatial applications into the dual data and compute intensive era. As a consequence, this rapid advancement poses new computational and data processing challenges. We implement a novel remote sensing data flow (RESFlow) for advanced machine learning and computing with massive amounts of remotely sensed imagery. The core contribution is partitioning massive amount of data based on the spectral and semantic characteristics for distributed imagery analysis. RESFlow takes advantage of both a unified analytics engine for large-scale data processing and the availability of modern computing hardware to harness the acceleration of deep learning inference on expansive remote sensing imagery. The framework incorporates a strategy to optimize resource utilization across multiple executors assigned to a single worker. We showcase its deployment across computationally and data-intensive on pixel-level labeling workloads. The pipeline invokes deep learning inference at three stages; during deep feature extraction, deep metric mapping, and deep semantic segmentation. The tasks impose compute intensive and GPU resource sharing challenges motivating for a parallelized pipeline for all execution steps. By taking advantage of Apache Spark, Nvidia DGX1, and DGX2 computing platforms, we demonstrate unprecedented compute speed-ups for deep learning inference on pixel labeling workloads; processing 21,028 Terrabytes of imagery data and delivering an output maps at area rate of 5.245sq.km/sec, amounting to 453,168 sq.km/day -reducing a 28 day workload to 21 hours.
Buildings in the developing world are inadequately mapped. Lack of such critical geospatial data adds unnecessary challenges to locating and reaching a large segment of the world’s most vulnerable population, impeding sustainability goals ranging from disaster relief to poverty reduction. Use of volunteered geographic information (VGI) has emerged as a widely accepted source to fill such voids. Despite its promise, availability of building maps for developing countries significantly lags behind demand. We present a new approach, coupling deep convolutional neural networks (CNNs) with VGI for automating building map generation from high-resolution satellite images for Kano state, Nigeria. Specifically, we trained a CNN with VGI building outlines of limited quality and quantity and generated building maps for a 50,000 km2 area. Resulting maps are in strong agreement with existing settlement maps and require a fraction of the manual input needed for the latter. The VGI-based maps will provide support across multiple facets of socioeconomic development in Kano state, and demonstrates potential advancements in current mapping capabilities in resource constrained countries.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.