The spatial structural patterns of geochemical backgrounds are often ignored in geochemical anomaly recognition, leading to the ineffective recognition of valuable anomalies in geochemical prospecting. In this contribution, a multi-convolutional autoencoder (MCAE) approach is proposed to deal with this issue, which includes three unique steps: (1) a whitening process is used to minimize the correlations among geochemical elements, avoiding the diluting of effective background information embedded in redundant data; (2) the Global Moran’s I index is used to determine the recognition domain of the background spatial structure for each element, and then the domain is used for convolution window size setting in MCAE; and (3) a multi-convolutional autoencoder framework is designed to learn the spatial structural pattern and reconstruct the geochemical background of each element. Finally, the anomaly score at each sampling location is calculated as the difference between the whitened geochemical features and the reconstructed features. This method was applied to the southwestern Fujian Province metalorganic belt in China, using the concentrations of Cu, Mn, Pb, Zn, and Fe2O3 measured from stream sediment samples. The results showed that the recognition domain determination greatly improved the quality of anomaly recognition, and MCAE outperformed several existing methods in all aspects. In particular, the anomalies from MCAE were the most consistent with the known Fe deposits in the area, achieving an area under the curve (AUC) of 0.89 and a forecast area of 17%.
Extreme temperature events are becoming more frequent due to global warming, and have critical effects on natural ecosystems, social and economic spheres, human production and life. Predicting changes in temperature extremes and trends under future climate scenarios helps to assess the impact of climate change accurately. Based on climate observations from 54 meteorological stations in the North China Plain and the projection data from seven general circulation models (GCMs) from the Coupled Model Intercomparison Project phase 6 (CMIP6), this paper researches nine representative extreme temperature indices under four typical climate scenarios. The aim is to reveal the temporal and spatial variations in extreme temperature indices in the North China Plain during the past (1971–2010) and the future (2061–2100). The results show that: using a support vector machine (SVM) to perform regression analysis on the multi-GCMs prediction results, the root mean square error (RMSE) and relative standard deviation (RSD) of the multi-model ensemble simulations obtained by the SVM method are lower than those of the arithmetic mean method and can better match the trend of the historical extreme temperature index; the extreme high temperature index is predicted to show a significant upward trend in the future, while the extreme low temperature index will decrease significantly; and there are significant spatial differences in the extreme temperature index in both historical and future periods, with the extreme temperature index under the high radiation forcing scenario (SSP585) showing the most considerable variation and the most significant spatial differences.
Anomaly identification is important to ensure the safe and stable operation of oil pipelines and prevent leaks. Leak identification is performed to divide abnormal samples from normal oil transfer samples in monitoring data, and it is a dichotomous problem. However, the traditional machine learning binary classification method is no longer suitable for identifying leak anomalies in complex production environments. The main problem is that leaks in production environments are very rare, and traditional methods cannot directly identify the leaking pattern with their generalizability. The recognized normal pattern lacks the ability to adapt to dynamic environmental changes and an artificial adjustment of the pump frequency, instrument calibration, and other monitoring data mutations. These are known as false anomalies, and they are difficult to distinguish from true anomalies. This results in a lower recall rate for leak anomaly identification and a higher rate of false positives. To solve this problem, this study proposes a leak anomaly recognition method based on the distinction between true and false anomalies. A one-class SVM is used to learn the normal working mode of oil pipelines, and the model is used to screen out suspected pipeline anomalies, namely, true and false anomalies. It increases the morphological difference between true and false anomaly curves by superimposing multisource data and uses similarity clustering to discover anomaly patterns that indicate leak events. The results show that the leakage anomaly recall rate is 100%, and the false anomaly exclusion rate is 83.49%. This method achieves real-time and efficient monitoring of pipeline leaking events in complex production environments, and it is practical for the application of machine learning methods in production environments.
In urban environmental management and public health evaluation efforts, there is an urgent need for fine-grained urban air quality monitoring. However, the high price and sparse distribution of air quality monitoring equipment make it difficult to develop effective and comprehensive fine-scale monitoring at the city scale. This has also led to air quality estimation methods based on incomplete monitoring data, which lack the ability to detect urban air quality differences within a neighborhood. To address this problem, this study proposes a refined urban air quality estimation method that fuses multisource spatio-temporal data. Based on the fact that urban air quality is easily affected by social activities, this method integrates meteorological data with urban social activity data to form a comprehensive environmental data set. It uses the spatio-temporal feature extraction model to extract the multi-source spatio-temporal features of the comprehensive environmental data set. Finally, the improved cascade forest algorithm is used to fit the relationship between the multisource spatio-temporal features and the air quality index (AQI) to construct an air quality estimation model, and the model is used to estimate the hourly PM2.5 index in Beijing on a 1 km × 1 km grid. The results show that the estimation model has excellent performance, and its goodness-of-fit (R2) and root mean square error (RMSE) reach 0.961 and 17.47, respectively. This method effectively achieves the assessment of urban air quality differences within a neighborhood and provides a new strategy for preventing information fragmentation and improving the effectiveness of information representation in the data fusion process.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.