Change points are abrupt alterations in the distribution of sequential data. A change-point detection (CPD) model aims at quick detection of such changes. Classic approaches perform poorly for semi-structured sequential data because of the absence of adequate data representation learning. To deal with it, we introduce a principled differentiable loss function that considers the specificity of the CPD task. The theoretical results suggest that this function approximates well classic rigorous solutions. For such loss function, we propose an end-to-end method for the training of deep representation learning CPD models. Our experiments provide evidence that the proposed approach improves baseline results of change point detection for various data types, including real-world videos and image sequences, and improve representations for them. * This work is supported by the Russian Science Foundation (project 20-71-10135) Preprint. Under review.
During the directional drilling, a bit may sometimes go to a nonproductive rock layer due to the gap about 20m between the bit and high-fidelity rock type sensors. The only way to detect the lithotype changes in time is the usage of Measurements While Drilling (MWD) data. However, there are no general mathematical modeling approaches that both well reconstruct the rock type based on MWD data and correspond to specifics of the oil and gas industry. In this article, we present a data-driven procedure that utilizes MWD data for quick detection of changes in rock type. We propose the approach that combines traditional machine learning based on the solution of the rock type classification problem with change detection procedures rarely used before in Oil&Gas industry. The data come from a newly developed oilfield in the north of western Siberia. The results suggest that we can detect a significant part of changes in rock type reducing the change detection delay from 20 to 1.8 meters and the number of false positive alarms from 43 to 6 per well.
Majority of the accidents while drilling have a number of premonitory symptoms notable during continuous drilling support. Experts can usually recognize such symptoms, however, we are not aware of any system that can do this job automatically. We have developed a Machine learning algorithm which allows detecting anomalies using the drilling support data (drilling telemetry). The algorithm automatically extracts patterns of premonitory symptoms and then recognizes them during drilling. The machine learning model is based on Gradient Boosting decision trees. The model analyzes real time drilling parameters within a sliding 4-hour window. For each measurement, the model calculates the probability of an accident and warns about anomaly of particular type, if the probability exceeds the selected threshold. Our training sample comes from 20+ oilfields and consists of sections related to 80+ accidents of the following types: stuck pipe, mud loss, gas-oil-water show, washout of pipe string, failure of drilling tool, packing formation, that occurred while drilling, trip-in, trip-out, reaming. We have designed the prediction model to work during drilling new wells and to distinguish the normal drilling process from the faulty one. One can configure the anomaly threshold to balance amount of false alarms and the number of missed accidents. To evaluate quality of the model we measure such data science metrics as ROC AUC score and confusion matrices. While testing model can identify 24 accident from 30 with high confidence, whereas for the others there is still a room for improvement. Our findings suggest that including more accidents of underrepresented types will improve quality. Other data science metrics also support aptitude of the model. Finally, having data from multiple heterogeneous oilfields, we expect that the model will generalize well to new ones. This paper presents a good practice of development and implementation of a data-driven model for automatic supervision of continuous drilling. In particular, the model described in the paper will assist specialists with drilling accidents prediction, optimize their work with data and reduce the nonproductive time associated with the accidents by up to 20%.
A change points detection aims to catch an abrupt disorder in data distribution. Common approaches assume that there are only two fixed distributions for data: one before and another after a change point. Real-world data are richer than this assumption. There can be multiple different distributions before and after a change. We propose an approach that works in the multiple-distributions scenario. Our approach learn representations for semi-structured data suitable for change point detection, while a common classifiers-based approach fails. Moreover, our model is more robust, when predicting change points. The datasets used for benchmarking are sequences of images with and without change points in them.
One of the first steps during the investigation of geological objects is the interwell correlation. It provides information on the structure of the objects under study, as it comprises the framework for constructing geological models and assessing hydrocarbon reserves. Today, the detailed interwell correlation relies on manual analysis of well-logging data. Thus, it is time-consuming and of a subjective nature. The essence of the interwell correlation constitutes an assessment of the similarities between geological profiles. There were many attempts to automate the process of interwell correlation by means of rule-based approaches, classic machine learning approaches, and deep learning approaches in the past. However, most approaches are of limited usage and inherent subjectivity of experts. We propose a novel framework to solve the geological profile similarity estimation based on a deep learning model. Our similarity model takes well-logging data as input and provides the similarity of wells as output. The developed framework enables (1) extracting patterns and essential characteristics of geological profiles within the wells and (2) model training following the unsupervised paradigm without the need for manual analysis and interpretation of well-logging data. For model testing, we used two open datasets originating in New Zealand and Norway. Our data-based similarity models provide high performance: the accuracy of our model is 0.926 compared to 0.787 for baselines based on the popular gradient boosting approach. With them, an oil&gas practitioner can improve interwell correlation quality and reduce operation time.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.