The platform will undergo maintenance on Sep 14 at about 9:30 AM EST and will be unavailable for approximately 1 hour.
2021
DOI: 10.1155/2021/6155663
|View full text |Cite
|
Sign up to set email alerts
|

Total Organic Carbon Content Prediction in Lacustrine Shale Using Extreme Gradient Boosting Machine Learning Based on Bayesian Optimization

Abstract: The total organic carbon (TOC) content is a critical parameter for estimating shale oil resources. However, common TOC prediction methods rely on empirical formulas, and their applicability varies widely from region to region. In this study, a novel data-driven Bayesian optimization extreme gradient boosting (XGBoost) model was proposed to predict the TOC content using wireline log data. The lacustrine shale in the Damintun Sag, Bohai Bay Basin, China, was used as a case study. Firstly, correlation analysis wa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(3 citation statements)
references
References 33 publications
0
2
0
Order By: Relevance
“…The calculation efficiency of the XGBoost model generally decreases with increasing numbers of independent variables in the sample. When considering more independent variables, several variables are randomly selected to reorganize the learning samples so that the model can quickly process smaller learning samples (Liu et al, 2021). XGBoost avoids overfitting with high probability during the training process, thereby ensuring its reliability.…”
Section: Selection Of Machine Learning Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The calculation efficiency of the XGBoost model generally decreases with increasing numbers of independent variables in the sample. When considering more independent variables, several variables are randomly selected to reorganize the learning samples so that the model can quickly process smaller learning samples (Liu et al, 2021). XGBoost avoids overfitting with high probability during the training process, thereby ensuring its reliability.…”
Section: Selection Of Machine Learning Methodsmentioning
confidence: 99%
“…For each step, the loss function values must be calculated, and the objective function to obtain f(x) must be optimized. Finally, an optimal ensemble model is obtained based on the additive method (Liu et al, 2021). K-fold cross-validation was selected to optimize the model parameters in the present study.…”
Section: Figurementioning
confidence: 99%
“…To overcome the drawbacks of the current mathematical regression methods, the machine learning technique has been recently introduced for predicting TOC content [18,[20][21][22][23][24][25][26][27][28][29][30][31][32]. In these published works, several versions of machine learning models have been developed for TOC content estimation or other properties, including Bayesian regression, random forest (RF), fuzzy logic, neural network, support vector regression (SVR), decision tree and XGBoost, among others.…”
Section: Introductionmentioning
confidence: 99%