2022
DOI: 10.1038/s41598-022-13498-2
|View full text |Cite
|
Sign up to set email alerts
|

Feature selection for global tropospheric ozone prediction based on the BO-XGBoost-RFE algorithm

Abstract: Ozone is one of the most important air pollutants, with significant impacts on human health, regional air quality and ecosystems. In this study, we use geographic information and environmental information of the monitoring site of 5577 regions in the world from 2010 to 2014 as feature input to predict the long-term average ozone concentration of the site. A Bayesian optimization-based XGBoost-RFE feature selection model BO-XGBoost-RFE is proposed, and a variety of machine learning algorithms are used to predic… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 23 publications
(10 citation statements)
references
References 21 publications
(18 reference statements)
0
10
0
Order By: Relevance
“…Furthermore, it often outperforms other ML algorithms, which is why it has won several Kaggle competitions. However, it has a few limitations, including its high number of hyperparameters, making it difficult to tune [86], [87].…”
Section: Procedurementioning
confidence: 99%
“…Furthermore, it often outperforms other ML algorithms, which is why it has won several Kaggle competitions. However, it has a few limitations, including its high number of hyperparameters, making it difficult to tune [86], [87].…”
Section: Procedurementioning
confidence: 99%
“…Due to the intrinsic properties of eXtreme Gradient Boosting (XGBoost), feature selection in this research was based on the importance scores of the features. This technique has been widely used in various applications and has demonstrated satisfactory performance (Chen et al 2020a, Dai et al 2022, Zhang et al 2022. Consequently, the XGBoost model was used to fit the sample data.…”
Section: Feature Selectionmentioning
confidence: 99%
“…The Bayesian optimization framework is used to establish and update the probabilistic surrogate model based on previous evaluations of the objective function [ 47 ], and to actively select the evaluation points with the most global “potential” through the acquisition function. Bayesian optimization can effectively use prior information to judge the uncertainty of the unknown region and obtain the optimal solution within a few evaluations.…”
Section: Thermal Conductivity Identification Based On An Optimization...mentioning
confidence: 99%