2023
DOI: 10.3389/fdata.2023.1038283
|View full text |Cite
|
Sign up to set email alerts
|

A large-scale machine learning study of sociodemographic factors contributing to COVID-19 severity

Abstract: Understanding sociodemographic factors behind COVID-19 severity relates to significant methodological difficulties, such as differences in testing policies and epidemics phase, as well as a large number of predictors that can potentially contribute to severity. To account for these difficulties, we assemble 115 predictors for more than 3,000 US counties and employ a well-defined COVID-19 severity measure derived from epidemiological dynamics modeling. We then use a number of advanced feature selection techniqu… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
0
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 81 publications
0
0
0
Order By: Relevance
“…The RFR and XGBoost models were selected as the most relevant models for feature importance evaluation, frequently utilized by researchers [9], [20], [22], [23], [27], [35]. The K-means-Coefficient of Variance sensitivity analysis was developed as a more sensitive novel machine learning approach, and the Ordinary Least Squares Multifactor Regression methodology was introduced as a validation model.…”
Section: Discussion and Limitationsmentioning
confidence: 99%
See 1 more Smart Citation
“…The RFR and XGBoost models were selected as the most relevant models for feature importance evaluation, frequently utilized by researchers [9], [20], [22], [23], [27], [35]. The K-means-Coefficient of Variance sensitivity analysis was developed as a more sensitive novel machine learning approach, and the Ordinary Least Squares Multifactor Regression methodology was introduced as a validation model.…”
Section: Discussion and Limitationsmentioning
confidence: 99%
“…Other researchers focused their work on assessing non-clinical factors, such as demographics, travel, environmental factors (temperature, relative humidity, atmospheric pollutants, etc. ), capacity and health related county-level factors, vulnerable population scores, national socio-economic factors, and different epidemiological data [12], [30], [31], [33], [34], [35], [36].…”
Section: Introductionmentioning
confidence: 99%
“…It's worth noting that we incorporated the cubic root of the predictors as a predictive parameter. This was done to mitigate data skewness and bring the distribution closer to normal [100]. Henceforth, the results presented for the turbidity modeling are based on the selected parameters presented in Figures 2 and 3, and the cubic root of each predictor.…”
Section: Parameter Selectionmentioning
confidence: 99%