2020
DOI: 10.1101/2020.09.18.20197319
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Uncovering clinical risk factors and prediction of severe COVID-19: A machine learning approach based on UK Biobank data

Abstract: Background: COVID-19 is a major public health concern. Given the extent of the pandemic, it is urgent to identify risk factors associated with severe disease. Accurate prediction of those at risk of developing severe infections is also important clinically. Methods: Based on the UK Biobank (UKBB data), we built machine learning(ML) models to predict the risk of developing severe or fatal infections, and to evaluate the major risk factors involved. We first restricted the analysis to infected subjects, then p… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 13 publications
(13 citation statements)
references
References 52 publications
(66 reference statements)
0
13
0
Order By: Relevance
“…A number of studies focused on prediction of severity or mortality have noted that the age is one of the top features that helps to predict the severity of cases [10][11][12][13]. In our study, age was ranked among top 10 features across all 25 features used in our prediction model.…”
Section: Experimental Setup and Resultsmentioning
confidence: 97%
See 1 more Smart Citation
“…A number of studies focused on prediction of severity or mortality have noted that the age is one of the top features that helps to predict the severity of cases [10][11][12][13]. In our study, age was ranked among top 10 features across all 25 features used in our prediction model.…”
Section: Experimental Setup and Resultsmentioning
confidence: 97%
“…Similarly, Wong and So [11] also used XGB with another dataset to predict the severe and the death cases and identify the risk factors associated with COVID-19. e dataset was retrieved from United Kingdom Biobank (UKBB) and includes 93 different variables collected between 16 March 2020 and 19 July 2020.…”
Section: Introductionmentioning
confidence: 99%
“…An advantage of using ML models is that non-linear and complex interactions can be considered, which may improve predictive performance over logistic models. We employed the same set of predictors as our previous work, and followed the same analysis strategy of hyper-parameter tuning and cross-validation to obtain predicted probabilities (please refer to 18 for details). Beta-calibration 19 was performed and the resulting average AUC was 0.622.…”
Section: Methodsmentioning
confidence: 99%
“…We took reference to the approach described in 16 to analyze the data with IPW. Following our recent work 18 which aims to predict COVID-19 severity with machine learning (ML), here we also employed an ML model (XGboost) to predict Pr(tested) based on a range of factors. An advantage of using ML models is that non-linear and complex interactions can be considered, which may improve predictive performance over logistic models.…”
Section: Inverse Probability Weighting Of the Probability Of Being Tementioning
confidence: 99%
See 1 more Smart Citation