Real-world evaluation of AI-driven COVID-19 triage for emergency admissions: External validation &amp; operational assessment of lab-free and high-throughput screening solutions

Soltan, Andrew; Yang, Jenny; Pattanshetty, Ravi; Novak, Alex; Rohanian, Omid; Beer, Sally; Soltan, Marina; Thickett, David; Fairhead, Rory; Collaborative, Curial Translational; Zhu, Tingting; Eyre, David W; Clifton, David A.

doi:10.1101/2021.08.24.21262376

Cited by 8 publications

(20 citation statements)

References 45 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To train and validate our models, we used clinical data with linked, deidentified demographic information for all patients presenting to emergency departments across the four hospital groups. To better compare our results to the clinical validation study performed by Soltan et al (2022), we used a similar set of features to one of their models – “CURIAL-Lab” – which used a focused subset of routinely collected clinical features. These included blood tests and vital signs, excluding the coagulation panel and blood gas testing, as these are not performed universally and are less informative (Soltan et al, 2022).…”

Section: Methodsmentioning

confidence: 99%

“…We aim to build on a previous study introduced by Soltan et al (2022), where an ML pipeline (based on XGBoost) was used to screen patients attending hospital emergency departments for COVID-19. In this study, the authors trained and tested models using data from one NHS trust (Oxford University Hospitals; OUH), thereafter externally and prospectively validating the models across four independent trusts.…”

Section: Introductionmentioning

confidence: 99%

“…However, in many cases, different sites may not have any of these three forms of access. Thus, using the same datasets as Soltan et al (2022), we developed and tested our models such that data from different sites are processed completely independent of the training data – as would be required for many external validation exercises in practice. To investigate a role for transfer learning, the current work uses a complex neural network approach which was previously shown to be effective at screening for COVID-19 (Yang et al, 2022), instead of the previously described XGBoost approach.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Machine Learning Generalizability Across Healthcare Settings: Insights from multi-site COVID-19 screening

Yang

Soltan

Clifton

2022

Preprint

Self Cite

View full text Add to dashboard Cite

As patient health information is highly regulated due to privacy concerns, the majority of machine learning (ML)-based healthcare studies are unable to test on external patient cohorts, resulting in a gap between locally reported model performance and cross-site generalizability. Different approaches have been introduced for developing models across multiple clinical sites, however no studies have compared methods for translating ready-made models for adoption in new settings. We introduce three methods to do this - (1) applying a ready-made model as-is; (2) readjusting the decision threshold on the output of a ready-made model using site-specific data; and (3) finetuning a ready-made model using site-specific data via transfer learning. Using a case study of COVID-19 diagnosis across four NHS Hospital Trusts, we show that all methods achieve clinically-effective performances (NPV >0.959), with transfer learning achieving the best results (mean AUROCs between 0.870-0.925). Our models demonstrate that site-specific customization improves predictive performance when compared to other ready-made approaches.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Machine Learning Generalizability Across Healthcare Settings: Insights from multi-site COVID-19 screening

Yang

Soltan

Clifton

2022

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…In The Lancet Digital Health , Andrew A S Soltan and colleagues present their latest Article regarding a new artificial intelligence-driven COVID-19 triage model, 9 providing an interesting and innovative vision of what could represent a future solution, with an impressive study including 72 223 patients across four validating sites in the UK. The authors report the improvement of their previously described tool, CURIAL-1.0, established on preselected vital signs and blood tests, and introduce two updated models: CURIAL-Lab, developed with use of vital signs and readily available blood tests (full blood count; urea, creatinine, and electrolytes; liver function tests; and C-reactive protein) and CURIAL-Rapide, developed with use of vital signs and full blood count alone.…”

mentioning

confidence: 99%

“…The authors report the improvement of their previously described tool, CURIAL-1.0, established on preselected vital signs and blood tests, and introduce two updated models: CURIAL-Lab, developed with use of vital signs and readily available blood tests (full blood count; urea, creatinine, and electrolytes; liver function tests; and C-reactive protein) and CURIAL-Rapide, developed with use of vital signs and full blood count alone. 9 These models were validated externally and prospectively evaluated for emergency admissions to four UK National Health Service trusts. The strength of this study lies in the adequate external validation and operational assessment of their devices, inferring the potential generalisability of the implementation of CURIAL models in emergency settings.…”

mentioning

confidence: 99%

Triage in the time of COVID-19

Gilbert¹,

Ghuysen²

2022

The Lancet Digital Health

View full text Add to dashboard Cite

Algorithmic Fairness and Bias Mitigation for Clinical Machine Learning: Insights from Rapid COVID-19 Diagnosis by Adversarial Learning

Yang

Soltan

Yang

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

Machine learning is becoming increasingly promi- nent in healthcare. Although its benefits are clear, growing attention is being given to how machine learning may exacerbate existing biases and disparities. In this study, we introduce an adversarial training framework that is capable of mitigating biases that may have been acquired through data collection or magnified during model development. For example, if one class is over-presented or errors/inconsistencies in practice are reflected in the training data, then a model can be biased by these. To evaluate our adversarial training framework, we used the statistical definition of equalized odds. We evaluated our model for the task of rapidly predicting COVID-19 for patients presenting to hospital emergency departments, and aimed to mitigate regional (hospital) and ethnic biases present. We trained our framework on a large, real-world COVID-19 dataset and demonstrated that adversarial training demonstrably improves outcome fairness (with respect to equalized odds), while still achieving clinically-effective screening performances (NPV>0.98). We compared our method to the benchmark set by related previous work, and performed prospective and external validation on four independent hospital cohorts. Our method can be generalized to any outcomes, models, and definitions of fairness.

show abstract

Real-world evaluation of AI-driven COVID-19 triage for emergency admissions: External validation & operational assessment of lab-free and high-throughput screening solutions

Cited by 8 publications

References 45 publications

Machine Learning Generalizability Across Healthcare Settings: Insights from multi-site COVID-19 screening

Machine Learning Generalizability Across Healthcare Settings: Insights from multi-site COVID-19 screening

Triage in the time of COVID-19

Algorithmic Fairness and Bias Mitigation for Clinical Machine Learning: Insights from Rapid COVID-19 Diagnosis by Adversarial Learning

Contact Info

Product

Resources

About