The rapid spread of the coronavirus disease COVID-19 has imposed clinical and financial burdens on hospitals and governments attempting to provide patients with medical care and implement disease-controlling policies. The transmissibility of the disease was shown to be correlated with the patient’s viral load, which can be measured during testing using the cycle threshold (Ct). Previous models have utilized Ct to forecast the trajectory of the spread, which can provide valuable information to better allocate resources and change policies. However, these models combined other variables specific to medical institutions or came in the form of compartmental models that rely on epidemiological assumptions, all of which could impose prediction uncertainties. In this study, we overcome these limitations using data-driven modeling that utilizes Ct and previous number of cases, two institution-independent variables. We collected three groups of patients (n = 6296, n = 3228, and n = 12,096) from different time periods to train, validate, and independently validate the models. We used three machine learning algorithms and three deep learning algorithms that can model the temporal dynamic behavior of the number of cases. The endpoint was 7-week forward number of cases, and the prediction was evaluated using mean square error (MSE). The sequence-to-sequence model showed the best prediction during validation (MSE = 0.025), while polynomial regression (OLS) and support vector machine regression (SVR) had better performance during independent validation (MSE = 0.1596, and MSE = 0.16754, respectively), which exhibited better generalizability of the latter. The OLS and SVR models were used on a dataset from an external institution and showed promise in predicting COVID-19 incidences across institutions. These models may support clinical and logistic decision-making after prospective validation.
The transmission of the contagious COVID-19 is known to be highly dependent on individual viral dynamics. Since the cycle threshold (Ct) isthe only semi-quantitative viral measurement that could reflect infectivity, we utilized Ct values to forecast COVID-19 incidences. Our COVID-19 cohort (n=9531), retrieved from a single representative cross-sectional virology test center in Lebanon, revealed that low daily mean Ct values are followed by an increase in the number of national positive COVID-19 cases. A subset of the data was used to develop a deep neural network model, tune its hyperparameters, and optimize the weights for minimal mean square error of prediction. The final model s accuracy is reported by comparing its predictions with an unseen dataset. Our model was the first to capture the interaction of the previously reported Ct values with the upcoming number of COVID-19 cases and any temporal effects that arise from population dynamics. Our model was deployed as a publicly available and easy-to-use estimator to facilitate prospective validation. Our model has potential application in predicting COVID-19 incidences in other countries and in assessing post-vaccination policies. Aside from emphasizing patient responsibility in adopting early testing practices, this study proposed and validated viral load measurement as a rigid input that can enhance outcomes and precision of viral disease predicting models.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.