The existence of widespread COVID-19 infections has prompted worldwide efforts to control and manage the virus, and hopefully curb it completely. One important line of research is the use of
machine learning
(ML) to understand and fight COVID-19. This is currently an active research field. Although there are already many surveys in the literature, there is a need to keep up with the rapidly growing number of publications on COVID-19-related applications of ML. This paper presents a review of recent reports on ML algorithms used in relation to COVID-19. We focus on the potential of ML for two main applications: diagnosis of COVID-19 and prediction of mortality risk and severity, using readily available clinical and laboratory data. Aspects related to algorithm types, training data sets, and feature selection are discussed. As we cover work published between January 2020 and January 2021, a few key points have come to light. The bulk of the machine learning algorithms used in these two applications are supervised learning algorithms. The established models are yet to be used in real-world implementations, and much of the associated research is experimental. The diagnostic and prognostic features discovered by ML models are consistent with results presented in the medical literature. A limitation of the existing applications is the use of imbalanced data sets that are prone to selection bias.
COVID-19 is a disease-causing coronavirus strain that emerged in December 2019 that led to an ongoing global pandemic. The ability to anticipate the pandemic’s path is critical. This is important in order to determine how to combat and track its spread. COVID-19 data is an example of time-series data where several methods can be applied for forecasting. Although various time-series forecasting models are available, it is difficult to draw broad theoretical conclusions regarding their relative merits. This paper presents an empirical evaluation of several time-series models for forecasting COVID-19 cases, recoveries, and deaths in Saudi Arabia. In particular, seven forecasting models were trained using autoregressive integrated moving average, TBATS, exponential smoothing, cubic spline, simple exponential smoothing Holt, and HoltWinters. The models were built using publicly available daily data of COVID-19 during the period of 24 March 2020 to 5 April 2021 reported in Saudi Arabia. The experimental results indicate that the ARIMA model had a smaller prediction error in forecasting confirmed cases, which is consistent with results reported in the literature, while cubic spline showed better predictions for recoveries and deaths. As more data become available, a fluctuation in the forecasting-accuracy metrics was observed, possibly due to abrupt changes in the data.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.