Íris Viana dos Santos Santana scite author profile

Íris Viana dos Santos Santana

4Publications

10Citation Statements Received

60Citation Statements Given

How they've been cited

How they cite others

Affiliations

Publications

Order By: Most citations

Classification Models for COVID-19 Test Prioritization in Brazil: Machine Learning Approach

Santana¹,

Silveira²,

Sobrinho³

et al. 2021

J Med Internet Res

View full text Add to dashboard Cite

Background Controlling the COVID-19 outbreak in Brazil is a challenge due to the population’s size and urban density, inefficient maintenance of social distancing and testing strategies, and limited availability of testing resources. Objective The purpose of this study is to effectively prioritize patients who are symptomatic for testing to assist early COVID-19 detection in Brazil, addressing problems related to inefficient testing and control strategies. Methods Raw data from 55,676 Brazilians were preprocessed, and the chi-square test was used to confirm the relevance of the following features: gender, health professional, fever, sore throat, dyspnea, olfactory disorders, cough, coryza, taste disorders, and headache. Classification models were implemented relying on preprocessed data sets; supervised learning; and the algorithms multilayer perceptron (MLP), gradient boosting machine (GBM), decision tree (DT), random forest (RF), extreme gradient boosting (XGBoost), k-nearest neighbors (KNN), support vector machine (SVM), and logistic regression (LR). The models’ performances were analyzed using 10-fold cross-validation, classification metrics, and the Friedman and Nemenyi statistical tests. The permutation feature importance method was applied for ranking the features used by the classification models with the highest performances. Results Gender, fever, and dyspnea were among the highest-ranked features used by the classification models. The comparative analysis presents MLP, GBM, DT, RF, XGBoost, and SVM as the highest performance models with similar results. KNN and LR were outperformed by the other algorithms. Applying the easy interpretability as an additional comparison criterion, the DT was considered the most suitable model. Conclusions The DT classification model can effectively (with a mean accuracy≥89.12%) assist COVID-19 test prioritization in Brazil. The model can be applied to recommend the prioritizing of a patient who is symptomatic for COVID-19 testing.

show abstract

Classification Models for COVID-19 Test Prioritization in Brazil: Machine Learning Approach (Preprint)

Santana¹,

Silveira²,

Sobrinho³

et al. 2021

Preprint

View full text Add to dashboard Cite

BACKGROUND controlling the COVID-19 outbreak in Brazil is considered a challenge of continental proportions due to the high population and urban density, weak implementation and maintenance of social distancing strategies, and limited testing capabilities. OBJECTIVE to contribute to addressing such a challenge, we present the implementation and evaluation of supervised Machine Learning (ML) models to assist the COVID-19 detection in Brazil based on early-stage symptoms. METHODS firstly, we conducted data preprocessing and applied the Chi-squared test in a Brazilian dataset, mainly composed of early-stage symptoms, to perform statistical analyses. Afterward, we implemented ML models using the Random Forest (RF), Support Vector Machine (SVM), Multilayer Perceptron (MLP), K-Nearest Neighbors (KNN), Decision Tree (DT), Gradient Boosting Machine (GBM), and Extreme Gradient Boosting (XGBoost) algorithms. We evaluated the ML models using precision, accuracy score, recall, the area under the curve, and the Friedman and Nemenyi tests. Based on the comparison, we grouped the top five ML models and measured feature importance. RESULTS the MLP model presented the highest mean accuracy score, with more than 97.85%, when compared to GBM (> 97.39%), RF (> 97.36%), DT (> 97.07%), XGBoost (> 97.06%), KNN (> 95.14%), and SVM (> 94.27%). Based on the statistical comparison, we grouped MLP, GBM, DT, RF, and XGBoost, as the top five ML models, because the evaluation results are statistically indistinguishable. The ML models` importance of features used during predictions varies from gender, profession, fever, sore throat, dyspnea, olfactory disorder, cough, runny nose, taste disorder, and headache. CONCLUSIONS supervised ML models effectively assist the decision making in medical diagnosis and public administration (e.g., testing strategies), based on early-stage symptoms that do not require advanced and expensive exams.

show abstract

A Brazilian dataset of symptomatic patients for screening the risk of COVID-19

Santana¹,

Silveira²,

Sobrinho³

et al. 2021

View full text Add to dashboard Cite

Modelos de Classiﬁcação de Aprendizagem de Máquina para Priorização de Teste de COVID-19 no Brasil

Santana¹,

Sobrinho²

2021

View full text Add to dashboard Cite

O objetivo com este estudo é priorizar efetivamente pacientes sintomáticos para teste de COVID-19 no Brasil e auxiliar na detecção precoce, abordando problemas relacionados à testagem e estratégias de controle. Foram pré-processados 55.676 dados e o teste do qui-quadrado foi usado para conﬁrmar a relevância dos campos: gênero, proﬁssional de saúde, tosse, coriza,dor de cabeça, distúrbios febre, dor de garganta, dispneia, olfatórios e do paladar. Modelos de classiﬁcação foram implementados com base nos dados; aprendizagem supervisionada; e algoritmos clássicos. Um dos modelos de maior desempenho foi o de árvore de decisão (média de acurácia ≥ 89,12%), como é de fácil interpretabilidade, foi considerado o mais adequado.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.