Cardiovascular diseases (CVDs) are disorders of the heart and blood vessels and are a major cause of disability and premature death worldwide. Individuals at higher risk of developing CVD must be noticed at an early stage to prevent premature deaths. Advances in the field of computational intelligence, together with the vast amount of data produced daily in clinical settings, have made it possible to create recognition systems capable of identifying hidden patterns and useful information. This paper focuses on the application of Data Mining Techniques (DMTs) to clinical data collected during the medical examination in an attempt to predict whether or not an individual has a CVD. To this end, the CRossIndustry Standard Process for Data Mining (CRISP-DM) methodology was followed, in which five classifiers were applied, namely DT, Optimized DT, RI, RF, and DL. The models were mainly developed using the RapidMiner software with the assist of the WEKA tool and were analyzed based on accuracy, precision, sensitivity, and specificity. The results obtained were considered promising on the basis of the research for effective means of diagnosing CVD, with the best model being Optimized DT, which achieved the highest values for all the evaluation metrics, 73.54%, 75.82%, 68.89%, 78.16% and 0.788 for accuracy, precision, sensitivity, specificity, and AUC, respectively.
The development of malign cells that can grow in any part of the stomach, known as gastric cancer, is one of the most common causes of death worldwide. In order to increase the survival rate in patients with this condition, it is essential to improve the decision-making process leading to a better and more efficient selection of treatment strategies. Nowadays, with the large amount of information present in hospital institutions, it is possible to use data mining algorithms to improve the healthcare delivery. Thus, this study, using the CRISP methodology, aims to predict not only the mortality associated with this disease, but also the occurrence of any complication following surgery. A set of classification models were tested and compared in order to improve the prediction accuracy. The study showed that, on one hand, the J48 algorithm using oversampling is the best technique to predict the mortality in gastric cancer patients, with an accuracy of approximately 74%. On the other hand, the rain forest algorithm using oversampling presents the best results when predicting the possible occurrence of complications among gastric cancer patients after their in-hospital stays, with an accuracy of approximately 83%.
Strokes are neurological events that affect a certain area of the brain. Since brain controls fundamental body activities, brain cell deterioration and dead can lead to serious disabilities and poor life quality. This makes strokes the leading cause of disabilities and mortality worldwide. Patients that suffer strokes are hospitalized in order to be submitted to surgery and receive recovery therapies. Thus, it's important to predict the length of stay for these patients, since it can be costly to them and their family, as well as to the medical institutions. The aim of this study is to make a prediction on the number of days of patients' hospital stays based on information available about the neurological event that happened, the patient's health status and surgery details. A neural network was put to test with three attribute subsets with different sizes. The best result was obtained with the subset with fewer features obtaining a RMSE and a MAE of 5.9451 and 4.6354, respectively.
Hospitals generate large amounts of data on a daily basis, but most of the time that data is just an overwhelming amount of information which never transitions to knowledge. Through the application of Data Mining techniques it is possible to find hidden relations or patterns among the data and convert those into knowledge that can further be used to aid in the decision-making of hospital professionals. This study aims to use information about patients with diabetes, which is a chronic (long-term) condition that occurs when the body does not produce enough or any insulin. The main purpose is to help hospitals improve their care with diabetic patients and consequently reduce readmission costs. An hospital readmission is an episode in which a patient discharged from a hospital is admitted again within a specified period of time (usually a 30 day period). This period allows hospitals to verify that their services are being performed correctly and also to verify the costs of these re-admissions. The goal of the study is to predict if a patient who suffers from diabetes will be readmitted, after being discharged, using Machine Leaning algorithms. The final results revealed that the most efficient algorithm was Random Forest with 0.898 of accuracy.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.