Early Detection of Coronary Heart Disease Based on Machine Learning Methods

Yılmaz, Rüstem; Yağın, Fatma Hilal

doi:10.37990/medr.1011924

Cited by 35 publications

(32 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…When comparing the three classification models, we can see that the accuracy of Random Forest algorithm is 95.63% which is higher than the accuracy of the other two classification techniques. In addition, if we have contrasted the results with past research, as was done in part IV of the problem statement portion, we attained the best accuracy using Random Forest, which is greater than [1] and is 95.63%. Yilmaz et al [1] obtained the best accuracy from Random Forest, which is 92.90%.…”

Section: Resultsmentioning

confidence: 70%

“…Due to their objectives are comparable to ours, we used Yilmaz et al [1], Pal et al [2], and Rajdhan et al [8] as our foundation articles in this study. The authors [1], [2] and [8] have used all the attributes present in the dataset and achieve good accuracy from their classification models but they have used small size dataset to compare classification model accuracy as well as did not handle null values present in the dataset. In addition, they did not employ any method of feature selection to identify strongly correlated features that can enhance classification model accuracy.…”

Section: B Problem Statementmentioning

confidence: 97%

“…The identification of heart disease and many other major diseases has been the subject of several studies utilizing various data mining techniques. In order to predict coronary heart disease, Yilmaz et al [1] used a variety of machine learning techniques. In comparison to other models, Random Forest (RF) has provided the greatest accuracy, with a score of 92.90%.…”

Section: Literature Reviewmentioning

confidence: 99%

See 2 more Smart Citations

Prediction of Heart Disease Using Machine Learning Algorithms

Nayeem

Rana

Islam

2022

EJAI

View full text Add to dashboard Cite

Heart disease has become one of the alarming issues of death. It is accountable for fatty plaques in the arteries. If this fatal condition can be identified early, we can preserve many people’s arteries. Different types of supervised machine learning algorithms are applied in our research paper in order to predict heart disease existence in patient body. Besides this, we have focused on an efficient way to improve the performance of our applied classifiers. Imputing mean value technique is applied to handle null values present in our dataset. The features which are unnecessary are removed by using the info-gain feature selection technique. In order to calculate prediction accuracy, K-Nearest Neighbors (KNN), Naive Bayes and Random Forest are applied to the heart disease dataset. Accuracy, precision, recall, F1-score, and ROC are calculated which help us to compare the performance of the classification models. Handling null values on a particular column by imputing mean values of that column and our applied info-gain feature selection technique has aided us in improving the accuracy of our prediction models. Random Forest among all has given the best classification accuracy which is 95.63% with precision, recall, F1-score and ROC are 0.93, 0.92, 0.92 and 0.9, respectively.

show abstract

Section: Resultsmentioning

confidence: 70%

Section: B Problem Statementmentioning

confidence: 97%

Section: Literature Reviewmentioning

confidence: 99%

See 1 more Smart Citation

Prediction of Heart Disease Using Machine Learning Algorithms

Nayeem

Rana

Islam

2022

EJAI

View full text Add to dashboard Cite

show abstract

“…For classification, trees each leaf node is created to contain only members of one class. For regression, trees continue to divide until a small number of units remain in the leaf node (12).…”

Section: Random Forestmentioning

confidence: 99%

Artificial Intelligence-based Colon Cancer Prediction by Identifying Genomic Biomarkers

PAKSOY

Yağın

2022

Medical Records

Self Cite

View full text Add to dashboard Cite

Colon cancer is the third most common type of cancer worldwide. Because of the poor prognosis and unclear preoperative staging, genetic biomarkers have become more important in the diagnosis and treatment of the disease. In this study, we aimed to determine the biomarker candidate genes for colon cancer and to develop a model that can predict colon cancer based on these genes. Material and Methods: In the study, a dataset containing the expression levels of 2000 genes from 62 different samples (22 healthy and 40 tumor tissues) obtained by the Princeton University Gene Expression Project and shared in the figshare database was used. Data were summarized as mean ± standard deviation. Independent Samples T-Test was used for statistical analysis. The SMOTE method was applied before the feature selection to eliminate the class imbalance problem in the dataset. The 13 most important genes that may be associated with colon cancer were selected with the LASSO feature selection method. Random Forest (RF), Decision Tree (DT), and Gaussian Naive Bayes methods were used in the modeling phase. Results: All 13 genes selected by LASSO had a statistically significant difference between normal and tumor samples. In the model created with RF, all the accuracy, specificity, f1-score, sensitivity, negative and positive predictive values were calculated as 1. The RF method offered the highest performance when compared to DT and Gaussian Naive Bayes. Conclusion:In the study, we identified the genomic biomarkers of colon cancer and classified the disease with a high-performance model. According to our results, it can be recommended to use the LASSO+RF approach when modeling high-dimensional microarray data.

show abstract

“…ML methods are one of the technologies that have seen widespread use in disease diagnosis and clinical decision support systems in recent years, and they have a wide range of applications. ML methods are typically used to classify disease prediction 14,15 . ML, which has a wide range of applications in the field of health, is the foundation of applications in the determination of genetic diseases, early detection of cancer diseases, and pattern recognition in medical imaging 16 .…”

Section: Introductionmentioning

confidence: 99%

Classification of colorectal cancer based on gene sequencing data with XGBoost model: An application of public health informatics

Akbulut

Küçükakçali

Çolak

2022

Cukurova Medical Journal

View full text Add to dashboard Cite

Amaç: Bu çalışma, bir makine öğrenmesi yöntemi olan XGBoost yöntemi ile açık erişimli kolorektal kanser gen verilerini sınıflandırmayı ve temel genleri tanımlamayı amaçlamaktadır. Gereç ve Yöntem: Çalışmada açık erişimli kolorektal kanser gen veri seti kullanıldı. Veri seti, sağlıklı kontrollerden 10 mukozanın ve kolorektal kanserli 12 hastanın kolon mukozasının gen dizileme sonuçlarını içeriyordu. Hastalığı sınıflandırmak için makine öğrenmesi yöntemlerinden biri olan XGboost kullanıldı. Model performansı için doğruluk, dengelenmiş doğruluk, duyarlılık, seçicilik, pozitif tahmin değeri ve negatif tahmin değeri performans metrikleri değerlendirildi. Bulgular: Değişken seçim yöntemine göre 17 gen seçilmiş ve bu girdi değişkenleri ile modelleme yapılmıştır. Modelleme sonuçlarından elde edilen doğruluk, dengeli doğruluk, duyarlılık, özgüllük, pozitif tahmin değeri, negatif tahmin değeri ve F1 puanı sırasıyla %95.5, %95.8, %91.7, %1, %1 ve %90.9 ve %95.7 idi. XGboost tekniği sonucundan elde edilen değişken önemliliklerine göre, CYR61, NR4A, FOSB ve NR4A2 genleri kolorektal kanser için biyolojik belirteçler olarak kullanılabilir. Sonuç: Bu araştırma sonucunda kolorektal kanserle bağlantılı olabilecek genlerin yanı sıra hastalığa yönelik genetik biyobelirteçler de belirlendi. Gelecekte, tespit edilen genlerin güvenilirliği doğrulanabilir, bu genlere dayalı olarak terapötik prosedürler oluşturulabilir ve klinik pratikteki yararları belgelenebilir.

show abstract

Early Detection of Coronary Heart Disease Based on Machine Learning Methods

Cited by 35 publications

References 14 publications

Prediction of Heart Disease Using Machine Learning Algorithms

Prediction of Heart Disease Using Machine Learning Algorithms

Artificial Intelligence-based Colon Cancer Prediction by Identifying Genomic Biomarkers

Classification of colorectal cancer based on gene sequencing data with XGBoost model: An application of public health informatics

Contact Info

Product

Resources

About