Breast Cancer Prediction Using Fine Needle Aspiration Features and Upsampling with Supervised Machine Learning

Shafique, Rahman; Rustam, Furqan; Choi, Gyu Sang; Díez, Isabel de la Torre; Mahmood, Arif; Lipari, Vivían; Velasco, Carmen Lili Rodríguez; Ashraf, Imran

doi:10.3390/cancers15030681

Cited by 21 publications

(11 citation statements)

References 40 publications

(46 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In this study, we tested several machine learning algorithms which have significant applications in different domains, such as the health care [ 40 ], Internet of Things (IoT) [ 41 ], machine vision [ 42 ], edge computing [ 43 ], education [ 44 , 45 ], and many others. In order to conduct a fair comparative evaluation of our proposed SSC model for the detection of thyroid disease, we chose the following machine learning classifiers: RF due to its effectiveness, interpretability, non-parametric nature, and high accuracy rate across a range of data types; GBM, which has various benefits including adaptability, robust tolerance to anomalous inputs, and high accuracy; AdaBoost, since it is less susceptible to overfitting; LR, because its training and implementation processes are simple; and Support Vector Classifier (SVC), that has advantages including efficiently handling high dimensional data [ 46 ]. For these algorithms to perform at their maximum, we optimized their hyperparameters.…”

Section: Methodsmentioning

confidence: 99%

SSC: The novel self-stack ensemble model for thyroid disease prediction

2024

PLoS ONE

View full text Add to dashboard Cite

Thyroid disease presents a significant health risk, lowering the quality of life and increasing treatment costs. The diagnosis of thyroid disease can be challenging, especially for inexperienced practitioners. Machine learning has been established as one of the methods for disease diagnosis based on previous studies. This research introduces a novel and more effective technique for predicting thyroid disease by utilizing machine learning methodologies, surpassing the performance of previous studies in this field. This study utilizes the UCI thyroid disease dataset, which consists of 9172 samples and 30 features, and exhibits a highly imbalanced target class distribution. However, machine learning algorithms trained on imbalanced thyroid disease data face challenges in reliably detecting minority data and disease. To address this issue, re-sampling is employed, which modifies the ratio between target classes to balance the data. In this study, the down-sampling approach is utilized to achieve a balanced distribution of target classes. A novel RF-based self-stacking classifier is presented in this research for efficient thyroid disease detection. The proposed approach demonstrates the ability to diagnose primary hypothyroidism, increased binding protein, compensated hypothyroidism, and concurrent non-thyroidal illness with an accuracy of 99.5%. The recommended model exhibits state-of-the-art performance, achieving 100% macro precision, 100% macro recall, and 100% macro F1-score. A thorough comparative assessment is conducted to demonstrate the viability of the proposed approach, including several machine learning classifiers, deep neural networks, and ensemble voting classifiers. The results of K-fold cross-validation provide further support for the efficacy of the proposed self-stacking classifier.

show abstract

Section: Methodsmentioning

confidence: 99%

SSC: The novel self-stack ensemble model for thyroid disease prediction

2024

PLoS ONE

View full text Add to dashboard Cite

show abstract

“…Medical image analysis domains, on the other hand, do not have access to such big datasets. Consequently, depending on the need to expand the amount of data, different augmentation techniques have been used in the existing literature [ 26 , 27 , 28 ]. In this study, the size of the training dataset was increased using these techniques.…”

Section: Methodsmentioning

confidence: 99%

Nerve Root Compression Analysis to Find Lumbar Spine Stenosis on MRI Using CNN

Shahzadi,

Ali,

Majeed

et al. 2023

Diagnostics

Self Cite

View full text Add to dashboard Cite

Lumbar spine stenosis (LSS) is caused by low back pain that exerts pressure on the nerves in the spine. Detecting LSS is a significantly important yet difficult task. It is detected by analyzing the area of the anteroposterior diameter of the patient’s lumbar spine. Currently, the versatility and accuracy of LSS segmentation algorithms are limited. The objective of this research is to use magnetic resonance imaging (MRI) to automatically categorize LSS. This study presents a convolutional neural network (CNN)-based method to detect LSS using MRI images. Radiological grading is performed on a publicly available dataset. Four regions of interest (ROIs) are determined to diagnose LSS with normal, mild, moderate, and severe gradings. The experiments are performed on 1545 axial-view MRI images. Furthermore, two datasets—multi-ROI and single-ROI—are created. For training and testing, an 80:20 ratio of randomly selected labeled datasets is used, with fivefold cross-validation. The results of the proposed model reveal a 97.01% accuracy for multi-ROI and 97.71% accuracy for single-ROI. The proposed computer-aided diagnosis approach can significantly improve diagnostic accuracy in everyday clinical workflows to assist medical experts in decision making. The proposed CNN-based MRI image segmentation approach shows its efficacy on a variety of datasets. Results are compared to existing state-of-the-art studies, indicating the superior performance of the proposed approach.

show abstract

“…Proper encoding ensures that categorical variables are utilized appropriately by the model, leading to an enhancement in its performance as described in study [45]. The study [46]described the data preparation approaches like oversampling, under-sampling, and the development of synthetic samples that can successfully address class imbalance issues. Equilibrating the dataset improves the model's ability to learn from underrepresented classes and forecast accurately across all categories.…”

Section: Dataset Descriptionmentioning

confidence: 99%

Advancing Autonomous Vehicle Safety: Machine Learning to Predict Sensor-Related Accident Severity

Shafique,

Rustam,

Murtala

et al. 2024

IEEE Access

Self Cite

View full text Add to dashboard Cite

Autonomous vehicles (AVs) represent an exciting frontier in transportation, promising increased safety and efficiency on the roads. However, like any technological advancement, they are not immune to accidents. Understanding the severity of accidents involving AVs is crucial for enhancing their reliability and ensuring public trust in this transformative technology. To address this challenge, our study has employed cutting-edge natural language processing techniques combined with machine learning to predict the severity of accidents involving AVs. Our study has contributed significantly by creating a novel dataset derived from post-disengagement accident reports, covering the years 2019-2022. This dataset comprises detailed descriptions of accidents, sensor information, and other critical parameters. Moreover, we have introduced a novel approach called Multi-Distance Synthetic Technique (MDST) to balance the imbalanced nature of our dataset, which included only 334 samples due to the rarity of such accident data. Utilizing MDST for data balancing, we aimed to enhance the robustness of our analysis. Additionally, we employed Recursive Feature Selection (RFS) to extract a valuable feature set that was crucial in predicting accident severity. Leveraging this selected feature set, we trained an ensemble model, which remarkably outperformed expectations, achieving an impressive accuracy score of 0.92.

show abstract

Breast Cancer Prediction Using Fine Needle Aspiration Features and Upsampling with Supervised Machine Learning

Cited by 21 publications

References 40 publications

SSC: The novel self-stack ensemble model for thyroid disease prediction

SSC: The novel self-stack ensemble model for thyroid disease prediction

Nerve Root Compression Analysis to Find Lumbar Spine Stenosis on MRI Using CNN

Advancing Autonomous Vehicle Safety: Machine Learning to Predict Sensor-Related Accident Severity

Contact Info

Product

Resources

About