2021
DOI: 10.11591/ijece.v11i3.pp2631-2639
|View full text |Cite
|
Sign up to set email alerts
|

Expert cancer model using supervised algorithms with a LASSO selection approach

Abstract: One of the most critical issues of the mortality rate in the medical field in current times is breast cancer. Nowadays, a large number of men and women is facing cancer-related deaths due to the lack of early diagnosis systems and proper treatment per year. To tackle the issue, various data mining approaches have been analyzed to build an effective model that helps to identify the different stages of deadly cancers. The study successfully proposes an early cancer disease model based on five different supervise… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
6
3

Relationship

2
7

Authors

Journals

citations
Cited by 31 publications
(13 citation statements)
references
References 13 publications
0
13
0
Order By: Relevance
“…K-Fold CV divides data into 𝐾 folds. At each iteration, one-fold (𝐾) is used as test data set while training data set is resided folds (K1) in 𝐾 experiments [51], [52]. In this work, the value of 𝐾 = 10 folds, nine data sets for training and one for testing, then repeat this process ten times until all data has been evaluated.…”
Section: Resultsmentioning
confidence: 99%
“…K-Fold CV divides data into 𝐾 folds. At each iteration, one-fold (𝐾) is used as test data set while training data set is resided folds (K1) in 𝐾 experiments [51], [52]. In this work, the value of 𝐾 = 10 folds, nine data sets for training and one for testing, then repeat this process ten times until all data has been evaluated.…”
Section: Resultsmentioning
confidence: 99%
“…The scientific community has agreed on a number of criteria for evaluating the classification system's quality [22]- [24]. The confusion matrix is used to assess the study's success using the following key parameters: true-positive (TP), true-negative (TN), false-positive (FP), and false-negative (FN) Validity metrics such as accuracy, sensitivity/recall, specificity, F1-score, precision/positive predicted value (PPV), negative predicted value (NPV), false-negative rate (FNR), false-positive rate (FPR), false discovery rate (FDR), false omission rate (FOR), and Matthews correlation coefficient (MCC) can be calculated using these parameters [25]- [30].…”
Section: Performance Evaluationmentioning
confidence: 99%
“…As the name suggests, the algorithm is formed in a tree structure with the root node, branches and leaf nodes that indicate attributes, conditions and outcomes respectively [14]. Entropy as denoted in (1) shows the homogeneity as well as the purity of a dataset, and information gain is the change in an input's entropy, which is usually a reduction [15].…”
Section: Data Classification a Decision Treementioning
confidence: 99%
“…In recent years, a significant increase of various liver diseases has been observed around the globe. In India, the mortality rate because of the disease is 2.4% of the population [1]. There are more than 100 types of liver diseases among which cirrhosis is diagnosed when the liver cells are damaged and replaced by non-living scar tissues [2].…”
Section: Introductionmentioning
confidence: 99%