2015
DOI: 10.13187/mai.2015.6.171
|View full text |Cite
|
Sign up to set email alerts
|

Data-Mining Techniques to Classify Microarray Gene Expression Data Using Gene Selection by SVD and Information Gain

Abstract: Microarray data analysis can provide valuable information for cancer prediction and diagnosis. One of the challenges for microarray applications is to select an appropriate number of the most significant genes for data analysis. Besides, it is hard to accomplish a satisfactory classification results by using data mining techniques because of the dimensionality problem and the over-fitting problem. For this reason, it is desirable to select informative genes in order to improve classification accuracy of data m… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
2

Year Published

2017
2017
2023
2023

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 23 publications
(13 citation statements)
references
References 11 publications
0
11
2
Order By: Relevance
“…Unlike that for lung cancer and DLBCL, the classification accuracy for colon cancer was relatively low, which was 90.32% for IG-SVM. Nonetheless the classification accuracy achieved in this study for colon cancer was still higher than that reported in another study (83.87%), which used the same number of selected genes by singular value decomposition and IG [18] . Since default settings were used for various tools, the possibility to achieve high accuracy with altered settings and selections can’t be ruled out.…”
Section: Resultscontrasting
confidence: 80%
See 2 more Smart Citations
“…Unlike that for lung cancer and DLBCL, the classification accuracy for colon cancer was relatively low, which was 90.32% for IG-SVM. Nonetheless the classification accuracy achieved in this study for colon cancer was still higher than that reported in another study (83.87%), which used the same number of selected genes by singular value decomposition and IG [18] . Since default settings were used for various tools, the possibility to achieve high accuracy with altered settings and selections can’t be ruled out.…”
Section: Resultscontrasting
confidence: 80%
“…We then applied the filter methods, namely IG, gain ratio, reliefF, and correlation, for gene selection. The required number of genes selected cannot be determined using a common standard, but several hundred of genes are demonstrated to be sufficient to achieve high accuracy [18] . Therefore, different numbers of genes are selected for different filters with the number of genes ranging from 1 to 200.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Colon cancer classification produced by ISIG is better than the ones reported in the literature [9,11,14,29,30]. Even though Alshamlan [5] and Motieghader et al [16] obtained slightly higher accuracy for Colon cancer classification, yet they are of larger gene's subset.…”
Section: Resultsmentioning
confidence: 78%
“…Further, in the classification of colon cancer, a sparse representation based method is proposed in [3] which provide 91.94% accuracy; nevertheless with a very huge gene subset. Moreover, the accuracies for colon cancer dataset and SRBCT are greater than [9,29]. At the same time the performance of SRBCT is greater than that of reported in [30].…”
Section: B Experimental Resultsmentioning
confidence: 86%