2019
DOI: 10.1371/journal.pone.0212333
|View full text |Cite
|
Sign up to set email alerts
|

A hybrid gene selection algorithm based on interaction information for microarray-based cancer classification

Abstract: We address gene selection and machine learning methods for cancer classification using microarray gene expression data. Due to the high dimensionality of microarray data, traditional gene selection algorithms are filter-based, focusing on intrinsic properties of the data such as distance, dependency, and correlation. These methods are fast but select far too many genes to use for the classification task. In this work, we present a new hybrid filter-wrapper gene subset selection algorithm that is an improved mo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
16
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
2
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 33 publications
(17 citation statements)
references
References 33 publications
(31 reference statements)
0
16
0
Order By: Relevance
“…We tested our model on a previously reported colon cancer dataset [ 4 , 16 , 35 , 36 ] ( .) containing the expression of 2000 genes with highest minimal intensity across 62 tissues.…”
Section: Resultsmentioning
confidence: 99%
“…We tested our model on a previously reported colon cancer dataset [ 4 , 16 , 35 , 36 ] ( .) containing the expression of 2000 genes with highest minimal intensity across 62 tissues.…”
Section: Resultsmentioning
confidence: 99%
“…A classification framework applied to cancer gene expression profiles was done by Hijazi and Chan [ 21 ]. A hybrid gene selection algorithm based on interaction information technology was utilized for microarray-based colon cancer classification [ 22 ]. A gene selection methodology based on clustering for classification tasks in colon cancer was done by Garzon and Gonzalez [ 23 ].…”
Section: Introductionmentioning
confidence: 99%
“…Moreover, for selecting 116 a feature, they proposed to use χ 2 statistics by showing that all three terms follow χ 2 117 distribution. Moreover, even though it has few good characteristics, by incorporating 118 the term redundancy in gene expression data, informative genes might be discarded [11]. 119 Another issue with gene selection for cancer classification, in contrast to traditional 120 feature selection methods in machine learning, is that the set of genes selected should be 121 biologically relevant to the disease under study.…”
Section: Introduction 18mentioning
confidence: 99%
“…However, this method selected more genes than the wrapper or hybrid algorithms. To 56 solve the limitations of IGIS, improved Interaction information-Guided Incremental 57 Selection (IGIS+) [11] was proposed where the first gene is selected based on the highest 58 accuracy and utilizes Cohen's d test to add a new gene into the selected gene set. One 59 major limitation is that it uses several handcrafted thresholds for Cohen's d effect.…”
mentioning
confidence: 99%