2006
DOI: 10.1186/1471-2105-7-228
|View full text |Cite
|
Sign up to set email alerts
|

A stable gene selection in microarray data analysis

Abstract: BackgroundMicroarray data analysis is notorious for involving a huge number of genes compared to a relatively small number of samples. Gene selection is to detect the most significantly differentially expressed genes under different conditions, and it has been a central research focus. In general, a better gene selection method can improve the performance of classification significantly. One of the difficulties in gene selection is that the numbers of samples under different conditions vary a lot.ResultsTwo no… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
35
0

Year Published

2006
2006
2022
2022

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 150 publications
(36 citation statements)
references
References 15 publications
(31 reference statements)
0
35
0
Order By: Relevance
“…Data for colon, SRBCT, leukemia and melanoma were used as they were. For other datasets, we adopted the technique suggested by Yang et al [15] and Ramaswamy et al [33]. For prostate dataset, floor value of 100 and a ceiling value of 16000 with a variation of the Max / Min ratio as 5 and Max-Min difference of 50 were used to filter the values.…”
Section: Experimental Setup and Datasetmentioning
confidence: 99%
See 1 more Smart Citation
“…Data for colon, SRBCT, leukemia and melanoma were used as they were. For other datasets, we adopted the technique suggested by Yang et al [15] and Ramaswamy et al [33]. For prostate dataset, floor value of 100 and a ceiling value of 16000 with a variation of the Max / Min ratio as 5 and Max-Min difference of 50 were used to filter the values.…”
Section: Experimental Setup and Datasetmentioning
confidence: 99%
“…Using the filter feature selection approach, many algorithms were proposed such as Relief [11], Relief-F [12], FOCUS, FOCUS-2 [13], Correlation based feature selection (CFS) [14], Fast Correlation based feature selection FCBS [8], FAST [6]. Algorithm by Yang et al [15] removes irrelevant features based on gene ranking methods of GS1 and GS2.…”
Section: Introductionmentioning
confidence: 99%
“…Many recent works have been conducted in identifying the discriminatory subset of genes. For example, Cho et al [24] apply the mean and standard deviation of the distances from each sample to the class center as criteria for classification; Yang et al [25] improve the method in [24] and bring inter-class variations into the algorithm; Cai et al [26] propose the clustered gene selection, which groups the genes via k -means clustering and picks up the top genes in each group that are closest to the centroid locations. These methods are simple and effective in some cases, but their heuristics are designed for continuous gene expression data, and are not directly applicable to discrete, and especially binary point mutation data.…”
Section: Introductionmentioning
confidence: 99%
“…For application purposes, these gene expression data must then be classified into various categories [2]. Together with classification methods, microarray technology has successfully guided clinical management decisions for individual patients, such as oncology [3, 4]. However, the sample size of the genetic dataset is usually much smaller than the number of genes, which extends into thousands or even tens of thousands [5].…”
Section: Introductionmentioning
confidence: 99%