2011
DOI: 10.6026/97320630007142
|View full text |Cite
|
Sign up to set email alerts
|

Random forest for gene selection and microarray data classification

Abstract: A random forest method has been selected to perform both gene selection and classification of the microarray data. In this embedded method, the selection of smallest possible sets of genes with lowest error rates is the key factor in achieving highest classification accuracy. Hence, improved gene selection method using random forest has been proposed to obtain the smallest subset of genes as well as biggest subset of genes prior to classification. The option for biggest subset selection is done to assist resea… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
14
0
1

Year Published

2013
2013
2024
2024

Publication Types

Select...
6
4

Relationship

1
9

Authors

Journals

citations
Cited by 47 publications
(16 citation statements)
references
References 16 publications
0
14
0
1
Order By: Relevance
“…However, using RainDance Microdroplet PCR as previously described, this limitation has been overcome [ 14 , 15 ]. In addition, the UroMark assay uses a random forest classifier which analyses the methylation status for each of 150 loci [ 22 ]. The classifier does not rely on single or low number of positive markers, or, a predefined pattern of methylation across a set of markers and a dichotomous output is derived from a cut off generated from the known outcomes of prior samples.…”
Section: Discussionmentioning
confidence: 99%
“…However, using RainDance Microdroplet PCR as previously described, this limitation has been overcome [ 14 , 15 ]. In addition, the UroMark assay uses a random forest classifier which analyses the methylation status for each of 150 loci [ 22 ]. The classifier does not rely on single or low number of positive markers, or, a predefined pattern of methylation across a set of markers and a dichotomous output is derived from a cut off generated from the known outcomes of prior samples.…”
Section: Discussionmentioning
confidence: 99%
“…Due to its implementation simplicity and classification effectiveness, KNN has been widely used in pattern recognition. It is also used as a different feature selection algorithm [ 50 , 51 ] and is integrated into the feature selection framework to evaluate the quality of a candidate feature subset [ 52 54 ].…”
Section: Methods and Techniquesmentioning
confidence: 99%
“…Random Forest (RF) is a type of machine-learning method, which has been experimentally proven to be the best classifier (10). RF has a number of advantages and has already been successfully applied to microarray data classification (11,12) and numerous other disease classifications (13,14). Among the different variable selection methods, variable selection using RF (VSURF) has demonstrated the best predictive performance thus far (15).…”
Section: Introductionmentioning
confidence: 99%