2008
DOI: 10.1038/sj.bjc.6604207
|View full text |Cite
|
Sign up to set email alerts
|

Approaches to working in high-dimensional data spaces: gene expression microarrays

Abstract: This review provides a focused summary of the implications of high-dimensional data spaces produced by gene expression microarrays for building better models of cancer diagnosis, prognosis, and therapeutics. We identify the unique challenges posed by high dimensionality to highlight methodological problems and discuss recent methods in predictive classification, unsupervised subclass discovery, and marker identification.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
50
0

Year Published

2010
2010
2014
2014

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 63 publications
(50 citation statements)
references
References 28 publications
0
50
0
Order By: Relevance
“…Analytic approaches to discover new subclasses of samples based on distinct patterns of gene expression have been extensively used in cancer genomics to identify new categories of tumors types with distinct molecular, prognostic, or drug-response features [7,8,9,29,30]. Clustering approaches share one critical aspect: the analysis is unsupervised.…”
Section: Subclass Discovery Using Microarray Datamentioning
confidence: 99%
“…Analytic approaches to discover new subclasses of samples based on distinct patterns of gene expression have been extensively used in cancer genomics to identify new categories of tumors types with distinct molecular, prognostic, or drug-response features [7,8,9,29,30]. Clustering approaches share one critical aspect: the analysis is unsupervised.…”
Section: Subclass Discovery Using Microarray Datamentioning
confidence: 99%
“…In the past few years, many classical gene selection method based on clustering algorithm have been proposed to find relative genes. [4][5][6] These methods often run clustering method on a small subset, * Authors to whom correspondence should be addressed. and then select significant genes at each gene clustering.…”
Section: Introductionmentioning
confidence: 99%
“…A large genotyping data set may consist of 2 million Single Nucleotide Polymorphism (SNP) probes by 10000 samples. Dimensionality is of concern to most analysis approaches, with the number of variables (genes, SNPs, sequences) vastly outweighing the number of observations [25].…”
Section: Introductionmentioning
confidence: 99%