2009
DOI: 10.2174/138920209789177601
|View full text |Cite
|
Sign up to set email alerts
|

Clustering Algorithms: On Learning, Validation, Performance, and Applications to Genomics

Abstract: The development of microarray technology has enabled scientists to measure the expression of thousands of genes simultaneously, resulting in a surge of interest in several disciplines throughout biology and medicine. While data clustering has been used for decades in image processing and pattern recognition, in recent years it has joined this wave of activity as a popular technique to analyze microarrays. To illustrate its application to genomics, clustering applied to genes from a set of microarray data group… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
43
1
3

Year Published

2011
2011
2022
2022

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 66 publications
(48 citation statements)
references
References 52 publications
1
43
1
3
Order By: Relevance
“…In other words, when re-running the same algorithm on the same data, with the same set of parameters, how sure can we be of obtaining similar results? Here again, different measures of stability can be found in the literature [36][37][38]. A useful approach consists in evaluating the stability of groups while iterating the same method with the same parameter set, on randomly chosen and reshuffled subsamples of the data.…”
Section: Finding the ''Best'' Fitting Results: Group Stabilitymentioning
confidence: 99%
“…In other words, when re-running the same algorithm on the same data, with the same set of parameters, how sure can we be of obtaining similar results? Here again, different measures of stability can be found in the literature [36][37][38]. A useful approach consists in evaluating the stability of groups while iterating the same method with the same parameter set, on randomly chosen and reshuffled subsamples of the data.…”
Section: Finding the ''Best'' Fitting Results: Group Stabilitymentioning
confidence: 99%
“…This is a pair‐wise average‐linked algorithm (Johnson et al. 2003; Dalton et al. 2009) that sorts through all the data to identify pairs of genes that behave most similarly and then progressively adds other genes to initial pairs to form clusters of genes with a correlated expression profiles.…”
Section: Resultsmentioning
confidence: 99%
“…To improve the chances of identifying genes involved in meiosis, we next performed hierarchical clustering. This is a pair-wise average-linked algorithm (Johnson et al 2003;Dalton et al 2009) that sorts through all the data to identify pairs of genes that behave most similarly and then progressively adds other genes to initial pairs to form clusters of genes with a correlated expression profiles. Hierarchical clustering results were then analysed in regard to the repartition of the set of 29 (Table S3a) or 47 (Table S3a and b) meiotic genes.…”
Section: Clusteringmentioning
confidence: 99%
“…A microarray gene expression matrix is a table where rows represent genes, columns represent various samples (or examined conditions), and numbers in each cell denote the expression level of the particular gene in the particular sample. Clustering is a helpful statistical data mining procedure for analyzing such gene expression data; it arranges genes together in groups that have potentially related functions or are co-regulated, thus helping to establish the relationships among them in the form of gene regulatory networks [123, 132, 133]. …”
Section: Clustering: a Statistical Data Mining Procedures For Analyzinmentioning
confidence: 99%