Proceedings of the 2004 SIAM International Conference on Data Mining 2004
DOI: 10.1137/1.9781611972740.11
|View full text |Cite
|
Sign up to set email alerts
|

Minimum Sum-Squared Residue Co-clustering of Gene Expression Data

Abstract: Microarray experiments have been extensively used for simultaneously measuring DNA expression levels of thousands of genes in genome research. A key step in the analysis of gene expression data is the clustering of genes into groups that show similar expression values over a range of conditions. Since only a small subset of the genes participate in any cellular process of interest, by focusing on subsets of genes and conditions, we can lower the noise induced by other genes and conditions -a co-cluster charact… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
229
0

Year Published

2005
2005
2016
2016

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 232 publications
(233 citation statements)
references
References 20 publications
0
229
0
Order By: Relevance
“…Co-clustering has been studied in many different application contexts including text mining [11], gene expression analysis [8,27] and graph mining [5] where these methods have yielded an impressive improvement in performance over traditional clustering techniques. The methods differ primarily by the criterion they optimize, such as minimum loss in mutual information [11], sum-squared distance [8], minimum description length (MDL) [5], Bregman divergence [2] and non-parametric association measures [29,17].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Co-clustering has been studied in many different application contexts including text mining [11], gene expression analysis [8,27] and graph mining [5] where these methods have yielded an impressive improvement in performance over traditional clustering techniques. The methods differ primarily by the criterion they optimize, such as minimum loss in mutual information [11], sum-squared distance [8], minimum description length (MDL) [5], Bregman divergence [2] and non-parametric association measures [29,17].…”
Section: Related Workmentioning
confidence: 99%
“…The methods differ primarily by the criterion they optimize, such as minimum loss in mutual information [11], sum-squared distance [8], minimum description length (MDL) [5], Bregman divergence [2] and non-parametric association measures [29,17]. Among these approaches, only those ones based on MDL and association measure are claimed to be parameter-free [19].…”
Section: Related Workmentioning
confidence: 99%
“…Similar ideas have been explored in for gene expression data in [4]. Here we illustrated in our clustering model.…”
Section: The Optimization Proceduresmentioning
confidence: 88%
“…Various co-clustering algorithms have adopted different error functions, such as minimum mutual information [15], sum-squared distance [9], and code length [6]. A general co-clustering framework based on Bregman divergence [4] has been proposed, which covers the entire exponential family.…”
Section: Definitions and Overviewmentioning
confidence: 99%