2009
DOI: 10.1093/bioinformatics/btp123
|View full text |Cite
|
Sign up to set email alerts
|

A modified hyperplane clustering algorithm allows for efficient and accurate clustering of extremely large datasets

Abstract: Motivation: As the number of publically available microarray experiments increases, the ability to analyze extremely large datasets across multiple experiments becomes critical. There is a requirement to develop algorithms which are fast and can cluster extremely large datasets without affecting the cluster quality. Clustering is an unsupervised exploratory technique applied to microarray data to find similar data structures or expression patterns. Because of the high input/output costs involved and large dist… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2010
2010
2022
2022

Publication Types

Select...
8

Relationship

1
7

Authors

Journals

citations
Cited by 17 publications
(9 citation statements)
references
References 12 publications
0
9
0
Order By: Relevance
“…1). In order to understand the expression patterns of differentially expressed genes, we performed cluster analysis using HPCluster program (8).…”
Section: Gene Expression Data Analysesmentioning
confidence: 99%
“…1). In order to understand the expression patterns of differentially expressed genes, we performed cluster analysis using HPCluster program (8).…”
Section: Gene Expression Data Analysesmentioning
confidence: 99%
“…Regarding the generalization capability, the optimal number of rules is automatically determined on the basis of learning theory [49]. The best model is chosen by evaluating a cost function based on network complexity and approximation error [50].…”
Section: Honfismentioning
confidence: 99%
“…Bohland et al (2010) used K-means to cluster all left hemisphere brain voxels, a 25, 155 × 271 matrix is used as an input for the algorithm. Sharma et al (2009) used a two-stage hyperplane algorithm applied in a software package called HPCluster. The first stage reduced the data size and the second stage was the conventional K-means.…”
Section: Algorithms Used For Clustering Gene Expression Datamentioning
confidence: 99%