2011
DOI: 10.1093/nar/gkr349
|View full text |Cite
|
Sign up to set email alerts
|

ESPRIT-Tree: hierarchical clustering analysis of millions of 16S rRNA pyrosequences in quasilinear computational time

Abstract: Taxonomy-independent analysis plays an essential role in microbial community analysis. Hierarchical clustering is one of the most widely employed approaches to finding operational taxonomic units, the basis for many downstream analyses. Most existing algorithms have quadratic space and computational complexities, and thus can be used only for small or medium-scale problems. We propose a new online learning-based algorithm that simultaneously addresses the space and computational issues of prior work. The basic… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
134
0

Year Published

2013
2013
2020
2020

Publication Types

Select...
7
3

Relationship

1
9

Authors

Journals

citations
Cited by 139 publications
(136 citation statements)
references
References 33 publications
1
134
0
Order By: Relevance
“…Due to the superior performance of MtHc [10] than other four state-of-the-art clustering methods (MSClust [11], ESPRIT-Tree [12], CROP [13] and BEBaC [14]), we select MtHc method to generate the OTUs in this paper.…”
Section: Resultsmentioning
confidence: 99%
“…Due to the superior performance of MtHc [10] than other four state-of-the-art clustering methods (MSClust [11], ESPRIT-Tree [12], CROP [13] and BEBaC [14]), we select MtHc method to generate the OTUs in this paper.…”
Section: Resultsmentioning
confidence: 99%
“…ESPRIT reduces computational complexity by generating only the lower part of a dendrogram. The approach of Mothur and ESPRIT is similar but instead of pairwise global alignment used by ESPRIT, Mothur uses multiple sequence alignment tool such as MUSCLE [14] to compute the pairwise distance matrix. It has been seen that pairwise alignment produces better clustering outcomes than multiple sequence alignments [7], [15].…”
Section: B Agglomerative Hierarchical Clustering (Ahc)mentioning
confidence: 99%
“…We employ ESPIRITTree [8] to cluster sequences into OTUs at various distance levels. To tackle the problem that ESPRIT-Tree uses a fixed distance threshold to define OTUs (operational taxonomy units) [7], which is inconsistent with genomic variations between taxa in real-world.…”
Section: ) Preprocessing Of Icomm Datamentioning
confidence: 99%