2014
DOI: 10.1016/j.ygeno.2014.02.007
|View full text |Cite
|
Sign up to set email alerts
|

MetaCAA: A clustering-aided methodology for efficient assembly of metagenomic datasets

Abstract: A key challenge in analyzing metagenomics data pertains to assembly of sequenced DNA fragments (i.e. reads) originating from various microbes in a given environmental sample. Several existing methodologies can assemble reads originating from a single genome. However, these methodologies cannot be applied for efficient assembly of metagenomic sequence datasets. In this study, we present MetaCAA - a clustering-aided methodology which helps in improving the quality of metagenomic sequence assembly. MetaCAA initia… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 17 publications
(10 citation statements)
references
References 28 publications
0
10
0
Order By: Relevance
“…The retrospective nature of this study allowed us to compare the success of the strategies in retrieving three novel viral genomes from 454 metagenomic data. While different metagenome assembly strategies, especially for very large datasets of short read data, apply k-mer clustering or digital normalization and partitioning prior to assembly (Howe et al, 2014 ; Reddy et al, 2014 ), we here concentrated on strategies to link contigs after assembly in silico . Theoretically, these strategies can also be applied to contigs from metagenomes produced by other sequencing methods.…”
Section: Discussionmentioning
confidence: 99%
“…The retrospective nature of this study allowed us to compare the success of the strategies in retrieving three novel viral genomes from 454 metagenomic data. While different metagenome assembly strategies, especially for very large datasets of short read data, apply k-mer clustering or digital normalization and partitioning prior to assembly (Howe et al, 2014 ; Reddy et al, 2014 ), we here concentrated on strategies to link contigs after assembly in silico . Theoretically, these strategies can also be applied to contigs from metagenomes produced by other sequencing methods.…”
Section: Discussionmentioning
confidence: 99%
“…For these purposes we utilized the program CAP3 (58) that was originally developed for the assembly of large-insert clones sequenced using capillary sequencing, but has also been applied to de novo assembly of short read RNA-seq (77) and metagenomic sequence data (78). For each putative site, we retrieved any insertion-supporting read pairs as reported by RetroSeq and proximal soft-clipped reads, and then performed de novo assemblies with CAP3 ( Figure 1) run with parameters adjusted for joining smaller overlaps that could be expected from 101 bp reads (also see Methods).…”
Section: Precise Assembly Of Full-length Alu Variants Using Read Datamentioning
confidence: 99%
“…While recent efforts in short read assembly have focused on a de brujin graph approach ( 74 76 ), we reasoned a local assembly using an overlap-layout-consensus approach would take full advantage of our data. For these purposes we utilized the program CAP3 ( 58 ) that was originally developed for the assembly of large-insert clones sequenced using capillary sequencing, but has also been applied to de novo assembly of short read RNA-seq ( 77 ) and metagenomic sequence data ( 78 ). For each putative site, we retrieved any insertion-supporting read pairs as reported by RetroSeq and proximal soft-clipped reads, and then performed de novo assemblies with CAP3 (Figure 1 ) run with parameters adjusted for joining smaller overlaps that could be expected from 101 bp reads (also see Materials and Methods).…”
Section: Resultsmentioning
confidence: 99%