2012
DOI: 10.1089/cmb.2012.0021
|View full text |Cite
|
Sign up to set email alerts
|

SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing

Abstract: The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (mu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

18
15,707
0
45

Year Published

2012
2012
2021
2021

Publication Types

Select...
9
1

Relationship

1
9

Authors

Journals

citations
Cited by 19,740 publications
(15,820 citation statements)
references
References 41 publications
18
15,707
0
45
Order By: Relevance
“…We compared short-read NLR assemblies with long-read assemblies, and confirmed the fragmented reconstruction of the NLR-complement from short-read sequencing data; 41% CLC Assembly Cell (CLC, www.clcbio.com) or 43% SPAdes assembler 16 (Figure 1a-c). We found 327 sequences assembled from long-read data for which no high confidence short-read assembly is present.…”
mentioning
confidence: 66%
“…We compared short-read NLR assemblies with long-read assemblies, and confirmed the fragmented reconstruction of the NLR-complement from short-read sequencing data; 41% CLC Assembly Cell (CLC, www.clcbio.com) or 43% SPAdes assembler 16 (Figure 1a-c). We found 327 sequences assembled from long-read data for which no high confidence short-read assembly is present.…”
mentioning
confidence: 66%
“…We also attempted filtering the reads by alignment to w Ri and assembling with SPAdes 3.0 (Bankevich et al. 2012). The assembly of w Spc is available from GenBank under accession number NTHL00000000 (project number PRJNA401169, Biosample SAMN07599555).…”
Section: Methodsmentioning
confidence: 99%
“…The short read sequences were mapped using SMALT (http://www.sanger.ac.uk/resources/software/smalt/) to the reference emm 1 genome MGAS5005 (Genbank accession number NC_007297.2) and single nucleotide polymorphisms (SNPs) were identified. De novo sequences were generated using SPAdes [15]. Mapping and SNPs over regulatory regions and the hasA promoter were manually checked using Artemis [16].…”
Section: Methodsmentioning
confidence: 99%