2016
DOI: 10.1038/ncomms12065
|View full text |Cite
|
Sign up to set email alerts
|

Long-read sequencing and de novo assembly of a Chinese genome

Abstract: Short-read sequencing has enabled the de novo assembly of several individual human genomes, but with inherent limitations in characterizing repeat elements. Here we sequence a Chinese individual HX1 by single-molecule real-time (SMRT) long-read sequencing, construct a physical map by NanoChannel arrays and generate a de novo assembly of 2.93 Gb (contig N50: 8.3 Mb, scaffold N50: 22.0 Mb, including 39.3 Mb N-bases), together with 206 Mb of alternative haplotypes. The assembly fully or partially fills 274 (28.4%… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

5
322
1

Year Published

2017
2017
2023
2023

Publication Types

Select...
6
1

Relationship

2
5

Authors

Journals

citations
Cited by 253 publications
(328 citation statements)
references
References 40 publications
5
322
1
Order By: Relevance
“…Recent studies have highlighted the improvements of SMRT-only assemblies compared to Illumina-only assemblies [Bickhart et al, 2017], [Gordon et al, 2016], [Jiao et al, 2017], [Zhang et al, 2016], [Shi et al, 2016]. Here we found that both the pooled Illumina assembly (with mixed read length) and the PacBio SMRT-only assembly resulted in substantially improved assembly statistics compared to the two published hamster genome assemblies (Table 1), with an order of magnitude fewer scaffolds and 2–4x larger N50 values.…”
Section: Resultsmentioning
confidence: 99%
“…Recent studies have highlighted the improvements of SMRT-only assemblies compared to Illumina-only assemblies [Bickhart et al, 2017], [Gordon et al, 2016], [Jiao et al, 2017], [Zhang et al, 2016], [Shi et al, 2016]. Here we found that both the pooled Illumina assembly (with mixed read length) and the PacBio SMRT-only assembly resulted in substantially improved assembly statistics compared to the two published hamster genome assemblies (Table 1), with an order of magnitude fewer scaffolds and 2–4x larger N50 values.…”
Section: Resultsmentioning
confidence: 99%
“…New reference-quality sequence sources are needed, because generation of finished sequence from clone libraries is in significant decline due to cost and some remaining assembly gaps occur in regions recalcitrant to cloning. A growing collection of human genomes in INSDC databases, a prerequisite for any sequence that will contribute to the reference assembly, that were sequenced and assembled with new technologies are candidates for use in assembly improvement (Earl et al 2011;Vezzi et al 2012;Bradnam et al 2013;English et al 2015;Pendleton et al 2015;Seo et al 2016;Shi et al 2016;Zook et al 2016). However, WGS assembly sequences have historically not been considered reference quality, raising concerns about their use in reference genome assembly curation.…”
Section: De Novo Assembly Evaluationsmentioning
confidence: 99%
“…Alignments were performed and analyzed as described in the Supplementary Methods of Shi et al (2016). However, in contrast to the RefSeq transcripts, we evaluated coverage for the GENCODE data over the full transcript, rather than the CDS, because we did not have the CDS information.…”
Section: Transcript Evaluation Of Assembliesmentioning
confidence: 99%
See 1 more Smart Citation
“…The scoring function of this approach is the likelihood of a pair of label intervals being similar [39]. These assembled contigs and alignments can be used in further hybrid assembly using sequencing reads from other technologies to yield an improved overall final assembly with longer N50( [40,41]). …”
Section: Bionano Optical Mapping Technologymentioning
confidence: 99%