Josh Quick scite author profile

We report the sequencing and assembly of a reference genome for the human GM12878 Utah/Ceph cell line using the MinION (Oxford Nanopore Technologies) nanopore sequencer. 91.2 Gb of sequence data, representing ~30× theoretical coverage, were produced. Reference-based alignment enabled detection of large structural variants and epigenetic modifications. De novo assembly of nanopore reads alone yielded a contiguous assembly (NG50 ~3 Mb). Next, we developed a protocol to generate ultra-long reads (N50 > 100kb, up to 882 kb). Incorporating an additional 5×-coverage of this data type more than doubled the assembly contiguity (NG50 ~6.4 Mb). The final assembled genome was 2,867 million bases in size, covering 85.8% of the reference. Assembly accuracy, after incorporating complementary short-read sequencing data, exceeded 99.8%. Ultra-long reads enabled assembly and phasing of the 4 Mb major histocompatibility complex (MHC) locus in its entirety, measurement of telomere repeat length and closure of gaps in the reference human genome assembly GRCh38.

show abstract

Telomere-to-telomere assembly of a complete human X chromosome

Miga

Koren

Rhie

et al. 2020

Nature

626

660

View full text Add to dashboard Cite

After two decades of improvements, the current human reference genome (GRCh38) is the most accurate and complete vertebrate genome ever produced. However, no single chromosome has been finished end to end, and hundreds of unresolved gaps persist 1,2. Here we present a human genome assembly that surpasses the continuity of GRCh38 2 , along with a gapless, telomere-to-telomere assembly of a human chromosome. This was enabled by high-coverage, ultra-long-read nanopore sequencing of the complete hydatidiform mole CHM13 genome, combined with complementary technologies for quality improvement and validation. Focusing our efforts on the human X chromosome 3 , we reconstructed the centromeric satellite DNA array (approximately 3.1 Mb) and closed the 29 remaining gaps in the current reference, including new sequences from the human pseudoautosomal regions and from cancer-testis ampliconic gene families (CT-X and GAGE). These sequences will be integrated into future human reference genome releases. In addition, the complete chromosome X, combined with the ultra-long nanopore data, allowed us to map methylation patterns across complex tandem repeats and satellite arrays. Our results demonstrate that finishing the entire human genome is now within reach, and the data presented here will facilitate ongoing efforts to complete the other human chromosomes. Complete, telomere-to-telomere reference genome assemblies are necessary to ensure that all genomic variants are discovered and studied. At present, unresolved areas of the human genome are defined by multi-megabase satellite arrays in the pericentromeric regions and the ribosomal DNA arrays on acrocentric short arms, as well as regions enriched in segmental duplications that are greater than hundreds of kilobases in length and that exhibit sequence identity of more than 98% between paralogues. Owing to their absence from the reference, these repeat-rich sequences are often excluded from genetics and genomics studies, which limits the scope of association and functional analyses 4,5. Unresolved repeat sequences also result in unintended consequences; for example, paralogous sequence variants incorrectly being called as allelic variants 6 , and the contamination of bacterial gene databases 7. Completion of the entire human genome is expected to contribute to our understanding of chromosome function 8 , human disease 9 and genomic variation, which will improve technologies in biomedicine that use short-read mapping to a reference genome (for example, RNA sequencing (RNA-seq) 10 , chromatin immunoprecipitation followed by sequencing (ChIP-seq) 11 and assay for transposase-accessible chromatin using sequencing (ATAC-seq) 12). The fundamental challenge of reconstructing a genome from many comparatively short sequencing reads-a process known as genome assembly-is distinguishing the repeated sequences from one another 13. Resolving such repeats relies on sequencing reads that are long enough to span the entire repeat or accurate enough to distinguish each repeat copy on the basis of...

show abstract

Genomic and epidemiological monitoring of yellow fever virus transmission potential

Faria¹,

Kraemer²,

Hill³

et al. 2018

Science

298

331

View full text Add to dashboard Cite

The yellow fever virus (YFV) epidemic in Brazil is the largest in decades. The recent discovery of YFV in Brazilian Aedes species mosquitos highlights a need to monitor the risk of reestablishment of urban YFV transmission in the Americas. We use a suite of epidemiological, spatial, and genomic approaches to characterize YFV transmission. We show that the age and sex distribution of human cases is characteristic of sylvatic transmission. Analysis of YFV cases combined with genomes generated locally reveals an early phase of sylvatic YFV transmission and spatial expansion toward previously YFV-free areas, followed by a rise in viral spillover to humans in late 2016. Our results establish a framework for monitoring YFV transmission in real time that will contribute to a global strategy to eliminate future YFV epidemics.

show abstract

nCoV-2019 sequencing protocol v1

Quick¹

2020

Preprint

340

298

View full text Add to dashboard Cite

Mix the following components in an 0.2mL 8-strip tube; Compone nt Vol ume Compone nt Vol ume 50µM random hexamers 1 µl 1 µl 10mM dNTPs mix (10mM each) 1 µl 1 µl Template RNA 11 µl 11 µl T ota l T ota l 13 µl 13 µl Viral RNA input from a clinical sample should be between Ct 18-35. If Ct is between 12-15, then dilute the sample 100-fold in water, if between 15-18 then dilute 10-fold in water. This will reduce the likelihood of PCRinhibition. A mastermix should be made up in the ma ste rmi x ca bi ne t ma ste rmi x ca bi ne t and aliquoted into PCR strip tubes. Tubes should be wiped down when entering and leaving the mastermix cabinet. 2 Gently mix by pipetting and pulse spin the tube to collect liquid at the bottom of the tube.

show abstract

Genomic Epidemiology of SARS-CoV-2 in Guangdong Province, China

Plessis

Liu

et al. 2020

Cell

261

226

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Josh Quick

Nanopore sequencing and assembly of a human genome with ultra-long reads

Telomere-to-telomere assembly of a complete human X chromosome

Genomic and epidemiological monitoring of yellow fever virus transmission potential

nCoV-2019 sequencing protocol v1

Genomic Epidemiology of SARS-CoV-2 in Guangdong Province, China

Contact Info

Product

Resources

About