2017
DOI: 10.1101/gr.213611.116
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly

Abstract: The human reference genome assembly plays a central role in nearly all aspects of today's basic and clinical research. GRCh38 is the first coordinate-changing assembly update since 2009; it reflects the resolution of roughly 1000 issues and encompasses modifications ranging from thousands of single base changes to megabase-scale path reorganizations, gap closures, and localization of previously orphaned sequences. We developed a new approach to sequence generation for targeted base updates and used data from n… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

10
568
0
2

Year Published

2017
2017
2023
2023

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 760 publications
(580 citation statements)
references
References 77 publications
10
568
0
2
Order By: Relevance
“…Over a number of years, the genome assembly has steadily improved (International Human Genome Sequencing Consortium 2004; Church et al 2011) to the point that the current Genome Reference Consortium (GRC) human genome assembly, GRCh38 (Schneider et al 2017), is arguably the best assembled mammalian genome in existence, with just 875 remaining assembly gaps and fewer than 160 million unspecified "N" nucleotides (as of GRCh38.p8).…”
Section: The Triumph Of the Human Reference Genomementioning
confidence: 99%
See 2 more Smart Citations
“…Over a number of years, the genome assembly has steadily improved (International Human Genome Sequencing Consortium 2004; Church et al 2011) to the point that the current Genome Reference Consortium (GRC) human genome assembly, GRCh38 (Schneider et al 2017), is arguably the best assembled mammalian genome in existence, with just 875 remaining assembly gaps and fewer than 160 million unspecified "N" nucleotides (as of GRCh38.p8).…”
Section: The Triumph Of the Human Reference Genomementioning
confidence: 99%
“…5). These graphs were then used to generate linear reference sequences for the two arrays (Miga et al 2014), using a Markovian traversal, and have subsequently been used to define linear representations of the centromeres that are included in GRCh38 (Schneider et al 2017). …”
Section: The Repeatomementioning
confidence: 99%
See 1 more Smart Citation
“…Scaffolds can contain errors in contig order (a 'translocation') or orientation (an 'inversion'). Examples of such errors can be found in the best available reference genomes for many species (Robert B. Norgren 2013;Shearer et al 2014;Tang et al 2014;Chen et al 2015;Davey et al 2016;Utsunomiya et al 2016;Schneider et al 2017;Korlach et al 2017). Consequently, inexpensive methods for identifying and correcting assembly errors are crucial for the generation of accurate assemblies (Salzberg and Yorke 2005;Phillippy, Schatz, and Pop 2008;Gnerre et al 2009;Tsai, Otto, and Berriman 2010;Salzberg et al 2012;Hunt et al 2013;Gurevich et al 2013;Bradnam et al 2013;Simão et al 2015;Fierst 2015;Muggli et al 2015;Yuan et al 2017;Harewood et al 2017).…”
mentioning
confidence: 99%
“…This issue marks the 38th build of the human reference sequence (Schneider et al 2017), which still contains more than 800 gaps after decades of work and billions of dollars spent. But there is hope on the horizon.…”
mentioning
confidence: 99%