2019
DOI: 10.1038/s41467-018-08260-0
|View full text |Cite
|
Sign up to set email alerts
|

Chromosome-level assembly of the water buffalo genome surpasses human and goat genomes in sequence contiguity

Abstract: Rapid innovation in sequencing technologies and improvement in assembly algorithms have enabled the creation of highly contiguous mammalian genomes. Here we report a chromosome-level assembly of the water buffalo (Bubalus bubalis) genome using single-molecule sequencing and chromatin conformation capture data. PacBio Sequel reads, with a mean length of 11.5 kb, helped to resolve repetitive elements and generate sequence contiguity. All five B. bubalis sub-metacentric chromosomes were correctly scaffolded with … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

10
140
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 133 publications
(153 citation statements)
references
References 40 publications
10
140
0
Order By: Relevance
“…A key challenge to achieving this chromosomal assembly was the presence of extensive genetic diversity in the sequencing data. Although high-quality genome assemblies are becoming more prevalent through the use of long-read sequencing technologies [4,5,54] , a feature of these assemblies is that they are derived from a single, few, or clonal individual(s) in which the genetic diversity is low or absent; most of the genetic diversity present in a single diploid individual can be phased and separated into distinct haplotypes (for example, using software such as Falcon-Unzip [8] and Supernova [7] for PacBio and 10X Genomics data, respectively), and assembled separately. However, even under these scenarios, heterozygosity remains a key technical limitation towards achieving contiguous genome assemblies.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…A key challenge to achieving this chromosomal assembly was the presence of extensive genetic diversity in the sequencing data. Although high-quality genome assemblies are becoming more prevalent through the use of long-read sequencing technologies [4,5,54] , a feature of these assemblies is that they are derived from a single, few, or clonal individual(s) in which the genetic diversity is low or absent; most of the genetic diversity present in a single diploid individual can be phased and separated into distinct haplotypes (for example, using software such as Falcon-Unzip [8] and Supernova [7] for PacBio and 10X Genomics data, respectively), and assembled separately. However, even under these scenarios, heterozygosity remains a key technical limitation towards achieving contiguous genome assemblies.…”
Section: Discussionmentioning
confidence: 99%
“…However, high-throughput sequencing has significantly impacted the rate at which draft genomes are produced and, together with the recent introduction of long-read sequencing from Pacific Biosciences (PacBio) and Oxford Nanopore, the quality and contiguity of assemblies of both large and small genomes has completely changed. Now, the ability to generate highly contiguous and closed prokaryote genomes is becoming routine, even from metagenomic samples containing multiple strains or species [2,3] , and high-quality chromosome-scale genomes for a number of eukaryotic species are rapidly becoming available [4][5][6] . These technologies generally require microgram quantities of high molecular weight DNA, and the success of a genome assembly is highly dependent on the degree of polymorphism present in the DNA that is sequenced.…”
Section: Introductionmentioning
confidence: 99%
“…An additional advantage to single-insect assemblies is that genome assembly for a diploid sample is algorithmically simpler than for a sample of many pooled individuals, each of which may contribute up to two unique haplotypes. Several de novo assembly methods are available for diploid samples [21,22,33], and have been broadly applied taxonomically [5,34]. Recent work indicates that assembly of high-heterozygosity samples is more accurate than for inbred samples when parental data can be used to partition long-read sequence data by haplotype, an approach called trio-binning [22,35].…”
Section: Nilaparvata Lugensmentioning
confidence: 99%
“…The array contains 89 988 SNPs, together with 5799 probes for Quality Control and 1784 probes for gender determination (Iamartino et al ). The candidate SNPs for the array design were selected to be evenly distributed genome‐wide with respect to the bovine UMD 3.1 genome, but have been recently remapped after the release of the current version of the buffalo reference genome (Low et al ). Also, SNP numbers were weighted with respect to the significance of the breeds included in the discovery panel as follows: Mediterranean, 30%; Murrah, 30%; Jafarabadi, 20%; Nili‐Ravi, 20%.…”
Section: Introductionmentioning
confidence: 99%