2019
DOI: 10.1038/s41467-019-09575-2
|View full text |Cite
|
Sign up to set email alerts
|

Platanus-allee is a de novo haplotype assembler enabling a comprehensive access to divergent heterozygous regions

Abstract: The ultimate goal for diploid genome determination is to completely decode homologous chromosomes independently, and several phasing programs from consensus sequences have been developed. These methods work well for lowly heterozygous genomes, but the manifold species have high heterozygosity. Additionally, there are highly divergent regions (HDRs), where the haplotype sequences differ considerably. Because HDRs are likely to direct various interesting biological phenomena, many genomic analysis targets fall w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
77
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 103 publications
(77 citation statements)
references
References 56 publications
(85 reference statements)
0
77
0
Order By: Relevance
“…The accuracy of genome sequencing and assembly of complex genomes is affected by different factors ranging from plant material availability to genome size, GC content, repeat content, ploidy level, sequencing technology, and bioinformatic software [19][20][21][22] . Long-read sequencing technologies, such as those of Pacific Biosciences or Oxford Nanopore Technologies, are currently used for reconstructing Mb-long chromosome sequences or both haplotypes of highly heterozygous genomes [23][24][25][26][27] . Short-read sequencing is less expensive, provides more coverage with the same investment and thus more sequence accuracy, but it leads to more fragmented assemblies.…”
Section: Discussionmentioning
confidence: 99%
“…The accuracy of genome sequencing and assembly of complex genomes is affected by different factors ranging from plant material availability to genome size, GC content, repeat content, ploidy level, sequencing technology, and bioinformatic software [19][20][21][22] . Long-read sequencing technologies, such as those of Pacific Biosciences or Oxford Nanopore Technologies, are currently used for reconstructing Mb-long chromosome sequences or both haplotypes of highly heterozygous genomes [23][24][25][26][27] . Short-read sequencing is less expensive, provides more coverage with the same investment and thus more sequence accuracy, but it leads to more fragmented assemblies.…”
Section: Discussionmentioning
confidence: 99%
“…The average coverage depth for all scaffolds was ×130. The longest scaffold, which was identified as the COTS27 chromosome, was closed by Sanger sequencing, and an alternative assembly was obtained using Platanus-allee v. 2.0.0 [47]. A circular view of the COTS27 chromosome was created using CGView Server [48] and manual processing.…”
Section: Reconstitution Of the Cots27 Chromosome From The Hologenomementioning
confidence: 99%
“…Nevertheless, the reconstruction of these data into accurate haploid (or phased diploid) genome representations still poses a challenge. In particular, current algorithms are not fully robust to differences in heterozygosity between loci and across clades (2). Importantly, the differences in type and structure of heterozygosity are the consequences of unique evolutionary histories.…”
Section: Introductionmentioning
confidence: 99%
“…A particularly challenging aspect of genome assembly is the detection of highly divergent regions (HDRs) (2), which often cannot be determined as allelic within the assembly process and requires supervised analysis (3). HDRs are likely to be biologically important and may indicate regions experiencing diversifying selection, for example the MHC locus in humans (4).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation