2021
DOI: 10.1073/pnas.2016274118
|View full text |Cite
|
Sign up to set email alerts
|

Long-read assembly of a Great Dane genome highlights the contribution of GC-rich sequence and mobile elements to canine genomes

Abstract: Technological advances have allowed improvements in genome reference sequence assemblies. Here, we combined long- and short-read sequence resources to assemble the genome of a female Great Dane dog. This assembly has improved continuity compared to the existing Boxer-derived (CanFam3.1) reference genome. Annotation of the Great Dane assembly identified 22,182 protein-coding gene models and 7,049 long noncoding RNAs, including 49 protein-coding genes not present in the CanFam3.1 reference. The Great Dane assemb… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
51
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 39 publications
(60 citation statements)
references
References 102 publications
1
51
0
Order By: Relevance
“…Second, we identified 321 intervals encompassing 38.3 Mb of sequence based on excess depth of coverage from Illumina sequencing reads. These measures of duplication content are both less than that found in the Great Dane Zoey or CanFam3.1 assemblies [8], indicating that these duplicated sequences are not correctly resolved in the Dog10K_Boxer_Tasha_1.0 genome assembly.…”
Section: Analysis Of Duplicationsmentioning
confidence: 72%
See 3 more Smart Citations
“…Second, we identified 321 intervals encompassing 38.3 Mb of sequence based on excess depth of coverage from Illumina sequencing reads. These measures of duplication content are both less than that found in the Great Dane Zoey or CanFam3.1 assemblies [8], indicating that these duplicated sequences are not correctly resolved in the Dog10K_Boxer_Tasha_1.0 genome assembly.…”
Section: Analysis Of Duplicationsmentioning
confidence: 72%
“…Segmental duplications were defined as segments of four or more consecutive windows with an estimated copy number of at least 2.5. Comparable annotations for the CanFam3.1 assembly were obtained from [8].…”
Section: Detection Of Common Repeats and Segmental Duplicationsmentioning
confidence: 99%
See 2 more Smart Citations
“…In total, 31,911 probes of the mammalian methylation array are aligned to loci that are proximal to 5,021 genes in the recently released long read reference assembly of the Great Dane (CanFam_GreatDane.UMICH_Zoey_3.1.100) 13 . The array has high inter-species conservation; thus, it can be extrapolated to other breeds and other mammalian species 9 .…”
Section: Resultsmentioning
confidence: 99%