Long-read based assembly and synteny analysis of a reference Drosophila subobscura genome reveals signatures of structural evolution driven by inversions recombination-suppression effects
Abstract:BackgroundDrosophila subobscura has long been a central model in evolutionary genetics. Presently, its use is hindered by the lack of a reference genome. To bridge this gap, here we used PacBio long-read technology, together with the available wealth of genetic marker information, to assemble and annotate a high-quality nuclear and complete mitochondrial genome for the species. With the obtained assembly, we performed the first synteny analysis of genome structure evolution in the subobscura subgroup.ResultsWe… Show more
“…2). Genome--wide alignments between our D. subobscura assembly and a recently published high-quality genome confirm our inference of pericentromeric DNA (Puerma et al 2018;Karageorgiou et al 2019) and verify the orientation of the chromosomes (Fig. S4).…”
Section: Resultssupporting
confidence: 83%
“…Our annotations identified 12,714 protein--coding genes in D. subobscura, 13,665 in D. athabasca, 14,547 in D. lowei, and 14,334 in D. pseudoobscura. The number of annotated genes in D. subobscura is very similar to the 13,317 protein--coding genes in another assembly of D. subobscura (Karageorgiou et al 2019) and the 13,453 genes of its close relative D. guanche (Puerma et al 2018). Further, the number of annotated genes in D. pseudoobscura and D. lowei (Table S4) are similar to the number in the current version of D. pseudoobscura (14,574 genes in Dpse_3.0).…”
Section: Resultssupporting
confidence: 61%
“…S3). BUSCO (Benchmarking Universal Single--Copy Orthologs) scores reveal that our genomes are highly complete (Table S3) and match or exceed other recently published high quality Drosophila genomes (Mahajan et al 2018;Puerma et al 2018;Miller et al 2018;Karageorgiou et al 2019). Identification of D. melanogaster orthologs in our genome assemblies allowed us to unambiguously assign each chromosome to a Muller element ( Fig.…”
Section: Resultsmentioning
confidence: 62%
“…2). A lower cut--off was used for D. subobscura, as this species has a very small, compact and repeat--poor genome (see also Karageorgiou et al 2019). The sizes of (peri)centromere regions recovered for each chromosome and species vary considerably ( Table 1).…”
Centromeres are the basic unit for chromosome inheritance, but their evolutionary dynamics is poorly understood. We generate high--quality reference genomes for multiple Drosophila obscura group species to reconstruct karyotype evolution. All chromosomes in this lineage were ancestrally telocentric and the creation of metacentric chromosomes in some species was driven by de novo seeding of new centromeres at ancestrally gene--rich regions, independently of chromosomal rearrangements. The emergence of centromeres resulted in a drastic size increase due to repeat accumulation, and dozens of genes previously located in euchromatin are now embedded in pericentromeric heterochromatin. Metacentric chromosomes secondarily became telocentric in the pseudoobscura subgroup through centromere repositioning and a pericentric inversion. The former (peri)centric sequences left behind shrunk dramatically in size after their inactivation, yet contain remnants of their evolutionary past, including increased repeat--content and heterochromatic environment. Centromere movements are accompanied by rapid turnover of the major satellite DNA detected in (peri)centromeric regions.
“…2). Genome--wide alignments between our D. subobscura assembly and a recently published high-quality genome confirm our inference of pericentromeric DNA (Puerma et al 2018;Karageorgiou et al 2019) and verify the orientation of the chromosomes (Fig. S4).…”
Section: Resultssupporting
confidence: 83%
“…Our annotations identified 12,714 protein--coding genes in D. subobscura, 13,665 in D. athabasca, 14,547 in D. lowei, and 14,334 in D. pseudoobscura. The number of annotated genes in D. subobscura is very similar to the 13,317 protein--coding genes in another assembly of D. subobscura (Karageorgiou et al 2019) and the 13,453 genes of its close relative D. guanche (Puerma et al 2018). Further, the number of annotated genes in D. pseudoobscura and D. lowei (Table S4) are similar to the number in the current version of D. pseudoobscura (14,574 genes in Dpse_3.0).…”
Section: Resultssupporting
confidence: 61%
“…S3). BUSCO (Benchmarking Universal Single--Copy Orthologs) scores reveal that our genomes are highly complete (Table S3) and match or exceed other recently published high quality Drosophila genomes (Mahajan et al 2018;Puerma et al 2018;Miller et al 2018;Karageorgiou et al 2019). Identification of D. melanogaster orthologs in our genome assemblies allowed us to unambiguously assign each chromosome to a Muller element ( Fig.…”
Section: Resultsmentioning
confidence: 62%
“…2). A lower cut--off was used for D. subobscura, as this species has a very small, compact and repeat--poor genome (see also Karageorgiou et al 2019). The sizes of (peri)centromere regions recovered for each chromosome and species vary considerably ( Table 1).…”
Centromeres are the basic unit for chromosome inheritance, but their evolutionary dynamics is poorly understood. We generate high--quality reference genomes for multiple Drosophila obscura group species to reconstruct karyotype evolution. All chromosomes in this lineage were ancestrally telocentric and the creation of metacentric chromosomes in some species was driven by de novo seeding of new centromeres at ancestrally gene--rich regions, independently of chromosomal rearrangements. The emergence of centromeres resulted in a drastic size increase due to repeat accumulation, and dozens of genes previously located in euchromatin are now embedded in pericentromeric heterochromatin. Metacentric chromosomes secondarily became telocentric in the pseudoobscura subgroup through centromere repositioning and a pericentric inversion. The former (peri)centric sequences left behind shrunk dramatically in size after their inactivation, yet contain remnants of their evolutionary past, including increased repeat--content and heterochromatic environment. Centromere movements are accompanied by rapid turnover of the major satellite DNA detected in (peri)centromeric regions.
“…Using the recently assembled genome of D. subobscura [41] we have estimated that the hsp70 loci are around 0.8 Mb from the closest O 4 distal breakpoint and likely too far away as to be directly affected by this breakpoint, an inference that is in agreement with the similar RNA amounts found in the two chromosomal arrangements analysed here. Four genes (Pxd, Set 8, CG5225 and Acf) are located near O 4 inversion breakpoints [42], but at present we do not know whether potential position effects affecting these genes play any role in the in the adaptive value of this inversion.…”
Background: Drosophila subobscura exhibits a rich inversion polymorphism, with some adaptive inversions showing repeatable spatiotemporal patterns in frequencies related to temperature. Previous studies reported increased basal HSP70 protein levels in homokaryotypic strains for a warm-climate arrangement compared to a cold-climate one. These findings do not match the similar hsp70 genomic organization between arrangements, where gene expression levels are expected to be similar. In order to test this hypothesis and understand the molecular basis for hsp70 expression, we compared basal hsp70 mRNA levels in males and females, and analysed the 5′ and 3′ regulatory regions of hsp70 genes in warm-and cold-climate isochromosomal O3 + 4 + 7 and O ST lines of D. subobscura. Results: We observed comparable mRNA levels between the two arrangements and a sex-biased hsp70 gene expression. The number of heat-shock elements (HSEs) and GAGA sites on the promoters were identical amongst the O ST and O 3 + 4 + 7 lines analysed. This is also true for 3′ AU-rich elements where most A and B copies of hsp70 have, respectively, two and one element in both arrangements. Beyond the regulatory elements, the only notable difference between both arrangements is the presence in 3′ UTR of a 14 bp additional fragment after the stop codon in the hsp70A copy in five O 3 + 4 + 7 lines, which was not found in any of the six O ST lines.
Conclusions:The equivalent hsp70 mRNA amounts in O ST and O 3 + 4 + 7 arrangements provide the first evidence of a parallelism between gene expression and genetic organization in D. subobscura lines having these arrangements. This is reinforced by the lack of important differential features in the number and structure of regulatory elements between both arrangements, despite the genetic differentiation observed when the complete 5′ and 3′ regulatory regions were considered. Therefore, the basal levels of hsp70 mRNA cannot account, in principle, for the adaptive variation of the two arrangements studied. Consequently, further studies are necessary to understand the intricate molecular mechanisms of hsp70 gene regulation in D. subobscura.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.