The Gasterosteidae fish family hosts several species that are important models for eco-evolutionary, genetic and genomic research. In particular, a wealth of genetic and genomic data has been generated for the three-spined stickleback (Gasterosteus aculeatus), the ‘ecology’s supermodel’, while the genomic resources for the nine-spined stickleback (Pungitius pungitius) have remained relatively scarce. Here, we report a high-quality chromosome-level genome assembly of P. pungitius consisting of 5,303 contigs (N50 = 1.2 Mbp) with a total size of 521 Mbp. These contigs were mapped to 21 linkage groups using a high-density linkage map, yielding a final assembly with 98.5% BUSCO completeness. A total of 25,062 protein-coding genes were annotated, and ca. 23% of the assembly was found to consist of repetitive elements. A comprehensive analysis of repetitive elements uncovered centromeric-specific tandem repeats and provided insights into the evolution of retrotransposons. A multigene phylogenetic analysis inferred a divergence time of about 26 million years (MYA) between nine- and three-spined sticklebacks, which is far older than the commonly assumed estimate of 13 MYA. Compared to the three-spined stickleback, we identified an additional duplication of several genes in the hemoglobin cluster. Sequencing data from populations adapted to different environments indicated potential copy number variations in hemoglobin genes. Furthermore, genome-wide synteny comparisons between three- and nine-spined sticklebacks identified chromosomal rearrangements underlying the karyotypic differences between the two species. The high-quality chromosome-scale assembly of the nine-spined stickleback genome obtained with long-read sequencing technology provides a crucial resource for comparative and population genomic investigations of stickleback fishes and teleosts.
More than 60% of supratentorial ependymomas harbor a ZFTA–RELA (ZRfus) gene fusion (formerly C11orf95–RELA). To study the biology of ZRfus, we developed an autochthonous mouse tumor model using in utero electroporation (IUE) of the embryonic mouse brain. Integrative epigenomic and transcriptomic mapping was performed on IUE-driven ZRfus tumors by CUT&RUN, chromatin immunoprecipitation sequencing, assay for transposase-accessible chromatin sequencing, and RNA sequencing and compared with human ZRfus-driven ependymoma. In addition to direct canonical NFκB pathway activation, ZRfus dictates a neoplastic transcriptional program and binds to thousands of unique sites across the genome that are enriched with PLAGL family transcription factor (TF) motifs. ZRfus activates gene expression programs through recruitment of transcriptional coactivators (Brd4, Ep300, Cbp, Pol2) that are amenable to pharmacologic inhibition. Downstream ZRfus target genes converge on developmental programs marked by PLAGL TF proteins, and activate neoplastic programs enriched in Mapk, focal adhesion, and gene imprinting networks. Significance: Ependymomas are aggressive brain tumors. Although drivers of supratentorial ependymoma (ZFTA- and YAP1-associated gene fusions) have been discovered, their functions remain unclear. Our study investigates the biology of ZFTA–RELA-driven ependymoma, specifically mechanisms of transcriptional deregulation and direct downstream gene networks that may be leveraged for potential therapeutic testing. This article is highlighted in the In This Issue feature, p. 2113
Whole-genome duplication (WGD) has been a major evolutionary driver of increased genomic complexity in vertebrates. One such event occurred in the salmonid family ∼80 Ma (Ss4R) giving rise to a plethora of structural and regulatory duplicate-driven divergence, making salmonids an exemplary system to investigate the evolutionary consequences of WGD. Here, we present a draft genome assembly of European grayling (Thymallus thymallus) and use this in a comparative framework to study evolution of gene regulation following WGD. Among the Ss4R duplicates identified in European grayling and Atlantic salmon (Salmo salar), one-third reflect nonneutral tissue expression evolution, with strong purifying selection, maintained over ∼50 Myr. Of these, the majority reflect conserved tissue regulation under strong selective constraints related to brain and neural-related functions, as well as higher-order protein–protein interactions. A small subset of the duplicates have evolved tissue regulatory expression divergence in a common ancestor, which have been subsequently conserved in both lineages, suggestive of adaptive divergence following WGD. These candidates for adaptive tissue expression divergence have elevated rates of protein coding- and promoter-sequence evolution and are enriched for immune- and lipid metabolism ontology terms. Lastly, lineage-specific duplicate divergence points toward underlying differences in adaptive pressures on expression regulation in the nonanadromous grayling versus the anadromous Atlantic salmon. Our findings enhance our understanding of the role of WGD in genome evolution and highlight cases of regulatory divergence of Ss4R duplicates, possibly related to a niche shift in early salmonid evolution.
Salmonids represent an intriguing taxonomical group for investigating genome evolution in vertebrates due to their relatively recent last common whole genome duplication event, which occurred between 80 and 100 million years ago. Here, we report on the chromosome-level genome assembly of European grayling ( Thymallus thymallus ), which represents one of the earliest diverged salmonid subfamilies. To achieve this, we first generated relatively long genomic scaffolds by using a previously published draft genome assembly along with long-read sequencing data and a linkage map. We then merged those scaffolds by applying synteny evidence from the Atlantic salmon ( Salmo salar ) genome. Comparisons of the European grayling genome assembly to the genomes of Atlantic salmon and Northern pike ( Esox lucius ), the latter used as a nonduplicated outgroup, detailed aspects of the characteristic chromosome evolution process that has taken place in European grayling. While Atlantic salmon and other salmonid genomes are portrayed by the typical occurrence of numerous chromosomal fusions, European grayling chromosomes were confirmed to be fusion-free and were characterized by a relatively large proportion of paracentric and pericentric inversions. We further reported on transposable elements specific to either the European grayling or Atlantic salmon genome, on the male-specific sdY gene in the European grayling chromosome 11A, and on regions under residual tetrasomy in the homeologous European grayling chromosome pairs 9A-9B and 25A-25B. The same chromosome pairs have been observed under residual tetrasomy in Atlantic salmon and in other salmonids, suggesting that this feature has been conserved since the subfamily split.
ZFTA (C11orf95)—a gene of unknown function—partners with a variety of transcriptional coactivators in translocations that drive supratentorial ependymoma, a frequently lethal brain tumor. Understanding the function of ZFTA is key to developing therapies that inhibit these fusion proteins. Here, using a combination of transcriptomics, chromatin immunoprecipitation sequencing, and proteomics, we interrogated a series of deletion-mutant genes to identify a tripartite transformation mechanism of ZFTA-containing fusions, including: spontaneous nuclear translocation, extensive chromatin binding, and SWI/SNF, SAGA, and NuA4/Tip60 HAT chromatin modifier complex recruitment. Thereby, ZFTA tethers fusion proteins across the genome, modifying chromatin to an active state and enabling its partner transcriptional coactivators to promote promiscuous expression of a transforming transcriptome. Using mouse models, we validate further those elements of ZFTA-fusion proteins that are critical for transformation—including ZFTA zinc fingers and partner gene transactivation domains—thereby unmasking vulnerabilities for therapeutic targeting. Significance: Ependymomas are hard-to-treat brain tumors driven by translocations between ZFTA and a variety of transcriptional coactivators. We dissect the transforming mechanism of these fusion proteins and identify protein domains indispensable for tumorigenesis, thereby providing insights into the molecular basis of ependymoma tumorigenesis and vulnerabilities for therapeutic targeting. This article is highlighted in the In This Issue feature, p. 2113
Whole genome duplication (WGD) has been a major evolutionary driver of increased genomic complexity in vertebrates. One such event occurred in the salmonid family ∼80 million years ago (Ss4R) giving rise to a plethora of structural and regulatory duplicate-driven divergence, making salmonids an exemplary system to investigate the evolutionary consequences of WGD. Here, we present a draft genome of European grayling (Thymallus thymallus), and use this in a comparative framework to study evolution of gene regulation following WGD. Among the Ss4R duplicates identified in European grayling and Atlantic salmon, one third reflect non-neutral tissue expression evolution, with strong purifying selection, maintained over ∼50 million years. Of these, 84% reflect conserved tissue regulation under strong selective constraints and are involved in brain and neural-related functions, as well as higher-order protein-protein interactions. In contrast, 16% of the duplicates have evolved regulatory divergence in a common ancestor, suggestive of adaptive divergence following WGD. These candidates for adaptive expression divergence have elevated rates of protein coding-and promoter sequence evolution, and are enriched for immune-and metabolism ontology terms. Lastly, species-specific duplicate divergence points towards underlying differences in adaptive pressures on expression regulation in the non-anadromous grayling and anadromous Atlantic salmon. Our findings enhance our understanding of the role of WGD in genome evolution and highlights cases of functional divergence of Ss4R duplicates, possibly related to a niche shift in early salmonid evolution.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.