The duplication of entire genomes has long been recognized as having great potential for evolutionary novelties, but the mechanisms underlying their resolution through gene loss are poorly understood. Here we show that in the unicellular eukaryote Paramecium tetraurelia, a ciliate, most of the nearly 40,000 genes arose through at least three successive whole-genome duplications. Phylogenetic analysis indicates that the most recent duplication coincides with an explosion of speciation events that gave rise to the P. aurelia complex of 15 sibling species. We observed that gene loss occurs over a long timescale, not as an initial massive event. Genes from the same metabolic pathway or protein complex have common patterns of gene loss, and highly expressed genes are over-retained after all duplications. The conclusion of this analysis is that many genes are maintained after whole-genome duplication not because of functional innovation but because of gene dosage constraints.Ciliates are unique among unicellular organisms in that they separate germline and somatic functions 1 . Each cell harbours two kinds of nucleus, namely silent diploid micronuclei and highly polyploid macronuclei. The latter are unusual in that they contain an extensively rearranged genome streamlined for expression and divide by a non-mitotic process. Only micronuclei undergo meiosis to perpetuate genetic information; the macronuclei are lost at each sexual generation and develop anew from the micronuclear lineage.In Paramecium the exact number of micronuclear chromosomes (more than 50) and the structures of their centromeres and telomeres remain unknown. During macronuclear development, these chromosomes are amplified to about 800 copies and undergo two types of DNA elimination event. Tens of thousand of short, unique copy elements (internal eliminated sequences) are removed by a precise mechanism that leads to the reconstitution of functional genes 2 .Transposable elements and other repeated sequences are removed by an imprecise mechanism leading either to chromosome fragmentation and de novo telomere addition or to variable internal deletions 3 . These rearrangements occur after a few rounds of endoreplication, leading to some heterogeneity in the sequences abutting the imprecisely eliminated regions 3 . The sizes of the resulting, acentric macronuclear chromosomes range from 50-1,000 kilobases (kb) as measured by pulsed-field gel electrophoresis. Because the sexual process of autogamy results in an entirely homozygous genotype 4 , the macronuclear DNA that was sequenced was genetically homogeneous.The Paramecium genome sequence The Paramecium macronuclear genome sequence was established with the use of a whole-genome shotgun and assembly strategy. Paired-end sequencing of plasmid and bacterial artificial chromosome (BAC) clones provided a coverage of 13 genome equivalents (Supplementary Table S1). We assembled the sequence reads with Arachne 5 in 1,907 contigs connected in 697 scaffolds of size greater than 2 kb, giving a total coverage of 72...
Insertions of parasitic DNA within coding sequences are usually deleterious and are generally counter-selected during evolution. Thanks to nuclear dimorphism, ciliates provide unique models to study the fate of such insertions. Their germline genome undergoes extensive rearrangements during development of a new somatic macronucleus from the germline micronucleus following sexual events. In Paramecium, these rearrangements include precise excision of unique-copy Internal Eliminated Sequences (IES) from the somatic DNA, requiring the activity of a domesticated piggyBac transposase, PiggyMac. We have sequenced Paramecium tetraurelia germline DNA, establishing a genome-wide catalogue of ∼45,000 IESs, in order to gain insight into their evolutionary origin and excision mechanism. We obtained direct evidence that PiggyMac is required for excision of all IESs. Homology with known P. tetraurelia Tc1/mariner transposons, described here, indicates that at least a fraction of IESs derive from these elements. Most IES insertions occurred before a recent whole-genome duplication that preceded diversification of the P. aurelia species complex, but IES invasion of the Paramecium genome appears to be an ongoing process. Once inserted, IESs decay rapidly by accumulation of deletions and point substitutions. Over 90% of the IESs are shorter than 150 bp and present a remarkable size distribution with a ∼10 bp periodicity, corresponding to the helical repeat of double-stranded DNA and suggesting DNA loop formation during assembly of a transpososome-like excision complex. IESs are equally frequent within and between coding sequences; however, excision is not 100% efficient and there is selective pressure against IES insertions, in particular within highly expressed genes. We discuss the possibility that ancient domestication of a piggyBac transposase favored subsequent propagation of transposons throughout the germline by allowing insertions in coding sequences, a fraction of the genome in which parasitic DNA is not usually tolerated.
Distinct small RNA pathways are involved in the two types of homology-dependent effects described in Paramecium tetraurelia, as shown by a functional analysis of Dicer and Dicer-like genes and by the sequencing of small RNAs. The siRNAs that mediate post-transcriptional gene silencing when cells are fed with double-stranded RNA (dsRNA) were found to comprise two subclasses. DCR1-dependent cleavage of the inducing dsRNA generates ∼23-nt primary siRNAs from both strands, while a different subclass of ∼24-nt RNAs, characterized by a short untemplated poly-A tail, is strictly antisense to the targeted mRNA, suggestive of secondary siRNAs that depend on an RNA-dependent RNA polymerase. An entirely distinct pathway is responsible for homology-dependent regulation of developmental genome rearrangements after sexual reproduction. During early meiosis, the DCL2 and DCL3 genes are required for the production of a highly complex population of ∼25-nt scnRNAs from all types of germline sequences, including both strands of exons, introns, intergenic regions, transposons and Internal Eliminated Sequences. A prominent 5′-UNG signature, and a minor fraction showing the complementary signature at positions 21–23, indicate that scnRNAs are cleaved from dsRNA precursors as duplexes with 2-nt 3′ overhangs at both ends, followed by preferential stabilization of the 5′-UNG strand.
In the ciliate Paramecium tetraurelia, differentiation of the somatic nucleus from the zygotic nucleus is characterized by massive and reproducible deletion of transposable elements and of 45,000 short, dispersed, single-copy sequences. A specific class of small RNAs produced by the germline during meiosis, the scnRNAs, are involved in the epigenetic regulation of DNA deletion but the underlying mechanisms are poorly understood. Here, we show that trimethylation of histone H3 (H3K27me3 and H3K9me3) displays a dynamic nuclear localization that is altered when the endonuclease required for DNA elimination is depleted. We identified the putative histone methyltransferase Ezl1 necessary for H3K27me3 and H3K9me3 establishment and show that it is required for correct genome rearrangements. Genome-wide analyses show that scnRNA-mediated H3 trimethylation is necessary for the elimination of long, repeated germline DNA, while single copy sequences display differential sensitivity to depletion of proteins involved in the scnRNA pathway, Ezl1- a putative histone methyltransferase and Dcl5- a protein required for iesRNA biogenesis. Our study reveals cis-acting determinants, such as DNA length, also contribute to the definition of germline sequences to delete. We further show that precise excision of single copy DNA elements, as short as 26 bp, requires Ezl1, suggesting that development specific H3K27me3 and H3K9me3 ensure specific demarcation of very short germline sequences from the adjacent somatic sequences.
In animals and plants, the H3K9me3 and H3K27me3 chromatin silencing marks are deposited by different protein machineries. H3K9me3 is catalyzed by the SET-domain SU(VAR)3–9 enzymes, while H3K27me3 is catalyzed by the SET-domain Enhancer-of-zeste enzymes, which are the catalytic subunits of Polycomb Repressive Complex 2 (PRC2). Here, we show that the Enhancer-of-zeste-like protein Ezl1 from the unicellular eukaryote Paramecium tetraurelia , which exhibits significant sequence and structural similarities with human EZH2, catalyzes methylation of histone H3 in vitro and in vivo with an apparent specificity toward K9 and K27. We find that H3K9me3 and H3K27me3 co-occur at multiple families of transposable elements in an Ezl1-dependent manner. We demonstrate that loss of these histone marks results in global transcriptional hyperactivation of transposable elements with modest effects on protein-coding gene expression. Our study suggests that although often considered functionally distinct, H3K9me3 and H3K27me3 may share a common evolutionary history as well as a common ancestral role in silencing transposable elements.
BackgroundDNA elimination is developmentally programmed in a wide variety of eukaryotes, including unicellular ciliates, and leads to the generation of distinct germline and somatic genomes. The ciliate Paramecium tetraurelia harbors two types of nuclei with different functions and genome structures. The transcriptionally inactive micronucleus contains the complete germline genome, while the somatic macronucleus contains a reduced genome streamlined for gene expression. During development of the somatic macronucleus, the germline genome undergoes massive and reproducible DNA elimination events. Availability of both the somatic and germline genomes is essential to examine the genome changes that occur during programmed DNA elimination and ultimately decipher the mechanisms underlying the specific removal of germline-limited sequences.ResultsWe developed a novel experimental approach that uses flow cell imaging and flow cytometry to sort subpopulations of nuclei to high purity. We sorted vegetative micronuclei and macronuclei during development of P. tetraurelia. We validated the method by flow cell imaging and by high throughput DNA sequencing. Our work establishes the proof of principle that developing somatic macronuclei can be sorted from a complex biological sample to high purity based on their size, shape and DNA content. This method enabled us to sequence, for the first time, the germline DNA from pure micronuclei and to identify novel transposable elements. Sequencing the germline DNA confirms that the Pgm domesticated transposase is required for the excision of all ~45,000 Internal Eliminated Sequences. Comparison of the germline DNA and unrearranged DNA obtained from PGM-silenced cells reveals that the latter does not provide a faithful representation of the germline genome.ConclusionsWe developed a flow cytometry-based method to purify P. tetraurelia nuclei to high purity and provided quality control with flow cell imaging and high throughput DNA sequencing. We identified 61 germline transposable elements including the first Paramecium retrotransposons. This approach paves the way to sequence the germline genomes of P. aurelia sibling species for future comparative genomic studies.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-017-3713-7) contains supplementary material, which is available to authorized users.
The germ line genome of ciliates is extensively rearranged during development of the somatic macronucleus. Numerous sequences are eliminated, while others are amplified to a high ploidy level. In the Paramecium aurelia group of species, transformation of the maternal macronucleus with transgenes at high copy numbers can induce the deletion of homologous genes in sexual progeny, when a new macronucleus develops from the wild-type germ line. We show that this trans-nuclear effect correlates with homology-dependent silencing of maternal genes before autogamy and with the accumulation of ϳ22-to 23-nucleotide (nt) RNA molecules. The same effects are induced by feeding cells before meiosis with bacteria containing double-stranded RNA, suggesting that small interfering RNA-like molecules can target deletions. Furthermore, experimentally induced macronuclear deletions are spontaneously reproduced in subsequent sexual generations, and reintroduction of the missing gene into the variant macronucleus restores developmental amplification in sexual progeny. We discuss the possible roles of the ϳ22-to 23-nt RNAs in the targeting of deletions and the implications for the RNA-mediated genome-scanning process that is thought to determine developmentally regulated rearrangements in ciliates.
Thousands of single-copy internal eliminated sequences (IESs) are excised from the germ line genome of ciliates during development of the polygenomic somatic macronucleus, following sexual events. Paramecium IESs are short, noncoding elements that frequently interrupt coding sequences. No absolutely conserved sequence element, other than flanking 5-TA-3 direct repeats, has been identified among sequenced IESs; the mechanisms of their specific recognition and precise elimination are unknown. Previous work has revealed the existence of an epigenetic control of excision. It was shown that the presence of one IES in the vegetative macronucleus results in a specific inhibition of the excision of the same element during the development of a new macronucleus, in the following sexual generation. We have assessed the generality and sequence specificity of this transnuclear maternal control by studying the effects of macronuclear transformation with 13 different IESs. We show that at least five of them can be maintained in the new macronuclear genome; sequence specificity is complete both between genes and between different IESs in the same gene. In all cases, the degree of excision inhibition correlates with the copy number of the maternal IES, but each IES shows a characteristic inhibition efficiency. Short internal IES-like segments were found to be excised from two of the IESs when excision between normal boundaries was inhibited. Available data suggest that the sequence specificity of these maternal effects is mediated by pairing interactions between homologous nucleic acids.Ciliates are unicellular eukaryotes in which germ line and somatic functions are assumed by two kinds of nuclei coexisting in the same cytoplasm. The diploid micronuclei are transcriptionally silent during vegetative growth; their main function is to provide gametic nuclei upon meiosis. Vegetative transcription takes place in the polygenomic, somatic macronuclei that divide amitotically and are lost soon after meiosis. Following fertilization, both kinds of nuclei differentiate from copies of the diploid zygotic nucleus. Macronuclear development involves extensive rearrangements of the germ line genome: chromosomes are fragmented into shorter, acentric molecules, and tens of thousands of internally eliminated sequences (IESs) are removed from coding and noncoding sequences, in a reproducible and often highly precise manner. In addition, the genome is amplified to the final ploidy level, ϳ1,000n in Paramecium aurelia complex species (for general reviews, see references 5 and 34; for a review of IESs, see reference 19).All known Paramecium IESs are short (26 to 882 bp), ATrich, single-copy elements flanked by 5Ј-TA-3Ј direct repeats in the germ line sequence. Excision invariably leaves one of the TA repeats in the rearranged macronuclear sequence, suggesting that the same basic mechanism is used for all IESs. Although the excision mechanism is currently unknown, reaction products and intermediates have been identified in other ciliate species. In E...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.