As one of the few cellular traits that can be quantified across the tree of life, DNA-replication fidelity provides an excellent platform for understanding fundamental evolutionary processes. Furthermore, because mutation is the ultimate source of all genetic variation, clarifying why mutation rates vary is crucial for understanding all areas of biology. A potentially revealing hypothesis for mutation-rate evolution is that natural selection primarily operates to improve replication fidelity, with the ultimate limits to what can be achieved set by the power of random genetic drift. This drift-barrier hypothesis is consistent with comparative measures of mutation rates, provides a simple explanation for the existence of error-prone polymerases and yields a formal counter-argument to the view that selection fine-tunes gene-specific mutation rates.
Distinct small RNA pathways are involved in the two types of homology-dependent effects described in Paramecium tetraurelia, as shown by a functional analysis of Dicer and Dicer-like genes and by the sequencing of small RNAs. The siRNAs that mediate post-transcriptional gene silencing when cells are fed with double-stranded RNA (dsRNA) were found to comprise two subclasses. DCR1-dependent cleavage of the inducing dsRNA generates ∼23-nt primary siRNAs from both strands, while a different subclass of ∼24-nt RNAs, characterized by a short untemplated poly-A tail, is strictly antisense to the targeted mRNA, suggestive of secondary siRNAs that depend on an RNA-dependent RNA polymerase. An entirely distinct pathway is responsible for homology-dependent regulation of developmental genome rearrangements after sexual reproduction. During early meiosis, the DCL2 and DCL3 genes are required for the production of a highly complex population of ∼25-nt scnRNAs from all types of germline sequences, including both strands of exons, introns, intergenic regions, transposons and Internal Eliminated Sequences. A prominent 5′-UNG signature, and a minor fraction showing the complementary signature at positions 21–23, indicate that scnRNAs are cleaved from dsRNA precursors as duplexes with 2-nt 3′ overhangs at both ends, followed by preferential stabilization of the 5′-UNG strand.
Most eukaryotic genes are interrupted by non-coding introns that must be accurately removed from pre-messenger RNAs to produce translatable mRNAs. Splicing is guided locally by short conserved sequences, but genes typically contain many potential splice sites, and the mechanisms specifying the correct sites remain poorly understood. In most organisms, short introns recognized by the intron definition mechanism cannot be efficiently predicted solely on the basis of sequence motifs. In multicellular eukaryotes, long introns are recognized through exon definition and most genes produce multiple mRNA variants through alternative splicing. The nonsense-mediated mRNA decay (NMD) pathway may further shape the observed sets of variants by selectively degrading those containing premature termination codons, which are frequently produced in mammals. Here we show that the tiny introns of the ciliate Paramecium tetraurelia are under strong selective pressure to cause premature termination of mRNA translation in the event of intron retention, and that the same bias is observed among the short introns of plants, fungi and animals. By knocking down the two P. tetraurelia genes encoding UPF1, a protein that is crucial in NMD, we show that the intrinsic efficiency of splicing varies widely among introns and that NMD activity can significantly reduce the fraction of unspliced mRNAs. The results suggest that, independently of alternative splicing, species with large intron numbers universally rely on NMD to compensate for suboptimal splicing efficiency and accuracy.
The understanding of selective constraints affecting genes is a major issue in biology. It is well established that gene expression level is a major determinant of the rate of protein evolution, but the reasons for this relationship remain highly debated. Here we demonstrate that gene expression is also a major determinant of the evolution of gene dosage: the rate of gene losses after whole genome duplications in the Paramecium lineage is negatively correlated to the level of gene expression, and this relationship is not a byproduct of other factors known to affect the fate of gene duplicates. This indicates that changes in gene dosage are generally more deleterious for highly expressed genes. This rule also holds for other taxa: in yeast, we find a clear relationship between gene expression level and the fitness impact of reduction in gene dosage. To explain these observations, we propose a model based on the fact that the optimal expression level of a gene corresponds to a trade-off between the benefit and cost of its expression. This COSTEX model predicts that selective pressure against mutations changing gene expression level or affecting the encoded protein should on average be stronger in highly expressed genes and hence that both the frequency of gene loss and the rate of protein evolution should correlate negatively with gene expression. Thus, the COSTEX model provides a simple and common explanation for the general relationship observed between the level of gene expression and the different facets of gene evolution.
Despite the general assumption that site-specific mutation rates are independent of the local sequence context, a growing body of evidence suggests otherwise. To further examine context-dependent patterns of mutation, we amassed 5,645 spontaneous mutations in wild- type (WT) and mismatch-repair deficient (MMR(-)) mutation-accumulation (MA) lines of the gram-positive model organism Bacillus subtilis. We then analyzed>7,500 spontaneous base-substitution mutations across B. subtilis, Escherichia coli, and Mesoplasma florum WT and MMR(-) MA lines, finding a context-dependent mutation pattern that is asymmetric around the origin of replication. Different neighboring nucleotides can alter site-specific mutation rates by as much as 75-fold, with sites neighboring G:C base pairs or dimers involving alternating pyrimidine-purine and purine-pyrimidine nucleotides having significantly elevated mutation rates. The influence of context-dependent mutation on genome architecture is strongest in M. florum, consistent with the reduced efficiency of selection in organisms with low effective population size. If not properly accounted for, the disparities arising from patterns of context-dependent mutation can significantly influence interpretations of positive and purifying selection.
Proteins of the Argonaute family are small RNA carriers that guide regulatory complexes to their targets. The family comprises two major subclades. Members of the Ago subclade, which are present in most eukaryotic phyla, bind different classes of small RNAs and regulate gene expression at both transcriptional and post-transcriptional levels. Piwi subclade members appear to have been lost in plants and fungi and were mostly studied in metazoa, where they bind piRNAs and have essential roles in sexual reproduction. Their presence in ciliates, unicellular organisms harbouring both germline micronuclei and somatic macronuclei, offers an interesting perspective on the evolution of their functions. Here, we report phylogenetic and functional analyses of the 15 Piwi genes from Paramecium tetraurelia. We show that four constitutively expressed proteins are involved in siRNA pathways that mediate gene silencing throughout the life cycle. Two other proteins, specifically expressed during meiosis, are required for accumulation of scnRNAs during sexual reproduction and for programmed genome rearrangements during development of the somatic macronucleus. Our results indicate that Paramecium Piwi proteins have evolved to perform both vegetative and sexual functions through mechanisms ranging from post-transcriptional mRNA cleavage to epigenetic regulation of genome rearrangements.
Whole-genome duplications (WGDs) have contributed to gene-repertoire enrichment in many eukaryotic lineages. However, most duplicated genes are eventually lost and it is still unclear why some duplicated genes are evolutionary successful whereas others quickly turn to pseudogenes. Here, we show that dosage constraints are major factors opposing post-WGD gene loss in several Paramecium species that share a common ancestral WGD. We propose a model where a majority of WGD-derived duplicates preserve their ancestral function and are retained to produce enough of the proteins performing this same ancestral function. Under this model, the expression level of individual duplicated genes can evolve neutrally as long as they maintain a roughly constant summed expression, and this allows random genetic drift toward uneven contributions of the two copies to total expression. Our analysis suggests that once a high level of imbalance is reached, which can require substantial lengths of time, the copy with the lowest expression level contributes a small enough fraction of the total expression that selection no longer opposes its loss. Extension of our analysis to yeast species sharing a common ancestral WGD yields similar results, suggesting that duplicated-gene retention for dosage constraints followed by divergence in expression level and eventual deterministic gene loss might be a universal feature of post-WGD evolution.
This paper provides the first comprehensive analysis of the fidelity of transcription in eukaryotic cells.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.