Only a very small fraction of long noncoding RNAs (lncRNAs) are well characterized. The evolutionary history of lncRNAs can provide insights into their functionality, but the absence of lncRNA annotations in non-model organisms has precluded comparative analyses. Here we present a large-scale evolutionary study of lncRNA repertoires and expression patterns, in 11 tetrapod species. We identify approximately 11,000 primate-specific lncRNAs and 2,500 highly conserved lncRNAs, including approximately 400 genes that are likely to have originated more than 300 million years ago. We find that lncRNAs, in particular ancient ones, are in general actively regulated and may function predominantly in embryonic development. Most lncRNAs evolve rapidly in terms of sequence and expression levels, but tissue specificities are often conserved. We compared expression patterns of homologous lncRNA and protein-coding families across tetrapods to reconstruct an evolutionarily conserved co-expression network. This network suggests potential functions for lncRNAs in fundamental processes such as spermatogenesis and synaptic transmission, but also in more specific mechanisms such as placenta development through microRNA production.
New genes contribute substantially to adaptive evolutionary innovation, but the functional evolution of new mammalian genes has been little explored at a broad scale. Previous work established mRNA-derived gene duplicates, known as retrocopies, as models for the study of new gene origination. Here we combine mammalian transcriptomic and epigenomic data to unveil the processes underlying the evolution of stripped-down retrocopies into complex new genes. We show that although some robustly expressed retrocopies are transcribed from preexisting promoters, most evolved new promoters from scratch or recruited proto-promoters in their genomic vicinity. In particular, many retrocopy promoters emerged from ancestral enhancers (or bivalent regulatory elements) or are located in CpG islands not associated with other genes. We detected 88–280 selectively preserved retrocopies per mammalian species, illustrating that these mechanisms facilitated the birth of many functional retrogenes during mammalian evolution. The regulatory evolution of originally monoexonic retrocopies was frequently accompanied by exon gain, which facilitated co-option of distant promoters and allowed expression of alternative isoforms. While young retrogenes are often initially expressed in the testis, increased regulatory and structural complexities allowed retrogenes to functionally diversify and evolve somatic organ functions, sometimes as complex as those of their parents. Thus, some retrogenes evolved the capacity to temporarily substitute for their parents during the process of male meiotic X inactivation, while others rendered parental functions superfluous, allowing for parental gene loss. Overall, our reconstruction of the “life history” of mammalian retrogenes highlights retroposition as a general model for understanding new gene birth and functional evolution.
Gene duplications generate genomic raw material that allows the emergence of novel functions, likely facilitating adaptive evolutionary innovations. However, global assessments of the functional and evolutionary relevance of duplicate genes in mammals were until recently limited by the lack of appropriate comparative data. Here, we report a large-scale study of the expression evolution of DNA-based functional gene duplicates in three major mammalian lineages (placental mammals, marsupials, egg-laying monotremes) and birds, on the basis of RNA sequencing (RNA-seq) data from nine species and eight organs. We observe dynamic changes in tissue expression preference of paralogs with different duplication ages, suggesting differential contribution of paralogs to specific organ functions during vertebrate evolution. Specifically, we show that paralogs that emerged in the common ancestor of bony vertebrates are enriched for genes with brain-specific expression and provide evidence for differential forces underlying the preferential emergence of young testis- and liver-specific expressed genes. Further analyses uncovered that the overall spatial expression profiles of gene families tend to be conserved, with several exceptions of pronounced tissue specificity shifts among lineage-specific gene family expansions. Finally, we trace new lineage-specific genes that may have contributed to the specific biology of mammalian organs, including the little-studied placenta. Overall, our study provides novel and taxonomically broad evidence for the differential contribution of duplicate genes to tissue-specific transcriptomes and for their importance for the phenotypic evolution of vertebrates.
BackgroundMammalian microRNAs (miRNAs) are sometimes subject to adenosine-to-inosine RNA editing, which can lead to dramatic changes in miRNA target specificity or expression levels. However, although a few miRNAs are known to be edited at identical positions in human and mouse, the evolution of miRNA editing has not been investigated in detail. In this study, we identify conserved miRNA editing events in a range of mammalian and non-mammalian species.ResultsWe demonstrate deep conservation of several site-specific miRNA editing events, including two that date back to the common ancestor of mammals and bony fishes some 450 million years ago. We also find evidence of a recent expansion of an edited miRNA family in placental mammals and show that editing of these miRNAs is associated with changes in target mRNA expression during primate development and aging. While global patterns of miRNA editing tend to be conserved across species, we observe substantial variation in editing frequencies depending on tissue, age and disease state: editing is more frequent in neural tissues compared to heart, kidney and testis; in older compared to younger individuals; and in samples from healthy tissues compared to tumors, which together suggests that miRNA editing might be associated with a reduced rate of cell proliferation.ConclusionsOur results show that site-specific miRNA editing is an evolutionarily conserved mechanism, which increases the functional diversity of mammalian miRNA transcriptomes. Furthermore, we find that although miRNA editing is rare compared to editing of long RNAs, miRNAs are greatly overrepresented among conserved editing targets.
Small non-coding RNAs act as critical regulators of gene expression and are essential for male germ cell development and spermatogenesis. Previously, we showed that germ cell-specific inactivation of Dicer1, an endonuclease essential for the biogenesis of micro-RNAs (miRNAs) and endogenous small interfering RNAs (endo-siRNAs), led to complete male infertility due to alterations in meiotic progression, increased spermatocyte apoptosis and defects in the maturation of spermatozoa. To dissect the distinct physiological roles of miRNAs and endo-siRNAs in spermatogenesis, we compared the testicular phenotype of mice with Dicer1 or Dgcr8 depletion in male germ cells. Dgcr8 mutant mice, which have a defective miRNA pathway while retaining an intact endo-siRNA pathway, were also infertile and displayed similar defects, although less severe, to Dicer1 mutant mice. These included cumulative defects in meiotic and haploid phases of spermatogenesis, resulting in oligo-, terato-, and azoospermia. In addition, we found by RNA sequencing of purified spermatocytes that inactivation of Dicer1 and the resulting absence of miRNAs affected the fine tuning of protein-coding gene expression by increasing low level gene expression. Overall, these results emphasize the essential role of miRNAs in the progression of spermatogenesis, but also indicate a role for endo-siRNAs in this process.
Divergence of protein sequences and gene expression patterns are two fundamental mechanisms that generate organismal diversity. Here, we have used genome and transcriptome data from eight mammals and one bird to study the positive correlation of these two processes throughout mammalian evolution. We demonstrate that the correlation is stable over time and most pronounced in neural tissues, which indicates that it is the result of strong negative selection. The correlation is not driven by genes with specific functions and may instead best be viewed as an evolutionary default state, which can nevertheless be evaded by certain gene types. In particular, genes with developmental and neural functions are skewed toward changes in gene expression, consistent with selection against pleiotropic effects associated with changes in protein sequences. Surprisingly, we find that the correlation between expression divergence and protein divergence is not explained by between-gene variation in expression level, tissue specificity, protein connectivity, or other investigated gene characteristics, suggesting that it arises independently of these gene traits. The selective constraints on protein sequences and gene expression patterns also fluctuate in a coordinate manner across phylogenetic branches: We find that gene-specific changes in the rate of protein evolution in a specific mammalian lineage tend to be accompanied by similar changes in the rate of expression evolution. Taken together, our findings highlight many new aspects of the correlation between protein divergence and expression divergence, and attest to its role as a fundamental property of mammalian genome evolution.
Promoters and enhancers—key controllers of gene expression—have long been distinguished from each other based on their function. However, recent work suggested that common architectural and functional features might have facilitated the conversion of one type of element into the other during evolution. Here, based on cross-mammalian analyses of epigenome and transcriptome data, we provide support for this hypothesis by detecting 445 regulatory elements with signatures of activity turnover (termed P/E elements). Most events represent transformations of putative ancestral enhancers into promoters, leading to the emergence of species-specific transcribed loci or 5′ exons. Distinct GC sequence compositions and stabilizing 5′ splicing (U1) regulatory motif patterns may have predisposed P/E elements to regulatory repurposing, and changes in the U1 and polyadenylation signal densities and distributions likely drove the evolutionary activity switches. Our work suggests that regulatory repurposing facilitated regulatory innovation and the origination of new genes and exons during evolution.
Sexual dimorphism depends on sex-biased gene expression, but the contributions of microRNAs (miRNAs) have not been globally assessed. We therefore produced an extensive small RNA sequencing data set to analyze male and female miRNA expression profiles in mouse, opossum, and chicken. Our analyses uncovered numerous cases of somatic sex-biased miRNA expression, with the largest proportion found in the mouse heart and liver. Sex-biased expression is explained by miRNA-specific regulation, including sex-biased chromatin accessibility at promoters, rather than piggybacking of intronic miRNAs on sex-biased protein-coding genes. In mouse, but not opossum and chicken, sex bias is coordinated across tissues such that autosomal testis-biased miRNAs tend to be somatically male-biased, whereas autosomal ovary-biased miRNAs are female-biased, possibly due to broad hormonal control. In chicken, which has a Z/W sex chromosome system, expression output of genes on the Z Chromosome is expected to be male-biased, since there is no global dosage compensation mechanism that restores expression in ZW females after almost all genes on the W Chromosome decayed. Nevertheless, we found that the dominant liver miRNA, miR-122-5p, is Z-linked but expressed in an unbiased manner, due to the unusual retention of a W-linked copy. Another Z-linked miRNA, the male-biased miR-2954-3p, shows conserved preference for dosage-sensitive genes on the Z Chromosome, based on computational and experimental data from chicken and zebra finch, and acts to equalize male-to-female expression ratios of its targets. Unexpectedly, our findings thus establish miRNA regulation as a novel gene-specific dosage compensation mechanism.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.