In Arabidopsis, tandemly arrayed genes (TAGs) comprise >10% of the genes in the genome. These duplicated genes represent a rich template for genetic innovation, but little is known of the evolutionary forces governing their generation and maintenance. Here we compare the organization and evolution of TAGs between Arabidopsis and rice, two plant genomes that diverged ~150 million years ago. TAGs from the two genomes are similar in a number of respects, including the proportion of genes that are tandemly arrayed, the number of genes within an array, the number of tandem arrays, and the dearth of TAGs relative to single copy genes in centromeric regions. Analysis of recombination rates along rice chromosomes confirms a positive correlation between the occurrence of TAGs and recombination rate, as found in Arabidopsis. TAGs are also biased functionally relative to duplicated, nontandemly arrayed genes. In both genomes, TAGs are enriched for genes that encode membrane proteins and function in “abiotic and biotic stress” but underrepresented for genes involved in transcription and DNA or RNA binding functions. We speculate that these observations reflect an evolutionary trend in which successful tandem duplication involves genes either at the end of biochemical pathways or in flexible steps in a pathway, for which fluctuation in copy number is unlikely to affect downstream genes. Despite differences in the age distribution of tandem arrays, the striking similarities between rice and Arabidopsis indicate similar mechanisms of TAG generation and maintenance.
In eukaryotes, C5-cytosine methylation is a common mechanism associated with a variety of functions such as gene regulation or control of genomic stability. Different subfamilies of eukaryotic methyltransferases (MTases) have been identified, mainly in metazoa, plants, and fungi. In this paper, we used hidden Markov models to detect MTases in completed or almost completed eukaryotic genomes, including different species of Protozoa. A phylogenetic analysis of MTases enabled us to define six subfamilies of MTases, including two new subfamilies. The dnmt1 subfamily that includes all the known MTases with a maintenance activity seems to be absent in the Protozoa. The dnmt2 subfamily seems to be the most widespread, being present even in the nonmethylated Dictyostelium discoideum. We also found two dnmt2 members in the bacterial genus Geobacter, suggesting that horizontal transfers of MTases occurred between eukaryotes and prokaryotes. Even if the direction of transfer cannot be determined, this relationship might be useful for understanding the function of this enigmatic subfamily of MTases. Globally, our analysis reveals a great diversity of MTases in eukaryotes, suggesting the existence of different methylation systems. Our results also suggest acquisitions and losses of different MTases in every eukaryotic lineage studied and that some eukaryotes appear to be devoid of methylation.
http://pbil.univ-lyon1.fr/software/cpgprod.html
In an attempt to understand the origin of CpG islands (CGIs) in mammalian genomes, we have studied their location and structure according to the expression pattern of genes and to the G + C content of isochores in which they are embedded. We show that CGIs located over the transcription start site (named start CGIs) are very different structurally from the others (named no-start CGIs): (1) 61.6% of the no-start CGIs are due to repeated sequences (79 % are due to Alus), whereas only 5.6% of the start CGIs are due to such repeats; (2) start CGIs are longer and display a higher CpGo/e ratio and G + C level than no-start CGIs. The frequency of tissue-specific genes associated to a start CGI varies according to the genomic G + C content, from 25% in G + C-poor isochores to 64% in G + C-rich isochores. Conversely, the frequency of housekeeping genes associated to a start CGI (90%) is independent of the isochore context. Interestingly, the structure of start CGIs is very similar for tissue-specific and housekeeping genes. Moreover, 93% of genes expressed in early embryo are found to exhibit a CpG island over their transcription start point. These observations are consistent with the hypothesis that the occurrence of these CGIs is the consequence of gene expression at this stage, when the methylation pattern is installed.
DNA methylation is involved in the regulation of gene expression and plays an important role in normal developmental processes and diseases, such as cancer. DNA methyltransferases are the enzymes responsible for DNA methylation on the position 5 of cytidine in a CpG context. In order to identify and characterize novel inhibitors of these enzymes, we developed a fluorescence-based throughput screening by using a short DNA duplex immobilized on 96-well plates. We have screened 114 flavones and flavanones for the inhibition of the murine catalytic Dnmt3a/3L complex and found 36 hits with IC(50) values in the lower micromolar and high nanomolar ranges. The assay, together with inhibition tests on two other methyltransferases, structure-activity relationships and docking studies, gave insights on the mechanism of inhibition. Finally, two derivatives effected zebrafish embryo development, and induced a global demethylation of the genome, at doses lower than the control drug, 5-azacytidine.
In mammals DNA methylation occurs at position 5 of cytosine in a CpG context and regulates gene expression. It plays an important role in diseases and inhibitors of DNA methyltransferases (DNMTs)—the enzymes responsible for DNA methylation—are used in clinics for cancer therapy. The most potent inhibitors are 5-azacytidine and 5-azadeoxycytidine. Zebularine (1-(β-D-ribofuranosyl)-2(1H)- pyrimidinone) is another cytidine analog described as a potent inhibitor that acts by forming a covalent complex with DNMT when incorporated into DNA. Here we bring additional experiments to explain its mechanism of action. First, we observe an increase in the DNA binding when zebularine is incorporated into the DNA, compared to deoxycytidine and 5-fluorodeoxycytidine, together with a strong decrease in the dissociation rate. Second, we show by denaturing gel analysis that the intermediate covalent complex between the enzyme and the DNA is reversible, differing thus from 5-fluorodeoxycytidine. Third, no methylation reaction occurs when zebularine is present in the DNA. We confirm that zebularine exerts its demethylation activity by stabilizing the binding of DNMTs to DNA, hindering the methylation and decreasing the dissociation, thereby trapping the enzyme and preventing turnover even at other sites.
The neutral mutation rate is equal to the base substitution rate when the latter is not affected by natural selection. Differences between these rates may reveal that factors such as natural selection, linkage, or a mutator locus are affecting a given sequence. We examined the neutral base substitution rate by measuring the sequence divergence of ∼30,000 pairs of inactive orthologous L1 retrotransposon sequences interspersed throughout the human and chimpanzee genomes. In contrast to other studies, we related ortholog divergence to the time (age) that the L1 sequences resided in the genome prior to the chimpanzee and human speciation. As expected, the younger orthologs contained more hypermutable CpGs than the older ones because of their conversion to TpGs (and CpAs). Consequently, the younger orthologs accumulated more CpG mutations than the older ones during the ∼5 million years since the human and chimpanzee lineages separated. But during this same time, the younger orthologs also accumulated more non-CpG mutations than the older ones. In fact, non-CpG and CpG mutations showed an almost perfect (R 2 = 0.98) correlation for ∼97% of the ortholog pairs. The correlation is independent of G + C content, recombination rate, and chromosomal location. Therefore, it likely reflects an intrinsic effect of CpGs, or mutations thereof, on non-CpG DNA rather than the joint manifestation of the chromosomal environment. The CpG effect is not uniform for all regions of non-CpG DNA. Therefore, the mutation rate of non-CpG DNA is contingent to varying extents on local CpG content. Aside from their implications for mutational mechanisms, these results indicate that a precise determination of a uniform genome-wide neutral mutation rate may not be attainable.
BackgroundAlpha satellite is the major repeated DNA element of primate centromeres. Evolution of these tandemly repeated sequences has led to the existence of numerous families of monomers exhibiting specific organizational patterns. The limited amount of information available in non-human primates is a restriction to the understanding of the evolutionary dynamics of alpha satellite DNA.ResultsWe carried out the targeted high-throughput sequencing of alpha satellite monomers and dimers from the Cercopithecus solatus genome, an Old World monkey from the Cercopithecini tribe. Computational approaches were used to infer the existence of sequence families and to study how these families are organized with respect to each other. While previous studies had suggested that alpha satellites in Old World monkeys were poorly diversified, our analysis provides evidence for the existence of at least four distinct families of sequences within the studied species and of higher order organizational patterns. Fluorescence in situ hybridization using oligonucleotide probes that are able to target each family in a specific way showed that the different families had distinct distributions on chromosomes and were not homogeneously distributed between chromosomes.ConclusionsOur new approach provides an unprecedented and comprehensive view of the diversity and organization of alpha satellites in a species outside the hominoid group. We consider these data with respect to previously known alpha satellite families and to potential mechanisms for satellite DNA evolution. Applying this approach to other species will open new perspectives regarding the integration of satellite DNA into comparative genomic and cytogenetic studies.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-016-3246-5) contains supplementary material, which is available to authorized users.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.