BackgroundLong terminal repeat (LTR) retrotransposons are a widespread kind of transposable element present in eukaryotic genomes. They are a major factor in genome evolution due to their ability to create large scale mutations and genome rearrangements. Compared to other transposable elements, little attention has been paid to elements belonging to the metazoan BEL/Pao subclass of LTR retrotransposons. No comprehensive characterization of these elements is available so far. The aim of this study was to describe all BEL/Pao elements in a set of 62 sequenced metazoan genomes, and to analyze their phylogenetic relationship.ResultsWe identified a total of 7,861 BEL/Pao elements in 53 of our 62 study genomes. We identified BEL/Pao elements in 20 genomes where such elements had not been found so far. Our analysis shows that BEL/Pao elements are the second-most abundant class of LTR retrotransposons in the genomes we study, more abundant than Ty1/Copia elements, and second only to Ty3/Gypsy elements. They occur in multiple phyla, including basal metazoan phyla, suggesting that BEL/Pao elements arose early in animal evolution. We confirm findings from previous studies that BEL/Pao elements do not occur in mammals. The elements we found can be grouped into more than 1725 families, 1623 of which are new, previously unknown families. These families fall into seven superfamilies, only five of which have been characterized so far. One new superfamily is a major subdivision of the Pao superfamily which we propose to call Dan, because it is restricted to the genome of the zebrafish Danio rerio. The other new superfamily comprises 83 elements and is restricted to lower aquatic eumetazoans. We propose to call this superfamily Flow. BEL/Pao elements do not show any signs of recent horizontal gene transfer between distantly related species.ConclusionsIn sum, our analysis identifies thousands of new BEL/Pao elements and provides new insights into their distribution, abundance, and evolution.
BackgroundInsertions and deletions of DNA segments (indels) are together with substitutions the major mutational processes that generate genetic variation. Here we focus on recent DNA insertions and deletions in protein coding regions of the human genome to investigate selective constraints on indels in protein evolution.ResultsFrequencies of inserted and deleted amino acids differ from background amino acid frequencies in the human proteome. Small amino acids are overrepresented, while hydrophobic, aliphatic and aromatic amino acids are strongly suppressed. Indels are found to be preferentially located in protein regions that do not form important structural domains. Amino acid insertion and deletion rates in genes associated with elementary biochemical reactions (e. g. catalytic activity, ligase activity, electron transport, or catabolic process) are lower compared to those in other genes and are therefore subject to stronger purifying selection.ConclusionOur analysis indicates that indels in human protein coding regions are subject to distinct levels of selective pressure with regard to their structural impact on the amino acid sequence, as well as to general properties of the genes they are located in. These findings confirm that many commonly accepted characteristics of selective constraints for substitutions are also valid for amino acid insertions and deletions.
BackgroundTransposable elements (TEs) are major contributors to genome evolution. One factor that influences their evolutionary dynamics is whether their host reproduces through selfing or through outcrossing. According to the recombinational spreading hypothesis, for instance, TEs can spread more easily in outcrossing species through recombination, and should thus be less abundant in selfing species. We here studied the distribution and evolutionary dynamics of TE families in the predominantly selfing plant Arabidopsis thaliana and its close outcrossing relative Arabidopsis lyrata on a genome-wide scale. We characterized differences in TE abundance between them and asked which, if any, existing hypotheses about TE abundances may explain these differences.ResultsWe identified 1,819 TE families representing all known classes of TEs in both species, and found three times more copies in the outcrossing A. lyrata than in the predominantly selfing A. thaliana, as well as ten times more TE families unique to A. lyrata. On average, elements in A. lyrata are younger than elements in A. thaliana. In particular, A. thaliana shows a marked decrease in element number that occurred during the most recent 10% of the time interval since A. thaliana split from A. lyrata. This most recent period in the evolution of A. thaliana started approximately 500,000 years ago, assuming a splitting time of 5 million years ago, and coincides with the time at which predominant selfing originated.ConclusionsOur results indicate that the mating system may be important for determining TE copy number, and that selfing species are likely to have fewer TEs.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.