DIRS-1 is a retroelement from the slime mold Dictyostelium discoideum. Until recently only two related retrotransposons had been described: PAT from the nematode Panagrellus redivivus and Prt1 from the zygomycete fungus Phycomyces blakesleeanus. Analyses of the reverse transcriptase sequences encoded by these three elements suggested that they were closely related to each other and more distantly related to the Ty3/gypsy Long Terminal Repeat (LTR) retroelements. They have several unusual structural features that distinguish them from typical LTR elements. For instance, they each encode a tyrosine recombinase (YR), but not a DDE-type integrase or an aspartic protease. Although the DIRS-1-related elements are bordered by terminal repeats these differ from typical LTRs in a number of ways. In DIRS-1, for example, the terminal repeats are inverted (complementary), non-identical in sequence, and the outer edges of the terminal sequences are repeated (adjacent to each other) in the internal region. PAT has so-called “split” direct repeats in which the unrelated terminal sequences appear as direct repeats adjacent to each other in the internal region. The only repetition displayed by Prt1 is the presence of short inverted terminal repeats, but the sequenced copy of this element is believed to be a truncated version of an element with a structure resembling DIRS-1. The unusual structure of the terminal repeats of the DIRS1-like elements appears to be related to their replication via free circular intermediates. Site-specific recombination is believed to integrate the circle without creating duplications of the target sites. In recognition of these important distinctions it is proposed that the retrotransposons that encode tyrosine recombinases be called the tyrosine recombinase (or YR) retrotransposons. Recently a large number of additional YR retrotransposons have been described, including elements from fungi (zygomycetes and basidiomycetes), plants (green algae) and a wide range of animals including nematodes, insects, sea urchins, fish and amphibia, while remnants of elements related to DIRS-1 occur in the human genome. The complete set of YR retrotransposons can be divided into two major groups, the DIRS elements and the Ngaro elements, the two groups forming distinct clades on phylogenetic trees based on alignments of RT/RH and recombinase sequences, and also having some structural distinctions. A third group of transposable elements, which we call Cryptons, also carry tyrosine recombinases. These elements do not encode a reverse transcriptase and so are believed to be DNA transposons not retrotransposons. They have been detected in several pathogenic fungi, including the basidiomycete Cryptococcus neoformans, and the ascomycetes Coccidioides posadasii and Histoplasma capsulatum. Sequence comparisons suggest that the Crypton YRs are related to those of the YR retrotransposons. We suggest that the YR retrotransposons arose from the combination of a Crypton-like YR DNA transposon and the RT/RH encoding sequence of a ...
Only three retrotransposons of the DIRS1 group have previously been described: DIRS1 from the slime mold Dictyostelium discoideum, PAT from the nematode Panagrellus redivivus, and Prt1 from the zygomycetous fungus Phycomyces blakesleeanus. Analyses of the reverse transcriptase sequences encoded by these elements suggest that they are related to the long terminal repeat (LTR) retroelements, such as the Ty3/gypsy retrotransposons and the vertebrate retroviruses. The DIRS1-group elements, however, have several unusual structural features which distinguish them from typical LTR elements: (1) they lack the capacity to encode DDE-type integrases or aspartic proteases; (2) they have open reading frames (ORFs) of unknown function; (3) they integrate without creating duplications of their target sites; and (4) although they are bordered by terminal repeats, these sequences differ from typical LTRs in that they are either inverted repeats or "split" direct repeats. Because of the small number of DIRS1-like elements described, and the unusual structures of these elements, little is known about their evolution, distribution, and replication mechanisms. Here, we report the identification of several new DIRS1-like retrotransposons, including elements from nematodes, sea urchins, fish, and amphibia. We also present evidence for the existence of DIRS1-like sequences in the human genome. In addition, we show that the lack of DDE-type integrase genes from elements of the DIRS1 group is explained by the finding that the previously uncharacterized ORFs of these elements encode proteins related to the site-specific recombinase of bacteriophage lambda. The presence of lambda-recombinase-like genes in DIRS1 elements also accounts for the lack of target-site duplications for these elements and may be related to the unusual structures of their terminal repeats.
We have begun a characterization of the long terminal repeat (LTR) retrotransposons in the asexual yeast Candida albicans. A database of assembled C. albicans genomic sequence at Stanford University, which represents 14.9 Mb of the 16-Mb haploid genome, was screened and >350 distinct retrotransposon insertions were identified. The majority of these insertions represent previously unrecognized retrotransposons. The various elements were classified into 34 distinct families, each family being similar, in terms of the range of sequences that it represents, to a typical Ty element family of the related yeast Saccharomyces cerevisiae. These C. albicans retrotransposon families are generally of low copy number and vary widely in coding capacity. For only three families, was a full-length and apparently intact retrotransposon identified. For many families, only solo LTRs and LTR fragments remain. Several families of highly degenerate elements appear to be still capable of transposition, presumably via trans-activation. The overall structure of the retrotransposon population in C. albicans differs considerably from that of S. cerevisiae. In that species, retrotransposon insertions can be assigned to just five families. Most of these families still retain functional examples, and they generally appear at higher copy numbers than the C. albicans families. The possibility that these differences between the two species are attributable to the nonstandard genetic code of C. albicans or the asexual nature of its genome is discussed. A region rich in retrotransposon fragments, that lies adjacent to many of the CARE-2/Rel-2 sub-telomeric repeats, and which appears to have arisen through multiple rounds of duplication and recombination, is also described.
A wide variety of novel tyrosine recombinase (YR)-encoding retrotransposons were identified using data emerging from the various eukaryotic genome sequencing projects. Although many of these elements are clearly members of the previously described DIRS group of YR retrotransposons, a substantial number, including elements from a variety of fungi and animals, belong to a distinct and previously unrecognized group. We refer to these latter elements as the Ngaro group after a representative from zebrafish. Like the members of the DIRS group, Ngaro elements encode proteins bearing reverse transcriptase (RT) and ribonuclease H (RH) domains similar to those of long terminal repeat (LTR) retrotransposons. Phylogenetic analyses based on alignments of RT/RH and YR domains, however, indicate that Ngaro and DIRS are anciently diverged groups. Differences in coding capacity also support the distinction between the two groups. For instance, we found that DIRS elements all encode a protein domain which is similar in sequence to the DNA methyltransferases of certain bacteriophages, whereas this domain is absent from all Ngaro elements. Together, the Ngaro and DIRS groups of YR retrotransposons contain elements with an astonishing diversity in structures, with variations in the nature of the associated repeat sequences and in the arrangement and complement of coding regions. In addition they contain elements with some surprising features, such as spliceosomal introns and long overlapping open reading frames.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.