Abstract:The production of genome sequences has led to another important advance in their annotation, which is closely linked to the exact determination of their content in terms of repeats, among which are transposable elements (TEs). The evolutionary implications and the presence of coding regions in some TEs can confuse gene annotation, and also hinder the process of genome assembly, making particularly crucial to be able to annotate and classify them correctly in genome sequences. This review is intended to provide… Show more
“…Accordingly, the groupings of sequences in the network, represented by large circles, indicate that the sequences are identical ; therefore, these sequences are very recent and have not had sufficient time to diverge. Similar events have been reported for other transposable elements (Yang et Konovalov et al, 2010 ;Lerat, 2010) and MITE-like sequences in different organisms (Jiang et al, 2003 ;Chen et al, 2008;Zhou et al, 2008 ;Han et al, 2010 ;Hikosaka et al, 2011). This recent origin is also supported by the average time of origin of the insertions of each subfamily, 155 000 years (msechBari1) and 127 000 years (msechBari2).…”
SummaryA few occurrences of miniature inverted-repeat transposable elements (MITEs) have been reported in species of the genus Drosophila. Here, we describe msechBari, a MITE-like element in Drosophila sechellia. The element is short, approximately 90 bp in length, AT-rich and occurs in association with, or close to, genes, characteristics that are typical for MITEs. The identification was performed in silico using the sequenced genome of D. sechellia and confirmed in a laboratory strain. This short element is related to the Bari_DM transposon of Drosophila melanogaster, having terminal inverted repeats (TIRs) of a similar length and a high identity with the full-length Bari_DM element. The estimated recent origin of the element and the homogeneity observed between copies found in the genome suggests that msechBari could be active in D. sechellia.
“…Accordingly, the groupings of sequences in the network, represented by large circles, indicate that the sequences are identical ; therefore, these sequences are very recent and have not had sufficient time to diverge. Similar events have been reported for other transposable elements (Yang et Konovalov et al, 2010 ;Lerat, 2010) and MITE-like sequences in different organisms (Jiang et al, 2003 ;Chen et al, 2008;Zhou et al, 2008 ;Han et al, 2010 ;Hikosaka et al, 2011). This recent origin is also supported by the average time of origin of the insertions of each subfamily, 155 000 years (msechBari1) and 127 000 years (msechBari2).…”
SummaryA few occurrences of miniature inverted-repeat transposable elements (MITEs) have been reported in species of the genus Drosophila. Here, we describe msechBari, a MITE-like element in Drosophila sechellia. The element is short, approximately 90 bp in length, AT-rich and occurs in association with, or close to, genes, characteristics that are typical for MITEs. The identification was performed in silico using the sequenced genome of D. sechellia and confirmed in a laboratory strain. This short element is related to the Bari_DM transposon of Drosophila melanogaster, having terminal inverted repeats (TIRs) of a similar length and a high identity with the full-length Bari_DM element. The estimated recent origin of the element and the homogeneity observed between copies found in the genome suggests that msechBari could be active in D. sechellia.
“…This has led to the completion of multiple genome-sequencing projects. Simultaneously, a variety of in silico techniques for identifying and annotating potential ERVs have been developed (20). These entail searches for conserved sequences and/or sequence motifs organized in the same manner as exogenous retroviruses (21).…”
Over 40% of mammalian genomes comprise the products of reverse transcription. Among such retrotransposed sequences are those characterized by the presence of long terminal repeats (LTRs), including the endogenous retroviruses (ERVs), which are inherited genetic elements closely resembling the proviruses formed following exogenous retrovirus infection. Sequences derived from ERVs make up at least 8 to 10% of the human and mouse genomes and range from ancient sequences that predate mammalian divergence to elements that are currently still active. In this chapter we describe the discovery, classification and origins of ERVs in mammals and consider cellular mechanisms that have evolved to control their expression. We also discuss the negative effects of ERVs as agents of genetic disease and cancer and review examples of ERV protein domestication to serve host functions, as in placental development. Finally, we address growing evidence that the gene regulatory potential of ERV LTRs has been exploited multiple times during evolution to regulate genes and gene networks. Thus, although recently endogenized retroviral elements are often pathogenic, those that survive the forces of negative selection become neutral components of the host genome or can be harnessed to serve beneficial roles.
“…A high-quality TE prediction and annotation is essential to prevent mis-annotation of functional genes and to understand the biology of genomes [29]. Different strategies have been developed, including similarity search against databanks of known TE sequences, de novo repeat detection, k-mer-based counting, and structural motif detection (reviewed in [30]). Despite the development of many dedicated bioinformatic tools, precise TE modeling in complex (>1 Gb) genomes, such as in wheat or maize, is a tour de force.…”
Background: The 17 Gb bread wheat genome has massively expanded through the proliferation of transposable elements (TEs) and two recent rounds of polyploidization. The assembly of a 774 Mb reference sequence of wheat chromosome 3B provided us with the opportunity to explore the impact of TEs on the complex wheat genome structure and evolution at a resolution and scale not reached so far. Results: We develop an automated workflow, CLARI-TE, for TE modeling in complex genomes. We delineate precisely 56,488 intact and 196,391 fragmented TEs along the 3B pseudomolecule, accounting for 85% of the sequence, and reconstruct 30,199 nested insertions. TEs have been mostly silent for the last one million years, and the 3B chromosome has been shaped by a succession of bursts that occurred between 1 to 3 million years ago. Accelerated TE elimination in the high-recombination distal regions is a driving force towards chromosome partitioning. CACTAs overrepresented in the high-recombination distal regions are significantly associated with recently duplicated genes. In addition, we identify 140 CACTA-mediated gene capture events with 17 genes potentially created by exon shuffling and show that 19 captured genes are transcribed and under selection pressure, suggesting the important role of CACTAs in the recent wheat adaptation.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.