Endogenous retrovirus (ERV) families are derived from their exogenous counterparts by means of a process of germ-line infection and proliferation within the host genome. Several families in the human and mouse genomes now consist of many hundreds of elements and, although several candidates have been proposed, the mechanism behind this proliferation has remained uncertain. To investigate this mechanism, we reconstructed the ratio of nonsynonymous to synonymous changes and the acquisition of stop codons during the evolution of the human ERV family HERV-K(HML2). We show that all genes, including the env gene, which is necessary only for movement between cells, have been under continuous purifying selection. This finding strongly suggests that the proliferation of this family has been almost entirely due to germ-line reinfection, rather than retrotransposition in cis or complementation in trans, and that an infectious pool of endogenous retroviruses has persisted within the primate lineage throughout the past 30 million years. Because many elements within this pool would have been unfixed, it is possible that the HERV-K(HML2) family still contains infectious elements at present, despite their apparent absence in the human genome sequence. Analysis of the env gene of eight other HERV families indicated that reinfection is likely to be the most common mechanism by which endogenous retroviruses proliferate in their hosts.
Human endogenous retroviruses (HERVs) were first identified almost 20 years ago, and since then numerous families have been described. It has, however, been difficult to obtain a good estimate of both the total number of independently derived families and their relationship to each other as well as to other members of the family Retroviridae. In this study, I used sequence data derived from over 150 novel HERVs, obtained from the Human Genome Mapping Project database, and a variety of recently identified nonhuman retroviruses to classify the HERVs into 22 independently acquired families. Of these, 17 families were loosely assigned to the class I HERVs, 3 to the class II HERVs and 2 to the class III HERVs. Many of these families have been identified previously, but six are described here for the first time and another four, for which only partial sequence information was previously available, were further characterized. Members of each of the 10 families are defective, and calculation of their integration dates suggested that most of them are likely to have been present within the human lineage since it diverged from the Old World monkeys more than 25 million years ago.
The published human genome sequence contains many thousands of endogenous retroviruses (HERVs) but all are defective, containing nonsense mutations or major deletions. Only the HERV-K(HML2) family has been active since the divergence of humans and chimpanzees; it contains many members that are human specific, as well as several that are insertionally polymorphic (an inserted element present only in some human individuals). Here we perform a genomewide survey of insertional polymorphism levels in this family by using the published human genome sequence and a diverse sample of 19 humans. We find that there are 113 human-specific HERV-K(HML2) elements in the human genome sequence, 8 of which are insertionally polymorphic (11 if we extrapolate to those within regions of the genome that were not suitable for amplification). The average rate of accumulation since the divergence with chimpanzees is thus approximately 3.8 ؋ 10 ؊4 per haploid genome per generation. Furthermore, we find that the number of polymorphic elements is not significantly different from that predicted by a standard population genetic model that assumes constant activity of the family until the present. This suggests to us that the HERV-K(HML2) family may be active in present-day humans. Active (replication-competent) elements are likely to have inserted very recently and to be present at low allele frequencies, and they may be causing disease in the individuals carrying them. This view of the family from a population perspective rather than a genome perspective will inform the current debate about a possible role of HERV-K(HML2) in human disease.Endogenous retroviruses (ERVs) are derived from their exogenous counterparts via insertion (integration) into the germ line of their host. At insertion, each ERV element (provirus) consists of two identical, nontranslated long terminal repeats (LTRs) flanking an internal region that encodes proteins required for viral replication and assembly (5). ERVs become defective over time due to frameshift or nonsense mutations introduced during host DNA replication or via recombinational deletion of the internal region to leave a solo LTR structure (5, 31). Solo LTRs are approximately tenfold more abundant than their undeleted, full-length counterparts (31).Within the published human genome sequence, there are over 98,000 human endogenous retroviruses (HERVs), but all are defective, containing nonsense mutations or major deletions. No replication-competent HERVs have been identified to date (26,31,33,35), with only one (K113) with open reading frames for all genes (35), and thus their activity and infectivity is thought to have decreased substantially from levels occurring during earlier periods of primate evolution (1,23,34).One possible exception to this trend is the HERV-K(HML2) family, which makes up less than 1% of HERV elements (27). This family has been active and infectious for much of the past 30 million years (2,12,20,28,35). It contains many members that inserted into the genome after the divergence of h...
The lentiviruses are associated with a wide range of chronic diseases in mammals. These include immunodeficiencies (such as HIV/AIDS in humans), malignancies, and lymphatic and neurological disorders in primates, felids, and a variety of wild and domesticated ungulates. Evolutionary analyses of the genomic sequences of modern-day lentiviruses have suggested a relatively recent date for their emergence, but the failure to identify any endogenous, vertically transmitted examples has meant that their longer term evolutionary history and origin remain unknown. Here we report the discovery and characterization of retroviral sequences belonging to a new lentiviral subgroup from the European rabbit (Oryctolagus cuniculus). These viruses, the first endogenous examples described, are >7 million years old and thus provide the first evidence for an ancient origin of the lentiviruses. Despite being ancient, this subgroup contains many of the features found in present-day lentiviruses, such as the presence of tat and rev genes, thus also indicating an ancient origin for the complex regulation of lentivirus gene expression. Although the virus we describe is defective, reconstruction of an infectious progenitor could provide novel insights into lentivirus biology and host interactions.phylogeny ͉ retrovirus ͉ ERV ͉ rabbit
The genomes of the four primate lentiviral groups are complex and contain several regulatory or accessory genes. Two of these genes, vpr and vpx, are found in various combinations within the four groups and encode proteins whose functions have yet to be elucidated. Comparison of the encoded protein sequences suggests that the vpx gene within the HIV‐2 group arose by the duplication of an ancestral vpr gene within this group. Evolutionary distance analysis showed that both genes were well conserved when compared with viral regulatory genes, and indicated that the duplication occurred at approximately the same time as the HIV‐2 group and the other primate lentivirus groups diverged from a common ancestor. Furthermore, although the SIVagm vpx proteins are homologous to the HIV‐2 group vpx proteins, there are insufficient grounds from sequence analysis for classifying them as vpx proteins. Because of their similarity to the vpr proteins of other groups, we suggest reclassifying the SIVagm vpx gene as a vpr gene. This creates a simpler and more uniform picture of the genomic organization of the primate lentiviruses and allows the genomic organization of their common precursor to be defined; it probably contained five accessory genes: tat, rev, vif, nef and vpr.
The retroviral capacity for integration into the host genome can give rise to endogenous retroviruses (ERVs): retroviral sequences that are transmitted vertically as part of the host germ line, within which they may continue to replicate and evolve. ERVs represent both a unique archive of ancient viral sequence information and a dynamic component of host genomes. As such they hold great potential as informative markers for studies of both virus evolution and host genome evolution. Numerous novel ERVs have been described in recent years, particularly as genome sequencing projects have advanced. This review discusses the evolution of ERV lineages, considering the processes by which ERV distribution and diversity is generated. The diversity of ERVs isolated so far is summarised in terms of both their distribution across host taxa, and their relationships to recognised retroviral genera. Finally the relevance of ERVs to studies of genome evolution, host disease and viral ecology is considered, and recent findings discussed.
Retroviruses can leave a "fossil record" in their hosts' genomes in the form of endogenous retroviruses. Foamy viruses, complex retroviruses that infect mammals, have been notably absent from this record. We have found an endogenous foamy virus within the genomes of sloths and show that foamy viruses were infecting mammals more than 100 million years ago and codiverged with their hosts across an entire geological era. Our analysis highlights the role of evolutionary constraint in maintaining viral genome structure and indicates that accessory genes and mammalian mechanisms of innate immunity are the products of macroevolutionary conflict played out over a geological time scale.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.