Structural variants (SVs) are common in the human genome. Because approximately half of the human genome consists of repetitive, transposable DNA sequences, it is plausible that these elements play an important role in generating SVs in humans. Sequencing of the diploid genome of one individual human (HuRef) affords us the opportunity to assess, for the first time, the impact of mobile elements on SVs in an individual in a thorough and unbiased fashion. In this study, we systematically evaluated more than 8000 SVs to identify mobile element-associated SVs as small as 100 bp and specific to the HuRef genome. Combining computational and experimental analyses, we identified and validated 706 mobile element insertion events (including Alu, L1, SVA elements, and nonclassical insertions), which added more than 305 kb of new DNA sequence to the HuRef genome compared with the Human Genome Project (HGP) reference sequence (hg18). We also identified 140 mobile element-associated deletions, which removed ;126 kb of sequence from the HuRef genome. Overall, ;10% of the HuRef-specific indels larger than 100 bp are caused by mobile element-associated events. More than one-third of the insertion/deletion events occurred in genic regions, and new Alu insertions occurred in exons of three human genes. Based on the number of insertions and the estimated time to the most recent common ancestor of HuRef and the HGP reference genome, we estimated the Alu, L1, and SVA retrotransposition rates to be one in 21 births, 212 births, and 916 births, respectively. This study presents the first comprehensive analysis of mobile element-related structural variants in the complete DNA sequence of an individual and demonstrates that mobile elements play an important role in generating inter-individual structural variation.
Mobile elements have been recognized as powerful tools for phylogenetic and population-level analyses. However, issues regarding potential sources of homoplasy and other misleading events have been raised. We have collected available data for all phylogenetic and population level studies of primates utilizing Alu insertion data and examined them for potentially homoplasious and other misleading events. Very low levels of each potential confounding factor in a phylogenetic or population analysis (i.e., lineage sorting, parallel insertions, and precise excision) were found. Although taxa known to be subject to high levels of these types of events may indeed be subject to problems when using SINE analysis, we propose that most taxa will respond as the order Primates has--by the resolution of several long-standing problems observed using sequence-based methods.
To assess the extent to which the Nile River Valley has been a corridor for human migrations between Egypt and sub-Saharan Africa, we analyzed mtDNA variation in 224 individuals from various locations along the river. Sequences of the first hypervariable segment (HV1) of the mtDNA control region and a polymorphic HpaI site at position 3592 allowed us to designate each mtDNA as being of "northern" or "southern" affiliation. Proportions of northern and southern mtDNA differed significantly between Egypt, Nubia, and the southern Sudan. At slowly evolving sites within HV1, northern-mtDNA diversity was highest in Egypt and lowest in the southern Sudan, and southern-mtDNA diversity was highest in the southern Sudan and lowest in Egypt, indicating that migrations had occurred bidirectionally along the Nile River Valley. Egypt and Nubia have low and similar amounts of divergence for both mtDNA types, which is consistent with historical evidence for long-term interactions between Egypt and Nubia. Spatial autocorrelation analysis demonstrates a smooth gradient of decreasing genetic similarity of mtDNA types as geographic distance between sampling localities increases, strongly suggesting gene flow along the Nile, with no evident barriers. We conclude that these migrations probably occurred within the past few hundred to few thousand years and that the migration from north to south was either earlier or lesser in the extent of gene flow than the migration from south to north.
Alu elements have inserted in primate genomes throughout the evolution of the order. One particular Alu lineage (Ye) began amplifying relatively early in hominid evolution and continued propagating at a low level as many of its members are found in a variety of hominid genomes. This study represents the first conclusive application of short interspersed elements, which are considered nearly homoplasy-free, to elucidate the phylogeny of hominids. Phylogenetic analysis of Alu Ye5 elements and elements from several other subfamilies reveals high levels of support for monophyly of Hominidae, tribe Hominini and subtribe Hominina. Here we present the strongest evidence reported to date for a sister relationship between humans and chimpanzees while clearly distinguishing the chimpanzee and human lineages.mobile elements ͉ short interspersed elements ͉ trichotomy ͉ primates
An analysis of Y-chromosomal haplotypes in several European populations reveals an almost monomorphic pattern in the Finns, whereas Y-chromosomal diversity is significantly higher in other populations. Furthermore, analyses of nucleotide positions in the mitochondrial control region that evolve slowly show a decrease in genetic diversity in Finns. Thus, relatively few men and women have contributed the genetic lineages that today survive in the Finnish population. This is likely to have caused the so-called "Finnish disease heritage"-i.e., the occurrence of several genetic diseases in the Finnish population that are rare elsewhere. A preliminary analysis of the mitochondrial mutations that have accumulated subsequent to the bottleneck suggests that it occurred about 4000 years ago, presumably when populations using agriculture and animal husbandry arrived in Finland.
Long and short interspersed elements (LINEs and SINEs) are retroelements that make up almost half of the human genome. L1 and Alu represent the most prolific human LINE and SINE families, respectively. Only a few Alu elements are able to retropose, and the factors determining their retroposition capacity are poorly understood. The data presented in this paper indicate that the length of Alu "A-tails" is one of the principal factors in determining the retropositional capability of an Alu element. The A stretches of the Alu subfamilies analyzed, both old (Alu S and J) and young (Ya5), had a Poisson distribution of A-tail lengths with a mean size of 21 and 26, respectively. In contrast, the A-tails of very recent Alu insertions (disease causing) were all between 40 and 97 bp in length. The L1 elements analyzed displayed a similar tendency, in which the "disease"-associated elements have much longer A-tails (mean of 77) than do the elements even from the young Ta subfamily (mean of 41). Analysis of the draft sequence of the human genome showed that only about 1000 of the over one million Alu elements have tails of 40 or more adenosine residues in length. The presence of these long A stretches shows a strong bias toward the actively amplifying subfamilies, consistent with their playing a major role in the amplification process. Evaluation of the 19 Alu elements retrieved from the draft sequence of the human genome that are identical to the Alu Ya5a2 insert in the NF1 gene showed that only five have tails with 40 or more adenosine residues. Sequence analysis of the loci with the Alu elements containing the longest A-tails (7 of the 19) from the genomes of the NF1 patient and the father revealed that there are at least two loci with A-tails long enough to serve as source elements within our model. Analysis of the A-tail lengths of 12 Ya5a2 elements in diverse human population groups showed substantial variability in both the Alu A-tail length and sequence homogeneity. On the basis of these observations, a model is presented for the role of A-tail length in determining which Alu elements are active.[The sequence data from this study have been submitted to GenBank under accession nos.
A comprehensive analysis of two Alu Y lineage subfamilies was undertaken to assess Alu-associated genomic diversity and identify new Alu insertion polymorphisms for the study of human population genetics. Recently integrated Alu elements (283) from the Yg6 and Yi6 subfamilies were analyzed by polymerase chain reaction (PCR), and 25 of the loci analyzed were polymorphic for insertion presence/absence within the genomes of a diverse array of human populations. These newly identified Alu insertion polymorphisms will be useful tools for the study of human genomic diversity. Our screening of the Alu insertion loci also resulted in the recovery of several "young" Alu elements that resided at orthologous positions in nonhuman primate genomes. Sequence analysis demonstrated these "young" Alu insertions were the products of gene conversion events of older, preexisting Alu elements or independent parallel forward insertions of older Alu elements in the same short genomic region. The level of gene conversion between Alu elements suggests that it may have an influence on the single nucleotide polymorphism within Alu elements in the genome. We have also identified two genomic deletions associated with the retroposition and insertion of Alu Y lineage elements into the human genome. This type of Alu retroposition-mediated genomic deletion is a novel source of lineage-specific evolution within primate genomes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.