Instances of point and length heteroplasmy in the mitochondrial DNA control region were compiled and analyzed from over 5,000 global human population samples. These data represent observations from a large and broad population sample, representing nearly 20 global populations. As expected, length heteroplasmy was frequently observed in the HVI, HVII and HVIII C-stretches. Length heteroplasmy was also observed in the AC dinucleotide repeat region, as well as other locations. Point heteroplasmy was detected in approximately 6% of all samples, and while the vast majority of heteroplasmic samples comprised two molecules differing at a single position, samples exhibiting two and three mixed positions were also observed in this data set. In general, the sites at which heteroplasmy was most commonly observed correlated with reported control region mutational hotspots. However, for some sites, observations of heteroplasmy did not mirror established mutation rate data, suggesting the action of other mechanisms, both selective and neutral. Interestingly, these data indicate that the frequency of heteroplasmy differs between particular populations, perhaps reflecting variable mutation rates among different mtDNA lineages and/or artifacts of particular population groups. The results presented here contribute to our general understanding of mitochondrial DNA control region heteroplasmy and provide additional empirical information on the mechanisms contributing to mtDNA control region mutation and evolution.
Though investigations into the use of massively parallel sequencing technologies for the generation of complete mitochondrial genome (mtGenome) profiles from difficult forensic specimens are well underway in multiple laboratories, the high quality population reference data necessary to support full mtGenome typing in the forensic context are lacking. To address this deficiency, we have developed 588 complete mtGenome haplotypes, spanning three U.S. population groups (African American, Caucasian and Hispanic) from anonymized, randomly-sampled specimens. Data production utilized an 8-amplicon, 135 sequencing reaction Sanger-based protocol, performed in semi-automated fashion on robotic instrumentation. Data review followed an intensive multi-step strategy that included a minimum of three independent reviews of the raw data at two laboratories; repeat screenings of all insertions, deletions, heteroplasmies, transversions and any additional private mutations; and a check for phylogenetic feasibility. For all three populations, nearly complete resolution of the haplotypes was achieved with full mtGenome sequences: 90.3-98.8% of haplotypes were unique per population, an improvement of 7.7-29.2% over control region sequencing alone, and zero haplotypes overlapped between populations. Inferred maternal biogeographic ancestry frequencies for each population and heteroplasmy rates in the control region were generally consistent with published datasets. In the coding region, nearly 90% of individuals exhibited length heteroplasmy in the 12418-12425 adenine homopolymer; and despite a relatively high rate of point heteroplasmy (23.8% of individuals across the entire molecule), coding region point heteroplasmies shared by more than one individual were notably absent, and transversion-type heteroplasmies were extremely rare. The ratio of nonsynonymous to synonymous changes among point heteroplasmies in the protein-coding genes (1:1.3) and average pathogenicity scores in comparison to data reported for complete substitutions in previous studies seem to provide some additional support for the role of purifying selection in the evolution of the human mtGenome. Overall, these thoroughly vetted full mtGenome population reference data can serve as a standard against which the quality and features of future mtGenome datasets (especially those developed via massively parallel sequencing) may be evaluated, and will provide a solid foundation for the generation of complete mtGenome haplotype frequency estimates for forensic applications.
The current U.S. population represents an amalgam of individuals originating mainly from four continental regions (Africa, Europe, Asia and America). To study the genetic ancestry and compare with self-declared ancestry we have analyzed paternally, maternally and bi-parentally inherited DNA markers sensitive for indicating continental genetic ancestry in all four major U.S. American groups. We found that self-declared U.S. Hispanics and U.S. African Americans tend to show variable degrees of continental genetic admixture among the three genetic systems, with evidence for a marked sex-biased admixture history. Moreover, for these two groups we observed significant regional variation across the country in genetic admixture. In contrast, self-declared U.S. European and U.S. Asian Americans were genetically more homogeneous at the continental ancestry level. Two autosomal ancestry-sensitive markers located in skin pigmentation candidate genes showed significant differences in self-declared U.S. African Americans or U.S. European Americans, relative to their assumed parental populations from Africa or Europe. This provides genetic support for the importance of skin color in the complex process of ancestry identification. © 2010 Wiley-Liss, Inc.
Genetic and archaeological data indicate that the initial Paleoindian settlers of South America followed two entry routes separated by the Andes and the Amazon rainforest. The interactions between these paths and their impact on the peopling of South America remain unclear. Analysis of genetic variation in the Peruvian Andes and regions located south of the Amazon River might provide clues on this issue. We analyzed mitochondrial DNA variation at different Andean locations and >360,000 autosomal SNPs from 28 Native American ethnic groups to evaluate different trans-Andean demographic scenarios. Our data reveal that the Peruvian Altiplano was an important enclave for early Paleoindian expansions and point to a genetic continuity in the Andes until recent times, which was only marginally affected by gene flow from the Amazonian lowlands. Genomic variation shows a good fit with the archaeological evidence, indicating that the genetic interactions between the descendants of the settlers that followed the Pacific and Atlantic routes were extremely limited.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.