Relevant for various areas of human genetics, Y-chromosomal short tandem repeats (Y-STRs) are commonly used for testing close paternal relationships among individuals and populations, and for male lineage identification. However, even the widely used 17-loci Yfiler set cannot resolve individuals and populations completely. Here, 52 centers generated quality-controlled data of 13 rapidly mutating (RM) Y-STRs in 14,644 related and unrelated males from 111 worldwide populations. Strikingly, >99% of the 12,272 unrelated males were completely individualized. Haplotype diversity was extremely high (global: 0.9999985, regional: 0.99836–0.9999988). Haplotype sharing between populations was almost absent except for six (0.05%) of the 12,156 haplotypes. Haplotype sharing within populations was generally rare (0.8% nonunique haplotypes), significantly lower in urban (0.9%) than rural (2.1%) and highest in endogamous groups (14.3%). Analysis of molecular variance revealed 99.98% of variation within populations, 0.018% among populations within groups, and 0.002% among groups. Of the 2,372 newly and 156 previously typed male relative pairs, 29% were differentiated including 27% of the 2,378 father–son pairs. Relative to Yfiler, haplotype diversity was increased in 86% of the populations tested and overall male relative differentiation was raised by 23.5%. Our study demonstrates the value of RM Y-STRs in identifying and separating unrelated and related males and provides a reference database.
The autosomal short tandem repeat (STR) kits that are currently used in forensic science have a high discrimination power. However, this discrimination power is sometimes not sufficient for complex kinship analyses or decreases when alleles are missing due to degradation of the DNA. The Investigator HDplex kit contains nine STRs that are additional to the commonly used forensic markers, and we validated this kit to assist human identification. With the increasing number of markers it becomes inevitable that forensic and kinship analyses include two or more STRs present on the same chromosome. To examine whether such markers can be regarded as independent, we evaluated the 30 STRs present in NGM, Identifiler and HDplex. Among these 30 markers, 17 syntenic STR pairs can be formed. Allelic association between these pairs was examined using 335 Dutch reference samples and no linkage disequilibrium was detected, which makes it possible to use the product rule for profile probability calculations in unrelated individuals. Linkage between syntenic STRs was studied by determining the recombination fraction between them in five three-generation CEPH families. The recombination fractions were compared to the physical and genetic distances between the markers. For most types of pedigrees, the kinship analyses can be performed using the product rule, and for those cases that require an alternative calculation method (Gill et al., Forensic Sci Int Genet 6:477-486, 2011), the recombination fractions as determined in this study can be used. Finally, we calculated the (combined) match probabilities, for the supplementary genotyping results of HDplex, NGM and Identifiler.
The genotypes of 36 Y-chromosomal short tandem repeat (Y-STR) marker units were analysed in a Dutch population sample of 2085 males. Profiling results were compared for several partially overlapping kits, i.e. PowerPlex Y, Yfiler, PowerPlex Y23, and two in-house designed multiplexes with rapidly mutating Y-STRs. Nineteen Y-STR marker units, of which two are rapidly mutating, reside in at least two of these multiplexes, and for these markers concordance testing was performed. Two samples showed discordant genotyping results and the probable causative base change was revealed by Sanger sequencing. In addition, we encountered concordant, but aberrant genotyping results including one allele with low peak height and several null alleles. For 12 samples, this involved a null allele in two adjacent loci suggesting a large and recurrent deletion as the samples represent three distinct haplogroups. For each marker unit, the allele counts and frequencies are presented, as are the haplotype counts and haplotype diversities for several combinations of markers.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.