Background Hematopoietic stem-cell transplantation (HSCT) is a potentially lifesaving therapy for several blood cancers and other diseases. For patients without a suitable related HLA-matched donor, unrelated-donor registries of adult volunteers and banked umbilical cord–blood units, such as the Be the Match Registry operated by the National Marrow Donor Program (NMDP), provide potential sources of donors. Our goal in the present study was to measure the likelihood of finding a suitable donor in the U.S. registry. Methods Using human HLA data from the NMDP donor and cord-blood-unit registry, we built population-based genetic models for 21 U.S. racial and ethnic groups to predict the likelihood of identifying a suitable donor (either an adult donor or a cord-blood unit) for patients in each group. The models incorporated the degree of HLA matching, adult-donor availability (i.e., ability to donate), and cord-blood-unit cell dose. Results Our models indicated that most candidates for HSCT will have a suitable (HLA-matched or minimally mismatched) adult donor. However, many patients will not have an optimal adult donor — that is, a donor who is matched at high resolution at HLA-A, HLA-B, HLA-C, and HLA-DRB1. The likelihood of finding an optimal donor varies among racial and ethnic groups, with the highest probability among whites of European descent, at 75%, and the lowest probability among blacks of South or Central American descent, at 16%. Likelihoods for other groups are intermediate. Few patients will have an optimal cord-blood unit — that is, one matched at the antigen level at HLA-A and HLA-B and matched at high resolution at HLA-DRB1. However, cord-blood units mismatched at one or two HLA loci are available for almost all patients younger than 20 years of age and for more than 80% of patients 20 years of age or older, regardless of racial and ethnic background. Conclusions Most patients likely to benefit from HSCT will have a donor. Public investment in donor recruitment and cord-blood banks has expanded access to HSCT. (Funded by the Office of Naval Research, Department of the Navy, and the Health Resources and Services Administration, Department of Health and Human Services.)
Whole genome comparisons identified introgression from archaic to modern humans. Our analysis of highly polymorphic HLA class I, vital immune system components subject to strong balancing selection, shows how modern humans acquired the HLA-B*73 allele in west Asia through admixture with archaic humans called Denisovans, a likely sister group to the Neandertals. Virtual genotyping of Denisovan and Neandertal genomes identified archaic HLA haplotypes carrying functionally distinctive alleles that have introgressed into modern Eurasian and Oceanian populations. These alleles, of which several encode unique or strong ligands for natural killer cell receptors, now represent more than half the HLA alleles of modern Eurasians and also appear to have been later introduced into Africans. Thus, adaptive introgression of archaic alleles has significantly shaped modern human immune systems.
We have calculated six-locus high resolution HLA A∼C∼B∼DRB3/4/5∼DRB1∼DQB1 haplotype frequencies using all Be The Match(®) Registry volunteer donors typed by DNA methods at recruitment. Mixed resolution HLA typing data was inputted to a modified expectation-maximization (EM) algorithm in the form of genotype lists generated by interpretation of primary genomic typing data to the IMGT/HLA v3.4.0 allele list. The full cohort consists of 6.59 million subjects categorized at a broad race level. Overall 25.8% of the individuals were typed at the C locus, and 5.2% typed at the DQB1 locus, while all individuals were typed for A, B, DRB1. We also present a subset of 2.90 million subjects with detailed race/ethnic information mapped to 21 population subgroups, 64.1% of which have primary DNA typing data across at least A, B, and DRB1 loci. Sample sizes at the detailed race level range from 1,242,890 for European Caucasian to 1,376 Alaskan Native or Aleut. Genetic distance measurements show high levels of HLA genetic divergence among the 21 detailed race categories, especially among the eight Asian-American populations. These haplotype frequencies will be used to improve match predictions for donor selection algorithms for hematopoietic stem cell transplantation and improve the accuracy in modeling registry match rates.
Here, we present results for DPA1 and DPB1 four-digit allele-level typing in a large (n = 5,944) sample of unrelated European American stem cell donors previously characterized for other class I and class II loci. Examination of genetic data for both chains of the DP heterodimer in the largest cohort to date, at the amino acid epitope, allele, genotype, and haplotype level, allows new insights into the functional units of selection and association for the DP heterodimer. The data in this study suggest that for the DPA1-DPB1 heterodimer, the unit of selection is the combined amino acid epitope contributed by both the DPA1 and DPB1 genes, rather than the allele, and that patterns of LD are driven primarily by dimer stability and conformation of the P1 pocket. This may help explain the differential pattern of allele frequency distribution observed for this locus relative to the other class II loci. These findings further support the notion that allele-level associations in disease and transplantation may not be the most important unit of analysis, and that they should be considered instead in the molecular context.Electronic supplementary materialThe online version of this article (doi:10.1007/s00251-012-0615-3) contains supplementary material, which is available to authorized users.
Genetic matching for loci in the human leukocyte antigen (HLA) region between a donor and a patient in hematopoietic stem cell transplantation (HSCT) is critical to outcome; however, methods for HLA genotyping of donors in unrelated stem cell registries often yield results with allelic and phase ambiguity and/or do not query all clinically relevant loci. We present and evaluate a statistical method for in silico imputation of HLA alleles and haplotypes in large ambiguous population data from the Be The Match(®) Registry. Our method builds on haplotype frequencies estimated from registry populations and exploits patterns of linkage disequilibrium (LD) across HLA haplotypes to infer high resolution HLA assignments. We performed validation on simulated and real population data from the Registry with non-trivial ambiguity content. While real population datasets caused some predictions to deviate from expectation, validations still showed high percent recall for imputed results with average recall >76% when imputing HLA alleles from registry data. We simulated ambiguity generated by several HLA genotyping methods to evaluate the imputation performance on several levels of typing resolution. On average, imputation percent recall of allele-level HLA haplotypes was >95% for allele-level typing, >92% for intermediate resolution typing and >58% for serology (low-resolution) typing. Thus, allele-level HLA assignments can be imputed through the application of a set of statistical and population genetics inferences and with knowledge of haplotype frequencies and self-identified race and ethnicities.
Knowledge of an individual’s HLA genotype is essential for modern medical genetics, and is crucial for hematopoietic stem cell and solid-organ transplantation. However, the high levels of polymorphism known for the HLA genes make it difficult to generate an HLA genotype that unambiguously identifies the alleles that are present at a given HLA locus in an individual. For the last twenty years, the histocompatibility and immunogenetics community has recorded this HLA genotyping ambiguity using allele codes developed by the National Marrow Donor Program (NMDP). While these allele codes may have been effective for recording an HLA genotyping result when initially developed, their use today results in increased ambiguity in an HLA genotype, and they are no longer suitable in the era of rapid allele discovery and ultra-high allele polymorphism. Here, we present a text string format capable of fully representing HLA genotyping results. This Genotype List (GL) String format is an extension of a proposed standard for reporting KIR genotype data that can be applied to any genetic data that employs a standard nomenclature for identifying variants. The GL String format employs a hierarchical set of operators to describe the relationships between alleles, lists of possible alleles, phased alleles, genotypes, lists of possible genotypes, and multilocus unphased genotypes, without losing typing information or increasing typing ambiguity. When used in concert with appropriate tools to create, exchange, and parse these strings, we anticipate that GL Strings will replace NMDP allele codes for reporting HLA genotypes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.