BackgroundA close match of the HLA alleles between donor and recipient is an important prerequisite for successful unrelated hematopoietic stem cell transplantation. To increase the chances of finding an unrelated donor, registries recruit many hundred thousands of volunteers each year. Many registries with limited resources have had to find a trade-off between cost and resolution and extent of typing for newly recruited donors in the past. Therefore, we have taken advantage of recent improvements in NGS to develop a workflow for low-cost, high-resolution HLA typing.ResultsWe have established a straightforward three-step workflow for high-throughput HLA typing: Exons 2 and 3 of HLA-A, -B, -C, -DRB1, -DQB1 and -DPB1 are amplified by PCR on Fluidigm Access Array microfluidic chips. Illumina sequencing adapters and sample specific tags are directly incorporated during PCR. Upon pooling and cleanup, 384 samples are sequenced in a single Illumina MiSeq run. We developed “neXtype” for streamlined data analysis and HLA allele assignment. The workflow was validated with 1140 samples typed at 6 loci. All neXtype results were concordant with the Sanger sequences, demonstrating error-free typing of more than 6000 HLA loci. Current capacity in routine operation is 12,000 samples per week.ConclusionsThe workflow presented proved to be a cost-efficient alternative to Sanger sequencing for high-throughput HLA typing. Despite the focus on cost efficiency, resolution exceeds the current standards of Sanger typing for donor registration.
A catalog of common, intermediate and well‐documented (CIWD) HLA‐A, ‐B, ‐C, ‐DRB1, ‐DRB3, ‐DRB4, ‐DRB5, ‐DQB1 and ‐DPB1 alleles has been compiled from over 8 million individuals using data from 20 unrelated hematopoietic stem cell volunteer donor registries. Individuals are divided into seven geographic/ancestral/ethnic groups and data are summarized for each group and for the total population. P (two‐field) and G group assignments are divided into one of four frequency categories: common (≥1 in 10 000), intermediate (≥1 in 100 000), well‐documented (≥5 occurrences) or not‐CIWD. Overall 26% of alleles in IPD‐IMGT/HLA version 3.31.0 at P group resolution fall into the three CIWD categories. The two‐field catalog includes 18% (n = 545) common, 17% (n = 513) intermediate, and 65% (n = 1997) well‐documented alleles. Full‐field allele frequency data are provided but are limited in value by the variations in resolution used by the registries. A recommended CIWD list is based on the most frequent category in the total or any of the seven geographic/ancestral/ethnic groups. Data are also provided so users can compile a catalog specific to the population groups that they serve. Comparisons are made to three previous CWD reports representing more limited population groups. This catalog, CIWD version 3.0.0, is a step closer to the collection of global HLA frequencies and to a clearer view of HLA diversity in the human population as a whole.
Although the definition of CWD alleles itself is affected by different parameters, calling for current updates of the list, the EFI CWD catalogue provides new insights into European population genetics and will be a very useful tool for tissue-typing laboratories in and beyond Europe.
Homozygous carriers of CCR5-Δ32, a gene variant of CC-type chemokine receptor 5 (CCR5), are highly resistant to infections with human immunodeficiency virus type 1 (HIV-1) and therefore preferred stem cell donors for HIV-infected patients. We analyzed CCR5 typing data of 1,333,035 potential hematopoietic stem cell donors enlisted with three national DKMS donor centers. Allele and genotype frequencies were determined for 87 countries of origin as self-assessed by the donors. CCR5-Δ32 allele frequencies ranged from 16.4% in the Norwegian sample to 0 in donors from Ethiopia. The highest CCR5-Δ32/Δ32 genotype frequency was found in the sample from the Faroe Islands (2.3%), whereas in 27 samples, predominantly of donors from Africa, Asia and South America, none of the individuals carried this genotype. The characteristic CCR5-Δ32 allele frequency decline from Northern to Southeastern Eurasia supports findings of earlier studies. With available HLA haplotype frequency information for the patient's ethnicity, our data allows upfront estimation of the probability that an HLA-matched donor with CCR5-Δ32/Δ32 genotype can be found for a patient in need of hematopoietic stem cell transplantation.
BackgroundAt the DKMS Life Science Lab, Next Generation Sequencing (NGS) has been used for ultra-high-volume high-resolution genotyping of HLA loci for the last three and a half years. Here, we report on our experiences in genotyping the HLA, CCR5, ABO, RHD and KIR genes using a direct amplicon sequencing approach on Illumina MiSeq and HiSeq 2500 instruments.ResultsBetween January 2013 and June 2016, 2,714,110 samples largely from German, Polish and UK-based potential stem cell donors have been processed. 98.9% of all alleles for the targeted HLA loci (HLA-A, -B, -C, -DRB1, -DQB1 and -DPB1) were typed at high resolution or better. Initially a simple three-step workflow based on nanofluidic chips in conjunction with 4-primer amplicon tagging was used. Over time, we found that this setup results in PCR artefacts such as primer dimers and PCR-mediated recombination, which may necessitate repeat typing. Split workflows for low- and high-DNA-concentration samples helped alleviate these problems and reduced average per-locus repeat rates from 3.1 to 1.3%. Further optimisations of the workflow included the use of phosphorothioate oligos to reduce primer degradation and primer dimer formation, and employing statistical models to predict read yield from initial template DNA concentration to avoid intermediate quantification of PCR products. Finally, despite the populations typed at DKMS Life Science Lab being relatively homogenous genetically, an analysis of 1.4 million donors processed between January 2015 and May 2016 led to the discovery of 1,919 distinct novel HLA alleles.ConclusionsAmplicon-based NGS HLA genotyping workflows have become the workhorse in high-volume tissue typing of registry donors. The optimisation of workflow practices over multiple years has led to insights and solutions that improve the efficiency and robustness of short amplicon based genotyping workflows.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-017-3575-z) contains supplementary material, which is available to authorized users.
The killer-cell immunoglobulin-like receptor (KIR) genes regulate natural killer cell activity, influencing predisposition to immune mediated disease, and affecting hematopoietic stem cell transplantation (HSCT) outcome. Owing to the complexity of the KIR locus, with extensive gene copy number variation (CNV) and allelic diversity, high-resolution characterization of KIR has so far been applied only to relatively small cohorts. Here, we present a comprehensive high-throughput KIR genotyping approach based on next generation sequencing. Through PCR amplification of specific exons, our approach delivers both copy numbers of the individual genes and allelic information for every KIR gene. Ten-fold replicate analysis of a set of 190 samples revealed a precision of 99.9%. Genotyping of an independent set of 360 samples resulted in an accuracy of more than 99% taking into account consistent copy number prediction. We applied the workflow to genotype 1.8 million stem cell donor registry samples. We report on the observed KIR allele diversity and relative abundance of alleles based on a subset of more than 300,000 samples. Furthermore, we identified more than 2,000 previously unreported KIR variants repeatedly in independent samples, underscoring the large diversity of the KIR region that awaits discovery. This cost-efficient high-resolution KIR genotyping approach is now applied to samples of volunteers registering as potential donors for HSCT. This will facilitate the utilization of KIR as additional selection criterion to improve unrelated donor stem cell transplantation outcome. In addition, the approach may serve studies requiring high-resolution KIR genotyping, like population genetics and disease association studies.
Leprosy, a chronic infectious disease caused by Mycobacterium leprae (M. leprae), was very common in Europe till the 16th century. Here, we perform an ancient DNA study on medieval skeletons from Denmark that show lesions specific for lepromatous leprosy (LL). First, we test the remains for M. leprae DNA to confirm the infection status of the individuals and to assess the bacterial diversity. We assemble 10 complete M. leprae genomes that all differ from each other. Second, we evaluate whether the human leukocyte antigen allele DRB1*15:01, a strong LL susceptibility factor in modern populations, also predisposed medieval Europeans to the disease. The comparison of genotype data from 69 M. leprae DNA-positive LL cases with those from contemporary and medieval controls reveals a statistically significant association in both instances. In addition, we observe that DRB1*15:01 co-occurs with DQB1*06:02 on a haplotype that is a strong risk factor for inflammatory diseases today.
BackgroundThe characterization of the ABO blood group status is vital for blood transfusion and solid organ transplantation. Several methods for the molecular characterization of the ABO gene, which encodes the alleles that give rise to the different ABO blood groups, have been described. However, the application of those methods has so far been restricted to selected samples and not been applied to population-scale analysis.ResultsWe describe a cost-effective method for high-throughput genotyping of the ABO system by next generation sequencing. Sample specific barcodes and sequencing adaptors are introduced during PCR, rendering the products suitable for direct sequencing on Illumina MiSeq or HiSeq instruments. Complete sequence coverage of exons 6 and 7 enables molecular discrimination of the ABO subgroups and many alleles. The workflow was applied to ABO genotype more than a million samples. We report the allele group frequencies calculated on a subset of more than 110,000 sampled individuals of German origin. Further we discuss the potential of the workflow for high resolution genotyping taking the observed allele group frequencies into account. Finally, sequence analysis revealed 287 distinct so far not described alleles of which the most abundant one was identified in 174 samples.ConclusionsThe described workflow delivers high resolution ABO genotyping at low cost enabling population-scale molecular ABO characterization.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-016-2687-1) contains supplementary material, which is available to authorized users.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.