Human leukocyte antigen G (HLA-G) belongs to the family of non-classical HLA class I genes, located within the major histocompatibility complex (MHC). HLA-G has been the target of most recent research regarding the function of class I non-classical genes. The main features that distinguish HLA-G from classical class I genes are (a) limited protein variability, (b) alternative splicing generating several membrane bound and soluble isoforms, (c) short cytoplasmic tail, (d) modulation of immune response (immune tolerance), and (e) restricted expression to certain tissues. In the present work, we describe the HLA-G gene structure and address the HLA-G variability and haplotype diversity among several populations around the world, considering each of its major segments [promoter, coding, and 3′ untranslated region (UTR)]. For this purpose, we developed a pipeline to reevaluate the 1000Genomes data and recover miscalled or missing genotypes and haplotypes. It became clear that the overall structure of the HLA-G molecule has been maintained during the evolutionary process and that most of the variation sites found in the HLA-G coding region are either coding synonymous or intronic mutations. In addition, only a few frequent and divergent extended haplotypes are found when the promoter, coding, and 3′UTRs are evaluated together. The divergence is particularly evident for the regulatory regions. The population comparisons confirmed that most of the HLA-G variability has originated before human dispersion from Africa and that the allele and haplotype frequencies have probably been shaped by strong selective pressures.
The HLA-G molecule presents immunomodulatory properties that might inhibit immune responses when interacting with specific Natural Killer and T cell receptors, such as KIR2DL4, ILT2 and ILT4. Thus, HLA-G might influence the outcome of situations in which fine immune system modulation is required, such as autoimmune diseases, transplants, cancer and pregnancy. The majority of the studies regarding the HLA-G gene variability so far was restricted to a specific gene segment (i.e., promoter, coding or 3' untranslated region), and was performed by using Sanger sequencing and probabilistic models to infer haplotypes. Here we propose a massively parallel sequencing (NGS) with a bioinformatics strategy to evaluate the entire HLA-G regulatory and coding segments, with haplotypes inferred relying more on the straightforward haplotyping capabilities of NGS, and less on probabilistic models. Then, HLA-G variability was surveyed in two admixed population samples of distinct geographical regions and demographic backgrounds, Cyprus and Brazil. Most haplotypes (promoters, coding, 3'UTR and extended ones) were detected both in Brazil and Cyprus and were identical to the ones already described by probabilistic models, indicating that these haplotypes are quite old and may be present worldwide.
The human leukocyte antigen-E (HLA-E) locus is a human major histocompatibility complex (MHC) gene associated with immune-modulation and suppression of the immune response by the interaction with specific natural killer (NK) and T cell receptors (TCRs). It is considered one of the most conserved genes of the human MHC; however, this low nucleotide variability seems to be a consequence of the scarce number of studies focusing on this subject. In this manuscript we assessed the nucleotide variability at the HLA-E coding and 3' untranslated regions (3'UTRs) in Brazil and in the populations from the 1000Genomes Consortium. Twenty-eight variable sites arranged into 33 haplotypes were detected and most of these haplotypes (98.2%) are encoding one of the two HLA-E molecules found worldwide, E*01:01 and E*01:03. Moreover, three worldwide spread haplotypes, associated with the coding alleles E*01:01:01, E*01:03:01 and E*01:03:02, account for 85% of all HLA-E haplotypes, suggesting that they arose early before human speciation. In addition, the low nucleotide diversity found for the HLA-E coding and 3'UTR in worldwide populations suggests that the HLA-E gene is in fact a conserved gene, which might be a consequence of its key role in the modulation of the immune system.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.