Interindividual gene copy-number variation (CNV) of complement component C4 and its associated polymorphisms in gene size (long and short) and protein isotypes (C4A and C4B) probably lead to different susceptibilities to autoimmune disease. We investigated the C4 gene CNV in 1,241 European Americans, including patients with systemic lupus erythematosus (SLE), their first-degree relatives, and unrelated healthy subjects, by definitive genotyping and phenotyping techniques. The gene copy number (GCN) varied from 2 to 6 for total C4, from 0 to 5 for C4A, and from 0 to 4 for C4B. Four copies of total C4, two copies of C4A, and two copies of C4B were the most common GCN counts, but each constituted only between one-half and three-quarters of the study populations. Long C4 genes were strongly correlated with C4A (R=0.695; P<.0001). Short C4 genes were correlated with C4B (R=0.437; P<.0001). In comparison with healthy subjects, patients with SLE clearly had the GCN of total C4 and C4A shifting to the lower side. The risk of SLE disease susceptibility significantly increased among subjects with only two copies of total C4 (patients 9.3%; unrelated controls 1.5%; odds ratio [OR] = 6.514; P=.00002) but decreased in those with > or =5 copies of C4 (patients 5.79%; controls 12%; OR=0.466; P=.016). Both zero copies (OR=5.267; P=.001) and one copy (OR=1.613; P=.022) of C4A were risk factors for SLE, whereas > or =3 copies of C4A appeared to be protective (OR=0.574; P=.012). Family-based association tests suggested that a specific haplotype with a single short C4B in tight linkage disequilibrium with the -308A allele of TNFA was more likely to be transmitted to patients with SLE. This work demonstrates how gene CNV and its related polymorphisms are associated with the susceptibility to a human complex disease.
The complement component C4 genes located in the major histocompatibility complex (MHC) class III region exhibit an unusually complex pattern of variations in gene number, gene size, and nucleotide polymorphism. Duplication or deletion of a C4 gene always concurs with its neighboring genes serine/threonine nuclear protein kinase RP, steroid 21-hydroxylase (CYP21), and tenascin (TNX), which together form a genetic unit termed the RCCX module. A detailed molecular genetic analysis of C4A and C4B and RCCX modular arrangements was correlated with immunochemical studies of C4A and C4B protein polymorphism in 150 normal Caucasians. The results show that bimodular RCCX has a frequency of 69%, whereas monomodular and trimodular RCCX structures account for 17.0 and 14.0%, respectively. Three quarters of C4 genes harbor the endogenous retrovirus HERV-K(C4). Partial deficiencies of C4A and C4B, primarily due to gene deletions and homoexpression of C4A proteins, have a combined frequency of 31.6%. This is probably the most common variation of gene dosage and gene size in human genomes. The seven RCCX physical variants create a great repertoire of haplotypes and diploid combinations, and a heterozygosity frequency of 69.4%. This phenomenon promotes the exchange of genetic information among RCCX constituents that is important in homogenizing the structural and functional diversities of C4A and C4B proteins. However, such length variants may cause unequal, interchromosomal crossovers leading to MHC-associated diseases. An analyses of the RCCX structures in 22 salt-losing, congenital adrenal hyperplasia patients revealed a significant increase in the monomodular structure with a long C4 gene linked to the pseudogene CYP21A, and bimodular structures with two CYP21A, which are likely generated by recombinations between heterozygous RCCX length variants.
Among the genes and proteins of the human immune system, complement component C4 is extraordinary in its frequent germline variation in the size and number of genes. Definitive genotypic and phenotypic analyses were performed on a central European population to determine the C4 polygenic and gene size variations and their relationships with serum C4A and C4B protein concentrations and hemolytic activities. In a study population of 128 healthy subjects, the number of C4 genes present in a diploid genome varied between two to five, and 77.4% of the C4 genes belonged to the long form that contains the endogenous retrovirus HERV-K(C4). Intriguingly, higher C4 serum protein levels and higher C4 hemolytic activities were often detected in subjects with short C4 genes than those with long genes only, suggesting a negative epistatic effect of HERV-K(C4) on the expression of C4 proteins. Also, the body mass index appeared to affect the C4 serum levels, particularly in the individuals with medium or high C4 gene dosages, a phenomenon that was dissimilar in several aspects from the established correlation between body mass index and serum C3. As expected, there were strong, positive correlations between total C4 gene dosage and serum C4 protein concentrations, and between serum C4 protein concentrations and C4 hemolytic activities. There were also good correlations between the number of long genes with serum levels of C4A, and the number of short genes with serum levels of C4B. Thus, the polygenic and gene size variations of C4A and C4B contribute to the quantitative traits of C4 with a wide range of serum protein levels and hemolytic activities, and consequently the power of the innate defense system.
It was observed about 50 years ago that low serum complement activity or low protein concentrations of complement C4 concurred with disease activities of systemic lupus erythematosus (SLE). Complete deficiencies of complement components C4A and C4B, albeit rare in human populations, are among the strongest genetic risk factors for SLE or lupus-like disease, across HLA haplotypes and racial backgrounds. However, whether heterozygous or partial deficiency of C4A (C4AQ0) or C4B (C4BQ0) is a predisposing factor for SLE has been a highly controversial topic. In this review we critically analyzed past epidemiologic studies on deficiency of C4A or C4B in human SLE. Cumulative results from more than 35 different studies revealed that heterozygous and homozygous deficiencies of C4A were present in 40-60% of SLE patients from almost all ethnic groups or races investigated, which included northern and central Europeans, Anglo-Saxons, Caucasians in the US, African Americans, Asian Chinese, Koreans and Japanese. In addition, French SLE and control populations had relatively low frequencies of C4AQ0, but the difference between the patient and control groups was statistically significant. The relative risk of C4AQ0 in SLE varied between 2.3 and 5.3 among different ethnic groups. In Caucasian and African SLE patients, the two major causes for C4AQ0 are (1) the presence of a mono-S RCCX (RP-C4-CYP21-TNX) module with a single, short C4B gene in the major histocompatibility complex; and (2) a 2-bp insertion into the sequence for codon 1213 at exon 29 of the mutant C4A gene. Both mono-S structures and 2-bp insertion in exon 29 are absent or extremely rare in the C4AQ0 of Oriental SLE patients. The highly significant association of C4AQ0 with SLE across multiple HLA haplotypes and ethnic groups, and the presence of different mechanisms leading to a C4A protein deficiency among SLE patients suggested that deficiency or low expression level of C4A protein is a primary risk factor for SLE disease susceptibility per se. On the other hand, Spanish, Mexican, Australian Aborigine SLE patients had increased frequencies of C4B deficiency instead of C4A deficiency. Such observations underscore the importance of both C4A and C4B proteins in the fine control of autoimmunity. Different racial and genetic backgrounds could change the thresholds for the requirement of C4A or C4B protein levels in immune tolerance and immune regulation. Most past epidemiological studies of C4 in human SLE did not consider the polygenic and gene size variations of C4A and C4B. In addition, many studies were overly dependent on phenotypic observations or methods that did not distinguish differential C4A and C4B protein expression caused by unequal gene number or different gene size from the absence of a functional C4A or C4B gene. For further longitudinal studies on clinical manifestations of SLE, it would be informative to stratify the patients with accurately defined C4A and C4B genotypes. Likewise, elucidation of epistatic genetic factors interacting with C4AQ0 wou...
Recent comparative genome hybridization studies revealed that hundreds to thousands of human genomic loci can have interindividual copy number variations (CNVs). One of such CNV loci in the HLA codes for the immune effector protein complement component C4. Sensitive, specific, and accurate assays to interrogate the C4 CNV and its associated polymorphisms by using submicrogram quantities of genomic DNA are needed for high throughput epidemiologic studies of C4 CNVs in autoimmune, infectious, and neurological diseases. Quantitative real-time PCR (qPCR) assays were developed using TaqMan chemistry and based on sequences specific for C4A and C4B genes, structural characteristics corresponding to the long and short forms of C4 genes, and the breakpoint region of RP-C4-CYP21-TNX (RCCX) modular duplication. Assignments for gene copy numbers were achieved by relative standard curve methods using cloned C4 genomic DNA covering 6 logs of DNA concentrations for calibrations. The accuracies of test results were cross-confirmed internally in each sample, as the sum of C4A plus C4B equals to the sum of C4L plus C4S or the total copy number of RCCX modules. These qPCR assays were applied to determine C4 CNVs from samples of 50 consanguineous subjects who were mostly homozygous in HLA genotypes. The results revealed eight HLA haplotypes with single C4 genes in monomodular RCCX that are associated with multiple autoimmune and infectious diseases and 32 bimodular, 4 trimodular, and one quadrimodular RCCX. These C4 qPCR assays are proven to be robust, sensitive, and reliable, as they have contributed to the elucidation of C4 CNVs in >1000 human samples with autoimmune and neurological diseases.
Human populations are endowed with a sophisticated genetic diversity of complement C4 and its flanking genes RP, CYP21, and TNX in the RCCX modules of the major histocompatibility complex class III region. We applied definitive techniques to elucidate (a) the complement C4 polymorphisms in gene sizes, gene numbers, and protein isotypes and (b) their gene orders. Several intriguing features are unraveled, including (1) a trimodular RCCX haplotype with three long C4 genes expressing C4A protein only, (2) two trimodular haplotypes with two long (L) and one short (S) C4 genes organized in LSL configurations, (3) a quadrimodular haplotype with four C4 genes organized in a SLSL configuration, and (4) another quadrimodular structure, with four long C4 genes (LLLL), that has the human leukocyte antigen haplotype that is identical to ancestral haplotype 7.2 in the Japanese population. Long-range PCR and PshAI-RFLP analyses conclusively revealed that the short genes from the LSL and SLSL haplotypes are C4A. In four informative families, an astonishingly complex pattern of genetic diversity for RCCX haplotypes with one, two, three and four C4 genes is demonstrated; each C4 gene may be long or short, encoding a C4A or C4B protein. Such diversity may be related to different intrinsic strengths among humans to defend against infections and susceptibilities to autoimmune diseases.
Inter-individual gene copy-number variations (CNVs) probably afford human populations the flexibility to respond to a variety of environmental challenges, but also lead to differential disease predispositions. We investigated gene CNVs for complement component C4 and steroid 21-hydroxylase from the RP-C4-CYP21-TNX (RCCX) modules located in the major histocompatibility complex among healthy Asian-Indian Americans (AIA) and compared them to European Americans. A combination of definitive techniques that yielded cross-confirmatory results was used. The medium gene copy-numbers for C4 and its isotypes, acidic C4A and basic C4B, were 4, 2 and 2, respectively, but their frequencies were only 53-56%. The distribution patterns for total C4 and C4A are skewed towards the high copy-number side. For example, the frequency of AIA-subjects with three copies of C4A (30.7%) was 3.92-fold of those with a single copy (7.83%). The monomodular-short haplotype with a single C4B gene and the absence of C4A, which is in linkage-disequilibrium with HLA DRB1*0301 in Europeans and a strong risk factor for autoimmune diseases, has a frequency of 0.012 in AIA but 0.106 among healthy European Americans (p=6.6×10 −8 ). The copy-number and the size of C4 genes strongly determine the plasma C4 protein concentrations. Parallel variations in copy-numbers of CYP21A (CYP21A1P) and TNXA with total C4 were also observed. Notably, 13.1% of AIA-subjects had three copies of the functional CYP21B, which were likely generated by recombinations between monomodular and bimodular RCCX haplotypes. The high copy-numbers of C4 and the high frequency of RCCX recombinants offer important insights to the prevalence of autoimmune and genetic diseases.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.