The UK Biobank project is a prospective cohort study with deep genetic and phenotypic data collected on approximately 500,000 individuals from across the United Kingdom, aged between 40 and 69 at recruitment. The open resource is unique in its size and scope. A rich variety of phenotypic and health-related information is available on each participant, including biological measurements, lifestyle indicators, biomarkers in blood and urine, and imaging of the body and brain. Follow-up information is provided by linking health and medical records. Genome-wide genotype data have been collected on all participants, providing many opportunities for the discovery of new genetic associations and the genetic bases of complex traits. Here we describe the centralized analysis of the genetic data, including genotype quality, properties of population structure and relatedness of the genetic data, and efficient phasing and genotype imputation that increases the number of testable variants to around 96 million. Classical allelic variation at 11 human leukocyte antigen genes was imputed, resulting in the recovery of signals with known associations between human leukocyte antigen alleles and many diseases.
The UK Biobank project is a large prospective cohort study of ~500,000 individuals from across the United Kingdom, aged between 40-69 at recruitment. A rich variety of phenotypic and health-related information is available on each participant, making the resource unprecedented in its size and scope. Here we describe the genome-wide genotype data (~805,000 markers) collected on all individuals in the cohort and its quality control procedures. Genotype data on this scale offers novel opportunities for assessing quality issues, although the wide range of ancestries of the individuals in the cohort also creates particular challenges. We also conducted a set of analyses that reveal properties of the genetic data -such as population structure and relatedness -that can be important for downstream analyses. In addition, we phased and imputed genotypes into the dataset, using computationally efficient methods combined with the Haplotype Reference Consortium (HRC) and UK10K haplotype resource. This increases the number of testable variants by over 100-fold to ~96 million variants. We also imputed classical allelic variation at 11 human leukocyte antigen (HLA) genes, and as a quality control check of this imputation, we replicate signals of known associations between HLA alleles and many common diseases. We describe tools that allow efficient genome-wide association studies (GWAS) of multiple traits and fast phenome-wide association studies (PheWAS), which work together with a new compressed file format that has been used to distribute the dataset. As a further check of the genotyped and imputed datasets, we performed a test-case genome-wide association scan on a well-studied human trait, standing height.
We simultaneously investigated the genetic landscape of ankylosing spondylitis, Crohn's disease, psoriasis, primary sclerosing cholangitis and ulcerative colitis to investigate pleiotropy and the relationship between these clinically related diseases. Using high-density genotype data from more than 86,000 individuals of European-ancestry we identified 244 independent multi-disease signals including 27 novel genome-wide significant susceptibility loci and 3 unreported shared risk loci. Complex pleiotropy was supported when contrasting multi-disease signals with expression data sets from human, rat and mouse, and epigenetic and expressed enhancer profiles. The comorbidities among the five immune diseases were best explained by biological pleiotropy rather than heterogeneity (a subgroup of cases that is genetically identical to another disease, possibly due to diagnostic misclassification, molecular subtypes, or excessive comorbidity). In particular, the strong comorbidity between primary sclerosing cholangitis and inflammatory bowel disease is likely the result of a unique disease, which is genetically distinct from classical inflammatory bowel disease phenotypes.
SummaryThe inflammatory bowel diseases (IBD) are chronic gastrointestinal inflammatory disorders that affect millions worldwide. Genome-wide association studies have identified 200 IBD-associated loci, but few have been conclusively resolved to specific functional variants. Here we report fine-mapping of 94 IBD loci using high-density genotyping in 67,852 individuals. We pinpointed 18 associations to a single causal variant with >95% certainty, and an additional 27 associations to a single variant with >50% certainty. These 45 variants are significantly enriched for protein-coding changes (n=13), direct disruption of transcription factor binding sites (n=3) and tissue specific epigenetic marks (n=10), with the latter category showing enrichment in specific immune cells among associations stronger in CD and in gut mucosa among associations stronger in UC. The results of this study suggest that high-resolution fine-mapping in large samples can convert many GWAS discoveries into statistically convincing causal variants, providing a powerful substrate for experimental elucidation of disease mechanisms.
Ankylosing spondylitis is a common, highly heritable inflammatory arthritis affecting primarily the spine and pelvis. In addition to HLA-B*27 alleles, 12 loci have previously been identified that are associated with ankylosing spondylitis in populations of European ancestry, and 2 associated loci have been identified in Asians. In this study, we used the Illumina Immunochip microarray to perform a case-control association study involving 10,619 individuals with ankylosing spondylitis (cases) and 15,145 controls. We identified 13 new risk loci and 12 additional ankylosing spondylitis–associated haplotypes at 11 loci. Two ankylosing spondylitis–associated regions have now been identified encoding four aminopeptidases that are involved in peptide processing before major histocompatibility complex (MHC) class I presentation. Protective variants at two of these loci are associated both with reduced aminopeptidase function and with MHC class I cell surface expression.
Shared aetiopathogenic factors among immune-mediated diseases have long been suggested by their co-familiality and co-occurrence, and molecular support has been provided by analysis of human leukocyte antigen (HLA) haplotypes and genome-wide association studies. The interrelationships can now be better appreciated following the genotyping of large immune disease sample sets on a shared SNP array: the 'Immunochip'. Here, we systematically analyse loci shared among major immune-mediated diseases. This reveals that several diseases share multiple susceptibility loci, but there are many nuances. The most associated variant at a given locus frequently differs and, even when shared, the same allele often has opposite associations. Interestingly, risk alleles conferring the largest effect sizes are usually disease-specific. These factors help to explain why early evidence of extensive 'sharing' is not always reflected in epidemiological overlap.
Genomewide association studies (GWAS) have proven a powerful hypothesis-free method to identify common disease-associated variants. Even quite large GWAS, however, have only at best identified moderate proportions of the genetic variants contributing to disease heritability. To provide cost-effective genotyping of common and rare variants to map the remaining heritability and to fine-map established loci, the Immunochip Consortium has developed a 200,000 SNP chip that has been produced in very large numbers for a fraction of the cost of GWAS chips. This chip provides a powerful tool for immunogenetics gene mapping.
Adult hematopoietic stem cells (HSCs) with serially transplantable activity comprise two subtypes. One shows a balanced output of mature lymphoid and myeloid cells; the other appears selectively lymphoid deficient. We now show that both of these HSC subtypes are present in the fetal liver (at a 1:10 ratio) with the rarer, lymphoid-deficient HSCs immediately gaining an increased representation in the fetal bone marrow, suggesting that the marrow niche plays a key role in regulating their ensuing preferential amplification. Clonal analysis of HSC expansion posttransplant showed that both subtypes display an extensive but variable self-renewal activity with occasional interconversion. Clonal analysis of their differentiation programs demonstrated functional and molecular as well as quantitative HSC subtype-specific differences in the lymphoid progenitors they generate but an indistinguishable production of multipotent and myeloid-restricted progenitors. These findings establish a level of heterogeneity in HSC differentiation and expansion control that may have relevance to stem cell populations in other hierarchically organized tissues.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.