The Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) disease (COVID-19) pandemic has caused millions of deaths worldwide. Genome-wide association studies (GWAS) identified the 3p21.31 region as conferring a two-fold increased risk of respiratory failure. Here, using a combined multiomics and machine-learning approach, we identify the gain-of-function risk A allele of a single-nucleotide polymorphism (SNP), rs17713054G>A, as a probable causative variant. We show with chromosome conformation capture and gene expression analysis that the rs17713054-affected enhancer upregulates the interacting gene, Leucine Zipper Transcription Factor Like 1 ( LZTFL1 ). Selective spatial transcriptomic analysis of COVID-19 patient lung biopsies shows the presence of signals associated with epithelial-mesenchymal transition (EMT), a viral response pathway that is regulated by LZTFL1 . We conclude that pulmonary epithelial cells undergoing EMT, rather than immune cells, are likely to be responsible for the 3p21.31 associated risk. As the 3p21.31 effect is conferred by a gain-of-function, LZTFL1 may provide a therapeutic target.
Driven by the necessity to survive environmental pathogens, the human immune system has evolved exceptional diversity and plasticity, to which several factors contribute including inheritable structural polymorphism of the underlying genes. Characterizing this variation is challenging due to the complexity of these loci, which contain extensive regions of paralogy, segmental duplication and high copy-number repeats, but recent progress in long-read sequencing and optical mapping techniques suggests this problem may now be tractable. Here we assess this by using long-read sequencing platforms from PacBio and Oxford Nanopore, supplemented with short-read sequencing and Bionano optical mapping, to sequence DNA extracted from CD14+ monocytes and peripheral blood mononuclear cells from a single European individual identified as HV31. We use this data to build a de novo assembly of eight genomic regions encoding four key components of the immune system, namely the human leukocyte antigen, immunoglobulins, T cell receptors, and killer-cell immunoglobulin-like receptors. Validation of our assembly using k-mer based and alignment approaches suggests that it has high accuracy, with estimated base-level error rates below 1 in 10 kb, although we identify a small number of remaining structural errors. We use the assembly to identify heterozygous and homozygous structural variation in comparison to GRCh38. Despite analyzing only a single individual, we find multiple large structural variants affecting core genes at all three immunoglobulin regions and at two of the three T cell receptor regions. Several of these variants are not accurately callable using current algorithms, implying that further methodological improvements are needed. Our results demonstrate that assessing haplotype variation in these regions is possible given sufficiently accurate long-read and associated data. Continued reductions in the cost of these technologies will enable application of these methods to larger samples and provide a broader catalogue of germline structural variation at these loci, an important step toward making these regions accessible to large-scale genetic association studies.
A functional adaptive immune system must generate enormously diverse antigen receptor (AgR) repertoires from a limited number of AgR genes, using a common mechanism, V(D)J recombination. The AgR loci are among the largest in the genome, and individual genes must overcome huge spatial and temporal challenges to co-localize with optimum variability. Our understanding of the complex mechanisms involved has increased enormously, due in part to new technologies for high resolution mapping of AgR structure and dynamic movement, underpinning mechanisms, and resulting repertoires. This review will examine these advances using the paradigm of the mouse immunoglobulin heavy chain (Igh) locus. We will discuss the key regulatory elements implicated in Igh locus structure. Recent next generation repertoire sequencing methods have shown that local chromatin state at V genes contribute to recombination efficiency. Next on the multidimensional scale, we will describe imaging studies that provided the first picture of the large-scale dynamic looping and contraction the Igh locus undergoes during recombination. We will discuss chromosome conformation capture (3C)-based technologies that have provided higher resolution pictures of Igh locus structure, including the different models that have evolved. We will consider the key transcription factors (PAX5, YY1, E2A, Ikaros), and architectural factors, CTCF and cohesin, that regulate these processes. Lastly, we will discuss a plethora of recent exciting mechanistic findings. These include Rag recombinase scanning for convergent RSS sequences within DNA loops; identification of Igh loop extrusion, and its putative role in Rag scanning; the roles of CTCF, cohesin and cohesin loading factor, WAPL therein; a new phase separation model for Igh locus compartmentalization. We will draw these together and conclude with some horizon-scanning and unresolved questions.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.