Enhancer–promoter regulation is a fundamental mechanism underlying differential transcriptional regulation. Spatial chromatin organization brings remote enhancers in contact with target promoters in cis to regulate gene expression. There is considerable evidence for promoter–enhancer interactions (PEIs). In the recent years, genome-wide analyses have identified signatures and mapped novel enhancers; however, being able to precisely identify their target gene(s) requires massive biological and bioinformatics efforts. In this review, we give a short overview of the chromatin landscape and transcriptional regulation. We discuss some key concepts and problems related to chromatin interaction detection technologies, and emerging knowledge from genome-wide chromatin interaction data sets. Then, we critically review different types of bioinformatics analysis methods and tools related to representation and visualization of PEI data, raw data processing and PEI prediction. Lastly, we provide specific examples of how PEIs have been used to elucidate a functional role of non-coding single-nucleotide polymorphisms. The topic is at the forefront of epigenetic research, and by highlighting some future bioinformatics challenges in the field, this review provides a comprehensive background for future PEI studies.
Highlights d Prediction of antibody-antigen binding is a central question in immunology d A motif vocabulary of paratope-epitope interactions governs antibody specificity d Proof of principle that antibody-antigen binding is predictable d Implications for de novo antibody and (neo-)epitope design
Adaptive immune receptor repertoires (AIRR) are key targets for biomedical research as they record past and ongoing adaptive immune responses. The capacity of machine learning (ML) to identify complex discriminative sequence patterns renders it an ideal approach for AIRR-based diagnostic and therapeutic discovery. To date, widespread adoption of AIRR ML has been inhibited by a lack of reproducibility, transparency, and interoperability. immuneML ( immuneml.uio.no ) addresses these concerns by implementing each step of the AIRR ML process in an extensible, open-source software ecosystem that is based on fully specified and shareable workflows. To facilitate widespread user adoption, immuneML is available as a command-line tool and through an intuitive Galaxy web interface, and extensive documentation of workflows is provided. We demonstrate the broad applicability of immuneML by (i) reproducing a large-scale study on immune state prediction, (ii) developing, integrating, and applying a novel method for antigen specificity prediction, and (iii) showcasing streamlined interpretability-focused benchmarking of AIRR ML. 1.
Gluten-specific CD4+ T cells are drivers of celiac disease (CeD). Previous studies of gluten-specific T-cell receptor (TCR) repertoires have found public TCRs shared across multiple individuals, biased usage of particular V-genes and conserved CDR3 motifs. The CDR3 motifs within the gluten-specific TCR repertoire, however, have not been systematically investigated. In the current study, we analyzed the largest TCR database of gluten-specific CD4+ T cells studied so far consisting of TCRs of 3122 clonotypes from 63 CeD patients. We established a TCR database from CD4+ T cells isolated with a mix of HLA-DQ2.5:gluten tetramers representing four immunodominant gluten epitopes. In an unbiased fashion we searched by hierarchical clustering for common CDR3 motifs among 2764 clonotypes. We identified multiple CDR3α, CDR3β, and paired CDR3α:CDR3β motif candidates. Among these, a previously known conserved CDR3β R-motif used by TRAV26-1/TRBV7-2 TCRs specific for the DQ2.5-glia-α2 epitope was the most prominent motif. Furthermore, we identified the epitope specificity of altogether 16 new CDR3α:CDR3β motifs by comparing with TCR sequences of 231 T-cell clones with known specificity and TCR sequences of cells sorted with single HLA-DQ2.5:gluten tetramers. We identified 325 public TCRα and TCRβ sequences of which 145, 102 and 78 belonged to TCRα, TCRβ and paired TCRαβ sequences, respectively. While the number of public sequences was depended on the number of clonotypes in each patient, we found that the proportion of public clonotypes from the gluten-specific TCR repertoire of given CeD patients appeared to be stable (median 37%). Taken together, we here demonstrate that the TCR repertoire of CD4+ T cells specific to immunodominant gluten epitopes in CeD is diverse, yet there is clearly biased V-gene usage, presence of public TCRs and existence of conserved motifs of which R-motif is the most prominent.
Adaptive immune receptor repertoires (AIRR) are key targets for biomedical research as they record past and ongoing adaptive immune responses. The capacity of machine learning (ML) to identify complex discriminative sequence patterns renders it an ideal approach for AIRR-based diagnostic and therapeutic discovery. To date, widespread adoption of AIRR ML has been inhibited by a lack of reproducibility, transparency, and interoperability. immuneML (immuneml.uio.no) addresses these concerns by implementing each step of the AIRR ML process in an extensible, open-source software ecosystem that is based on fully specified and shareable workflows. To facilitate widespread user adoption, immuneML is available as a command-line tool and through an intuitive Galaxy web interface, and extensive documentation of workflows is provided. We demonstrate the broad applicability of immuneML by (i) reproducing a large-scale study on immune state prediction, (ii) developing, integrating, and applying a novel method for antigen specificity prediction, and (iii) showcasing streamlined interpretability-focused benchmarking of AIRR ML.
Background Plasma cells (PCs) are terminally differentiated B-lymphocytes producing antibodies. In coeliac disease (CeD) there is increased density of PCs in the small-intestinal lesion. Many of these PCs produce disease-specific autoantibodies targeting transglutaminase 2 (TG2). Objective The plasmacytosis of CeD motivated us to study the transcriptional programme of PCs from coeliac gut lesions. Methods RNA-seq was performed on the PCs of CeD patients and disease controls, being specific or non-specific for TG2. Results Being antibody-producing cells, 67% of the PCs’ transcript was aligned to immunoglobulin genes. Strikingly, genes encoding ligands and receptors of chemokines and cytokines were abundant. Higher transcript levels of genes associated with cell activation and immune responses were observed in PCs of CeD patients compared to controls. TG2-specific compared to non-TG2 specific PCs expressed increased levels of CXCR3, CXCL10 and interleukin-15; factors that have been implicated in the pathogenesis of CeD yet with production attributed to other cells than PCs. The presence of transcripts of HLA class II and T-cell co-stimulatory molecules suggests that PCs may serve as antigen-presenting cells for CD4 + helper T cells. Conclusions Our findings shed new light on the biology of intestinal PCs, implicating functions that go beyond the production of immunoglobulins.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.