The prediction of epitope recognition by T-cell receptors (TCRs) has seen many advancements in recent years, with several methods now available that can predict recognition for a specific set of epitopes. However, the generic case of evaluating all possible TCR-epitope pairs remains challenging, mainly due to the high diversity of the interacting sequences and the limited amount of currently available training data. In this work, we provide an overview of the current state of this unsolved problem. First, we examine appropriate validation strategies to accurately assess the generalization performance of generic TCR-epitope recognition models when applied to both seen and unseen epitopes. In addition, we present a novel feature representation approach, which we call ImRex (interaction map recognition). This approach is based on the pairwise combination of physicochemical properties of the individual amino acids in the CDR3 and epitope sequences, which provides a convolutional neural network with the combined representation of both sequences. Lastly, we highlight various challenges that are specific to TCR-epitope data and that can adversely affect model performance. These include the issue of selecting negative data, the imbalanced epitope distribution of curated TCR-epitope datasets and the potential exchangeability of TCR alpha and beta chains. Our results indicate that while extrapolation to unseen epitopes remains a difficult challenge, ImRex makes this feasible for a subset of epitopes that are not too dissimilar from the training data. We show that appropriate feature engineering methods and rigorous benchmark standards are required to create and validate TCR-epitope predictive models.
High-throughput T cell receptor (TCR) sequencing allows the characterization of an individual's TCR repertoire and directly queries their immune state. However, it remains a non-trivial task to couple these sequenced TCRs to their antigenic targets. In this paper, we present a novel strategy to annotate full TCR sequence repertoires with their epitope specificities. The strategy is based on a machine learning algorithm to learn the TCR patterns common to the recognition of a specific epitope. These results are then combined with a statistical analysis to evaluate the occurrence of specific epitope-reactive TCR sequences per epitope in repertoire data. In this manner, we can directly study the capacity of full TCR repertoires to target specific epitopes of the relevant vaccines or pathogens. We demonstrate the usability of this approach on three independent datasets related to vaccine monitoring and infectious disease diagnostics by independently identifying the epitopes that are targeted by the TCR repertoire. The developed method is freely available as a web tool for academic use at tcrex.biodatamining.be.
Background After transplantation, cell-free deoxyribonucleic acid (DNA) derived from the donor organ (ddcfDNA) can be detected in the recipient’s circulation. We aimed to investigate the role of plasma ddcfDNA as biomarker for acute kidney rejection. Methods From 107 kidney transplant recipients, plasma samples were collected longitudinally after transplantation (Day 1 to 3 months) within a multicentre set-up. Cell-free DNA from the donor was quantified in plasma as a fraction of the total cell-free DNA by next generation sequencing using a targeted, multiplex polymerase chain reaction-based method for the analysis of single nucleotide polymorphisms. Results Increases of the ddcfDNA% above a threshold value of 0.88% were significantly associated with the occurrence of episodes of acute rejection (P = 0.017), acute tubular necrosis (P = 0.011) and acute pyelonephritis (P = 0.032). A receiver operating characteristic curve analysis revealed an equal area under the curve of the ddcfDNA% and serum creatinine of 0.64 for the diagnosis of acute rejection. Conclusions Although increases in plasma ddcfDNA% are associated with graft injury, plasma ddcfDNA does not outperform the diagnostic capacity of the serum creatinine in the diagnosis of acute rejection.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.