2020
DOI: 10.1093/bioinformatics/btaa158
|View full text |Cite
|
Sign up to set email alerts
|

immuneSIM: tunable multi-feature simulation of B- and T-cell receptor repertoires for immunoinformatics benchmarking

Abstract: Abstract Summary B- and T-cell receptor repertoires of the adaptive immune system have become a key target for diagnostics and therapeutics research. Consequently, there is a rapidly growing number of bioinformatics tools for immune repertoire analysis. Benchmarking of such tools is crucial for ensuring reproducible and generalizable computational analyses. Currently, however, it remains chal… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
53
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
3

Relationship

3
5

Authors

Journals

citations
Cited by 57 publications
(56 citation statements)
references
References 12 publications
0
53
0
Order By: Relevance
“…Repertoire modeling was carried out with immuneSIM for V(D)J recombination modeling, and ShaZam for SHM modeling [62,63]. First, V gene frequency was extracted from donor data and used to build V gene distribution table.…”
Section: In Silico Repertoire Modeling and Analysismentioning
confidence: 99%
“…Repertoire modeling was carried out with immuneSIM for V(D)J recombination modeling, and ShaZam for SHM modeling [62,63]. First, V gene frequency was extracted from donor data and used to build V gene distribution table.…”
Section: In Silico Repertoire Modeling and Analysismentioning
confidence: 99%
“…Ground truth datasets are defined by the property that the link between the class of a sequence (class = disease/antigen specificity) or repertoire and the underlying sequence structure is known a priori. Thus, by definition, ground-truth datasets are those that are synthetically generated in silico (Weber et al, 2019a(Weber et al, , 2019b. One of the current bottlenecks for generating nature-like synthetic datasets for predicting, for example, antigenbinding from the immune receptor sequence alone is the lack of knowledge on the complexity of paratope and epitope interaction motifs.…”
Section: Interaction Sequence Motifs Provide Ground Truth For Benchmamentioning
confidence: 99%
“…Our research resolves this important knowledge gap. Specifically, our atlas structural interaction motifs now allows for the faithful simulation of nativelike ground truth immune receptor datasets for the development of immune-receptor based machine learning methods (Weber et al, 2019a). For example, paratope and epitope sequence motifs may be implanted into synthetic 3D antibody-antigen structures (Robert et al, 2018) to subsequently perform benchmarking of different ML architectures with regard to their capacity to (i) predict paratope-epitope binding and (ii) recover the exact sites of antibody-antigen interaction (native-like antigen-specific sequence motifs).…”
Section: Interaction Sequence Motifs Provide Ground Truth For Benchmamentioning
confidence: 99%
“…Furthermore, there exist several methods to construct phylogenetic trees, including distance-based metrics, maximum likelihood (ML), maximum parsimony (MP), and Bayesian inference (Stamatakis, 2006;Gascuel, 2006;Bouckaert et al, 2014). While there exist multiple simulation tools capable of exploring how inference method impacts the resulting phylogenetic trees derived from simulated B cell data (Yermanos et al, 2017;Davidsen and Matsen IV, 2018;Safonova et al, 2015;Weber et al, 2019), the extent of this influence on the evolutionary conclusions on experimental data remains largely unexplored. It remains unknown, for example, the extent of which the phylogenetic inference strategy impacts the biological conclusions pertaining to the evolutionary landscape across various infection cohorts.…”
Section: Introductionmentioning
confidence: 99%