2017
DOI: 10.4049/jimmunol.1700594
|View full text |Cite
|
Sign up to set email alerts
|

Learning the High-Dimensional Immunogenomic Features That Predict Public and Private Antibody Repertoires

Abstract: Recent studies have revealed that immune repertoires contain a substantial fraction of public clones, which may be defined as Ab or TCR clonal sequences shared across individuals. It has remained unclear whether public clones possess predictable sequence features that differentiate them from private clones, which are believed to be generated largely stochastically. This knowledge gap represents a lack of insight into the shaping of immune repertoire diversity. Leveraging a machine learning approach capable of … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
112
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
2
2

Relationship

2
6

Authors

Journals

citations
Cited by 109 publications
(115 citation statements)
references
References 87 publications
(114 reference statements)
2
112
0
Order By: Relevance
“…Adaptive immune receptor repertoires represent a major target area for the application of machine learning in the hope that it may fast-track the in silico discovery and development of immunereceptor based immunotherapies and immunodiagnostics (Brown et al, 2019;Greiff et al, 2012;Mason et al, 2018Mason et al, , 2019Miho et al, 2018). The complexity of sequence dependencies that determine antigen binding (Dash et al, 2017;Glanville et al, 2017), immune receptor publicity (Greiff et al, 2017b) and immune status (immunodiagnostics) (Ostmeyer et al, 2019;Thomas et al, 2014) represent a perfect application ground for machine learning analysis (Arora et al, 2019;Cinelli et al, 2017;Greiff et al, 2017b;Liu et al, 2019;Mason et al, 2019;Sidhom et al, 2018;Sun et al, 2017). As discussed extensively in a recent literature review by us (Brown et al, 2019) the development of ML approaches for immune receptor datasets was and is still hampered by the lack of ground truth datasets.…”
Section: Interaction Sequence Motifs Provide Ground Truth For Benchmamentioning
confidence: 99%
See 1 more Smart Citation
“…Adaptive immune receptor repertoires represent a major target area for the application of machine learning in the hope that it may fast-track the in silico discovery and development of immunereceptor based immunotherapies and immunodiagnostics (Brown et al, 2019;Greiff et al, 2012;Mason et al, 2018Mason et al, , 2019Miho et al, 2018). The complexity of sequence dependencies that determine antigen binding (Dash et al, 2017;Glanville et al, 2017), immune receptor publicity (Greiff et al, 2017b) and immune status (immunodiagnostics) (Ostmeyer et al, 2019;Thomas et al, 2014) represent a perfect application ground for machine learning analysis (Arora et al, 2019;Cinelli et al, 2017;Greiff et al, 2017b;Liu et al, 2019;Mason et al, 2019;Sidhom et al, 2018;Sun et al, 2017). As discussed extensively in a recent literature review by us (Brown et al, 2019) the development of ML approaches for immune receptor datasets was and is still hampered by the lack of ground truth datasets.…”
Section: Interaction Sequence Motifs Provide Ground Truth For Benchmamentioning
confidence: 99%
“…Recent reports have, however, provided preliminary evidence for the potential predictability of antibody-antigen interaction: (i) The antibody repertoire field has now established that antibody sequence diversity underlies predictable rules (Elhanati et al, 2015;Greiff et al, 2017bGreiff et al, , 2017a. (ii) The presence of transferable "specificity units" between distinct antibody molecules was recently suggested by showing that tightly binding functional antibodies may be conceived by designing and improving seemingly unrelated paratopes (Nimrod et al, 2018).…”
Section: Introductionmentioning
confidence: 99%
“…The latter is estimated to be at least 10 8 clonotypes for TCRβ, which is still larger than the diversity observed in the largest repertoire sequencing experiments , while a typical repertoire sequencing experiment samples less than one million cells. Inferring diversity from small samples is challenging, and many different metrics have been proposed for this task (Shannon entropy , RECON , DivE , Renyi entropy (https://arxiv.org/abs/1604.00487), and its transformation, Hill number‐based diversity estimators , Chao2 estimator ). V(D)J recombination and thymic selection result in a very diverse repertoire of naive T‐cells with more or less even distribution of clone sizes.…”
Section: Clone Size Distribution Statisticsmentioning
confidence: 99%
“…Although TCR/BCR nucleotide sequences can be used as unique clone identifiers, the occurrence of highly similar or identical receptor amino acid sequences between individuals is not impossible. Indeed, thousands of such ‘convergent’ immune receptor sequences can be found in bulk receptor repertoires from different individuals . Clonotypes present in more than a single donor are termed ‘public’, whereas clonotypes specific to an individual are considered ‘private’.…”
Section: Clonal Sharingmentioning
confidence: 99%
“…They found that just 0.022% of observed clonotypes were 'public' (seen in everyone). In a comple-D R A F T mentary approach, Greiff et al trained a Support Vector Machine on public and private clonal sequences to identify their high-dimensional features, proving that they have distinct immunogenomic properties (17). Clonotyping can also be used to detect antigen-specific immunoglobulins, through the identification of expanded clones after vaccination, or those present in unusually high proportions in individuals immune to certain diseases.…”
Section: Introductionmentioning
confidence: 99%