2018
DOI: 10.1101/367904
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

OLGA: fast computation of generation probabilities of B- and T-cell receptor amino acid sequences and motifs

Abstract: Motivation: High-throughput sequencing of large immune repertoires has enabled the development of methods to predict the probability of generation by V(D)J recombination of Tand B-cell receptors of any specific nucleotide sequence. These generation probabilities are very non-homogeneous, ranging over 20 orders of magnitude in real repertoires. Since the function of a receptor really depends on its protein sequence, it is important to be able to predict this probability of generation at the amino acid level. Ho… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
73
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
6
3

Relationship

4
5

Authors

Journals

citations
Cited by 33 publications
(75 citation statements)
references
References 62 publications
2
73
0
Order By: Relevance
“…The V (D)J recombination mechanism generates some antigen receptor sequences much more frequently than others. There are efficient algorithms that can calculate the probability of generation for a given antigen receptor sequence [52, 85,86], and these could theoretically estimate the number of donors sharing such sequence [87]. Thus, there is no sharp border between public and private clonotypes, but rather a continuous spectrum of publicness defined by the probability of an antigen receptor being produced by the recombination machinery.…”
Section: Clonal Sharingmentioning
confidence: 99%
“…The V (D)J recombination mechanism generates some antigen receptor sequences much more frequently than others. There are efficient algorithms that can calculate the probability of generation for a given antigen receptor sequence [52, 85,86], and these could theoretically estimate the number of donors sharing such sequence [87]. Thus, there is no sharp border between public and private clonotypes, but rather a continuous spectrum of publicness defined by the probability of an antigen receptor being produced by the recombination machinery.…”
Section: Clonal Sharingmentioning
confidence: 99%
“…The two corresponding clones would be indistinguishable, and form a single clonotype in the repertoire. This effect can be corrected for by using models of recombination [43,44]. These models, which are inferred from data, can predict the distribution of generation probabilities of full receptors, or of single chains, both at the level of nucletoide or amino acid sequences, as shown in Fig.…”
Section: Connecting Models To Datamentioning
confidence: 99%
“…Clones ×10 [44]. Most clonotypes have very low probability and are therefore unlikely to occur in two clones independently.…”
Section: Connecting Models To Datamentioning
confidence: 99%
“…. Entropy estimates for both nucleotide and amino acid sequences are reported in [176]. Human nucletoide TCR β diversity is H 1 ∼ 44 bits (bits refer to ln(2) units) for nucletoides and 30 bits for amino-acids, TCR α divesity is ∼ 30 bits for nucleotides and 25 bits for amino acids.…”
Section: Diversity and The Clone Size Distributionmentioning
confidence: 99%