2021
DOI: 10.1101/2021.07.06.451258
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Unconstrained generation of synthetic antibody-antigen structures to guide machine learning methodology for real-world antibody specificity prediction

Abstract: Machine learning (ML) is a key technology to enable accurate prediction of antibody-antigen binding, a prerequisite for in silico vaccine and antibody design. Two orthogonal problems hinder the current application of ML to antibody-specificity prediction and the benchmarking thereof: (i) The lack of a unified formalized mapping of immunological antibody specificity prediction problems into ML notation and (ii) the unavailability of large-scale training datasets. Here, we developed the Absolut! software suite t… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
66
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
6

Relationship

3
3

Authors

Journals

citations
Cited by 17 publications
(67 citation statements)
references
References 155 publications
(359 reference statements)
1
66
0
Order By: Relevance
“…Machine learning is increasingly used for AIRR classification both on the sequence (Greiff et al 2017b;Isacchini et al 2021;Akbar et al 2021a;Robert et al 2021a) and repertoire level (Emerson et al 2017;Shemesh et al 2021;Sidhom et al 2021), as well as for antibody generation (Friedensohn et al 2020;Akbar et al 2021b). Future studies will need to investigate whether differences in RGM also impact repertoire classification (Greiff et al 2020;Rodriguez et al 2020;Kanduri et al 2021).…”
Section: Discussionmentioning
confidence: 99%
“…Machine learning is increasingly used for AIRR classification both on the sequence (Greiff et al 2017b;Isacchini et al 2021;Akbar et al 2021a;Robert et al 2021a) and repertoire level (Emerson et al 2017;Shemesh et al 2021;Sidhom et al 2021), as well as for antibody generation (Friedensohn et al 2020;Akbar et al 2021b). Future studies will need to investigate whether differences in RGM also impact repertoire classification (Greiff et al 2020;Rodriguez et al 2020;Kanduri et al 2021).…”
Section: Discussionmentioning
confidence: 99%
“…So far, rule inference via, for instance, attribution methods, remains a challenge and is poorly standardized. 54 , 89…”
Section: Learnability Of Antibody–antigen Bindingmentioning
confidence: 99%
“…The outer red ring represents the number of antibody sequences in the iReceptor database (the largest publicly available sequence data, 53 the outer purple ring the number of synthetic antibody–antigen binding structures from Absolut! (the largest publicly available synthetic antibody-antigen structural dataset), 54 the outer blue ring displays the number of structures from AbDb (curated antibody–antigen structural data 55 obtained from the protein data bank), 56 and the outer grey ring represents developability information. 52 inner rings illustrate information about antibody-antigen complexes, ig repertoire, therapeutic antibodies, and paratope and epitope data.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations