2021
DOI: 10.1101/2021.07.08.451480
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

In silico proof of principle of machine learning-based antibody design at unconstrained scale

Abstract: Generative machine learning (ML) has been postulated to be a major driver in the computational design of antigen-specific monoclonal antibodies (mAb). However, efforts to confirm this hypothesis have been hindered by the infeasibility of testing arbitrarily large numbers of antibody sequences for their most critical design parameters: paratope, epitope, affinity, and developability. To address this challenge, we leveraged a lattice-based antibody-antigen binding simulation framework, which incorporates a wide … Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
35
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8
2

Relationship

5
5

Authors

Journals

citations
Cited by 28 publications
(35 citation statements)
references
References 64 publications
0
35
0
Order By: Relevance
“…Machine learning is increasingly used for AIRR classification both on the sequence (Greiff et al 2017b;Isacchini et al 2021;Akbar et al 2021a;Robert et al 2021a) and repertoire level (Emerson et al 2017;Shemesh et al 2021;Sidhom et al 2021), as well as for antibody generation (Friedensohn et al 2020;Akbar et al 2021b). Future studies will need to investigate whether differences in RGM also impact repertoire classification (Greiff et al 2020;Rodriguez et al 2020;Kanduri et al 2021).…”
Section: Discussionmentioning
confidence: 99%
“…Machine learning is increasingly used for AIRR classification both on the sequence (Greiff et al 2017b;Isacchini et al 2021;Akbar et al 2021a;Robert et al 2021a) and repertoire level (Emerson et al 2017;Shemesh et al 2021;Sidhom et al 2021), as well as for antibody generation (Friedensohn et al 2020;Akbar et al 2021b). Future studies will need to investigate whether differences in RGM also impact repertoire classification (Greiff et al 2020;Rodriguez et al 2020;Kanduri et al 2021).…”
Section: Discussionmentioning
confidence: 99%
“…Machine learning models tested included K-nearest neighbor, logistic regression, naive Bayes, support vector machines, and random forests (RF); long-short term memory recurrent neural networks (RNN) were also trained, which are a class of deep learning models that have the ability to learn long-range dependencies in sequential data (38)(39)(40)(41). While all baseline models performed effectively (e.g., accuracy scores between 0.87 -0.94), RF and RNN were selected for further optimization and application since they showed relatively higher performance metrics and could be trained faster (Supplementary Fig.…”
Section: Machine Learning Models Accurately Predict Ace2 Binding and ...mentioning
confidence: 99%
“… 54 We complemented this data with in silico predicted developability parameters to create datasets that encompass the three aforementioned key design parameters: paratope-epitope binding, affinity, and developability. 27 Such efforts have begun to increase the number of datasets to a level where the benchmarking of data-intensive methods, such as deep learning to study antibody-antigen binding at the paratope-epitope level as well as deep learning-based antibody sequence generation, started to become feasible. 27 , 54 More generally, large-scale 3D-atomistic resolution data generation may represent the next major step where abundantly available antibody sequence data will be leveraged to obtain large quantities of antibody-antigen complexes via recent advances in computational structural biology methods such as antibody modeling, 59–63 molecular docking, 64–67 and molecular dynamics.…”
Section: Introductionmentioning
confidence: 99%