2022
DOI: 10.1101/2022.11.20.517210
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

OpenFold: Retraining AlphaFold2 yields new insights into its learning mechanisms and capacity for generalization

Abstract: AlphaFold2 revolutionized structural biology with the ability to predict protein structures with exceptionally high accuracy. Its implementation, however, lacks the code and data required to train new models. These are necessary to (i) tackle new tasks, like protein-ligand complex structure prediction, (ii) investigate the process by which the model learns, which remains poorly understood, and (iii) assess the model's generalization capacity to unseen regions of fold space. Here we report OpenFold, a fast, mem… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
109
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 109 publications
(112 citation statements)
references
References 113 publications
2
109
0
Order By: Relevance
“…The database and web application TMvisDB offers a straightforward search functionality and visualization interface. While several accurate structure prediction methods have been made available over the last year [14][15][16][17], we chose to enhance TMvisDB sequence annotations with AlphaFold2 [12] predictions that have been shown to perform well in structural analysis of transmembrane proteins (TMPs) [52], and have, therefore, been successfully applied as input by resources such as the TmAlphaFold database that collects alpha-helical TMPs [34].…”
Section: Discussionmentioning
confidence: 99%
“…The database and web application TMvisDB offers a straightforward search functionality and visualization interface. While several accurate structure prediction methods have been made available over the last year [14][15][16][17], we chose to enhance TMvisDB sequence annotations with AlphaFold2 [12] predictions that have been shown to perform well in structural analysis of transmembrane proteins (TMPs) [52], and have, therefore, been successfully applied as input by resources such as the TmAlphaFold database that collects alpha-helical TMPs [34].…”
Section: Discussionmentioning
confidence: 99%
“…Sequence-based Protein Structure Prediction Since AlphaFold2 achieve remarkable performance at CASP14, some group are looking to speed up AlphaFold2 pipeline to reduce the computational expense, including model inference stage [2,7] and homologs search stage [25,13]. In nature a protein folds without knowledge of its sequence homologs, predicting protein structure based upon some non-natural conditions such as MSA or template does not reflect very well how a protein actually folds.…”
Section: Msa-based Protein Structure Predictionmentioning
confidence: 99%
“…We make a comparison among AlphaFold2 (Jumper et al 2021), Open-Fold (Ahdritz et al 2021), HelixFold (Wang et al 2022) and UniFold (Li et al 2022), in terms of hardware, time cost of each step, training throughput and total training time as…”
Section: End-to-end Training Training Performancementioning
confidence: 99%