2020
DOI: 10.3390/biom10121636
|View full text |Cite
|
Sign up to set email alerts
|

Comparative Assessment of Intrinsic Disorder Predictions with a Focus on Protein and Nucleic Acid-Binding Proteins

Abstract: With over 60 disorder predictors, users need help navigating the predictor selection task. We review 28 surveys of disorder predictors, showing that only 11 include assessment of predictive performance. We identify and address a few drawbacks of these past surveys. To this end, we release a novel benchmark dataset with reduced similarity to the training sets of the considered predictors. We use this dataset to perform a first-of-its-kind comparative analysis that targets two large functional families of disord… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
24
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
7
1
1

Relationship

3
6

Authors

Journals

citations
Cited by 25 publications
(25 citation statements)
references
References 126 publications
1
24
0
Order By: Relevance
“…This list contextualizes the efforts to develop deep learning predictors in a broader setting of the entire disorder prediction field. We identify the 36 predictors using a wide-ranging list of sources including databases of disorder predictions: MobiDB [122] , D 2 P 2 [123] and DescribePROT [124] ; community assessments and surveys that were published on or after 2013 [33] , [41] , [42] , [43] , [46] , [47] , [49] , [50] , [52] , [58] , [59] , and a manual search of relevant articles from PubMed that we collect using the “(disorder[Title]) AND (prediction[Title]) AND protein” query. Table 1 reveals that 13 out of the 36 recent disorder predictors use deep learning models.…”
Section: Prediction Of Intrinsic Disorder Using Deep Learningmentioning
confidence: 99%
See 1 more Smart Citation
“…This list contextualizes the efforts to develop deep learning predictors in a broader setting of the entire disorder prediction field. We identify the 36 predictors using a wide-ranging list of sources including databases of disorder predictions: MobiDB [122] , D 2 P 2 [123] and DescribePROT [124] ; community assessments and surveys that were published on or after 2013 [33] , [41] , [42] , [43] , [46] , [47] , [49] , [50] , [52] , [58] , [59] , and a manual search of relevant articles from PubMed that we collect using the “(disorder[Title]) AND (prediction[Title]) AND protein” query. Table 1 reveals that 13 out of the 36 recent disorder predictors use deep learning models.…”
Section: Prediction Of Intrinsic Disorder Using Deep Learningmentioning
confidence: 99%
“…Current surveys point to the long history of the disorder prediction area, providing invaluable insights concerning architectures of these methods, their availability, trends in their development efforts and approaches to comparatively evaluate their predictive performance [40] , [41] , [42] , [43] , [44] , [45] , [46] , [47] , [48] . Moreover, users and developers benefit from empirical studies that comparatively assess predictive quality of disorder predictors [33] , [49] , [50] , [51] , [52] , [53] , [54] , [55] , [56] , [57] , [58] , [59] . These comparative studies include several community assessments, such as Critical Assessment of Structure Prediction (CASP) between CASP5 to CASP10 [53] , [54] , [55] , [56] , [57] , [58] and Critical Assessment of Intrinsic Protein Disorder (CAID) [52] .…”
Section: Introductionmentioning
confidence: 99%
“…There are several AI (artificial intelligence)-trained sequence-based disorder predictors [105][106][107] that return residue-wise disorder probability scores, which are trained primarily on evolutionary sequence data (e.g., mutational co-variance matrices). These sequence-based disorder predictors have their known limits in accuracy [108,109], for not explicitly accounting for the actual three-dimensional structural dynamics of the protein(s)/peptide(s), but, can serve as a good first test of the comparative FLCS Spike loopdisorder among its close evolutionary homologs. A representative set of Spike structures (CoV/CoV-2) were culled (resolution ≤ 3 Å), accumulated (see Section 2.1, Materials and Methods), and their UNIPROT sequences (in FASTA format) derived from proteomics data [67], were extracted from corresponding entries in the Protein Data Bank [43].…”
Section: More the Arginines More The Disorder' In The Flcs Spike Acti...mentioning
confidence: 99%
“…There are several AI 6 -trained sequence based disorder predictors [101][102][103] that return residue-wise disorder probability scores, which are trained primarily on evolutionary sequence data (e.g., mutational co-variance matrices). These sequence-based disorder predictors have their known limits in accuracy [104,105], for not explicitly accounting for the actual three-dimensional structural dynamics of the protein(s) / peptide(s), but, can serve as a good first test of the comparative FLCS Spike loop-disorder among its close evolutionary homologs. A representative set of Spike structures (CoV/CoV-2) were culled (resolution ≤ 3 Å), accumulated (see subsection 2.1, Materials and Methods), and their UNIPROT sequences (in FASTA format) derived from proteomics data [63], were extracted from corresponding entries in the Protein Data Bank [39].…”
Section: 'More the Arginines More The Disorder' In The Flcs Spike Activation Loopsmentioning
confidence: 99%