2018
DOI: 10.1073/pnas.1805256116
|View full text |Cite
|
Sign up to set email alerts
|

Data-driven supervised learning of a viral protease specificity landscape from deep sequencing and molecular simulations

Abstract: Biophysical interactions between proteins and peptides are key determinants of molecular recognition specificity landscapes. However, an understanding of how molecular structure and residue-level energetics at protein−peptide interfaces shape these landscapes remains elusive. We combine information from yeast-based library screening, next-generation sequencing, and structure-based modeling in a supervised machine learning approach to report the comprehensive sequence−energetics−function mapping of the specific… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
40
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 32 publications
(40 citation statements)
references
References 67 publications
0
40
0
Order By: Relevance
“…We sought an alternate approach to simple sequencebased metrics. Inspired by the literature on characteristic biochemical properties of intrinsically-disordered proteins, 29 as well as previous machine-learning approaches applied to biochemical interactions, 7,30,31 we hypothesized that the biochemical features of peptide tar-gets could be used to construct a predictive model of of peptide enrichment. For each peptide sequence identified in the phage display dataset, we calculated a set of 57 features covering an array of biochemical properties.…”
Section: Supervised Machine Learning Can Be Used To Train a Predictmentioning
confidence: 99%
See 2 more Smart Citations
“…We sought an alternate approach to simple sequencebased metrics. Inspired by the literature on characteristic biochemical properties of intrinsically-disordered proteins, 29 as well as previous machine-learning approaches applied to biochemical interactions, 7,30,31 we hypothesized that the biochemical features of peptide tar-gets could be used to construct a predictive model of of peptide enrichment. For each peptide sequence identified in the phage display dataset, we calculated a set of 57 features covering an array of biochemical properties.…”
Section: Supervised Machine Learning Can Be Used To Train a Predictmentioning
confidence: 99%
“…One can predict protein targets by searching for matching sequences within the proteome 6 . Some proteins deviate from this highly specific paradigm, requiring more sophisticated approaches 7 . For example, many PDZ binding domains exhibit binding “multi‐specificity”, in which peptide preference must be represented as a handful of binding motifs 8,9 .…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Researchers can derive models of protease-peptide complexes to analyse the binding spectrum [ 29 , 30 ]. One approach is to analyse protease-peptide structures to ascertain amino acid preferences at different positions of a peptide substrate sequence [ 31 ]. These computational analyses usually employ experimental catalytic information stored in repositories such as M-CSA [ 32 ], bringing a clearer perspective of the structural role of the catalytic residues, and other pocket residues, during binding.…”
Section: Introductionmentioning
confidence: 99%
“…Similarly, Pethe et al 36 were able to obtain significantly improved prediction accuracies for proteases (including HIV−1 protease) compared to previous methods (MFPred, sequence tolerance and pepspec) by employing machine learning and a discriminatory score based on geometric features, interface score terms from Rosetta and electrostatic score terms from Amber. In another work, Pethe et al 37 used supervised learning on experimentally obtained deep-sequencing data and information from structure-based models to chart the specificity landscape of 3.2 million substrate variants of the viral protease HCV.…”
Section: Introductionmentioning
confidence: 99%