2019
DOI: 10.1101/677328
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Using machine learning to predict quantitative phenotypes from protein and nucleic acid sequences

Abstract: The link between sequence and phenotype is essential to understanding the molecular mechanisms of evolution, and the design of proteins and genes with specific properties. However, it is difficult to describe the relationship between sequence and protein or organismal phenotypes, due to the complex relationship between sequence, protein folding and activity, and organismal physiology. Here, we use machine learning models trained on individual families of proteins or nucleic acids to predict the originating spe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 78 publications
(167 reference statements)
0
1
0
Order By: Relevance
“…The SVM method (termed ThermoProt) yielded the best performance (MTH, 86.6% accuracy) and was applied to the PETase HMM hits without OGT data to predict the thermophilicity. It is important to note that while this work was ongoing, a dataset of OGT for 21,498 microbes was published 54 , which enabled regression models that directly predict the OGT 91,92 , and the optimal catalytic temperature (T opt ) of an enzyme 92,93 . These new regression methods possibly enable improved prediction of the thermotolerance of enzymes.…”
Section: Prediction Of Thermophilicity With Machine Learning (Thermop...mentioning
confidence: 99%
“…The SVM method (termed ThermoProt) yielded the best performance (MTH, 86.6% accuracy) and was applied to the PETase HMM hits without OGT data to predict the thermophilicity. It is important to note that while this work was ongoing, a dataset of OGT for 21,498 microbes was published 54 , which enabled regression models that directly predict the OGT 91,92 , and the optimal catalytic temperature (T opt ) of an enzyme 92,93 . These new regression methods possibly enable improved prediction of the thermotolerance of enzymes.…”
Section: Prediction Of Thermophilicity With Machine Learning (Thermop...mentioning
confidence: 99%