2020
DOI: 10.1021/acs.jcim.9b00877
|View full text |Cite
|
Sign up to set email alerts
|

Exploration and Evaluation of Machine Learning-Based Models for Predicting Enzymatic Reactions

Abstract: Unannotated gene sequences in databases are increasing due to sequencing advances. Therefore, computational methods to predict functions of unannotated genes are needed. Moreover, novel enzyme discovery for metabolic engineering applications further encourages annotation of sequences. Here, enzyme functions are predicted using two general approaches, each including several machine learning algorithms. First, Enzyme-models (E-models) predict Enzyme Commission (EC) numbers from amino acid sequence information. S… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
19
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 22 publications
(19 citation statements)
references
References 48 publications
0
19
0
Order By: Relevance
“…We acknowledge that different approaches, such as machine learning of structure-activity relationships of enzymes and substrates are a very promising alternative for large datasets, with a number of studies published recently. 59,60 However, for regimes of little data, as presented in this study, we believe that simple heuristic scoring schemes are a more robust and interpretable route toward success, and estimate the performance of EHreact to be satisfactory for use in computer-aided pathway design. As outlook, we plan to utilize EHreact to design multi-step synthesis pathways and enzymatic cascades.…”
Section: Discussionmentioning
confidence: 88%
“…We acknowledge that different approaches, such as machine learning of structure-activity relationships of enzymes and substrates are a very promising alternative for large datasets, with a number of studies published recently. 59,60 However, for regimes of little data, as presented in this study, we believe that simple heuristic scoring schemes are a more robust and interpretable route toward success, and estimate the performance of EHreact to be satisfactory for use in computer-aided pathway design. As outlook, we plan to utilize EHreact to design multi-step synthesis pathways and enzymatic cascades.…”
Section: Discussionmentioning
confidence: 88%
“…We acknowledge that different approaches, such as machine learning of structure-activity relationships of enzymes and substrates are a very promising alternative for large datasets, with a number of studies published recently. 61,62 However, for regimes of little data, as presented in this study, we believe that simple heuristic scoring schemes are a more robust and interpretable route toward success, and estimate the performance of EHreact to be satisfactory for use in computer-aided pathway design. We plan to utilize EHreact to design multi-step synthesis pathways and enzymatic cascades.…”
Section: Discussionmentioning
confidence: 88%
“…We acknowledge that different approaches, such as machine learning of structure− activity relationships of enzymes and substrates, are a very promising alternative for large data sets, with a number of studies published recently. 61,62 However, for regimes of little data, as presented in this study, we believe that simple heuristic scoring schemes are a more robust and interpretable route toward success and estimate the performance of EHreact to be satisfactory for use in computer-aided pathway design. We plan to utilize EHreact to design multistep synthesis pathways and enzymatic cascades.…”
Section: ■ Conclusionmentioning
confidence: 89%