2022
DOI: 10.48550/arxiv.2207.02698
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Binding prediction of multi-domain cellulases with a dual-CNN

Abstract: Cellulases hold great promise for the production of biofuels and biochemicals. However, they are modular enzymes acting on a complex heterogeneous substrate. Because of this complexity, the computational prediction of their catalytic properties remains scarce, which restricts both enzyme discovery and enzyme design. Here, we present a dual-input convolutional neural network to predict the binding of multi-domain enzymes. This regression model outperformed previous molecular dynamics-based methods for binding p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 20 publications
0
1
0
Order By: Relevance
“…A data-driven approach of ML combined with statistics overcame this disadvantage by inferring the numerous and possibly unknown factors which map from sequence to function according to the above data and provided superior predictive accuracy for predicting mutation sites in BGL [ 66 ]. For example, a regression model based on a dual-input convolutional neural network was used to predict the binding affinity of cellulase to the substrate to improve enzyme activity [ 67 ]. Due to the lack of negative sequence examples in the DMS dataset, and the inability to learn directly from the large-scale sequence function DMS dataset with the ML-supervised method, a method was developed to classify the DMS dataset as positive unlabeled data and successfully applied to design thermally stable BGLs [ 68 ].…”
Section: Bgl Engineering Strategiesmentioning
confidence: 99%
“…A data-driven approach of ML combined with statistics overcame this disadvantage by inferring the numerous and possibly unknown factors which map from sequence to function according to the above data and provided superior predictive accuracy for predicting mutation sites in BGL [ 66 ]. For example, a regression model based on a dual-input convolutional neural network was used to predict the binding affinity of cellulase to the substrate to improve enzyme activity [ 67 ]. Due to the lack of negative sequence examples in the DMS dataset, and the inability to learn directly from the large-scale sequence function DMS dataset with the ML-supervised method, a method was developed to classify the DMS dataset as positive unlabeled data and successfully applied to design thermally stable BGLs [ 68 ].…”
Section: Bgl Engineering Strategiesmentioning
confidence: 99%