2023
DOI: 10.1021/acs.jafc.3c01172
|View full text |Cite
|
Sign up to set email alerts
|

Data-Driven Prediction of Molecular Biotransformations in Food Fermentation

Abstract: Fermentation products, together with food components, determine the sense, nutrition, and safety of fermented foods. Traditional methods of fermentation product identification are time-consuming and cumbersome, which cannot meet the increasing need for the identification of the extensive bioactive metabolites produced during food fermentation. Hence, we propose a data-driven integrated platform (FFExplorer, http://www.rxnfinder.org/ffexplorer/) based on machine learning and data on 2,192,862 microbial sequence… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 7 publications
(5 citation statements)
references
References 43 publications
0
5
0
Order By: Relevance
“…Despite successful application in many areas, interpretability remains challenging for deep learning-based methods . PU-EPP uses attention mechanisms to capture the importance and contributions of different input positions (e.g., atoms in substrates or residues in enzymes) to the final prediction, thereby inferring the knowledge the model has learned (Figure A,B). Though embeddings could not provide a perfect way to fully interpret the results, we mapped the attention weights to the substrates and enzymes and compared them with known data to verify whether PU-EPP could learn the catalytic mechanism behind the data.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…Despite successful application in many areas, interpretability remains challenging for deep learning-based methods . PU-EPP uses attention mechanisms to capture the importance and contributions of different input positions (e.g., atoms in substrates or residues in enzymes) to the final prediction, thereby inferring the knowledge the model has learned (Figure A,B). Though embeddings could not provide a perfect way to fully interpret the results, we mapped the attention weights to the substrates and enzymes and compared them with known data to verify whether PU-EPP could learn the catalytic mechanism behind the data.…”
Section: Resultsmentioning
confidence: 99%
“…We acquired 170,179 enzymes, 5837 substrates, and 606,555 corresponding enzyme–substrate pairs. Since negative data are rarely reported, previous studies usually augmented negative samples for training classification models based on activity thresholds or random sampling. , However, low activity is not equivalent to inactivity, and certain randomly sampled enzyme–substrate pairs may actually be functional (positive) because of substrate promiscuity, which would result in the deep learning models acquiring inaccurate results. Therefore, we proposed a strategy that combines weighted random sampling and positive unlabeled (PU) learning to minimize the impact of inaccurate negative samples (see the Methods section for details).…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…Albeit effective for simple experimental designs, these one-dimensional methods often hinder the evaluation of nonlinear interactions due to the complexity and multifaceted nature of antioxidant responses, which are affected by several factors such as their mechanism of action, structural properties, and matrix effects. On the contrary, machine learning approaches are particularly well suited to address these complex problems and have been recently applied to make predictions related to taste and fermentation of foods, screen for Nrf2-agonists, identify flours infested by insects, and even discover green insecticides …”
Section: Introductionmentioning
confidence: 99%