2019
DOI: 10.1038/s41598-019-46113-y
|View full text |Cite
|
Sign up to set email alerts
|

Discovery of food identity markers by metabolomics and machine learning technology

Abstract: Verification of food authenticity establishes consumer trust in food ingredients and components of processed food. Next to genetic or protein markers, chemicals are unique identifiers of food components. Non-targeted metabolomics is ideally suited to screen food markers when coupled to efficient data analysis. This study explored feasibility of random forest (RF) machine learning, specifically its inherent feature extraction for non-targeted metabolic marker discovery. The distinction of chia, linseed, and ses… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
30
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
2
1

Relationship

2
7

Authors

Journals

citations
Cited by 58 publications
(33 citation statements)
references
References 49 publications
0
30
0
Order By: Relevance
“…To select molecular features that were relevant for the metabolic classification of melon accessions, we applied random forest (RF) machine learning technology. RF technology tuned towards metabolic feature selection enables the selection of small sets of molecular features that, if manually supervised, can be relevant for sample classification [29]. For this purpose, we used the set of 605 provisionally annotated molecular features from the combined >80,000 data set (Table S4).…”
Section: Feature Selection By Random Forest Technologymentioning
confidence: 99%
See 1 more Smart Citation
“…To select molecular features that were relevant for the metabolic classification of melon accessions, we applied random forest (RF) machine learning technology. RF technology tuned towards metabolic feature selection enables the selection of small sets of molecular features that, if manually supervised, can be relevant for sample classification [29]. For this purpose, we used the set of 605 provisionally annotated molecular features from the combined >80,000 data set (Table S4).…”
Section: Feature Selection By Random Forest Technologymentioning
confidence: 99%
“…To select annotated metabolic features that are relevant to classifying the C. melo accessions, random forest (RF) analysis was performed using the subset of annotated molecular features [27] from the combined data set (Table S4). Feature selection by RF technology was as described earlier [29] using hyperparameter optimization proposed by [75]. The R-package randomForest v 4.6-14 [76,77] was used for training classification trees.…”
Section: Data Handling and Miningmentioning
confidence: 99%
“…The support vector machine (SVM) is a machine learning classifier and can mitigate the effects of noisy data [50]. Both have been widely been employed for biomarker discovery [51,52]. RF and SVM analyses were separately used to evaluate the importance of the differential plasma metabolites identified by the Mann-Whitney-Wilcoxon rank sum test and to select plasma metabolic biomarkers.…”
Section: Metabolomics Data Analysismentioning
confidence: 99%
“…Metabolite profiling of milkripe seed tissue by gas chromatography–mass spectrometry (GC–MS) was performed essentially as previously described ( Erban et al, 2007 ; Erban et al, 2019 ). To enrich for polar primary metabolites and small secondary products, 60 mg samples were extracted with methanol and chloroform as described above for free methionine determination but with the following changes for GC–MS analysis: extraction volumes were scaled to the larger initial sample mass, 13 C 6 -sorbitol at 17 mg/L was added to the methanol extraction step to allow later correction of analytical variance, and 160 μl, rather than 100 μl, polar phase aliquots were taken for analysis.…”
Section: Methodsmentioning
confidence: 99%