2021
DOI: 10.12688/f1000research.52549.1
|View full text |Cite
|
Sign up to set email alerts
|

Supervised topic modeling for predicting molecular substructure from mass spectrometry

Abstract: Small-molecule metabolites are principal actors in myriad phenomena across biochemistry and serve as an important source of biomarkers and drug candidates. Given a sample of unknown composition, identifying the metabolites present is difficult given the large number of small molecules both known and yet to be discovered. Even for biofluids such as human blood, building reliable ways of identifying biomarkers is challenging. A workhorse method for characterizing individual molecules in such untargeted metabolom… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 30 publications
0
5
0
Order By: Relevance
“…We observed metabolites enriched in the same pathways we observed in the urinary transcriptome, and we further observed that metabolic cell types whose signal we measure with RNA participate in distinct metabolic pathways with measurable metabolites. We are optimistic that the ability to use metabolomics as a functional readout alongside gene expression changes in linking genotype to disease phenotype and enactment will improve with the development of new tools that expand the space of identifiable compounds from untargeted metabolomics data 49,50 and the continued joint measurement of the transcriptome and metabolome, thereby offering increased potential for more precise biomarker panels for disease diagnostics and subtyping. This work also identifies and addresses challenges facing the development of urine liquid biopsies.…”
Section: Discussionmentioning
confidence: 99%
“…We observed metabolites enriched in the same pathways we observed in the urinary transcriptome, and we further observed that metabolic cell types whose signal we measure with RNA participate in distinct metabolic pathways with measurable metabolites. We are optimistic that the ability to use metabolomics as a functional readout alongside gene expression changes in linking genotype to disease phenotype and enactment will improve with the development of new tools that expand the space of identifiable compounds from untargeted metabolomics data 49,50 and the continued joint measurement of the transcriptome and metabolome, thereby offering increased potential for more precise biomarker panels for disease diagnostics and subtyping. This work also identifies and addresses challenges facing the development of urine liquid biopsies.…”
Section: Discussionmentioning
confidence: 99%
“…For MALDI‐TOF MSI, it has been used as a soft clustering technique and, with tandem mass spectrometry (MS/MS), one can use LDA to cluster the fragmentation of the data in substructures, as has been done in previous research. 33 , 34 , 35 , 36 …”
Section: Methodsmentioning
confidence: 99%
“…For MALDI-TOF MSI, it has been used as a soft clustering technique and, with tandem mass spectrometry (MS/MS), one can use LDA to cluster the fragmentation of the data in substructures, as has been done in previous research. [33][34][35][36] Let us define similarly as for PLSA: N as the number of m/z bins (or words) in the data, K as the number of components (or topics), which can be seen as the different cell-types within the tissue, and D as the number of spectra or pixels in the MSI dataset (which corresponds to the number of documents in the setting of text mining). Also, let θ d be the probability of a spectrum (d) belonging to a cell type (c) according to a Dirichlet prior α, and β c be the probability of a certain m/z bin (w) belonging to a cell type (c) according to another Dirichlet prior η.…”
Section: Latent Dirichlet Allocation (Lda)mentioning
confidence: 99%
“…Reder et al. developed Labeled Latent Dirichlet Allocation to map spectrum features to the chemical space of known structures as a supervised topic modeling approach, which allows for interpretable chemical structure prediction given tandem MS profiles [ 41 ]. Gao et al.…”
Section: Data-driven Approaches: Machine and Deep Learningmentioning
confidence: 99%