2021
DOI: 10.48550/arxiv.2111.04824
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

MassFormer: Tandem Mass Spectrum Prediction with Graph Transformers

Abstract: Mass spectrometry is a key tool in the study of small molecules, playing an important role in metabolomics, drug discovery, and environmental chemistry. Tandem mass spectra capture fragmentation patterns that provide key structural information about a molecule and help with its identification. Practitioners often rely on spectral library searches to match unknown spectra with known compounds. However, such search-based methods are limited by availability of reference experimental data. In this work we show tha… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
20
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(20 citation statements)
references
References 54 publications
0
20
0
Order By: Relevance
“…It is therefore foreseeable that the efficiency of the interrogation of the 1M-DB will strongly benefit from up-to-date optimization of MatchMS parallelization as part of future version releases. Multiple tools have been developed in recent years to produce hypothetical MS/MS spectra (23)(24)(25)(26)49). A more recent development in this area is that of QCxMS, a computational tool that generate high-quality fragmentation spectra from molecules based on molecular dynamics calculations (26).…”
Section: Discussionmentioning
confidence: 99%
“…It is therefore foreseeable that the efficiency of the interrogation of the 1M-DB will strongly benefit from up-to-date optimization of MatchMS parallelization as part of future version releases. Multiple tools have been developed in recent years to produce hypothetical MS/MS spectra (23)(24)(25)(26)49). A more recent development in this area is that of QCxMS, a computational tool that generate high-quality fragmentation spectra from molecules based on molecular dynamics calculations (26).…”
Section: Discussionmentioning
confidence: 99%
“…In comparison, it takes around 0.03, 0.05, and 24.1 seconds for NEIMS, MassFormer, and CFM-ID 4.0 to predict the MS/MS spectrum of a compound. We compare our model against the existing mass spectra prediction methods, including CFM-ID 4.0 (Wang et al, 2021), NEIMS (Wei et al, 2019) and MassFormer (Young et al, 2021). NEIMS and MassFormer are retrained by using our training set for a direct comparison.…”
Section: Performance Of Predicting Ms/ms Spectra Acquired By Using Ag...mentioning
confidence: 99%
“…The molecular point-sets encode the accurate 3D coordinates and attributes of the atoms, while the chemical bonds are represented in neighboring vectors. When trained and tested on the compounds' MS/MS spectra collected from the Agilent Personal Compound Database and Library (PCDL) and NIST20, 3DMolMS achieved higher accuracy than CFM-ID 4.0 (Wang et al, 2021), NEIMS (Wei et al, 2019) and the existing deep learning model MassFormer (Young et al, 2021). Furthermore, 3DMolMS achieves satisfactory performance when applied to the prediction of the spectra in the public spectral library MassBank of North America (MoNA) with minor fine-tuning, which indicates the 3DMolMS model can be generalized to the prediction of spectra acquired by different labs on different instruments.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Representation learning of spectra such as Spec2Vec [28], MS2DeepScore [29], and sinusoidal embeddings [30] can be used to learn a more meaningful distance between spectra to facilitate molecular networking. Forward models have incorporated feed forward networks and graph neural networks to predict a fragmentation spectrum directly from molecular structure [31,32,33]. In the inverse direction MSGenie [34], Spec2Mol [35], and MetFID [36] directly attempt to generate fingerprints or SMILES strings [37] from mass spectra, but no approach outperforms CSI:FingerID when trained with equivalent data.…”
Section: Mainmentioning
confidence: 99%