Protein-protein interactions govern most cellular pathways and processes, and multiple technologies have emerged to systematically map them. Assessing the error of interaction networks has been a challenge. Crosslinking mass spectrometry is currently widening its scope from structural analyses of purified multi-protein complexes towards systems-wide analyses of protein-protein interactions (PPIs). Using a carefully controlled large-scale analysis of Escherichia coli cell lysate, we demonstrate that false-discovery rates (FDR) for PPIs identified by crosslinking mass spectrometry can be reliably estimated. We present an interaction network comprising 590 PPIs at 1% decoy-based PPI-FDR. The structural information included in this network localises the binding site of the hitherto uncharacterised protein YacL to near the DNA exit tunnel on the RNA polymerase.
Crosslinking mass spectrometry is widening its scope from structural analyzes of purified multi-protein complexes towards systems-wide analyzes of protein-protein interactions. Assessing the error in these large datasets is currently a challenge. Using a controlled large-scale analysis of Escherichia coli cell lysate, we demonstrate a reliable false-discovery rate estimation procedure for protein-protein interactions identified by crosslinking mass spectrometry.Crosslinking mass spectrometry has become a key technology for structural biology by providing distance restraints on purified multi-protein complexes 1 . Proteins can also be crosslinked in complex mixtures. Charting protein-protein interactions (PPIs) in cell lysates, organelles or even whole cells has, therefore, become the next frontier for this technique 2-13 . To avoid reporting large numbers of spurious PPIs, it is important that the false discovery rate (FDR) of the reported interactions is reliably estimated. Doing this correctly is a challenge for the field 12-15 and to date, a consensus has not yet emerged (Table S1).
Crosslinking mass spectrometry has developed into a robust technique that is increasingly used to investigate the interactomes of organelles and cells. However, the incomplete and noisy information in the mass spectra of crosslinked peptides limits the numbers of protein–protein interactions that can be confidently identified. Here, we leverage chromatographic retention time information to aid the identification of crosslinked peptides from mass spectra. Our Siamese machine learning model xiRT achieves highly accurate retention time predictions of crosslinked peptides in a multi-dimensional separation of crosslinked E. coli lysate. Importantly, supplementing the search engine score with retention time features leads to a substantial increase in protein–protein interactions without affecting confidence. This approach is not limited to cell lysates and multi-dimensional separation but also improves considerably the analysis of crosslinked multiprotein complexes with a single chromatographic dimension. Retention times are a powerful complement to mass spectrometric information to increase the sensitivity of crosslinking mass spectrometry analyses.
Crosslinking mass spectrometry (Crosslinking MS) has developed into a robust technique that is increasingly used to investigate the interactomes of organelles and cells. However, the incomplete and noisy information in the spectra limits the numbers of protein-protein interactions (PPIs) that can be confidently identified. Here, we successfully leveraged chromatographic retention time (RT) information to aid the identification of crosslinked peptides from spectra. Our Siamese machine learning model xiRT achieved highly accurate RT predictions of crosslinked peptides in a multi-dimensional separation of crosslinked E. coli lysate. We combined strong cation exchange (SCX), hydrophilic strong anion exchange (hSAX) and reversed-phase (RP) chromatography and reached R^2 0.94 in RP and a margin of error of 1 fraction for hSAX in 94%, and SCX in 85% of the predictions. Importantly, supplementing the search engine score with retention time features led to a 1.4-fold increase in PPIs at a 1% false discovery rate. We also demonstrate the value of this approach for the more routine analysis of a crosslinked multiprotein complexes. An increase of 1.7-fold in heteromeric crosslinked residue-pairs was achieved at 1% residue-pair FDR for Fanconi anaemia monoubiquitin ligase complex, solely using reversed-phase RT. Retention times are a powerful complement to mass spectrometric information to increase the sensitivity of Crosslinking MS analyses.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.