A full description of the human proteome relies on the challenging task of detecting mature and changing forms of protein molecules in the body. Large scale proteome analysis1 has routinely involved digesting intact proteins followed by inferred protein identification using mass spectrometry (MS)2. This “bottom up” process affords a high number of identifications (not always unique to a single gene). However, complications arise from incomplete or ambiguous2 characterization of alternative splice forms, diverse modifications (e.g., acetylation and methylation), and endogenous protein cleavages, especially when combinations of these create complex patterns of intact protein isoforms and species3. “Top down” interrogation of whole proteins can overcome these problems for individual proteins4,5, but has not been achieved on a proteome scale due to the lack of intact protein fractionation methods that are well integrated with tandem MS. Here we show, using a new four dimensional (4D) separation system, identification of 1,043 gene products from human cells that are dispersed into >3,000 protein species created by post-translational modification, RNA splicing, and proteolysis. The overall system produced >20-fold increases in both separation power and proteome coverage, enabling the identification of proteins up to 105 kilodaltons and those with up to 11 transmembrane helices. Many previously undetected isoforms of endogenous human proteins were mapped, including changes in multiply-modified species in response to accelerated cellular aging (senescence) induced by DNA damage. Integrated with the latest version of the Swiss-Prot database6, the data provide precise correlations to individual genes and proof-of-concept for large scale interrogation of whole protein molecules. The technology promises to improve the link between proteomics data and complex phenotypes in basic biology and disease research7.
Like phosphorylation, the addition of O-linked β-N-acetylglucosamine (O-GlcNAcylation) is a ubiquitous, reversible process that modifies serine and threonine residues on nuclear and cytoplasmic proteins. Overexpression of the enzyme that adds O-GlcNAc to target proteins, O-GlcNAc transferase (OGT), perturbs cytokinesis and promotes polyploidy, but the molecular targets of OGT that are important for its cell cycle functions are unknown. Here, we identify 141 previously unknown O-GlcNAc sites on proteins that function in spindle assembly and cytokinesis. Many of these O-GlcNAcylation sites are either identical to known phosphorylation sites or in close proximity to them. Furthermore, we found that O-GlcNAcylation altered the phosphorylation of key proteins associated with the mitotic spindle and midbody. Forced overexpression of OGT increased the inhibitory phosphorylation of cyclin-dependent kinase 1 (CDK1) and reduced the phosphorylation of CDK1 target proteins. The increased phosphorylation of CDK1 is explained by increased activation of its upstream kinase, MYT1, and by a concomitant reduction in the transcript for the CDK1 phosphatase, CDC25C. OGT overexpression also caused a reduction in both messenger RNA expression and protein abundance of Polo-like kinase 1, which is upstream of both MYT1 and CDC25C. The data not only illustrate the crosstalk between O-GlcNAcylation and phosphorylation of proteins that are regulators of crucial signaling pathways, but also uncover a mechanism for the role of O-GlcNAcylation in regulation of cell division.
Native mass spectrometry (MS) is becoming an important integral part of structural proteomics and system biology research. The approach holds great promise for elucidating higher levels of protein structure: from primary to quaternary. This requires the most efficient use of tandem MS, which is the cornerstone of MS-based approaches. In this work, we advance a two-step fragmentation approach, or (pseudo)-MS(3), from native protein complexes to a set of constituent fragment ions. Using an efficient desolvation approach and quadrupole selection in the extended mass-to-charge (m/z) range, we have accomplished sequential dissociation of large protein complexes, such as phosporylase B (194 kDa), pyruvate kinase (232 kDa), and GroEL (801 kDa), to highly charged monomers which were then dissociated to a set of multiply charged fragmentation products. Fragment ion signals were acquired with a high resolution, high mass accuracy Orbitrap instrument that enabled highly confident identifications of the precursor monomer subunits. The developed approach is expected to enable characterization of stoichiometry and composition of endogenous native protein complexes at an unprecedented level of detail.
Top-down proteomics has improved over the last decade despite the significant challenges presented by the analysis of large protein ions. Here, the detection of these high mass species by electrospray-based mass spectrometry (MS) is examined from a theoretical perspective to understand the mass-dependent increases in the number of charge states, isotopic peaks, and interfering species present in typical protein mass spectra. Integrating these effects into a quantitative model captures the reduced ability to detect species over 25 kDa with the speed and sensitivity characteristic of proteomics based on <3 kDa peptide ions. The model quantifies the challenge that top-down proteomics faces with respect to current MS instrumentation and projects that depletion of 13C and 15N isotopes can improve detection at high mass by only <2-fold at 100 kDa whereas the effect is up to 5-fold at 10 kDa. Further, we find that supercharging electrosprayed proteins to the point of producing <5 charge states at high mass would improve detection by more than 20 fold.
A new Orbitrap-based ion analysis procedure is shown to be possible by determining the direct charge for numerous individual protein ions to generate true mass spectra. The deployment of an Orbitrap system for charge detection enables the characterization of highly complicated mixtures of proteoforms and their complexes in both denatured and native modes of operation, revealing information not obtainable by traditional measurement of an ensemble of ions.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.