Structural biology studies performed inside cells can capture molecular machines in action within their native context. In this work, we developed an integrative in-cell structural approach using the genome-reduced human pathogen Mycoplasma pneumoniae. We combined whole-cell cross-linking mass spectrometry, cellular cryo–electron tomography, and integrative modeling to determine an in-cell architecture of a transcribing and translating expressome at subnanometer resolution. The expressome comprises RNA polymerase (RNAP), the ribosome, and the transcription elongation factors NusG and NusA. We pinpointed NusA at the interface between a NusG-bound elongating RNAP and the ribosome and propose that it can mediate transcription-translation coupling. Translation inhibition dissociated the expressome, whereas transcription inhibition stalled and rearranged it. Thus, the active expressome architecture requires both translation and transcription elongation within the cell.
Translation is the fundamental process of protein synthesis and is catalysed by the ribosome in all living cells1. Here we use advances in cryo-electron tomography and sub-tomogram analysis2,3 to visualize the structural dynamics of translation inside the bacterium Mycoplasma pneumoniae. To interpret the functional states in detail, we first obtain a high-resolution in-cell average map of all translating ribosomes and build an atomic model for the M. pneumoniae ribosome that reveals distinct extensions of ribosomal proteins. Classification then resolves 13 ribosome states that differ in their conformation and composition. These recapitulate major states that were previously resolved in vitro, and reflect intermediates during active translation. On the basis of these states, we animate translation elongation inside native cells and show how antibiotics reshape the cellular translation landscapes. During translation elongation, ribosomes often assemble in defined three-dimensional arrangements to form polysomes4. By mapping the intracellular organization of translating ribosomes, we show that their association into polysomes involves a local coordination mechanism that is mediated by the ribosomal protein L9. We propose that an extended conformation of L9 within polysomes mitigates collisions to facilitate translation fidelity. Our work thus demonstrates the feasibility of visualizing molecular processes at atomic detail inside cells.
Protein-protein interactions govern most cellular pathways and processes, and multiple technologies have emerged to systematically map them. Assessing the error of interaction networks has been a challenge. Crosslinking mass spectrometry is currently widening its scope from structural analyses of purified multi-protein complexes towards systems-wide analyses of protein-protein interactions (PPIs). Using a carefully controlled large-scale analysis of Escherichia coli cell lysate, we demonstrate that false-discovery rates (FDR) for PPIs identified by crosslinking mass spectrometry can be reliably estimated. We present an interaction network comprising 590 PPIs at 1% decoy-based PPI-FDR. The structural information included in this network localises the binding site of the hitherto uncharacterised protein YacL to near the DNA exit tunnel on the RNA polymerase.
Structural biology performed inside cells can capture molecular machines in action within their native context. Here we develop an integrative in-cell structural approach using the genome-reduced human pathogen Mycoplasma pneumoniae. We combine whole-cell crosslinking mass spectrometry, cellular cryo-electron tomography, and integrative modeling to determine an in-cell architecture of a transcribing and translating expressome at sub-nanometer resolution. The expressome comprises RNA polymerase (RNAP), the ribosome, and the transcription elongation factors NusG and NusA. We pinpoint NusA at the interface between a NusG-bound elongating RNAP and the ribosome, and propose it could mediate transcription-translation coupling. Translation inhibition dissociates the expressome, whereas transcription inhibition stalls and rearranges it, demonstrating that the active expressome architecture requires both translation and transcription elongation within the cell.
The field of structural biology is increasingly focusing on studying proteins in situ, i.e., in their greater biological context. Cross-linking mass spectrometry (CLMS) is contributing to this effort, typically through the use of mass spectrometry (MS)-cleavable cross-linkers. Here, we apply the popular noncleavable cross-linker disuccinimidyl suberate (DSS) to human mitochondria and identify 5518 distance restraints between protein residues. Each distance restraint on proteins or their interactions provides structural information within mitochondria. Comparing these restraints to protein data bank (PDB)-deposited structures and comparative models reveals novel protein conformations. Our data suggest, among others, substrates and protein flexibility of mitochondrial heat shock proteins. Through this study, we bring forward two central points for the progression of CLMS towards large-scale in situ structural biology: First, clustered conflicts of cross-link data reveal in situ protein conformation states in contrast to error-rich individual conflicts. Second, noncleavable cross-linkers are compatible with proteome-wide studies.
Accurately modeling the structures of proteins and their complexes using artificial intelligence is revolutionizing molecular biology. Experimental data enable a candidate‐based approach to systematically model novel protein assemblies. Here, we use a combination of in‐cell crosslinking mass spectrometry and co‐fractionation mass spectrometry (CoFrac‐MS) to identify protein–protein interactions in the model Gram‐positive bacterium Bacillus subtilis . We show that crosslinking interactions prior to cell lysis reveals protein interactions that are often lost upon cell lysis. We predict the structures of these protein interactions and others in the Subti Wiki database with AlphaFold‐Multimer and, after controlling for the false‐positive rate of the predictions, we propose novel structural models of 153 dimeric and 14 trimeric protein assemblies. Crosslinking MS data independently validates the AlphaFold predictions and scoring. We report and validate novel interactors of central cellular machineries that include the ribosome, RNA polymerase, and pyruvate dehydrogenase, assigning function to several uncharacterized proteins. Our approach uncovers protein–protein interactions inside intact cells, provides structural insight into their interaction interfaces, and is applicable to genetically intractable organisms, including pathogenic bacteria.
A significant challenge in our understanding of biological systems is the high number of genes with unknown function in many genomes. The fungal genus Aspergillus contains important pathogens of humans, model organisms, and microbial cell factories. Aspergillus niger is used to produce organic acids, proteins, and is a promising source of new bioactive secondary metabolites. Out of the 14,165 open reading frames predicted in the A. niger genome only 2% have been experimentally verified and over 6,000 are hypothetical. Here, we show that gene co-expression network analysis can be used to overcome this limitation. A meta-analysis of 155 transcriptomics experiments generated co-expression networks for 9,579 genes (∼65%) of the A. niger genome. By populating this dataset with over 1,200 gene functional experiments from the genus Aspergillus and performing gene ontology enrichment, we could infer biological processes for 9,263 of A. niger genes, including 2,970 hypothetical genes. Experimental validation of selected co-expression sub-networks uncovered four transcription factors involved in secondary metabolite synthesis, which were used to activate production of multiple natural products. This study constitutes a significant step towards systems-level understanding of A. niger, and the datasets can be used to fuel discoveries of model systems, fungal pathogens, and biotechnology.
Crosslinking mass spectrometry is widening its scope from structural analyzes of purified multi-protein complexes towards systems-wide analyzes of protein-protein interactions. Assessing the error in these large datasets is currently a challenge. Using a controlled large-scale analysis of Escherichia coli cell lysate, we demonstrate a reliable false-discovery rate estimation procedure for protein-protein interactions identified by crosslinking mass spectrometry.Crosslinking mass spectrometry has become a key technology for structural biology by providing distance restraints on purified multi-protein complexes 1 . Proteins can also be crosslinked in complex mixtures. Charting protein-protein interactions (PPIs) in cell lysates, organelles or even whole cells has, therefore, become the next frontier for this technique 2-13 . To avoid reporting large numbers of spurious PPIs, it is important that the false discovery rate (FDR) of the reported interactions is reliably estimated. Doing this correctly is a challenge for the field 12-15 and to date, a consensus has not yet emerged (Table S1).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.