Investigating proteins 'at work' in a living environment at atomic resolution is a major goal of molecular biology, which has not been achieved even though methods for the three-dimensional (3D) structure determination of purified proteins in single crystals or in solution are widely used. Recent developments in NMR hardware and methodology have enabled the measurement of high-resolution heteronuclear multi-dimensional NMR spectra of macromolecules in living cells (in-cell NMR). Various intracellular events such as conformational changes, dynamics and binding events have been investigated by this method. However, the low sensitivity and the short lifetime of the samples have so far prevented the acquisition of sufficient structural information to determine protein structures by in-cell NMR. Here we show the first, to our knowledge, 3D protein structure calculated exclusively on the basis of information obtained in living cells. The structure of the putative heavy-metal binding protein TTHA1718 from Thermus thermophilus HB8 overexpressed in Escherichia coli cells was solved by in-cell NMR. Rapid measurement of the 3D NMR spectra by nonlinear sampling of the indirectly acquired dimensions was used to overcome problems caused by the instability and low sensitivity of living E. coli samples. Almost all of the expected backbone NMR resonances and most of the side-chain NMR resonances were observed and assigned, enabling high quality (0.96 ångström backbone root mean squared deviation) structures to be calculated that are very similar to the in vitro structure of TTHA1718 determined independently. The in-cell NMR approach can thus provide accurate high-resolution structures of proteins in living environments.
Investigating three-dimensional (3D) structures of proteins in living cells by in-cell nuclear magnetic resonance (NMR) spectroscopy opens an avenue towards understanding the structural basis of their functions and physical properties under physiological conditions inside cells. In-cell NMR provides data at atomic resolution non-invasively, and has been used to detect protein-protein interactions, thermodynamics of protein stability, the behavior of intrinsically disordered proteins, etc. in cells. However, so far only a single de novo 3D protein structure could be determined based on data derived only from in-cell NMR. Here we introduce methods that enable in-cell NMR protein structure determination for a larger number of proteins at concentrations that approach physiological ones. The new methods comprise (1) advances in the processing of non-uniformly sampled NMR data, which reduces the measurement time for the intrinsically short-lived in-cell NMR samples, (2) automatic chemical shift assignment for obtaining an optimal resonance assignment, and (3) structure refinement with Bayesian inference, which makes it possible to calculate accurate 3D protein structures from sparse data sets of conformational restraints. As an example application we determined the structure of the B1 domain of protein G at about 250 μM concentration in living E. coli cells.
The cell is a crowded environment in which proteins interact specifically with other proteins, nucleic acids, cofactors and ligands. Atomic resolution structural explanation of proteins functioning in this environment is a main goal of biochemical research. Recent improvements to nuclear magnetic resonance (NMR) hardware and methodology allow the measurement of high-resolution heteronuclear multidimensional NMR spectra of macromolecules in living cells (in-cell NMR). In this study, we describe a protocol for the stable isotope ((13)C, (15)N and (2)H) labeling and structure determination of proteins overexpressed in Escherichia coli cells exclusively on the basis of information obtained in living cells. The protocol combines the preparation of the protein in E. coli cells, the rapid measurement of the three-dimensional (3D) NMR spectra by nonlinear sampling of the indirectly acquired dimensions, structure calculation and structure refinement. Under favorable circumstances, this in-cell NMR approach can provide high-resolution 3D structures of proteins in living environments. The protocol has been used to solve the first 3D structure of a protein in living cells for the putative heavy metal-binding protein TTHA1718 from Thermus thermophilus HB8 overexpressed in E. coli cells. As no protein purification is necessary, a sample for in-cell NMR measurements can be obtained within 2-3 d. With the nonlinear sampling scheme, the duration of each 3D experiment can be reduced to 2-3 h. Once chemical shift assignments and NOESY peak lists have been prepared, structure calculation with the program CYANA and energy refinement can be completed in less than 1 h on a powerful computer system.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.