A computational method is proposed for inferring protein interactions from genome sequences on the basis of the observation that some pairs of interacting proteins have homologs in another organism fused into a single protein chain. Searching sequences from many genomes revealed 6809 such putative proteinprotein interactions in Escherichia coli and 45,502 in yeast. Many members of these pairs were confirmed as functionally related; computational filtering further enriches for interactions. Some proteins have links to several other proteins; these coupled links appear to represent functional interactions such as complexes or pathways. Experimentally confirmed interacting pairs are documented in a Database of Interacting Proteins.The lives of biological cells are controlled by interacting proteins in metabolic and signaling pathways and in complexes such as the molecular machines that synthesize and use adenosine triphosphate (ATP), replicate and translate genes, or build up the cytoskeletal infrastructure (1). Our knowledge of proteinprotein interactions has been accumulated from biochemical and genetic experiments, including the widely used yeast two-hybrid test (2). Here we ask if protein-protein interactions can be recognized from genome sequences by purely computational means.Some interacting proteins such as the Gyr A and Gyr B subunits of Escherichia coli DNA gyrase are fused into a single chain in another organism, in this case the topoisomerase II of yeast (3). Thus, the sequence similarities of Gyr A (804 amino acid residues) and Gyr B (875 residues) to different segments of the topoisomerase II (1429 residues) might be used to predict that Gyr A and Gyr B interact in E. coli.To find other such putative protein interactions in E. coli, we searched the 4290 protein sequences of the E. coli genome (4) for these patterns of sequence homology (5). We found 6809 pairs of nonhomologous sequences, both members of the pair having significant similarity (6) to a single protein in some other genome that we term a Rosetta Stone sequence because it deciphers the interaction between the protein pairs. The 4290 proteins could form at most (4290) 2 /2 ϭ 9 ϫ 10 6 pair interactions, but we would expect many fewer interactions in a functioning cell; roughly 2 to 10 interactions for each protein does not seem unreasonably many. Each of these 6809 pairs is a candidate for a pair of interacting proteins in E. coli. Five such candidates are shown in Fig. 1. The first three pairs of E. coli proteins were among those easily determined from the biochemical literature in fact to interact. The final two pairs of proteins are not known to interact. They are representatives of many such pairs whose putative interactions at this time must be taken as testable hypotheses.We devised three independent tests of interactions predicted by the method we term domain fusion analysis, each showing that a reasonable fraction may in fact interact. The first method uses the annotation of proteins given in the SWISS-PROT database (7). For cases wh...
Summary Fluorescent proteins have become valuable tools for biomedical research as protein tags, reporters of gene expression, biosensor components, and cell lineage tracers. However, applications of fluorescent proteins for deep tissue imaging in whole mammals have been constrained by the opacity of tissues to excitation light below 600 nm, due to absorbance by hemoglobin. Fluorescent proteins that excite efficiently in the “optical window” above 600 nm are therefore highly desirable. We report here the evolution of far-red fluorescent proteins with peak excitation at 600 nm or above. The brightest one of these, Neptune, performs well in imaging deep tissues in living mice. The crystal structure of Neptune reveals a novel mechanism for red-shifting involving the acquisition of a new hydrogen bond with the acylimine region of the chromophore.
Orange-red fluorescent proteins (FPs) are widely used in biomedical research for multiplexed epifluorescence microscopy with GFP-based probes, but their different excitation requirements make multiplexing with new advanced microscopy methods difficult. Separately, orange-red FPs are useful for deep-tissue imaging in mammals due to the relative tissue transmissibility of orange-red light, but their dependence on illumination limits their sensitivity as reporters in deep tissues. Here we describe CyOFP1, a bright engineered orange-red FP that is excitable by cyan light. We show that CyOFP1 enables single-excitation multiplexed imaging with GFP-based probes in single-photon and two-photon microscopy, including time-lapse imaging in light-sheet systems. CyOFP1 also serves as an efficient acceptor for resonance energy transfer from the highly catalytic blue-emitting luciferase NanoLuc. An optimized fusion of CyOFP1 and NanoLuc, called Antares, functions as a highly sensitive bioluminescent reporter in vivo, producing substantially brighter signals from deep tissues than firefly luciferase and other bioluminescent proteins.
A method for non-invasive visualization of genetically labelled cells in animal disease models with micron-level resolution would greatly facilitate development of cell-based therapies. Imaging of fluorescent proteins (FPs) using red excitation light in the “optical window” above 600 nm is one potential method for visualizing implanted cells. However, previous efforts to engineer FPs with peak excitation beyond 600 nm have resulted in undesirable reductions in brightness. Here we report three new red-excitable monomeric FPs obtained by structure-guided mutagenesis of mNeptune, previously the brightest monomeric FP when excited beyond 600 nm. Two of these, mNeptune2 and mNeptune2.5, demonstrate improved maturation and brighter fluorescence, while the third, mCardinal, has a red-shifted excitation spectrum without reduction in brightness. We show that mCardinal can be used to non-invasively and longitudinally visualize the differentiation of myoblasts and stem cells into myocytes in living mice with high anatomical detail.
Although proteins populate large structural ensembles, X-ray diffraction data are traditionally interpreted using a single model. To search for evidence of alternate conformers, we developed a program, Ringer, which systematically samples electron density around the dihedral angles of protein side chains. In a diverse set of 402 structures, Ringer identified weak, nonrandom electron-density features that suggest of the presence of hidden, lowly populated conformations for >18% of uniquely modeled residues. Although these peaks occur at electron-density levels traditionally regarded as noise, statistically significant (P < 10 25 ) enrichment of peaks at successive rotameric v angles validates the assignment of these features as unmodeled conformations. Weak electron density corresponding to alternate rotamers also was detected in an accurate electron density map free of model bias. Ringer analysis of the high-resolution structures of free and peptide-bound calmodulin identified shifts in ensembles and connected the alternate conformations to ligand recognition. These results show that the signal in high-resolution electron density maps extends below the traditional 1 r cutoff, and crystalline proteins are more polymorphic than current crystallographic models. Ringer provides an objective, systematic method to identify previously undiscovered alternate conformations that can mediate protein folding and function.
Ferritin nanocages synthesize ferric oxide minerals, containing hundreds to thousands of Fe(III) diferric oxo/hydroxo complexes, by reactions of Fe(II) ions with O2 at multiple di-iron catalytic centers. Ferric–oxy multimers, tetramers, and/or larger mineral nuclei form during postcatalytic transit through the protein cage, and mineral accretion occurs in the central cavity. We determined how Fe(II) substrates can access catalytic sites using frog M ferritins, active and inactivated by ligand substitution, crystallized with 2.0 M Mg(II) ± 0.1 M Co(II) for Co(II)-selective sites. Co(II) inhibited Fe(II) oxidation. High-resolution (<1.5 Å) crystal structures show (1) a line of metal ions, 15 Å long, which penetrates the cage and defines ion channels and internal pores to the nanocavity that link external pores to the cage interior, (2) metal ions near negatively charged residues at the channel exits and along the inner cavity surface that model Fe(II) transit to active sites, and (3) alternate side-chain conformations, absent in ferritins with catalysis eliminated by amino acid substitution, which support current models of protein dynamics and explain changes in Fe–Fe distances observed during catalysis. The new structural data identify a ~27-Å path Fe(II) ions can follow through ferritin entry channels between external pores and the central cavity and along the cavity surface to the active sites where mineral synthesis begins. This “bucket brigade” for Fe(II) ion access to the ferritin catalytic sites not only increases understanding of biological nanomineral synthesis but also reveals unexpected design principles for protein cage-based catalysts and nanomaterials.
Serine/threonine protein phosphatases are central mediators of phosphorylation-dependent signals in eukaryotes and a variety of pathogenic bacteria. Here, we report the crystal structure of the intracellular catalytic domain of Mycobacterium tuberculosis PstPpp, a membrane-anchored phosphatase in the PP2C family. Despite sharing the fold and two-metal center of human PP2Calpha, the PstPpp catalytic domain binds a third Mn(2+) in a site created by a large shift in a previously unrecognized flap subdomain adjacent to the active site. Mutations in this site selectively increased the Michaelis constant for Mn(2+) in the reaction of a noncognate, small-molecule substrate, p-nitrophenyl phosphate. The PstP/Ppp structure reveals core functional motifs that advance the framework for understanding the mechanisms of substrate recognition, catalysis, and regulation of PP2C phosphatases.
The DNA dodecamer CATGGGCCCATG in a crystal structure of resolution 1.3 Å has a conformation intermediate between A and B DNA. This trapping of a stable intermediate suggests that the A and B DNA families are not discrete, as previously believed. The structure supports a base-centered rather than a backbone-centered mechanism for the A 7 B transition mediated by guanine tracts. Interconversion between A and B DNA provides another means for regulating protein-DNA recognition.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.