Identification of protein-protein interactions often provides insight into protein function, and many cellular processes are performed by stable protein complexes. We used tandem affinity purification to process 4,562 different tagged proteins of the yeast Saccharomyces cerevisiae. Each preparation was analysed by both matrix-assisted laser desorption/ionization-time of flight mass spectrometry and liquid chromatography tandem mass spectrometry to increase coverage and accuracy. Machine learning was used to integrate the mass spectrometry scores and assign probabilities to the protein-protein interactions. Among 4,087 different proteins identified with high confidence by mass spectrometry from 2,357 successful purifications, our core data set (median precision of 0.69) comprises 7,123 protein-protein interactions involving 2,708 proteins. A Markov clustering algorithm organized these interactions into 547 protein complexes averaging 4.9 subunits per complex, about half of them absent from the MIPS database, as well as 429 additional interactions between pairs of complexes. The data (all of which are available online) will help future studies on individual proteins as well as functional genomics and systems biology.
Defining the functional relationships between proteins is critical for understanding virtually all aspects of cell biology. Large-scale identification of protein complexes has provided one important step towards this goal; however, even knowledge of the stoichiometry, affinity and lifetime of every protein-protein interaction would not reveal the functional relationships between and within such complexes. Genetic interactions can provide functional information that is largely invisible to protein-protein interaction data sets. Here we present an epistatic miniarray profile (E-MAP) consisting of quantitative pairwise measurements of the genetic interactions between 743 Saccharomyces cerevisiae genes involved in various aspects of chromosome biology (including DNA replication/repair, chromatid segregation and transcriptional regulation). This E-MAP reveals that physical interactions fall into two well-represented classes distinguished by whether or not the individual proteins act coherently to carry out a common function. Thus, genetic interaction data make it possible to dissect functionally multi-protein complexes, including Mediator, and to organize distinct protein complexes into pathways. In one pathway defined here, we show that Rtt109 is the founding member of a novel class of histone acetyltransferases responsible for Asf1-dependent acetylation of histone H3 on lysine 56. This modification, in turn, enables a ubiquitin ligase complex containing the cullin Rtt101 to ensure genomic integrity during DNA replication.
Summary The explosion of sequence information in bacteria makes developing high-throughput, cost-effective approaches to matching genes with phenotypes imperative. Using E. coli as proof of principle, we show that combining large-scale chemical genomics with quantitative fitness measurements provides a high-quality data set rich in discovery. Probing growth profiles of a mutant library in hundreds of conditions in parallel yielded > 10,000 phenotypes that allowed us to study gene essentiality, discover leads for gene function and drug action, and understand higher-order organization of the bacterial chromosome. We highlight new information derived from the study, including insights into a gene involved in multiple antibiotic resistance and the synergy between a broadly used combinatory antibiotic therapy, trimethoprim and sulfonamides. This data set, publicly available at http://ecoliwiki.net/tools/chemgen/, is a valuable resource for both the microbiological and bioinformatic communities, as it provides high-confidence associations between hundreds of annotated and uncharacterized genes as well as inferences about the mode of action of several poorly understood drugs.
Human immunodeficiency virus (HIV) has a small genome and therefore relies heavily on the host cellular machinery to replicate. Identifying which host proteins and complexes come into physical contact with the viral proteins is crucial for a comprehensive understanding of how HIV rewires the host’s cellular machinery during the course of infection. Here we report the use of affinity tagging and purification mass spectrometry1-3 to determine systematically the physical interactions of all 18 HIV-1 proteins and polyproteins with host proteins in two different human cell lines (HEK293 and Jurkat). Using a quantitative scoring system that we call MiST, we identified with high confidence 497 HIV–human protein–protein interactions involving 435 individual human proteins, with ~40% of the interactions being identified in both cell types. We found that the host proteins hijacked by HIV, especially those found interacting in both cell types, are highly conserved across primates. We uncovered a number of host complexes targeted by viral proteins, including the finding that HIV protease cleaves eIF3d, a subunit of eukaryotic translation initiation factor 3. This host protein is one of eleven identified in this analysis that act to inhibit HIV replication. This data set facilitates a more comprehensive and detailed understanding of how the host machinery is manipulated during the course of HIV infection.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.