HEK293 Flp-In T-Rex were authenticated by STR analysis with The Center for Applied Genomics Genetic Analysis Facility (Sick Kids Hospital, Toronto). HeLa cells and primary fibroblasts were not independently authenticated Mycoplasma contaminationCell lines were routinely monitored for mycoplasma contamination as assessed by a commercial kit (MycoAlert, Lonza). Commonly misidentified lines (See ICLAC register)No commonly misidentified cell lines were used in this study.
Although we now routinely sequence human genomes, we can confidently identify only a fraction of the sequence variants that have a functional impact. Here, we developed a deep mutational scanning framework that produces exhaustive maps for human missense variants by combining random codon mutagenesis and multiplexed functional variation assays with computational imputation and refinement. We applied this framework to four proteins corresponding to six human genes: UBE2I (encoding SUMO E2 conjugase), SUMO1 (small ubiquitin‐like modifier), TPK1 (thiamin pyrophosphokinase), and CALM1/2/3 (three genes encoding the protein calmodulin). The resulting maps recapitulate known protein features and confidently identify pathogenic variation. Assays potentially amenable to deep mutational scanning are already available for 57% of human disease genes, suggesting that DMS could ultimately map functional variation for all human disease genes.
Nano-flow liquid chromatography tandem mass spectrometry (nano-flow LC-MS/MS) is the mainstay in proteome research because of its excellent sensitivity but often comes at the expense of robustness. Here we show that micro-flow LC-MS/MS using a 1 × 150 mm column shows excellent reproducibility of chromatographic retention time (<0.3% coefficient of variation, CV) and protein quantification (<7.5% CV) using data from >2000 samples of human cell lines, tissues and body fluids. Deep proteome analysis identifies >9000 proteins and >120,000 peptides in 16 h and sample multiplexing using tandem mass tags increases throughput to 11 proteomes in 16 h. The system identifies >30,000 phosphopeptides in 12 h and protein-protein or protein-drug interaction experiments can be analyzed in 20 min per sample. We show that the same column can be used to analyze >7500 samples without apparent loss of performance. This study demonstrates that micro-flow LC-MS/MS is suitable for a broad range of proteomic applications.
High‐throughput binary protein interaction mapping is continuing to extend our understanding of cellular function and disease mechanisms. However, we remain one or two orders of magnitude away from a complete interaction map for humans and other major model organisms. Completion will require screening at substantially larger scales with many complementary assays, requiring further efficiency gains in proteome‐scale interaction mapping. Here, we report Barcode Fusion Genetics‐Yeast Two‐Hybrid (BFG‐Y2H), by which a full matrix of protein pairs can be screened in a single multiplexed strain pool. BFG‐Y2H uses Cre recombination to fuse DNA barcodes from distinct plasmids, generating chimeric protein‐pair barcodes that can be quantified via next‐generation sequencing. We applied BFG‐Y2H to four different matrices ranging in scale from ~25 K to 2.5 M protein pairs. The results show that BFG‐Y2H increases the efficiency of protein matrix screening, with quality that is on par with state‐of‐the‐art Y2H methods.
Viral replication is dependent on interactions between viral polypeptides and host proteins. Identifying virus-host protein interactions can thus uncover unique opportunities for interfering with the virus life cycle via novel drug compounds or drug repurposing. Importantly, many viral-host protein interactions take place at intracellular membranes and poorly soluble organelles, which are difficult to profile using classical biochemical purification approaches. Applying proximity-dependent biotinylation (BioID) with the fast-acting miniTurbo enzyme to 27 SARS-CoV-2 proteins in a lung adenocarcinoma cell line (A549), we detected 7810 proximity interactions (7382 of which are new for SARS-CoV-2) with 2242 host proteins (results available at covid19interactome.org). These results complement and dramatically expand upon recent affinity purification-based studies identifying stable host-virus protein complexes, and offer an unparalleled view of membrane-associated processes critical for viral production. Host cell organellar markers were also subjected to BioID in parallel, allowing us to propose modes of action for several viral proteins in the context of host proteome remodelling. In summary, our dataset identifies numerous high confidence proximity partners for SARS-CoV-2 viral proteins, and describes potential mechanisms for their effects on specific host cell functions.
INTRODUCTIONCompartmentalization is an essential characteristic of eukaryotic cells, ensuring that cellular processes are partitioned to defined subcellular locations. High throughput microscopy 1 and biochemical fractionation coupled with mass spectrometry 2-6 have helped to define the proteomes of multiple organelles and macromolecular structures. However, many compartments have remained refractory to such methods, partly due to lysis and purification artefacts and poor subcompartment resolution. Recently developed proximity-dependent biotinylation approaches such as BioID and APEX provide an alternative avenue for defining the composition of cellular compartments in living cells (e.g. 7-10 ). Here we report an extensive BioID-based proximity map of a human cell, comprising 192 markers from 32 different compartments that identifies 35,902 unique high confidence proximity interactions and localizes 4,145 proteins expressed in HEK293 cells. The recall of our localization predictions is on par with or better than previous large-scale mass spectrometry and microscopy approaches, but with higher localization specificity. In addition to assigning compartment and subcompartment localization for many previously unlocalized proteins, our data contain finegrained localization information that, for example, allowed us to identify proteins with novel roles in mitochondrial dynamics. As a community resource, we have created humancellmap.org, a website that allows exploration of our data in detail, and aids with the analysis of BioID experiments. BODYProximity-dependent labelling approaches have rapidly grown in popularity, as they provide a robust way to label the environment in which a protein resides in living cells 7,8 . In the most widely used of these techniques, BioID, a mutant E. coli biotin ligase -BirA* (R118G) -is fused in-frame with the coding sequence of a bait polypeptide of interest, and the resulting fusion protein expressed in cultured cells. While BirA* can activate biotin to biotinoyl-AMP, the abortive mutant enzyme exhibits a reduced affinity for the activated molecule. A reactive intermediate is thus released into the local environment that can react with free epsilon amine groups on nearby lysine residues 7 . This ability for BirA* to label a local environment has led to BioID being employed by multiple laboratories to define the composition, and in some cases the overall organization, of both membrane-bound and membraneless organelles (e.g. 7-10 ).Here, we set out to map a human cell by profiling markers (consisting of full-length proteins or targeting sequences) from 32 cellular compartments. These compartments include the cytosolic face of all membrane-bound organelles, the ER lumen, subcompartments of the nucleus and mitochondria, major membraneless organelles such as the centrosome and the nucleolus, and the main cytoskeletal structures (actin, microtubules and intermediate filaments). Several proteins were also queried throughout the endomembrane system to identify components enriched at locales a...
The de novo synthesis of fatty acids has emerged as a therapeutic target for various diseases including cancer. Since cancer cells are intrinsically buffered to combat metabolic stress, it is important to understand how cells may adapt to loss of de novo fatty acid biosynthesis. Here we use pooled genome-wide CRISPR screens to systematically map genetic interactions (GIs) in human HAP1 cells carrying a loss-of-function mutation in FASN , whose product catalyzes the formation of long-chain fatty acids. FASN mutant cells show a strong dependence on lipid uptake that is reflected in negative GIs with genes involved in the LDL receptor pathway, vesicle trafficking, and protein glycosylation. Further support for these functional relationships is derived from additional GI screens in query cell lines deficient for other genes involved in lipid metabolism, including LDLR , SREBF1 , SREBF2 , ACACA . Our GI profiles also identify a potential role for the previously uncharacterized gene LUR1 / C12orf49 in exogenous lipid uptake regulation through modulation of SREBF2 signalling in response to lipid starvation. Overall, our data highlight the genetic determinants underlying the cellular adaptation associated with loss of de novo fatty acid synthesis and demonstrate the power of systematic GI mapping for uncovering metabolic buffering mechanisms in human cells.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.