Proteins such as FUS phase separate to form liquid-like condensates that can harden into less dynamic structures. However, how these properties emerge from the collective interactions of many amino acids remains largely unknown. Here, we use extensive mutagenesis to identify a sequence-encoded molecular grammar underlying the driving forces of phase separation of proteins in the FUS family and test aspects of this grammar in cells. Phase separation is primarily governed by multivalent interactions among tyrosine residues from prion-like domains and arginine residues from RNA-binding domains, which are modulated by negatively charged residues. Glycine residues enhance the fluidity, whereas glutamine and serine residues promote hardening. We develop a model to show that the measured saturation concentrations of phase separation are inversely proportional to the product of the numbers of arginine and tyrosine residues. These results suggest it is possible to predict phase-separation properties based on amino acid sequences.
Many biomolecular condensates appear to form via spontaneous or driven processes that have the hallmarks of intracellular phase transitions. This suggests that a common underlying physical framework might govern the formation of functionally and compositionally unrelated biomolecular condensates. In this review, we summarize recent work that leverages a stickers-and-spacers framework adapted from the field of associative polymers for understanding how multivalent protein and RNA molecules drive phase transitions that give rise to biomolecular condensates. We discuss how the valence of stickers impacts the driving forces for condensate formation and elaborate on how stickers can be distinguished from spacers in different contexts. We touch on the impact of sticker- and spacer-mediated interactions on the rheological properties of condensates and show how the model can be mapped to known drivers of different types of biomolecular condensates.
Stressed cells shut down translation, release mRNA molecules from polysomes, and form stress granules (SGs) via a network of interactions that involve G3BP. Here we focus on the mechanistic underpinnings of SG assembly. We show that, under non-stress conditions, G3BP adopts a compact auto-inhibited state stabilized by electrostatic intramolecular interactions between the intrinsically disordered acidic tracts and the positively charged arginine-rich region. Upon release from polysomes, unfolded mRNAs outcompete G3BP auto-inhibitory interactions, engendering a conformational transition that facilitates clustering of G3BP through protein-RNA interactions. Subsequent physical crosslinking of G3BP clusters drives RNA molecules into networked RNA/protein condensates. We show that G3BP condensates impede RNA entanglement and recruit additional client proteins that promote SG maturation or induce a liquid-to-solid transition that may underlie disease. We propose that condensation coupled to conformational rearrangements and heterotypic multivalent interactions may be a general principle underlying RNP granule assembly.
The SARS-CoV-2 nucleocapsid (N) protein is an abundant RNA-binding protein critical for viral genome packaging, yet the molecular details that underlie this process are poorly understood. Here we combine single-molecule spectroscopy with all-atom simulations to uncover the molecular details that contribute to N protein function. N protein contains three dynamic disordered regions that house putative transiently-helical binding motifs. The two folded domains interact minimally such that full-length N protein is a flexible and multivalent RNA-binding protein. N protein also undergoes liquid-liquid phase separation when mixed with RNA, and polymer theory predicts that the same multivalent interactions that drive phase separation also engender RNA compaction. We offer a simple symmetry-breaking model that provides a plausible route through which single-genome condensation preferentially occurs over phase separation, suggesting that phase separation offers a convenient macroscopic readout of a key nanoscopic interaction.
Nuclear speckles are self-assembled organelles composed of RNAs and proteins. They are proposed to act as structural domains that control distinct steps in gene expression, including transcription, splicing and mRNA export. Earlier studies identified differential localization of a few components within the speckles. It was speculated that the spatial organization of speckle components might contribute directly to the order of operations that coordinate distinct processes. Here, by performing multi-color structured illumination microscopy, we characterized the multilayer organization of speckles at a higher resolution. We found that SON and SC35 (also known as SRSF2) localize to the central region of the speckle, whereas and small nuclear (sn)RNAs are enriched at the speckle periphery. Coarse-grained simulations indicate that the non-random organization arises due to the interplay between favorable sequence-encoded intermolecular interactions of speckle-resident proteins and RNAs. Finally, we observe positive correlation between the total amount of RNA present within a speckle and the speckle size. These results imply that speckle size may be regulated to accommodate RNA accumulation and processing. Accumulation of RNA from various actively transcribed speckle-associated genes could contribute to the observed speckle size variations within a single cell.
The SARS-CoV-2 nucleocapsid (N) protein is an abundant RNA binding protein that plays a variety of roles in the viral life cycle including replication, transcription, and genome packaging. Despite its critical and multifunctional nature, the molecular details that underlie how N protein mediates these functions are poorly understood. Here we combine single-molecule spectroscopy with all-atom simulations to uncover the molecular details that contribute to the function of SARS-CoV-2 N protein. N protein contains three intrinsically disordered regions and two folded domains. All three disordered regions are highly dynamic and contain regions of transient helicity that appear to act as local binding interfaces for protein-protein or protein-RNA interactions. The two folded domains do not significantly interact with one another, such that full-length N protein is a flexible and multivalent RNA binding protein. As observed for other proteins with similar molecular features, we found that N protein undergoes liquid-liquid phase separation when mixed with RNA. Polymer models predict that the same multivalent interactions that drive phase separation also engender RNA compaction. We propose a simple model in which symmetry breaking through specific binding sites promotes the formation of metastable single-RNA condensate, as opposed to large multi-RNA phase separated droplets. We speculate that RNA compaction to form dynamic single-genome condensates may underlie the early stages of genome packaging. As such, assays that measure how compounds modulate phase separation could provide a convenient tool for identifying drugs that disrupt viral packaging.
Transcriptional activation domains are essential for gene regulation, but their intrinsic disorder and low primary sequence conservation have made it difficult to identify the amino acid composition features that underlie their activity. Here, we describe a rational mutagenesis scheme that deconvolves the function of four activation domain sequence features-acidity, hydrophobicity, intrinsic disorder, and short linear motifs-by quantifying the activity of thousands of variants in vivo and simulating their conformational ensembles using an all-atom Monte Carlo approach. Our results with a canonical activation domain from the Saccharomyces cerevisiae transcription factor Gcn4 reconcile existing observations into a unified model of its function: the intrinsic disorder and acidic residues keep two hydrophobic motifs from driving collapse. Instead, the most-active variants keep their aromatic residues exposed to the solvent. Our results illustrate how the function of intrinsically disordered proteins can be revealed by high-throughput rational mutagenesis.
Collective phase transitions, including phase separation and gelation of multivalent protein and RNA molecules appears to underlie the biogenesis of biomolecular condensates such as membraneless organelles. In vivo, these condensates encompass hundreds of distinct types of molecules that are often organized into multi-layered structures supporting the differential partitioning of molecules into distinct regions with distinct material properties. The interplay between driven (active) versus spontaneous (passive) processes that are required for enabling the formation of condensates with coexisting layers of distinct material properties remains unclear. Here, we investigate the role of spontaneous driving forces as determinants of protein-RNA condensates with complex morphologies and distinct material properties. Through the use of systematic in vitro experiments and simulations based on coarse-grained models we find that that the collective interactions among the simplest, biologically relevant proteins and archetypal RNA molecules are sufficient for driving the spontaneous emergence of multi-layered condensates with distinct material properties. Our results demonstrate that key properties of protein-RNA condensates such as their overall morphologies, internal dynamics, and the selective partitioning of substrates are governed specific amino acid chemistries as well as RNA sequence and secondary structure. Our findings yield a clear set of heuristics regarding homo- and heterotypic interactions that are likely to be relevant for understanding the interplay between active and passive processes that control the formation of functional biomolecular condensates.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.