A mechanistic understanding of the SARS-CoV-2 viral replication cycle is essential to develop new therapies for the COVID-19 global health crisis. In this study, we show that the SARS-CoV-2 nucleocapsid protein (N-protein) undergoes liquid-liquid phase separation (LLPS) with the viral genome, and propose a model of viral packaging through LLPS. N-protein condenses with specific RNA sequences in the first 1000 nts (5'-End) under physiological conditions and is enhanced at human upper airway temperatures. N-protein condensates exclude non-packaged RNA sequences. We comprehensively map sites bound by N-protein in the 5'-End and find preferences for single-stranded RNA flanked by stable structured elements. Liquid-like N-protein condensates form in mammalian cells in a concentration-dependent manner and can be altered by small molecules. Condensation of N-protein is sequence and structure specific, sensitive to human body temperature, and manipulatable with small molecules thus presenting screenable processes for identifying antiviral compounds effective against SARS-CoV-2.
IntroductionThe outbreak of COVID-19, caused by the severe acute respiratory syndrome-related coronavirus SARS-CoV-2, is a global public health crisis. Coronaviruses, including SARS-CoV-2, are RNA viruses with ~30 kb genomes that are replicated and packaged in host cells. Packaging is thought to be highly specific for the complete viral genome (gRNA), and excludes host RNA and abundant virus-produced subgenomic RNAs (1). Viral replication and gRNA packaging depends on the nucleocapsid protein (N-protein) (2, 3). The N-protein has two RNA-binding domains, forms multimers (4) and is predicted to contain intrinsically disordered regions ( Figure 1A). N-protein thus has hallmarks of proteins that undergo liquid-liquid phase separation (LLPS), a process which may provide selectivity and efficiency to viral replication and packaging.
N-protein phase separates with viral RNA in a length, sequence and concentration dependent mannerWe reconstituted purified N-protein under physiological buffer conditions with viral RNA segments and observed that N-protein produced in mammalian cells (post-translationally modified) or bacteria (unmodified) phase separated with viral RNA segments. However, unmodified protein yielded larger and more abundant droplets ( Figure S1A). Since N-protein in SARS-CoV1 virions is hypophoshorylated (5) and packaging (initiated by binding of N-protein to gRNA) first occurs in the cytoplasm of coronaviruses (6,7), where N-protein is thought to be in its unphosphorylated state (8), we used unmodified protein for subsequent experiments.Pure N-protein demixed into droplets on its own and phase separation was enhanced by fulllength genomic SARS-CoV-2 RNA ( Figure 1B). To determine if certain segments of SARS-CoV-2 genome had preferential ability to drive phase separation, we identified regions of the gRNA under synonymous codon constraints. We hypothesized that LLPS occurs specifically with gRNA carrying a viral packaging signal(s), whose ex...