The COVID-19 pandemic has led to accelerated efforts to develop therapeutics and vaccines. A key target of these efforts is the spike (S) protein, which is metastable and difficult to produce recombinantly. Here, we characterized 100 structure-guided spike designs and identified 26 individual substitutions that increased protein yields and stability. Testing combinations of beneficial substitutions resulted in the identification of HexaPro, a variant with six beneficial proline substitutions exhibiting ~10-fold higher expression than its parental construct and the ability to withstand heat stress, storage at room temperature, and three freeze-thaw cycles. A 3.2 Å-resolution cryo-EM structure of HexaPro confirmed that it retains the prefusion spike conformation. High-yield production of a stabilized prefusion spike protein will accelerate the development of vaccines and serological diagnostics for SARS-CoV-2.
1The COVID-19 pandemic caused by the novel coronavirus SARS-CoV-2 has led to accelerated 2 efforts to develop therapeutics, diagnostics, and vaccines to mitigate this public health 3 emergency. A key target of these efforts is the spike (S) protein, a large trimeric class I fusion 4 protein that is metastable and difficult to produce recombinantly in large quantities. Here, we 5 designed and expressed over 100 structure-guided spike variants based upon a previously 6 determined cryo-EM structure of the prefusion SARS-CoV-2 spike. Biochemical, biophysical 7 and structural characterization of these variants identified numerous individual substitutions that 8 increased protein yields and stability. The best variant, HexaPro, has six beneficial proline 9 substitutions leading to ~10-fold higher expression than its parental construct and is able to 10 withstand heat stress, storage at room temperature, and multiple freeze-thaws. A 3.2 Å-resolution 11 cryo-EM structure of HexaPro confirmed that it retains the prefusion spike conformation. High-12 yield production of a stabilized prefusion spike protein will accelerate the development of 13 vaccines and serological diagnostics for SARS-CoV-2. 14 3 INTRODUCTION 15 Coronaviruses are enveloped viruses containing positive-sense RNA genomes. Four human 16 coronaviruses generally cause mild respiratory illness and circulate annually. However, SARS-17 CoV and MERS-CoV were acquired by humans via zoonotic transmission and caused outbreaks 18 of severe respiratory infections with high case-fatality rates in 2002 and 2012, respectively 1,2 . 19 SARS-CoV-2 is a novel betacoronavirus that emerged in Wuhan, China in December 2019 and 20 is the causative agent of the ongoing COVID-19 pandemic 3,4 . As of May 26, 2020, the WHO has 21 reported over 5 million cases and 350,000 deaths worldwide. Effective vaccines, therapeutic 22 antibodies and small-molecule inhibitors are urgently needed, and the development of these 23 interventions is proceeding rapidly. 24 Coronavirus virions are decorated with a spike (S) glycoprotein that binds to host-cell 25 receptors and mediates cell entry via fusion of the host and viral membranes 5 . S proteins are 26 trimeric class I fusion proteins that are expressed as a single polypeptide that is subsequently 27cleaved into S1 and S2 subunits by cellular proteases 6,7 . The S1 subunit contains the receptor-28 binding domain (RBD), which, in the case of SARS-CoV-2, recognizes the angiotensin-29 converting enzyme 2 (ACE2) receptor on the host-cell surface [8][9][10] . The S2 subunit mediates 30 membrane fusion and contains an additional protease cleavage site, referred to as S2′, that is 31 adjacent to a hydrophobic fusion peptide. Binding of the RBD to ACE2 triggers S1 dissociation, 32 allowing for a large rearrangement of S2 as it transitions from a metastable prefusion 33 conformation to a highly stable postfusion conformation 6,11 . During this rearrangement, the 34 fusion peptide is inserted into the host-cell membrane after cleavage at S2′, and two h...
The molecular composition and binding epitopes of the immunoglobulin G (IgG) antibodies that circulate in blood plasma after severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection are unknown. Proteomic deconvolution of the IgG repertoire to the spike glycoprotein in convalescent subjects revealed that the response is directed predominantly (>80%) against epitopes residing outside the receptor binding domain (RBD). In one subject, just four IgG lineages accounted for 93.5% of the response, including an amino (N)-terminal domain (NTD)–directed antibody that was protective against lethal viral challenge. Genetic, structural, and functional characterization of a multidonor class of “public” antibodies revealed an NTD epitope that is recurrently mutated among emerging SARS-CoV-2 variants of concern. These data show that “public” NTD-directed and other non-RBD plasma antibodies are prevalent and have implications for SARS-CoV-2 protection and antibody escape.
We sequenced the genomes of 5,085 SARS-CoV-2 strains causing two COVID-19 disease waves in metropolitan Houston, Texas, an ethnically diverse region with seven million residents. The genomes were from viruses recovered in the earliest recognized phase of the pandemic in Houston, and an ongoing massive second wave of infections. The virus was originally introduced into Houston many times independently. Virtually all strains in the second wave have a Gly614 amino acid replacement in the spike protein, a polymorphism that has been linked to increased transmission and infectivity. Patients infected with the Gly614 variant strains had significantly higher virus loads in the nasopharynx on initial diagnosis. We found little evidence of a significant relationship between virus genotypes and altered virulence, stressing the linkage between disease severity, underlying medical conditions, and host genetics. Some regions of the spike protein - the primary target of global vaccine efforts - are replete with amino acid replacements, perhaps indicating the action of selection. We exploited the genomic data to generate defined single amino acid replacements in the receptor binding domain of spike protein that, importantly, produced decreased recognition by the neutralizing monoclonal antibody CR30022. Our study is the first analysis of the molecular architecture of SARS-CoV-2 in two infection waves in a major metropolitan region. The findings will help us to understand the origin, composition, and trajectory of future infection waves, and the potential effect of the host immune response and therapeutic maneuvers on SARS-CoV-2 evolution.
Fragile X syndrome (FXS) is caused by silencing of the FMR1 gene, which encodes a protein with a critical role in synaptic plasticity. The molecular abnormality underlying FMR1 silencing, CGG repeat expansion, is well characterized; however, delineation of the pathway from DNA to RNA to protein using biosamples from well characterized patients with FXS is limited. Since FXS is a common and prototypical genetic disorder associated with intellectual disability (ID) and autism spectrum disorder (ASD), a comprehensive assessment of the FMR1 DNA-RNA-protein pathway and its correlations with the neurobehavioral phenotype is a priority. We applied nine sensitive and quantitative assays evaluating FMR1 DNA, RNA, and FMRP parameters to a reference set of cell lines representing the range of FMR1 expansions. We then used the most informative of these assays on blood and buccal specimens from cohorts of patients with different FMR1 expansions, with emphasis on those with FXS (N = 42 total, N = 31 with FMRP measurements). The group with FMRP data was also evaluated comprehensively in terms of its neurobehavioral profile, which allowed molecular–neurobehavioral correlations. FMR1 CGG repeat expansions, methylation levels, and FMRP levels, in both cell lines and blood samples, were consistent with findings of previous FMR1 genomic and protein studies. They also demonstrated a high level of agreement between blood and buccal specimens. These assays further corroborated previous reports of the relatively high prevalence of methylation mosaicism (slightly over 50% of the samples). Molecular-neurobehavioral correlations confirmed the inverse relationship between overall severity of the FXS phenotype and decrease in FMRP levels (N = 26 males, mean 4.2 ± 3.3 pg FMRP/ng genomic DNA). Other intriguing findings included a significant relationship between the diagnosis of FXS with ASD and two-fold lower levels of FMRP (mean 2.8 ± 1.3 pg FMRP/ng genomic DNA, p = 0.04), in particular observed in younger age- and IQ-adjusted males (mean age 6.9 ± 0.9 years with mean 3.2 ± 1.2 pg FMRP/ng genomic DNA, 57% with severe ASD), compared to FXS without ASD. Those with severe ID had even lower FMRP levels independent of ASD status in the male-only subset. The results underscore the link between FMR1 expansion, gene methylation, and FMRP deficit. The association between FMRP deficiency and overall severity of the neurobehavioral phenotype invites follow up studies in larger patient cohorts. They would be valuable to confirm and potentially extend our initial findings of the relationship between ASD and other neurobehavioral features and the magnitude of FMRP deficit. Molecular profiling of individuals with FXS may have important implications in research and clinical practice.
We sequenced the genomes of 5,085 severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) strains causing two coronavirus disease 2019 (COVID-19) disease waves in metropolitan Houston, TX, an ethnically diverse region with 7 million residents. The genomes were from viruses recovered in the earliest recognized phase of the pandemic in Houston and from viruses recovered in an ongoing massive second wave of infections. The virus was originally introduced into Houston many times independently. Virtually all strains in the second wave have a Gly614 amino acid replacement in the spike protein, a polymorphism that has been linked to increased transmission and infectivity. Patients infected with the Gly614 variant strains had significantly higher virus loads in the nasopharynx on initial diagnosis. We found little evidence of a significant relationship between virus genotype and altered virulence, stressing the linkage between disease severity, underlying medical conditions, and host genetics. Some regions of the spike protein—the primary target of global vaccine efforts—are replete with amino acid replacements, perhaps indicating the action of selection. We exploited the genomic data to generate defined single amino acid replacements in the receptor binding domain of spike protein that, importantly, produced decreased recognition by the neutralizing monoclonal antibody CR3022. Our report represents the first analysis of the molecular architecture of SARS-CoV-2 in two infection waves in a major metropolitan region. The findings will help us to understand the origin, composition, and trajectory of future infection waves and the potential effect of the host immune response and therapeutic maneuvers on SARS-CoV-2 evolution. IMPORTANCE There is concern about second and subsequent waves of COVID-19 caused by the SARS-CoV-2 coronavirus occurring in communities globally that had an initial disease wave. Metropolitan Houston, TX, with a population of 7 million, is experiencing a massive second disease wave that began in late May 2020. To understand SARS-CoV-2 molecular population genomic architecture and evolution and the relationship between virus genotypes and patient features, we sequenced the genomes of 5,085 SARS-CoV-2 strains from these two waves. Our report provides the first molecular characterization of SARS-CoV-2 strains causing two distinct COVID-19 disease waves.
CRISPR-Cas systems protect bacteria and archaea from phages and other mobile genetic elements, which use small anti-CRISPR (Acr) proteins to overcome CRISPR-Cas immunity. Because Acrs are challenging to identify, their natural diversity and impact on microbial ecosystems are underappreciated. To overcome this discovery bottleneck, we developed a high-throughput functional selection to isolate ten DNA fragments from human oral and fecal metagenomes that inhibit Streptococcus pyogenes Cas9 (SpyCas9) in Escherichia coli. The most potent Acr from this set, AcrIIA11, was recovered from a Lachnospiraceae phage. We found that AcrIIA11 inhibits SpyCas9 in bacteria and in human cells. AcrIIA11 homologs are distributed across diverse bacteria; many distantly-related homologs inhibit both SpyCas9 and a divergent Cas9 from Treponema denticola. We find that AcrIIA11 antagonizes SpyCas9 using a different mechanism than other previously characterized Type II-A Acrs. Our study highlights the power of functional selection to uncover widespread Cas9 inhibitors within diverse microbiomes.
The severe acute respiratory syndrome coronavirus 2 spike protein is a critical component of coronavirus disease 2019 vaccines and diagnostics and is also a therapeutic target. However, the spike protein is difficult to produce recombinantly because it is a large trimeric class I fusion membrane protein that is metastable and heavily glycosylated. We recently developed a prefusion-stabilized spike variant, termed HexaPro for six stabilizing proline substitutions, that can be expressed with a yield of >30 mg/L in ExpiCHO cells. This protocol describes an optimized workflow for expressing and biophysically characterizing rationally engineered spike proteins in Freestyle 293 and ExpiCHO cell lines. Although we focus on HexaPro, this protocol has been used to purify over a hundred different spike variants in our laboratories. We also provide guidance on expression quality control, long-term storage, and uses in enzyme-linked immunosorbent assays. The entire protocol, from transfection to biophysical characterization, can be completed in 7 d by researchers with basic tissue cell culture and protein purification expertise.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.