Rafael de Cesaris Araujo Tavares scite author profile

SARS-CoV-2 is the positive-sense RNA virus that causes COVID-19, a disease that has triggered a major human health and economic crisis. The genome of SARS-CoV-2 is unique among viral RNAs in its vast potential to form stable RNA structures and yet, as much as 97% of its 30 kilobases have not been structurally explored in the context of a viral infection. Our limited knowledge of SARS-CoV-2 genomic architecture is a fundamental limitation to both our mechanistic understanding of coronavirus life cycle and the development of COVID-19 RNA-based therapeutics. Here, we apply a novel long amplicon strategy to determine for the first time the secondary structure of the SARS-CoV-2 RNA genome probed in infected cells. In addition to the conserved structural motifs at the viral termini, we report new structural features like a conformationally flexible programmed ribosomal frameshifting pseudoknot, and a host of novel RNA structures, each of which highlights the importance of studying viral structures in their native genomic context. Our in-depth structural analysis reveals extensive networks of well-folded RNA structures throughout Orf1ab and reveals new aspects of SARS-CoV-2 genome architecture that distinguish it from other single-stranded, positive-sense RNA viruses. Evolutionary analysis of RNA structures in SARS-CoV-2 shows that several features of its genomic structure are conserved across beta coronaviruses and we pinpoint individual regions of well-folded RNA structure that merit downstream functional analysis. The native, complete secondary structure of SAR-CoV-2 presented here is a roadmap that will facilitate focused studies on mechanisms of replication, translation and packaging, and guide the identification of new RNA drug targets against COVID-19.

show abstract

The Global and Local Distribution of RNA Structure throughout the SARS-CoV-2 Genome

Tavares

Mahadeshwar

Wan

et al. 2021

J Virol

View full text Add to dashboard Cite

SARS-CoV-2 is the causative viral agent of COVID-19, the disease at the center of the current global pandemic. While knowledge of highly structured regions is integral for mechanistic insights into the viral infection cycle, very little is known about the location and folding stability of functional elements within the massive, ∼30kb SARS-CoV-2 RNA genome. In this study, we analyze the folding stability of this RNA genome relative to the structural landscape of other well-known viral RNAs. We present an in-silico pipeline to predict regions of high base pair content across long genomes and to pinpoint hotspots of well-defined RNA structures, a method that allows for direct comparisons of RNA structural complexity within the several domains in SARS-CoV-2 genome. We report that the SARS-CoV-2 genomic propensity for stable RNA folding is exceptional among RNA viruses, superseding even that of HCV, one of the most structured viral RNAs in nature. Furthermore, our analysis suggests varying levels of RNA structure across genomic functional regions, with accessory and structural ORFs containing the highest structural density in the viral genome. Finally, we take a step further to examine how individual RNA structures formed by these ORFs are affected by the differences in genomic and subgenomic contexts, which given the technical difficulty of experimentally separating cellular mixtures of sgRNA from gRNA, is a unique advantage of our in-silico pipeline. The resulting findings provide a useful roadmap for planning focused empirical studies of SARS-CoV-2 RNA biology, and a preliminary guide for exploring potential SARS-CoV-2 RNA drug targets. Importance The RNA genome of SARS-CoV-2 is among the largest and most complex viral genomes, and yet its RNA structural features remain relatively unexplored. Since RNA elements guide function in most RNA viruses, and they represent potential drug targets, it is essential to chart the architectural features of SARS-CoV-2 and pinpoint regions that merit focused study. Here we show that RNA folding stability of SARS-CoV-2 genome is exceptional among viral genomes and we develop a method to directly compare levels of predicted secondary structure across SARS-CoV-2 domains. Remarkably, we find that coding regions display the highest structural propensity in the genome, forming motifs that differ between the genomic and subgenomic contexts. Our approach provides an attractive strategy to rapidly screen for candidate structured regions based on base pairing potential and provides a readily interpretable roadmap to guide functional studies of RNA viruses and other pharmacologically relevant RNA transcripts.

show abstract

The global and local distribution of RNA structure throughout the SARS-CoV-2 genome

Tavares

Mahadeshwar

Pyle

2020

Preprint

View full text Add to dashboard Cite

AbstractSARS-CoV-2 is the causative viral agent of COVID-19, the disease at the center of the current global pandemic. While knowledge of highly structured regions is integral for mechanistic insights into the viral infection cycle, very little is known about the location and folding stability of functional elements within the massive, ~30kb SARS-CoV-2 RNA genome. In this study, we analyze the folding stability of this RNA genome relative to the structural landscape of other well-known viral RNAs. We present an in-silico pipeline to locate regions of high base pair content across this long genome and also identify well-defined RNA structures, a method that allows for direct comparisons of RNA structural complexity within the several domains in SARS-CoV-2 genome. We report that the SARS-CoV-2 genomic propensity to stable RNA folding is exceptional among RNA viruses, superseding even that of HCV, one of the most highly structured viral RNAs in nature. Furthermore, our analysis reveals varying levels of RNA structure across genomic functional regions, with accessory and structural ORFs containing the highest structural density in the viral genome. Finally, we take a step further to examine how individual RNA structures formed by these ORFs are affected by the differences in genomic and subgenomic contexts. The conclusions reported in this study provide a foundation for structure-function hypotheses in SARS-CoV-2 biology, and in turn, may guide the 3D structural characterization of potential RNA drug targets for COVID-19 therapeutics.

show abstract

Discovery of a Well-Folded Protein Interaction Hub Within the Human Long Non-Coding RNANORAD

Kumar

Wan

Perry

et al. 2023

Preprint

View full text Add to dashboard Cite

The long non-coding RNA NORAD functions in maintaining genomic stability in humans via sequestering Pumilio proteins from the cytoplasm, and thereby modulating the gene expression of mRNA targets of Pumilio proteins. Despite its role in fundamental cellular pathways including chromosome segregation and DNA damage response, there have been limited structural and biophysical descriptions of NORAD. Here, using an integrative approach combining chemical probing coupled to high throughput sequencing, and RNA-pull downs coupled with mass spectrometry, we discovered a well-folded and structured protein interaction hub within the functional core of NORAD. Our in vitro biochemical reconstitutions using purified recombinant proteins and a NORAD repeat unit region within this hub reveal the assembly of a higher-order multimeric RNA-protein complex.

show abstract

RSCanner: rapid assessment and visualization of RNA structure content

Mahadeshwar

Tavares

Wan

et al. 2023

View full text Add to dashboard Cite

Motivation The increasing availability of RNA structural information that spans many kilobases of transcript sequence imposes a need for tools that can rapidly screen, identify and prioritize structural modules of interest. Results We describe RSCanner, an automated tool that scans RNA transcripts for regions that contain high levels of secondary structure, and then classifies each region for its relative propensity to adopt stable or dynamic structures. RSCanner then generates an intuitive heatmap enabling users to rapidly pinpoint regions likely to contain a high or low density of discrete RNA structures, thereby informing downstream functional or structural investigation. Availability RSCanner is freely available as both R script and R Markdown files, along with full documentation and test data (https://github.com/pylelab/RSCanner). Supplementary information Supplementary data are available at Bioinformatics online.

show abstract

MRT-ModSeq – Rapid detection of RNA modifications with MarathonRT

Tavares

Mahadeshwar

Wan

et al. 2023

Preprint

View full text Add to dashboard Cite

Chemical modifications are essential regulatory elements that modulate the behavior and function of cellular RNAs. Despite recent advances in sequencing-based RNA modification mapping, methods combining accuracy and speed are still lacking. Here, we introduce MRT-ModSeq for rapid, simultaneous detection of multiple RNA modifications using MarathonRT. MRT-ModSeq employs distinct divalent cofactors to generate 2-D mutational profiles that are highly dependent on nucleotide identity and modification type. As a proof of concept, we use the MRT fingerprints of well-studied rRNAs to implement a general workflow for detecting RNA modifications. MRT-ModSeq rapidly detects positions of diverse modifications across a RNA transcript, enabling assignment of m1acp3Y, m1A, m3U, m7G and 2'-OMe locations through mutation-rate filtering and machine learning. m1A sites in sparsely modified targets, such as MALAT1 and PRUNE1 could also be detected. MRT-ModSeq can be trained on natural and synthetic transcripts to expedite detection of diverse RNA modification subtypes across targets of interest.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.