RNA-binding proteins are key regulators of gene expression, yet only a small fraction have been functionally characterized. Here we report a systematic analysis of the RNA motifs recognized by RNA-binding proteins, encompassing 205 distinct genes from 24 diverse eukaryotes. The sequence specificities of RNA-binding proteins display deep evolutionary conservation, and the recognition preferences for a large fraction of metazoan RNA-binding proteins can thus be inferred from their RNA-binding domain sequence. The motifs that we identify in vitro correlate well with in vivo RNA-binding data. Moreover, we can associate them with distinct functional roles in diverse types of post-transcriptional regulation, enabling new insights into the functions of RNA-binding proteins both in normal physiology and in human disease. These data provide an unprecedented overview of RNA-binding proteins and their targets, and constitute an invaluable resource for determining post-transcriptional regulatory mechanisms in eukaryotes.
The RNA-Binding Protein DataBase (RBPDB) is a collection of experimental observations of RNA-binding sites, both in vitro and in vivo, manually curated from primary literature. To build RBPDB, we performed a literature search for experimental binding data for all RNA-binding proteins (RBPs) with known RNA-binding domains in four metazoan species (human, mouse, fly and worm). In total, RPBDB contains binding data on 272 RBPs, including 71 that have motifs in position weight matrix format, and 36 sets of sequences of in vivo-bound transcripts from immunoprecipitation experiments. The database is accessible by a web interface which allows browsing by domain or by organism, searching and export of records, and bulk data downloads. Users can also use RBPDB to scan sequences for RBP-binding sites. RBPDB is freely available, without registration at http://rbpdb.ccbr.utoronto.ca/.
Metazoan genomes encode hundreds of RNA-binding proteins (RBPs) but RNA-binding preferences for relatively few RBPs have been well defined. Current techniques for determining RNA targets, including in vitro selection and RNA co-immunoprecipitation, require significant time and labor investment. Here we introduce RNAcompete, a method for the systematic analysis of RNA binding specificities that uses a single binding reaction to determine the relative preferences of RBPs for short RNAs that contain a complete range of k-mers in structured and unstructured RNA contexts. We tested RNAcompete by analyzing nine diverse RBPs (HuR, Vts1, FUSIP1, PTB, U1A, SF2/ASF, SLM2, RBM4 and YB1). RNAcompete identified expected and previously unknown RNA binding preferences. Using in vitro and in vivo binding data, we demonstrate that preferences for individual 7-mers identified by RNAcompete are a more accurate representation of binding activity than are conventional motif models. We anticipate that RNAcompete will be a valuable tool for the study of RNA-protein interactions.
SUMMARY LIN28 is a conserved RNA binding protein implicated in pluripotency, reprogramming and oncogenesis. Previously shown to act primarily by blocking let-7 microRNA (miRNA) biogenesis, here we elucidate distinct roles of LIN28 regulation via its direct messenger RNA (mRNA) targets. Through cross-linking and immunoprecipitation coupled with high-throughput sequencing (CLIP-seq) in human embryonic stem cells and somatic cells expressing exogenous LIN28, we have defined discrete LIN28 binding sites in a quarter of human transcripts. These sites revealed that LIN28 binds to GGAGA sequences enriched within loop structures in mRNAs, reminiscent of its interaction with let-7 miRNA precursors. Among LIN28 mRNA targets, we found evidence for LIN28 autoregulation and also direct but differing effects on the protein abundance of splicing regulators in somatic and pluripotent stem cells. Splicing-sensitive microarrays demonstrated that exogenous LIN28 expression causes widespread downstream alternative splicing changes. These findings identify important regulatory functions of LIN28 via direct mRNA interactions.
Metazoan genomes encode hundreds of RNA-binding proteins (RBPs). These proteins regulate post-transcriptional gene expression and have critical roles in numerous cellular processes including mRNA splicing, export, stability and translation. Despite their ubiquity and importance, the binding preferences for most RBPs are not well characterized. In vitro and in vivo studies, using affinity selection-based approaches, have successfully identified RNA sequence associated with specific RBPs; however, it is difficult to infer RBP sequence and structural preferences without specifically designed motif finding methods. In this study, we introduce a new motif-finding method, RNAcontext, designed to elucidate RBP-specific sequence and structural preferences with greater accuracy than existing approaches. We evaluated RNAcontext on recently published in vitro and in vivo RNA affinity selected data and demonstrate that RNAcontext identifies known binding preferences for several control proteins including HuR, PTB, and Vts1p and predicts new RNA structure preferences for SF2/ASF, RBM4, FUSIP1 and SLM2. The predicted preferences for SF2/ASF are consistent with its recently reported in vivo binding sites. RNAcontext is an accurate and efficient motif finding method ideally suited for using large-scale RNA-binding affinity datasets to determine the relative binding preferences of RBPs for a wide range of RNA sequences and structures.
The mechanisms instructing genesis of neuronal subtypes from mammalian neural precursors are not well understood. To address this issue, we have characterized the transcriptional landscape of radial glial precursors (RPs) in the embryonic murine cortex. We show that individual RPs express mRNA, but not protein, for transcriptional specifiers of both deep and superficial layer cortical neurons. Some of these mRNAs, including the superficial versus deep layer neuron transcriptional regulators Brn1 and Tle4, are translationally repressed by their association with the RNA-binding protein Pumilio2 (Pum2) and the 4E-T protein. Disruption of these repressive complexes in RPs mid-neurogenesis by knocking down 4E-T or Pum2 causes aberrant co-expression of deep layer neuron specification proteins in newborn superficial layer neurons. Thus, cortical RPs are transcriptionally primed to generate diverse types of neurons, and a Pum2/4E-T complex represses translation of some of these neuronal identity mRNAs to ensure appropriate temporal specification of daughter neurons.
A hallmark of inflammatory diseases is the excessive recruitment and influx of monocytes to sites of tissue damage and their ensuing differentiation into macrophages. Numerous stimuli are known to induce transcriptional changes associated with macrophage phenotype, but posttranscriptional control of human macrophage differentiation is less well understood. Here we show that expression levels of the RNA-binding protein Quaking (QKI) are low in monocytes and early human atherosclerotic lesions, but are abundant in macrophages of advanced plaques. Depletion of QKI protein impairs monocyte adhesion, migration, differentiation into macrophages and foam cell formation in vitro and in vivo. RNA-seq and microarray analysis of human monocyte and macrophage transcriptomes, including those of a unique QKI haploinsufficient patient, reveal striking changes in QKI-dependent messenger RNA levels and splicing of RNA transcripts. The biological importance of these transcripts and requirement for QKI during differentiation illustrates a central role for QKI in posttranscriptionally guiding macrophage identity and function.
Combination antibiotic therapies are being increasingly used in the clinic to enhance potency and counter drug resistance. However, the large search space of candidate drugs and dosage regimes makes the identification of effective combinations highly challenging. Here, we present a computational approach called INDIGO, which uses chemogenomics data to predict antibiotic combinations that interact synergistically or antagonistically in inhibiting bacterial growth. INDIGO quantifies the influence of individual chemical–genetic interactions on synergy and antagonism and significantly outperforms existing approaches based on experimental evaluation of novel predictions in Escherichia coli. Our analysis revealed a core set of genes and pathways (e.g. central metabolism) that are predictive of antibiotic interactions. By identifying the interactions that are associated with orthologous genes, we successfully estimated drug‐interaction outcomes in the bacterial pathogens Mycobacterium tuberculosis and Staphylococcus aureus, using the E. coli INDIGO model. INDIGO thus enables the discovery of effective combination therapies in less‐studied pathogens by leveraging chemogenomics data in model organisms.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.