Nucleotide variants can cause functional changes by altering protein–RNA binding in various ways that are not easy to predict. This can affect processes such as splicing, nuclear shuttling, and stability of the transcript. Therefore, correct modeling of protein–RNA binding is critical when predicting the effects of sequence variations. Many RNA-binding proteins recognize a diverse set of motifs and binding is typically also dependent on the genomic context, making this task particularly challenging. Here, we present DeepCLIP, the first method for context-aware modeling and predicting protein binding to RNA nucleic acids using exclusively sequence data as input. We show that DeepCLIP outperforms existing methods for modeling RNA-protein binding. Importantly, we demonstrate that DeepCLIP predictions correlate with the functional outcomes of nucleotide variants in independent wet lab experiments. Furthermore, we show how DeepCLIP binding profiles can be used in the design of therapeutically relevant antisense oligonucleotides, and to uncover possible position-dependent regulation in a tissue-specific manner. DeepCLIP is freely available as a stand-alone application and as a webtool at http://deepclip.compbio.sdu.dk.
It is now widely accepted that aberrant splicing of constitutive exons is often caused by mutations affecting cis‐acting splicing regulatory elements, but there is a misconception that all exons have an equal dependency on splicing regulatory elements and thus a similar susceptibility to aberrant splicing. We investigated exonic mutations in ACADM exon 5 to experimentally examine their effect on splicing and found that 7 out of 11 tested mutations affected exon inclusion, demonstrating that this constitutive exon is particularly vulnerable to exonic splicing mutations. Employing ACADM exon 5 and 6 as models, we demonstrate that the balance between splicing enhancers and silencers, flanking intron length, and flanking splice site strength are important factors that determine exon definition and splicing efficiency of the exon in question. Our study shows that two constitutive exons in ACADM have different inherent vulnerabilities to exonic splicing mutations. This suggests that in silico prediction of potential pathogenic effects on splicing from exonic mutations may be improved by also considering the inherent vulnerability of the exon. Moreover, we show that single nucleotide polymorphism that affect either of two different exonic splicing silencers, located far apart in exon 5, all protect against both immediately flanking and more distant exonic splicing mutations.
Accuracy of pre‐messenger RNA (pre‐mRNA) splicing is crucial for normal gene expression. Complex regulation supports the spliceosomal distinction between authentic exons and the many seemingly functional splice sites delimiting pseudoexons. Pseudoexons are nonfunctional intronic sequences that can be activated for aberrant inclusion in mRNA, which may cause disease. Pseudoexon activation is very challenging to predict, in particular when activation occurs by sequence variants that alter the splicing regulatory environment without directly affecting splice sites. As pseudoexon inclusion often evades detection due to activation of nonsense‐mediated mRNA decay, and because conventional diagnostic procedures miss deep intronic sequence variation, pseudoexon activation is a heavily underreported disease mechanism. Pseudoexon characteristics have mainly been studied based on in silico predicted sequences. Moreover, because recognition of sequence variants that create or strengthen splice sites is possible by comparison with well‐established consensus sequences, this type of pseudoexon activation is by far the most frequently reported. Here we review all known human disease‐associated pseudoexons that carry functional splice sites and are activated by deep intronic sequence variants located outside splice site sequences. We delineate common characteristics that make this type of wild type pseudoexons distinct high‐risk sites in the human genome.
Costello syndrome (CS) may be caused by activating mutations in codon 12/13 of the HRAS proto-oncogene. HRAS p.Gly12Val mutations have the highest transforming activity, are very frequent in cancers, but very rare in CS, where they are reported to cause a severe, early lethal, phenotype. We identified an unusual, new germline p.Gly12Val mutation, c.35_36GC>TG, in a 12-year-old boy with attenuated CS. Analysis of his HRAS cDNA showed high levels of exon 2 skipping. Using wild type and mutant HRAS minigenes, we confirmed that c.35_36GC>TG results in exon 2 skipping by simultaneously disrupting the function of a critical Exonic Splicing Enhancer (ESE) and creation of an Exonic Splicing Silencer (ESS). We show that this vulnerability of HRAS exon 2 is caused by a weak 3’ splice site, which makes exon 2 inclusion dependent on binding of splicing stimulatory proteins, like SRSF2, to the critical ESE. Because the majority of cancer- and CS- causing mutations are located here, they affect splicing differently. Therefore, our results also demonstrate that the phenotype in CS and somatic cancers is not only determined by the different transforming potentials of mutant HRAS proteins, but also by the efficiency of exon 2 inclusion resulting from the different HRAS mutations. Finally, we show that a splice switching oligonucleotide (SSO) that blocks access to the critical ESE causes exon 2 skipping and halts proliferation of cancer cells. This unravels a potential for development of new anti-cancer therapies based on SSO-mediated HRAS exon 2 skipping.
Familial dysautonomia (FD) is a severe genetic disorder causing sensory and autonomic dysfunction. It is predominantly caused by a c.2204+6T>C mutation in the IKBKAP gene. This mutation decreases the 5′ splice site strength of IKBKAP exon 20 leading to exon 20 skipping and decreased amounts of full-length IKAP protein. We identified a binding site for the splicing regulatory protein hnRNP A1 downstream of the IKBKAP exon 20 5′-splice site. We show that hnRNP A1 binds to this splicing regulatory element (SRE) and that two previously described inhibitory SREs inside IKBKAP exon 20 are also bound by hnRNP A1. Knockdown of hnRNP A1 in FD patient fibroblasts increases IKBKAP exon 20 inclusion demonstrating that hnRNP A1 is a negative regulator of IKBKAP exon 20 splicing. Furthermore, by mutating the SREs in an IKBKAP minigene we show that all three SREs cause hnRNP A1-mediated exon repression. We designed splice switching oligonucleotides (SSO) that blocks the intronic hnRNP A1 binding site, and demonstrate that this completely rescues splicing of IKBKAP exon 20 in FD patient fibroblasts and increases the amounts of IKAP protein. We propose that this may be developed into a potential new specific treatment of FD.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.