One of the most amazing findings in molecular biology was the discovery that eukaryotic genes are discontinuous, interrupted by stretches of non-coding sequence. The subsequent realization that the intervening regions are removed from pre-mRNA transcripts via the activity of a common set of small nuclear RNAs (snRNAs), which assemble together with associated proteins into a spliceosome, was equally surprising. How do cells orchestrate the assembly of this molecular machine? And how does the spliceosome accurately recognize exons and introns to carry out the splicing reaction? Insights into these questions have been gained by studying the life cycle of spliceosomal snRNAs from their transcription, nuclear export and reimport, all the way through to their dynamic assembly into the spliceosome. This assembly process can also affect the regulation of alternative splicing and has implications for human disease.
Alternative splicing of pre-mRNAs is a major contributor to both proteomic diversity and control of gene expression levels. Splicing is tightly regulated in different tissues and developmental stages, and its disruption can lead to a wide range of human diseases. An important long-term goal in the splicing field is to determine a set of rules or ''code'' for splicing that will enable prediction of the splicing pattern of any primary transcript from its sequence. Outside of the core splice site motifs, the bulk of the information required for splicing is thought to be contained in exonic and intronic cis-regulatory elements that function by recruitment of sequence-specific RNA-binding protein factors that either activate or repress the use of adjacent splice sites. Here, we summarize the current state of knowledge of splicing cis-regulatory elements and their context-dependent effects on splicing, emphasizing recent global/genome-wide studies and open questions.Keywords: context dependence; pre-mRNA splicing; splicing code; splicing factor; splicing regulation PRE-MRNA SPLICING Because human genes typically contain multiple introns, the process of pre-mRNA splicing is an essential step in the expression of most genes. A majority of human genes undergo alternative splicing (AS), generating multiple splicing isoforms containing different combinations of exons (Johnson et al. 2003). The effects of AS on protein products can be dramatic, e.g., producing soluble versus membranebound forms of the Fas receptor that have opposing effects on apoptosis (Cascino et al. 1995), or producing isoforms of the Drosophila fruitless protein that act to specify sexual orientation (Demir and Dickson 2005). The major forms of AS are summarized in Figure 1A. Splicing regulation has been comprehensively described in several recent reviews (Black 2003;Konarska and Query 2005;Matlin et al. 2005;Blencowe 2006;House and Lynch 2008). Here, we briefly summarize some aspects of splicing specificity and regulation before turning to our main topic of splicing regulatory elements and the rules governing their activity.The sequential phosphodiester transfer reactions involved in splicing are catalyzed by large ribonucleoprotein complexes known as spliceosomes. Containing more than 100 core proteins and five small nuclear RNAs (snRNAs U1, U2, U4, U5, and U6), spliceosomes may be the most complex machines in the cell (Zhou et al. 2002;Jurica and Moore 2003;Nilsen 2003). In addition to these core factors, additional regulatory proteins participate in the splicing of particular pre-mRNAs. Splicing of most introns is thought to occur cotranscriptionally with fairly extensive interactions between splicing factors and the core transcription machinery ( CORE SPLICING SIGNALSThree sites, the 59 splice site (59ss), the 39 splice site (39ss), and the branch point sequence (BPS), participate in the splicing reaction and are present in every intron, and thus are known as the core splicing signals. These signals are recognized multiple times during spl...
Human genome-wide association studies have linked single nucleotide polymorphisms (SNPs) on chromosome 9p21.3 near the INK4/ARF (CDKN2a/b) locus with susceptibility to atherosclerotic vascular disease (ASVD). Although this locus encodes three well-characterized tumor suppressors, p16INK4a, p15INK4b, and ARF, the SNPs most strongly associated with ASVD are ∼120 kb from the nearest coding gene within a long non-coding RNA (ncRNA) known as ANRIL (CDKN2BAS). While individuals homozygous for the atherosclerotic risk allele show decreased expression of ANRIL and the coding INK4/ARF transcripts, the mechanism by which such distant genetic variants influence INK4/ARF expression is unknown. Here, using rapid amplification of cDNA ends (RACE) and analysis of next-generation RNA sequencing datasets, we determined the structure and abundance of multiple ANRIL species. Each of these species was present at very low copy numbers in primary and cultured cells; however, only the expression of ANRIL isoforms containing exons proximal to the INK4/ARF locus correlated with the ASVD risk alleles. Surprisingly, RACE also identified transcripts containing non-colinear ANRIL exonic sequences, whose expression also correlated with genotype and INK4/ARF expression. These non-polyadenylated RNAs resisted RNAse R digestion and could be PCR amplified using outward-facing primers, suggesting they represent circular RNA structures that could arise from by-products of mRNA splicing. Next-generation DNA sequencing and splice prediction algorithms identified polymorphisms within the ASVD risk interval that may regulate ANRIL splicing and circular ANRIL (cANRIL) production. These results identify novel circular RNA products emanating from the ANRIL locus and suggest causal variants at 9p21.3 regulate INK4/ARF expression and ASVD risk by modulating ANRIL expression and/or structure.
Exonic splicing silencers (ESSs) are cis-regulatory elements that inhibit the use of adjacent splice sites, often contributing to alternative splicing (AS). To systematically identify ESSs, an in vivo splicing reporter system was developed to screen a library of random decanucleotides. The screen yielded 141 ESS decamers, 133 of which were unique. The silencer activity of over a dozen of these sequences was also confirmed in a heterologous exon/intron context and in a second cell type. Of the unique ESS decamers, most could be clustered into groups to yield seven putative ESS motifs, some resembling known motifs bound by hnRNPs H and A1. Potential roles of ESSs in constitutive splicing were explored using an algorithm, ExonScan, which simulates splicing based on known or putative splicing-related motifs. ExonScan and related bioinformatic analyses suggest that these ESS motifs play important roles in suppression of pseudoexons, in splice site definition, and in AS.
Correct splice site recognition is critical in pre-mRNA splicing. We find that almost all of a diverse panel of exonic splicing silencer (ESS) elements alter splice site choice when placed between competing sites, consistently inhibiting use of intron-proximal 5' and 3' splice sites. Supporting a general role for ESSs in splice site definition, we found that ESSs are both abundant and highly conserved between alternative splice site pairs and that mutation of ESSs located between natural alternative splice site pairs consistently shifted splicing toward the intron-proximal site. Some exonic splicing enhancers (ESEs) promoted use of intron-proximal 5' splice sites, and tethering of hnRNP A1 and SF2/ASF proteins between competing splice sites mimicked the effects of ESS and ESE elements, respectively. Further, we observed that specific subsets of ESSs had distinct effects on a multifunctional intron retention reporter and that one of these subsets is likely preferred for regulation of endogenous intron retention events. Together, our findings provide a comprehensive picture of the functions of ESSs in the control of diverse types of splicing decisions.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.