Coilin is a marker protein for subnuclear organelles known as Cajal bodies, which are sites of various RNA metabolic processes including the biogenesis of spliceosomal small nuclear ribonucleoprotein particles. Through self-associations and interactions with other proteins and RNA, coilin provides a structural scaffold for Cajal body formation. However, despite a conspicuous presence in Cajal bodies, most coilin is dispersed in the nucleoplasm and expressed in cell types that lack these organelles. The molecular function of coilin, particularly of the substantial nucleoplasmic fraction, remains uncertain. We identified coilin loss-of-function mutations in a genetic screen for mutants showing either reduced or enhanced expression of an alternatively spliced GFP reporter gene in Arabidopsis thaliana. The coilin mutants feature enhanced GFP fluorescence and diminished Cajal bodies compared with wild-type plants. The amount of GFP protein is several-fold higher in the coilin mutants owing to elevated GFP transcript levels and more efficient splicing to produce a translatable GFP mRNA. Genome-wide RNA-sequencing data from two distinct coilin mutants revealed a small, shared subset of differentially expressed genes, many encoding stress-related proteins, and, unexpectedly, a trend toward increased splicing efficiency. These results suggest that coilin attenuates splicing and modulates transcription of a select group of genes. The transcriptional and splicing changes observed in coilin mutants are not accompanied by gross phenotypic abnormalities or dramatically altered stress responses, supporting a role for coilin in fine tuning gene expression. Our GFP reporter gene provides a sensitive monitor of coilin activity that will facilitate further investigations into the functions of this enigmatic protein.
In a genetic screen for mutants showing modified splicing of an alternatively spliced GFP reporter gene in Arabidopsis thaliana, we identified mutations in genes encoding the putative U1 small nuclear ribonucleoprotein (snRNP) factors RBM25 and PRP39a. The latter has not yet been studied for its role in pre-messenger RNA (pre-mRNA) splicing in plants. Both proteins contain predicted RNA-binding domains and have been implicated in 5′ splice site selection in yeast and metazoan cells. In rbm25 mutants, splicing efficiency of GFP pre-mRNA was reduced and GFP protein levels lowered relative to wild-type plants. By contrast, prp39a mutants exhibited preferential splicing of a U2-type AT-AC intron in GFP pre-mRNA and elevated levels of GFP protein. These opposing findings indicate that impaired function of either RBM25 or PRP39a can differentially affect the same pre-mRNA substrate. Given a prior genome-wide analysis of alternative splicing in rbm25 mutants, we focused on examining the alternative splicing landscape in prp39a mutants. RNA-seq experiments performed using two independent prp39a alleles revealed hundreds of common genes undergoing changes in alternative splicing, including PRP39a itself, a second putative U1 snRNP component PRP40b, and genes encoding a number of general transcription-related proteins. The prp39a mutants displayed somewhat delayed flowering, shorter stature, and reduced seed set but no other obvious common defects under normal conditions. Mutations in PRP39b, the paralog of PRP39a, did not visibly alter GFP expression, indicating the paralogs are not functionally equivalent in this system. Our study provides new information on the contribution of PRP39a to alternative splicing and expands knowledge of plant splicing factors.
To identify regulators of pre-mRNA splicing in plants, we developed a forward genetic screen based on an alternatively spliced GFP reporter gene in Arabidopsis thaliana. In wild-type plants, three major splice variants issue from the GFP gene but only one represents a translatable GFP mRNA. Compared to wild-type seedlings, which exhibit an intermediate level of GFP expression, mutants identified in the screen feature either a "GFP-weak" or "Hyper-GFP" phenotype depending on the ratio of the three splice variants. GFP-weak mutants, including previously identified prp8 and rtf2, contain a higher proportion of unspliced transcript or canonically spliced transcript, neither of which is translatable into GFP protein. In contrast, the coilindeficient hyper-gfp1 (hgf1) mutant displays a higher proportion of translatable GFP mRNA, which arises from enhanced splicing of a U2-type intron with noncanonical AT-AC splice sites. Here we report three new hgf mutants that are defective, respectively, in spliceosome-associated proteins SMU1, SmF, and CWC16, an Yju2/CCDC130-related protein that has not yet been described in plants. The smu1 and cwc16 mutants have substantially increased levels of translatable GFP transcript owing to preferential splicing of the U2-type AT-AC intron, suggesting that SMU1 and CWC16 influence splice site selection in GFP pre-mRNA. Genome-wide analyses of splicing in smu1 and cwc16 mutants revealed a number of introns that were variably spliced from endogenous pre-mRNAs. These results indicate that SMU1 and CWC16, which are predicted to act directly prior to and during the first catalytic step of splicing, respectively, function more generally to modulate splicing patterns in plants.
Green fluorescent protein (GFP) and related fluorescent proteins are widely used in biological research to monitor gene expression and protein localization in living cells. The GFP chromophore is generated spontaneously in the presence of oxygen by a multi-step reaction involving cyclization of the internal tripeptide Ser65 (or Thr65)-Tyr66-Gly67, which is embedded in the center of an 11-stranded β-barrel structure. Random and site-specific mutagenesis has been used to optimize GFP fluorescence and create derivatives with novel properties. However, loss-of-function mutations that would aid in understanding GFP protein folding and chromophore formation have not been fully cataloged. Here we report a collection of ethyl methansulfonate–induced GFP loss-of-function mutations in the model plant Arabidopsis thaliana. Mutations that alter residues important for chromophore maturation, such as Arg96 and Ser205, greatly reduce or extinguish fluorescence without dramatically altering GFP protein accumulation. By contrast, other loss-of-fluorescence mutations substantially diminish the amount of GFP protein, suggesting that they compromise protein stability. Many mutations in this category generate substitutions of highly conserved glycine residues, including the following: Gly67 in the chromogenic tripeptide; Gly31, Gly33, and Gly35 in the second β-strand; and Gly20, Gly91, and Gly127 in the lids of the β-barrel scaffold. Our genetic analysis supports conclusions from structural and biochemical studies and demonstrates a critical role for multiple, highly conserved glycine residues in GFP protein stability.
Coconut palm (Cocos nucifera) is a symbol of the tropics and a source of numerous edible and nonedible products of economic value. Despite its nutritional and industrial significance, coconut remains under-represented in public repositories for genomic and transcriptomic data. We report de novo transcript assembly from RNA-seq data and analysis of gene expression in seed tissues (embryo and endosperm) and leaves of a dwarf coconut variety. Assembly of 10 GB sequencing data for each tissue resulted in 58,211 total unigenes in embryo, 61,152 in endosperm, and 33,446 in leaf. Within each unigene pool, 24,857 could be annotated in embryo, 29,731 could be annotated in endosperm, and 26,064 could be annotated in leaf. A KEGG analysis identified 138, 138, and 139 pathways, respectively, in transcriptomes of embryo, endosperm, and leaf tissues. Given the extraordinarily large size of coconut seeds and the importance of small RNA-mediated epigenetic regulation during seed development in model plants, we used homology searches to identify putative homologs of factors required for RNA-directed DNA methylation in coconut. The findings suggest that RNA-directed DNA methylation is important during coconut seed development, particularly in maturing endosperm. This dataset will expand the genomics resources available for coconut and provide a foundation for more detailed analyses that may assist molecular breeding strategies aimed at improving this major tropical crop.
SUMMARYShort interfering RNAs (siRNAs) homologous to transcriptional regulatory regions can induce RNA-directed DNA methylation (RdDM) and transcriptional gene silencing (TGS) of target genes. In our system, siRNAs are produced by transcribing an inverted DNA repeat (IR) of enhancer sequences, yielding a hairpin RNA that is processed by several Dicer activities into siRNAs of 21-24 nt. Primarily 24-nt siRNAs trigger RdDM of the target enhancer in trans and TGS of a downstream GFP reporter gene. We analyzed siRNA accumulation from two different structural forms of a trans-silencer locus in which tandem repeats are embedded in the enhancer IR and distinguished distinct RNA polymerase II (Pol II)-and Pol IV-dependent pathways of siRNA biogenesis. At the original silencer locus, Pol-II transcription of the IR from a 35S promoter produces a hairpin RNA that is diced into abundant siRNAs of 21-24 nt. A silencer variant lacking the 35S promoter revealed a normally masked Pol IV-dependent pathway that produces low levels of 24-nt siRNAs from the tandem repeats. Both pathways operate concurrently at the original silencer locus. siRNAs accrue only from specific regions of the enhancer and embedded tandem repeat. Analysis of these sequences and endogenous tandem repeats producing siRNAs revealed the preferential accumulation of siRNAs at GC-rich regions containing methylated CG dinucleotides. In addition to supporting a correlation between base composition, DNA methylation and siRNA accumulation, our results highlight the complexity of siRNA biogenesis at repetitive loci and show that Pol II and Pol IV use different promoters to transcribe the same template.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.