The genome of the flowering plant Arabidopsis thaliana has five chromosomes. Here we report the sequence of the largest, chromosome 1, in two contigs of around 14.2 and 14.6 megabases. The contigs extend from the telomeres to the centromeric borders, regions rich in transposons, retrotransposons and repetitive elements such as the 180-base-pair repeat. The chromosome represents 25% of the genome and contains about 6,850 open reading frames, 236 transfer RNAs (tRNAs) and 12 small nuclear RNAs. There are two clusters of tRNA genes at different places on the chromosome. One consists of 27 tRNA(Pro) genes and the other contains 27 tandem repeats of tRNA(Tyr)-tRNA(Tyr)-tRNA(Ser) genes. Chromosome 1 contains about 300 gene families with clustered duplications. There are also many repeat elements, representing 8% of the sequence.
Pre-mRNA alternative splicing is a conserved mechanism for eukaryotic cells to leverage existing genetic resources to create a diverse pool of protein products. It is regulated in coordination with other events in RNA metabolism such as transcription, polyadenylation, RNA transport, and nonsense-mediated decay via protein networks. SERINE/ARGININE-RICH 45 (SR45) is thought to be a neutral splicing regulator. It is orthologous to a component of the apoptosis and splicing-associated protein (ASAP) complex functioning to regulate RNA metabolism at multiple levels. Within this context, we try to understand why the sr45-1 mutant Arabidopsis has malformed flowers, delayed flowering time, and increased disease resistance. Prior studies revealed increased expression for some disease resistance genes and the flowering suppressor Flowering Locus C (FLC) in sr45-1 mutants and a physical association between SR45 and reproductive process-related RNAs. Here, we used Tandem Mass Tag-based quantitative mass spectrometry to compare the protein abundance from inflorescence between Arabidopsis wild-type (Col-0) and sr45-1 mutant plants. A total of 7,206 proteins were quantified, of which 227 proteins exhibited significantly different accumulation. Only a small percentage of these proteins overlapped with the dataset of RNAs with altered expression. The proteomics results revealed that the sr45-1 mutant had increased amounts of enzymes for glucosinolate biosynthesis which are important for disease resistance. Furthermore, the mutant inflorescence had a drastically reduced amount of the Sin3-associated protein 18 (SAP18), a second ASAP complex component, despite no significant reduction in SAP18 RNA. The third ASAP component protein, ACINUS, also had lower abundance without significant RNA changes in the sr45-1 mutant. To test the effect of SR45 on SAP18, a SAP18-GFP fusion protein was overproduced in transgenic Arabidopsis Col-0 and sr45-1 plants. SAP18-GFP has less accumulation in the nucleus, the site of activity for the ASAP complex, without SR45. Furthermore, transgenic sr45-1 mutants overproducing SAP18-GFP expressed even more FLC and had a more severe flowering delay than non-transgenic sr45-1 mutants. These results suggest that SR45 is required to maintain the wild-type level of SAP18 protein accumulation in the nucleus and that FLC-regulated flowering time is regulated by the correct expression and localization of the ASAP complex.
Arabidopsis thaliana is an important model system for plant biologists. In 1996 an international collaboration (the Arabidopsis Genome Initiative) was formed to sequence the whole genome of Arabidopsis and in 1999 the sequence of the first two chromosomes was reported. The sequence of the last three chromosomes and an analysis of the whole genome are reported in this issue. Here we present the sequence of chromosome 3, organized into four sequence segments (contigs). The two largest (13.5 and 9.2 Mb) correspond to the top (long) and the bottom (short) arms of chromosome 3, and the two small contigs are located in the genetically defined centromere. This chromosome encodes 5,220 of the roughly 25,500 predicted protein-coding genes in the genome. About 20% of the predicted proteins have significant homology to proteins in eukaryotic genomes for which the complete sequence is available, pointing to important conserved cellular functions among eukaryotes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.