Characterization of the chloroplast proteome is needed to understand the essential contribution of the chloroplast to plant growth and development. Here we present a large scale analysis by nanoLC-Q-TOF and nanoLC-LTQ-Orbitrap mass spectrometry (MS) of ten independent chloroplast preparations from Arabidopsis thaliana which unambiguously identified 1325 proteins. Novel proteins include various kinases and putative nucleotide binding proteins. Based on repeated and independent MS based protein identifications requiring multiple matched peptide sequences, as well as literature, 916 nuclear-encoded proteins were assigned with high confidence to the plastid, of which 86% had a predicted chloroplast transit peptide (cTP). The protein abundance of soluble stromal proteins was calculated from normalized spectral counts from LTQ-Obitrap analysis and was found to cover four orders of magnitude. Comparison to gel-based quantification demonstrates that ‘spectral counting’ can provide large scale protein quantification for Arabidopsis. This quantitative information was used to determine possible biases for protein targeting prediction by TargetP and also to understand the significance of protein contaminants. The abundance data for 550 stromal proteins was used to understand abundance of metabolic pathways and chloroplast processes. We highlight the abundance of 48 stromal proteins involved in post-translational proteome homeostasis (including aminopeptidases, proteases, deformylases, chaperones, protein sorting components) and discuss the biological implications. N-terminal modifications were identified for a subset of nuclear- and chloroplast-encoded proteins and a novel N-terminal acetylation motif was discovered. Analysis of cTPs and their cleavage sites of Arabidopsis chloroplast proteins, as well as their predicted rice homologues, identified new species-dependent features, which will facilitate improved subcellular localization prediction. No evidence was found for suggested targeting via the secretory system. This study provides the most comprehensive chloroplast proteome analysis to date and an expanded Plant Proteome Database (PPDB) in which all MS data are projected on identified gene models.
Experimental proteome analysis was combined with a genome-wide prediction screen to characterize the protein content of the thylakoid lumen of Arabidopsis chloroplasts. Soluble thylakoid proteins were separated by two-dimensional electrophoresis and identified by mass spectrometry. The identities of 81 proteins were established, and N termini were sequenced to validate localization prediction. Gene annotation of the identified proteins was corrected by experimental data, and an interesting case of alternative splicing was discovered. Expression of a surprising number of paralogs was detected. Expression of five isomerases of different classes suggests strong (un)folding activity in the thylakoid lumen. These isomerases possibly are connected to a network of peripheral and lumenal proteins involved in antioxidative response, including peroxiredoxins, m-type thioredoxins, and a lumenal ascorbate peroxidase. Characteristics of the experimentally identified lumenal proteins and their orthologs were used for a genome-wide prediction of the lumenal proteome. Lumenal proteins with a typical twin-arginine translocation motif were predicted with good accuracy and sensitivity and included additional isomerases and proteases. Thus, prime functions of the lumenal proteome include assistance in the folding and proteolysis of thylakoid proteins as well as protection against oxidative stress. Many of the predicted lumenal proteins must be present at concentrations at least 10,000-fold lower than proteins of the photosynthetic apparatus.
An extensive analysis of the Arabidopsis thaliana peripheral and integral thylakoid membrane proteome was performed by sequential extractions with salt, detergent, and organic solvents, followed by multidimensional protein separation steps (reverse-phase HPLC and one-and two-dimensional electrophoresis gels), different enzymatic and nonenzymatic protein cleavage techniques, mass spectrometry, and bioinformatics. Altogether, 154 proteins were identified, of which 76 (49%) were a-helical integral membrane proteins. Twenty-seven new proteins without known function but with predicted chloroplast transit peptides were identified, of which 17 (63%) are integral membrane proteins. These new proteins, likely important in thylakoid biogenesis, include two rubredoxins, a potential metallochaperone, and a new DnaJ-like protein. The data were integrated with our analysis of the lumenal-enriched proteome. We identified 83 out of 100 known proteins of the thylakoid localized photosynthetic apparatus, including several new paralogues and some 20 proteins involved in protein insertion, assembly, folding, or proteolysis. An additional 16 proteins are involved in translation, demonstrating that the thylakoid membrane surface is an important site for protein synthesis. The high coverage of the photosynthetic apparatus and the identification of known hydrophobic proteins with low expression levels, such as cpSecE, Ohp1, and Ohp2, indicate an excellent dynamic resolution of the analysis. The sequential extraction process proved very helpful to validate transmembrane prediction. Our data also were cross-correlated to chloroplast subproteome analyses by other laboratories. All data are deposited in a new curated plastid proteome database (PPDB) with multiple search functions (http:// cbsusrv01.tc.cornell.edu/users/ppdb/). This PPDB will serve as an expandable resource for the plant community.
Tetradecameric Clp protease core complexes in nonphotosynthetic plastids of roots, flower petals, and in chloroplasts of leaves of Arabidopsis thaliana were purified based on native mass and isoelectric point and identified by mass spectrometry. The stoichiometry between the subunits was determined. The protease complex consisted of one to three copies of five different serine-type protease Clp proteins (ClpP1,3-6) and four non-proteolytic ClpR proteins (ClpR1-4). Three-dimensional homology modeling showed that the ClpP/R proteins fit well together in a tetradecameric complex and also indicated unique contributions for each protein.Lateral exit gates for proteolysis products are proposed. In addition, ClpS1,2, unique to land plants, tightly interacted with this core complex, with one copy of each per complex. The three-dimensional modeling show that they do fit well on the axial sites of the ClpPR cores. In contrast to plastids, plant mitochondria contained a single ϳ320-kDa homo-tetradecameric ClpP2 complex, without association of ClpR or ClpS proteins. It is surprising that the Clp core composition appears identical in all three plastid types, despite the remarkable differences in plastid proteome composition. This suggests that regulation of plastid proteolysis by the Clp machinery is not through differential regulation of ClpP/R/S gene expression, but rather through substrate recognition mechanisms and regulated interaction of chaperone-like molecules (ClpS1,2 and others) to the ClpP/R core.Plastids are essential organelles of prokaryotic origin that are present in every plant cell and differentiate from proplastids into non-photosynthetic plastids in roots and flowers and photosynthetic plastids in leafs and stems. Plastids are responsible for synthesis of key molecules required for the architecture and functions of plant cells.To maintain a correct stoichiometry between different proteins and pathways, to remove and recycle damaged or misfolded proteins, and to control gene expression by proteolysis of transcription or translation factors, different proteolytic systems are present in the plastid. Members of at least five protease families are present in plastids, but their structures, functions, substrates, and biological importance are poorly understood (2).A very prominent group of proteases in plants is the Clp protease family. Our latest analysis of the Arabidopsis thaliana nuclear genome indicates the presence of at least 26 Clp-related genes, with 15 genes encoding for plastid-localized proteins (3) (Fig.
This study presents an analysis of the stromal proteome in its oligomeric state extracted from highly purified chloroplasts of Arabidopsis thaliana. 241 proteins (88% with predicted cTP), mostly assembled in oligomeric complexes, were identified by mass spectrometry with emphasis on distinguishing between paralogues. This is critical because different paralogues in a gene family often have different subcellular localizations and/or different expression patterns and functions. The native protein masses were determined for all identified proteins. Comparison with the few well characterized stromal complexes from A. thaliana confirmed the accuracy of the native mass determination, and by extension, the usefulness of the native mass data for future in-depth protein interaction studies. Resolved protein interactions are discussed and compared with an extensive collection of native mass data of orthologues in other plants and bacteria. Relative protein expression levels were estimated from spot intensities and also provided estimates of relative concentrations of individual proteins. No such quantification has been reported so far. Surprisingly proteins dedicated to chloroplast protein synthesis, biogenesis, and fate represented nearly 10% of the total stroma protein mass. Oxidative pentose phosphate pathway, glycolysis, and Calvin cycle represented together about 75%, nitrogen assimilation represented 5-7%, and all other pathways such as biosynthesis of e.g. fatty acids, amino acids, nucleotides, tetrapyrroles, and vitamins B 1 and B 2 each represented less than 1% of total protein mass. Several proteins with diverse functions outside primary carbon metabolism, such as the isomerase ROC4, lipoxygenase 2 involved in jasmonic acid biosynthesis, and a carbonic anhydrase (CA1), were surprisingly abundant in the range of 0.75-1
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.