Plant metabolism underpins many traits of ecological and agronomic importance. Plants produce numerous compounds to cope with their environments but the biosynthetic pathways for most of these compounds have not yet been elucidated. To engineer and improve metabolic traits, we need comprehensive and accurate knowledge of the organization and regulation of plant metabolism at the genome scale. Here, we present a computational pipeline to identify metabolic enzymes, pathways, and gene clusters from a sequenced genome. Using this pipeline, we generated metabolic pathway databases for 22 species and identified metabolic gene clusters from 18 species. This unified resource can be used to conduct a wide array of comparative studies of plant metabolism. Using the resource, we discovered a widespread occurrence of metabolic gene clusters in plants: 11,969 clusters from 18 species. The prevalence of metabolic gene clusters offers an intriguing possibility of an untapped source for uncovering new metabolite biosynthesis pathways. For example, more than 1,700 clusters contain enzymes that could generate a specialized metabolite scaffold (signature enzymes) and enzymes that modify the scaffold (tailoring enzymes). In four species with sufficient gene expression data, we identified 43 highly coexpressed clusters that contain signature and tailoring enzymes, of which eight were characterized previously to be functional pathways. Finally, we identified patterns of genome organization that implicate local gene duplication and, to a lesser extent, single gene transposition as having played roles in the evolution of plant metabolic gene clusters.
Summary Protein phosphorylation and acetylation are the two most abundant post‐translational modifications (PTMs) that regulate protein functions in eukaryotes. In plants, these PTMs have been investigated individually; however, their co‐occurrence and dynamics on proteins is currently unknown. Using Arabidopsis thaliana, we quantified changes in protein phosphorylation, acetylation and protein abundance in leaf rosettes, roots, flowers, siliques and seedlings at the end of day (ED) and at the end of night (EN). This identified 2549 phosphorylated and 909 acetylated proteins, of which 1724 phosphorylated and 536 acetylated proteins were also quantified for changes in PTM abundance between ED and EN. Using a sequential dual‐PTM workflow, we identified significant PTM changes and intersections in these organs and plant developmental stages. In particular, cellular process‐, pathway‐ and protein‐level analyses reveal that the phosphoproteome and acetylome predominantly intersect at the pathway‐ and cellular process‐level at ED versus EN. We found 134 proteins involved in core plant cell processes, such as light harvesting and photosynthesis, translation, metabolism and cellular transport, that were both phosphorylated and acetylated. Our results establish connections between PTM motifs, PTM catalyzing enzymes and putative substrate networks. We also identified PTM motifs for further characterization of the regulatory mechanisms that control cellular processes during the diurnal cycle in different Arabidopsis organs and seedlings. The sequential dual‐PTM analysis expands our understanding of diurnal plant cell regulation by PTMs and provides a useful resource for future analyses, while emphasizing the importance of analyzing multiple PTMs simultaneously to elucidate when, where and how they are involved in plant cell regulation.
Background Cassava is an important food crop in tropical and sub-tropical regions worldwide. In Africa, cassava production is widely affected by cassava mosaic disease (CMD), which is caused by the African cassava mosaic geminivirus that is transmitted by whiteflies. Cassava breeders often use a single locus, CMD2, for introducing CMD resistance into susceptible cultivars. The CMD2 locus has been genetically mapped to a 10-Mbp region, but its organization and genes as well as their functions are unknown. Results We report haplotype-resolved de novo assemblies and annotations of the genomes for the African cassava cultivar TME (tropical Manihot esculenta), which is the origin of CMD2, and the CMD-susceptible cultivar 60444. The assemblies provide phased haplotype information for over 80% of the genomes. Haplotype comparison identified novel features previously hidden in collapsed and fragmented cassava genomes, including thousands of allelic variants, inter-haplotype diversity in coding regions, and patterns of diversification through allele-specific expression. Reconstruction of the CMD2 locus revealed a highly complex region with nearly identical gene sets but limited microsynteny between the two cultivars. Conclusions The genome maps of the CMD2 locus in both 60444 and TME3, together with the newly annotated genes, will help the identification of the causal genetic basis of CMD2 resistance to geminiviruses. Our de novo cassava genome assemblies will also facilitate genetic mapping approaches to narrow the large CMD2 region to a few candidate genes for better informed strategies to develop robust geminivirus resistance in susceptible cassava cultivars.
Background Cassava (Manihot esculenta) is an important clonally propagated food crop in tropical and subtropical regions worldwide. Genetic gain by molecular breeding has been limited, partially because cassava is a highly heterozygous crop with a repetitive and difficult-to-assemble genome. Findings Here we demonstrate that Pacific Biosciences high-fidelity (HiFi) sequencing reads, in combination with the assembler hifiasm, produced genome assemblies at near complete haplotype resolution with higher continuity and accuracy compared to conventional long sequencing reads. We present 2 chromosome-scale haploid genomes phased with Hi-C technology for the diploid African cassava variety TME204. With consensus accuracy >QV46, contig N50 >18 Mb, BUSCO completeness of 99%, and 35k phased gene loci, it is the most accurate, continuous, complete, and haplotype-resolved cassava genome assembly so far. Ab initio gene prediction with RNA-seq data and Iso-Seq transcripts identified abundant novel gene loci, with enriched functionality related to chromatin organization, meristem development, and cell responses. During tissue development, differentially expressed transcripts of different haplotype origins were enriched for different functionality. In each tissue, 20–30% of transcripts showed allele-specific expression (ASE) differences. ASE bias was often tissue specific and inconsistent across different tissues. Direction-shifting was observed in <2% of the ASE transcripts. Despite high gene synteny, the HiFi genome assembly revealed extensive chromosome rearrangements and abundant intra-genomic and inter-genomic divergent sequences, with large structural variations mostly related to LTR retrotransposons. We use the reference-quality assemblies to build a cassava pan-genome and demonstrate its importance in representing the genetic diversity of cassava for downstream reference-guided omics analysis and breeding. Conclusions The phased and annotated chromosome pairs allow a systematic view of the heterozygous diploid genome organization in cassava with improved accuracy, completeness, and haplotype resolution. They will be a valuable resource for cassava breeding and research. Our study may also provide insights into developing cost-effective and efficient strategies for resolving complex genomes with high resolution, accuracy, and continuity.
Phosphorus absorbed in the form of phosphate (H 2 PO 4 À) is an essential but limiting macronutrient for plant growth and agricultural productivity. A comprehensive understanding of how plants respond to phosphate starvation is essential for the development of more phosphate-efficient crops. Here we employed label-free proteomics and phosphoproteomics to quantify protein-level responses to 48 h of phosphate versus phosphite (H 2 PO 3 À) resupply to phosphate-deprived Arabidopsis thaliana suspension cells. Phosphite is similarly sensed, taken up and transported by plant cells as phosphate, but cannot be metabolized or used as a nutrient. Phosphite is thus a useful tool for differentiating between non-specific processes related to phosphate sensing and transport and specific responses to phosphorus nutrition. We found that responses to phosphate versus phosphite resupply occurred mainly at the level of protein phosphorylation, complemented by limited changes in protein abundance, primarily in protein translation, phosphate transport and scavenging, and central metabolism proteins. Altered phosphorylation of proteins involved in core processes such as translation, RNA splicing and kinase signaling was especially important. We also found differential phosphorylation in response to phosphate and phosphite in 69 proteins, including splicing factors, translation factors, the PHT1;4 phosphate transporter and the HAT1 histone acetyltransferasepotential phosphoswitches signaling changes in phosphorus nutrition. Our study illuminates several new aspects of the phosphate starvation response and identifies important targets for further investigation and potential crop improvement.
Plant growth depends on the diurnal regulation of cellular processes, but it is not well understood if and how transcriptional regulation controls diurnal fluctuations at the protein level. Here, we report a high‐resolution Arabidopsis thaliana (Arabidopsis) leaf rosette proteome acquired over a 12 hr light:12 hr dark diurnal cycle and the phosphoproteome immediately before and after the light‐to‐dark and dark‐to‐light transitions. We quantified nearly 5,000 proteins and 800 phosphoproteins, of which 288 fluctuated in their abundance and 226 fluctuated in their phosphorylation status. Of the phosphoproteins, 60% were quantified for changes in protein abundance. This revealed six proteins involved in nitrogen and hormone metabolism that had concurrent changes in both protein abundance and phosphorylation status. The diurnal proteome and phosphoproteome changes involve proteins in key cellular processes, including protein translation, light perception, photosynthesis, metabolism and transport. The phosphoproteome at the light–dark transitions revealed the dynamics at phosphorylation sites in either anticipation of or response to a change in light regime. Phosphorylation site motif analyses implicate casein kinase II and calcium/calmodulin‐dependent kinases among the primary light–dark transition kinases. The comparative analysis of the diurnal proteome and diurnal and circadian transcriptome established how mRNA and protein accumulation intersect in leaves during the diurnal cycle of the plant.
Summary Cassava (Manihot esculenta Crantz) is one of the important staple foods in Sub‐Saharan Africa. It produces starchy storage roots that provide food and income for several hundred million people, mainly in tropical agriculture zones. Increasing cassava storage root and starch yield is one of the major breeding targets with respect to securing the future food supply for the growing population of Sub‐Saharan Africa. The Cassava Source–Sink (CASS) project aims to increase cassava storage root and starch yield by strategically integrating approaches from different disciplines. We present our perspective and progress on cassava as an applied research organism and provide insight into the CASS strategy, which can serve as a blueprint for the improvement of other root and tuber crops. Extensive profiling of different field‐grown cassava genotypes generates information for leaf, phloem, and root metabolic and physiological processes that are relevant for biotechnological improvements. A multi‐national pipeline for genetic engineering of cassava plants covers all steps from gene discovery, cloning, transformation, molecular and biochemical characterization, confined field trials, and phenotyping of the seasonal dynamics of shoot traits under field conditions. Together, the CASS project generates comprehensive data to facilitate conventional breeding strategies for high‐yielding cassava genotypes. It also builds the foundation for genome‐scale metabolic modelling aiming to predict targets and bottlenecks in metabolic pathways. This information is used to engineer cassava genotypes with improved source–sink relations and increased yield potential.
BackgroundReversible protein acetylation occurring on Lys-Ne has emerged as a key regulatory post-translational modification in eukaryotes. It is mediated by two groups of enzymes: lysine acetyltransferases (KATs) and lysine deacetylases (KDACs) that catalyze the addition and removal of acetyl groups from target proteins. Estimates indicate that protein acetylation is second to protein phosphorylation in abundance, with thousands of acetylated sites now identified in different subcellular compartments. Considering the important regulatory role of protein phosphorylation, elucidating the diversity of KATs and KDACs across photosynthetic eukaryotes is essential in furthering our understanding of the impact of reversible protein acetylation on plant cell processes.ResultsWe report a genome-scale analysis of lysine acetyltransferase (KAT)- and lysine deacetylase (KDAC)-families from 53 photosynthetic eukaryotes. KAT and KDAC orthologs were identified in sequenced genomes ranging from glaucophytes and algae to land plants and then analyzed for evolutionary relationships. Based on consensus molecular phylogenetic and subcellular localization data we found new sub-classes of enzymes in established KAT- and KDAC-families. Specifically, we identified a non-photosynthetic origin of the HD-tuin family KDACs, a new monocot-specific Class I HDA-family sub-class, and a phylogenetically distinct Class II algal/heterokont sub-class which maintains an ankyrin domain not conserved in land plant Class II KDACs. Protein structure analysis showed that HDA- and SRT-KDACs exist as bare catalytic subunits with highly conserved median protein length, while all KATs maintained auxiliary domains, with CBP- and TAFII250-KATs displaying protein domain gain and loss over the course of photosynthetic eukaryote evolution in addition to variable protein length. Lastly, promoter element enrichment analyses across species revealed conserved cis-regulatory sequences that support KAT and KDAC involvement in the regulation of plant development, cold/drought stress response, as well as cellular processes such as the circadian clock.ConclusionsOur results reveal new evolutionary, structural, and biological insights into the KAT- and KDAC-families of photosynthetic eukaryotes, including evolutionary parallels to protein kinases and protein phosphatases. Further, we provide a comprehensive annotation framework through our extensive phylogenetic analysis, from which future research investigating aspects of protein acetylation in plants can use to position new findings in a broader context.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-017-3894-0) contains supplementary material, which is available to authorized users.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.