The analysis of gene expression data has shown that transcriptionally coordinated (co-expressed) genes are often functionally related, enabling scientists to use expression data in gene function prediction. This Focused Review discusses our original paper (Large-scale co-expression approach to dissect secondary cell wall formation across plant species, Frontiers in Plant Science 2:23). In this paper we applied cross-species analysis to co-expression networks of genes involved in cellulose biosynthesis. We showed that the co-expression networks from different species are highly similar, indicating that whole biological pathways are conserved across species. This finding has two important implications. First, the analysis can transfer gene function annotation from well-studied plants, such as Arabidopsis, to other, uncharacterized plant species. As the analysis finds genes that have similar sequence and similar expression pattern across different organisms, functionally equivalent genes can be identified. Second, since co-expression analyses are often noisy, a comparative analysis should have higher performance, as parts of co-expression networks that are conserved are more likely to be functionally relevant. In this Focused Review, we outline the comparative analysis done in the original paper and comment on the recent advances and approaches that allow comparative analyses of co-function networks. We hypothesize that in comparison to simple co-expression analysis, comparative analysis would yield more accurate gene function predictions. Finally, by combining comparative analysis with genomic information of green plants, we propose a possible composition of cellulose biosynthesis machinery during earlier stages of plant evolution.
While Brachypodium distachyon (Brachypodium) is an emerging model for grasses, no expression atlas or gene coexpression network is available. Such tools are of high importance to provide insights into the function of Brachypodium genes. We present a detailed Brachypodium expression atlas, capturing gene expression in its major organs at different developmental stages. The data were integrated into a large-scale coexpression database ( www.gene2function.de), enabling identification of duplicated pathways and conserved processes across 10 plant species, thus allowing genome-wide inference of gene function. We highlight the importance of the atlas and the platform through the identification of duplicated cell wall modules, and show that a lignin biosynthesis module is conserved across angiosperms. We identified and functionally characterised a putative ferulate 5-hydroxylase gene through overexpression of it in Brachypodium, which resulted in an increase in lignin syringyl units and reduced lignin content of mature stems, and led to improved saccharification of the stem biomass. Our Brachypodium expression atlas thus provides a powerful resource to reveal functionally related genes, which may advance our understanding of important biological processes in grasses.
Selaginella moellendorffii is a representative of the lycophyte lineage that is studied to understand the evolution of land plant traits such as the vasculature, leaves, stems, roots, and secondary metabolism. However, only a few studies have investigated the expression and transcriptional coordination of Selaginella genes, precluding us from understanding the evolution of the transcriptional programs behind these traits. We present a gene expression atlas comprising all major organs, tissue types, and the diurnal gene expression profiles for S. moellendorffii. We show that the transcriptional gene module responsible for the biosynthesis of lignocellulose evolved in the ancestor of vascular plants and pinpoint the duplication and subfunctionalization events that generated multiple gene modules involved in the biosynthesis of various cell wall types. We demonstrate how secondary metabolism is transcriptionally coordinated and integrated with other cellular pathways. Finally, we identify root-specific genes and show that the evolution of roots did not coincide with an increased appearance of gene families, suggesting that the development of new organs does not coincide with increased fixation of new gene functions. Our updated database at conekt.plant.tools represents a valuable resource for studying the evolution of genes, gene families, transcriptomes, and functional gene modules in the Archaeplastida kingdom.
CK2, a serine/threonine (Ser/Thr) kinase present in eukaryotic cells, is known to have a vast number of substrates. We have recently shown that it localizes to nuclei and at pores between hyphal compartments in Magnaporthe oryzae. We performed a pulldown proteomics analysis of M. oryzae CK2 catalytic subunit MoCKa to detect interacting proteins. The MoCKa pulldown was enriched for septum and nucleolus proteins and intrinsically disordered proteins (IDPs) containing a CK2 phosphorylation motif that is proposed to destabilize and unfold α-helices. This points to a function for CK2 phosphorylation and corresponding phosphatase dephosphorylation in the formation of functional protein-protein aggregates and protein-RNA/DNA binding. To test this as widely as possible, we used secondary data downloaded from databases from a large range of M. oryzae experiments, as well as data for a relatively closely related plant-pathogenic fungus, Fusarium graminearum. We found that CKa expression was strongly positively correlated with Ser/Thr phosphatases, as well as with disaggregases (HSP104, YDJ1, and SSA1) and an autophagy-indicating protein (ATG8). The latter points to increased protein aggregate formation at high levels of CKa expression. Our results suggest a general role for CK2 in chaperoning aggregation and disaggregation of IDPs and their binding to proteins, DNA, and RNA. IMPORTANCE CK2 is a eukaryotic conserved kinase enzyme complex that phosphorylates proteins. CK2 is known to phosphorylate a large number of proteins and is constitutively active, and thus a “normal” role for a kinase in a signaling cascade might not be the case for CK2. Previous results on localization and indications from the literature point to a function for CK2 phosphorylation in shaping and folding of proteins, especially intrinsically disordered proteins, which constitute about 30% of eukaryotic proteins. We used pulldown of interacting proteins and data downloaded from a large range of transcriptomic experiments in M. oryzae and complemented these with data downloaded from a large range of transcriptomic experiments in Fusarium graminearum. We found support for a general role for CK2 in aggregation and disaggregation of IDPs and their binding to proteins, DNA, and RNA—interactions that could explain the importance of CK2 in eukaryotic cell function and disease.
Recent advances in gene function prediction rely on ensemble approaches that integrate results from multiple inference methods to produce superior predictions. Yet, these developments remain largely unexplored in plants. We have explored and compared two methods to integrate 10 gene co-function networks for Arabidopsis thaliana and demonstrate how the integration of these networks produces more accurate gene function predictions for a larger fraction of genes with unknown function. These predictions were used to identify genes involved in mitochondrial complex I formation, and for five of them, we confirmed the predictions experimentally. The ensemble predictions are provided as a user-friendly online database, EnsembleNet. The methods presented here demonstrate that ensemble gene function prediction is a powerful method to boost prediction performance, whereas the EnsembleNet database provides a cutting-edge community tool to guide experimentalists.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.