SUMMARY Soybean (Glycine max [L.] Merr.) is a major crop in animal feed and human nutrition, mainly for its rich protein and oil contents. The remarkable rise in soybean transcriptome studies over the past 5 years generated an enormous amount of RNA‐seq data, encompassing various tissues, developmental conditions and genotypes. In this study, we have collected data from 1298 publicly available soybean transcriptome samples, processed the raw sequencing reads and mapped them to the soybean reference genome in a systematic fashion. We found that 94% of the annotated genes (52 737/56 044) had detectable expression in at least one sample. Unsupervised clustering revealed three major groups, comprising samples from aerial, underground and seed/seed‐related parts. We found 452 genes with uniform and constant expression levels, supporting their roles as housekeeping genes. On the other hand, 1349 genes showed heavily biased expression patterns towards particular tissues. A transcript‐level analysis revealed that 95% (70 963 of 74 490) of the assembled transcripts have intron chains exactly matching those from known transcripts, whereas 3256 assembled transcripts represent potentially novel splicing isoforms. The dataset compiled here constitute a new resource for the community, which can be downloaded or accessed through a user‐friendly web interface at http://venanciogroup.uenf.br/resources/. This comprehensive transcriptome atlas will likely accelerate research on soybean genetics and genomics.
Currently, standard network analysis workflows rely on many different packages, often requiring users to have a solid statistics and programming background. Here, we present BioNERO, an R package that aims to integrate all aspects of network analysis workflows, including expression data preprocessing, gene coexpression and regulatory network inference, functional analyses, and intraspecies and interspecies network comparisons. The state-of-the-art methods implemented in BioNERO ensure that users can perform all analyses with a single package in a simple pipeline, without needing to learn a myriad of packagespecific syntaxes. BioNERO offers a user-friendly framework that can be easily incorporated in systems biology pipelines.
The COBRA-like (COBL) gene family has been associated with the regulation of cell wall expansion and cellulose deposition. COBL mutants result in reduced levels and disorganized deposition of cellulose causing defects in the cell wall and inhibiting plant development. In this study, we report the identification of 24 COBL genes (GmCOBL) in the soybean genome. Phylogenetic analysis revealed that the COBL proteins are divided into two groups, which differ by about 170 amino acids in the N-terminal region. The GmCOBL genes were heterogeneously distributed in 14 of the 20 soybean chromosomes. This study showed that segmental duplication has contributed significantly to the expansion of the COBL family in soybean during all Glycine-specific whole-genome duplication events. The expression profile revealed that the expression of the paralogous genes is highly variable between organs and tissues of the plant. Only 20% of the paralogous gene pairs showed similar expression patterns. The high expression levels of some GmCOBLs suggest they are likely essential for regulating cell expansion during the whole soybean life cycle. Our comprehensive overview of the COBL gene family in soybean provides useful information for further understanding the evolution and diversification of COBL genes in soybean.
Soybean is one of the most important legume crops worldwide. However, soybean yield is dramatically affected by fungal diseases, leading to economic losses of billions of dollars yearly. Here, we integrated publicly available genome-wide association studies and transcriptomic data to prioritize candidate genes associated with resistance to Cadophora gregata, Fusarium graminearum, Fusarium virguliforme, Macrophomina phaseolina, and Phakopsora pachyrhizi. We identified 188, 56, 11, 8, and 3 high-confidence candidates for resistance to F. virguliforme, F. graminearum, C. gregata, M. phaseolina and P. pachyrhizi, respectively. The prioritized candidate genes are highly conserved in the pangenome of cultivated soybeans and are heavily biased towards fungal species-specific defense responses. The vast majority of the prioritized candidate resistance genes are related to plant immunity processes, such as recognition, signaling, oxidative stress, systemic acquired resistance, and physical defense. Based on the number of resistance alleles, we selected the five most resistant accessions against each fungal species in the soybean USDA germplasm. Interestingly, the most resistant accessions do not reach the maximum theoretical resistance potential. Hence, they can be further improved to increase resistance in breeding programs or through genetic engineering. Finally, the coexpression network generated here is available in a user-friendly web application (https://soyfungigcn.venanciogroup.uenf.br/) and an R/Shiny package (https://github.com/almeidasilvaf/SoyFungiGCN) that serve as a public resource to explore soybean-pathogenic fungi interactions at the transcriptional level.
Soybean (Glycine max (L.) Merr.) is one of the most important crops worldwide, constituting a major source of protein and edible oil. Gene co-expression networks (GCN) have been extensively used to study transcriptional regulation and evolution of genes and genomes. Here, we report a soybean GCN using 1,284 publicly available RNA-Seq samples from 15 distinct tissues. We found modules that are differentially regulated in specific tissues, comprising processes such as photosynthesis, gluconeogenesis, lignin metabolism, and response to biotic stress. We identified transcription factors among intramodular hubs, which probably integrate different pathways and shape the transcriptional landscape in different conditions. The top hubs for each module tend to encode proteins with critical roles, such as succinate dehydrogenase and RNA polymerase subunits. Importantly, gene essentiality was strongly correlated with degree centrality and essential hubs enriched in genes involved in nucleic acids metabolism and regulation of cell replication. By using a using a guilt-by-association approach, we predicted functions for 93 of 106 hubs without functional description in soybean. Most of the duplicated genes had different transcriptional profiles, supporting their functional divergence, although paralogs originating from whole-genome duplications (WGD) are more often preserved in the same module than those from other mechanisms. Together, our results highlight the importance of GCN analysis in unraveling key functional aspects of the soybean genome, in particular those associated with hub genes and WGD events.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.