SUMMARY Soybean (Glycine max [L.] Merr.) is a major crop in animal feed and human nutrition, mainly for its rich protein and oil contents. The remarkable rise in soybean transcriptome studies over the past 5 years generated an enormous amount of RNA‐seq data, encompassing various tissues, developmental conditions and genotypes. In this study, we have collected data from 1298 publicly available soybean transcriptome samples, processed the raw sequencing reads and mapped them to the soybean reference genome in a systematic fashion. We found that 94% of the annotated genes (52 737/56 044) had detectable expression in at least one sample. Unsupervised clustering revealed three major groups, comprising samples from aerial, underground and seed/seed‐related parts. We found 452 genes with uniform and constant expression levels, supporting their roles as housekeeping genes. On the other hand, 1349 genes showed heavily biased expression patterns towards particular tissues. A transcript‐level analysis revealed that 95% (70 963 of 74 490) of the assembled transcripts have intron chains exactly matching those from known transcripts, whereas 3256 assembled transcripts represent potentially novel splicing isoforms. The dataset compiled here constitute a new resource for the community, which can be downloaded or accessed through a user‐friendly web interface at http://venanciogroup.uenf.br/resources/. This comprehensive transcriptome atlas will likely accelerate research on soybean genetics and genomics.
Xq22 deletions that encompass PLP1 (Xq22‐PLP1‐DEL) are notable for variable expressivity of neurological disease traits in females ranging from a mild late‐onset form of spastic paraplegia type 2 (MIM# 312920), sometimes associated with skewed X‐inactivation, to an early‐onset neurological disease trait (EONDT) of severe developmental delay, intellectual disability, and behavioral abnormalities. Size and gene content of Xq22‐PLP1‐DEL vary and were proposed as potential molecular etiologies underlying variable expressivity in carrier females where two smallest regions of overlap (SROs) were suggested to influence disease. We ascertained a cohort of eight unrelated patients harboring Xq22‐PLP1‐DEL and performed high‐density array comparative genomic hybridization and breakpoint‐junction sequencing. Molecular characterization of Xq22‐PLP1‐DEL from 17 cases (eight herein and nine published) revealed an overrepresentation of breakpoints that reside within repeats (11/17, ~65%) and the clustering of ~47% of proximal breakpoints in a genomic instability hotspot with characteristic non‐B DNA density. These findings implicate a potential role for genomic architecture in stimulating the formation of Xq22‐PLP1‐DEL. The correlation of Xq22‐PLP1‐DEL gene content with neurological disease trait in female cases enabled refinement of the associated SROs to a single genomic interval containing six genes. Our data support the hypothesis that genes contiguous to PLP1 contribute to EONDT.
The COBRA-like (COBL) gene family has been associated with the regulation of cell wall expansion and cellulose deposition. COBL mutants result in reduced levels and disorganized deposition of cellulose causing defects in the cell wall and inhibiting plant development. In this study, we report the identification of 24 COBL genes (GmCOBL) in the soybean genome. Phylogenetic analysis revealed that the COBL proteins are divided into two groups, which differ by about 170 amino acids in the N-terminal region. The GmCOBL genes were heterogeneously distributed in 14 of the 20 soybean chromosomes. This study showed that segmental duplication has contributed significantly to the expansion of the COBL family in soybean during all Glycine-specific whole-genome duplication events. The expression profile revealed that the expression of the paralogous genes is highly variable between organs and tissues of the plant. Only 20% of the paralogous gene pairs showed similar expression patterns. The high expression levels of some GmCOBLs suggest they are likely essential for regulating cell expansion during the whole soybean life cycle. Our comprehensive overview of the COBL gene family in soybean provides useful information for further understanding the evolution and diversification of COBL genes in soybean.
20Soybean (Glycine max [L.] Merr.) is a major crop in animal feed and human nutrition, 21 mainly for its rich protein and oil contents. The remarkable rise in soybean transcriptome 22 studies over the past five years generated an enormous amount of RNA-seq data, 23 encompassing various tissues, developmental conditions, and genotypes. In this study, 24we have collected data from 1,298 publicly available soybean transcriptome samples, 25processed the raw sequencing reads, and mapped them to the soybean reference 26 genome in a systematic fashion. We found that 94% of the annotated genes 27(52,737/56,044) had detectable expression in at least one sample. Unsupervised 28 clustering revealed three major groups, comprising samples from aerial, underground, 29and seed/seed-related parts. We found 452 genes with uniform and constant expression 30 levels, supporting their roles as housekeeping genes. On the other hand, 1,349 genes 31 showed heavily biased expression patterns towards particular tissues. A transcript-level 32 analysis revealed that 95% (70,963/74,490) of the known transcripts overlap with those 33 reported here, whereas 3,256 assembled transcripts represent potentially novel splicing 34isoforms. The dataset compiled here constitute a new resource for the community, which 35can be downloaded or accessed through a user-friendly web interface at 36
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.