Recent high-throughput techniques have generated a flood of biological data in all aspects. The transformation and visualization of multi-dimensional and numerical gene or protein expression data in a single heatmap can provide a concise but comprehensive presentation of molecular dynamics under different conditions. In this work, we developed an easy-to-use tool named HemI (Heat map Illustrator), which can visualize either gene or protein expression data in heatmaps. Additionally, the heatmaps can be recolored, rescaled or rotated in a customized manner. In addition, HemI provides multiple clustering strategies for analyzing the data. Publication-quality figures can be exported directly. We propose that HemI can be a useful toolkit for conveniently visualizing and manipulating heatmaps. The stand-alone packages of HemI were implemented in Java and can be accessed at http://hemi.biocuckoo.org/down.php.
In this work, we developed a family-based database of UUCD (http://uucd.biocuckoo.org) for ubiquitin and ubiquitin-like conjugation, which is one of the most important post-translational modifications responsible for regulating a variety of cellular processes, through a similar E1 (ubiquitin-activating enzyme)–E2 (ubiquitin-conjugating enzyme)–E3 (ubiquitin-protein ligase) enzyme thioester cascade. Although extensive experimental efforts have been taken, an integrative data resource is still not available. From the scientific literature, 26 E1s, 105 E2s, 1003 E3s and 148 deubiquitination enzymes (DUBs) were collected and classified into 1, 3, 19 and 7 families, respectively. To computationally characterize potential enzymes in eukaryotes, we constructed 1, 1, 15 and 6 hidden Markov model (HMM) profiles for E1s, E2s, E3s and DUBs at the family level, separately. Moreover, the ortholog searches were conducted for E3 and DUB families without HMM profiles. Then the UUCD database was developed with 738 E1s, 2937 E2s, 46 631 E3s and 6647 DUBs of 70 eukaryotic species. The detailed annotations and classifications were also provided. The online service of UUCD was implemented in PHP + MySQL + JavaScript + Perl.
As an important protein acylation modification, lysine succinylation (Ksucc) is involved in diverse biological processes, and participates in human tumorigenesis. Here, we collected 26,243 non-redundant known Ksucc sites from 13 species as the benchmark data set, combined 10 types of informative features, and implemented a hybrid-learning architecture by integrating deep-learning and conventional machine-learning algorithms into a single framework. We constructed a new tool named HybridSucc, which achieved area under curve (AUC) values of 0.885 and 0.952 for general and human-specific prediction of Ksucc sites, respectively. In comparison, the accuracy of HybridSucc was 17.84%–50.62% better than that of other existing tools. Using HybridSucc, we conducted a proteome-wide prediction and prioritized 370 cancer mutations that change Ksucc states of 218 important proteins, including PKM2, SHMT2, and IDH2. We not only developed a high-profile tool for predicting Ksucc sites, but also generated useful candidates for further experimental consideration. The online service of HybridSucc can be freely accessed for academic research at http://hybridsucc.biocuckoo.org/.
Background Dehydration responsive element-binding (DREB) transcription factors play a crucial role in plant growth, development and stress responses. Although DREB genes have been characterized in many plant species, genome-wide identification of the DREB gene family has not yet been reported in pineapple (Ananas comosus (L.) Merr.). Results Using comprehensive genome-wide screening, we identified 20 AcoDREB genes on 14 chromosomes. These were categorized into five subgroups. AcoDREBs within a group had similar gene structures and domain compositions. Using gene structure analysis, we showed that most AcoDREB genes (75%) lacked introns, and that the promoter regions of all 20 AcoDREB genes had at least one stress response-related cis-element. We identified four genes with high expression levels and six genes with low expression levels in all analyzed tissues. We detected expression changes under abiotic stress for eight selected AcoDREB genes. Conclusions This report presents the first genome-wide analysis of the DREB transcription factor family in pineapple. Our results provide preliminary data for future functional analysis of AcoDREB genes in pineapple, and useful information for developing new pineapple varieties with key agronomic traits such as stress tolerance.
Here, we reported the compendium of protein lysine modifications (CPLM 4.0, http://cplm.biocuckoo.cn/), a data resource for various post-translational modifications (PTMs) specifically occurred at the side-chain amino group of lysine residues in proteins. From the literature and public databases, we collected 450 378 protein lysine modification (PLM) events, and combined them with the existing data of our previously developed protein lysine modification database (PLMD 3.0). In total, CPLM 4.0 contained 592 606 experimentally identified modification events on 463 156 unique lysine residues of 105 673 proteins for up to 29 types of PLMs across 219 species. Furthermore, we carefully annotated the data using the knowledge from 102 additional resources that covered 13 aspects, including variation and mutation, disease-associated information, protein-protein interaction, protein functional annotation, DNA & RNA element, protein structure, chemical-target relation, mRNA expression, protein expression/proteomics, subcellular localization, biological pathway annotation, functional domain annotation, and physicochemical property. Compared to PLMD 3.0 and other existing resources, CPLM 4.0 achieved a >2-fold increase in collection of PLM events, with a data volume of ∼45GB. We anticipate that CPLM 4.0 can serve as a more useful database for further study of PLMs.
The cellulose synthase (Ces) and cellulose synthase-like (Csl) gene families belonging to the cellulose synthase gene superfamily, are responsible for the biosynthesis of cellulose and hemicellulose of the plant cell wall, and play critical roles in plant development, growth and evolution. However, the Ces/Csl gene family remains to be characterized in pineapple, a highly valued and delicious tropical fruit. Here, we carried out genome-wide study and identified a total of seven Ces genes and 25 Csl genes in pineapple. Genomic features and phylogeny analysis of Ces/Csl genes were carried out, including phylogenetic tree, chromosomal locations, gene structures, and conserved motifs identification. In addition, we identified 32 pineapple AcoCes/Csl genes with 31 Arabidopsis AtCes/Csl genes as orthologs by the syntenic and phylogenetic approaches. Furthermore, a RNA-seq investigation exhibited the expression profile of several AcoCes/Csl genes in various tissues and multiple developmental stages. Collectively, we provided comprehensive information of the evolution and function of pineapple Ces/Csl gene superfamily, which would be useful for screening out and characterization of the putative genes responsible for tissue development in pineapple. The present study laid the foundation for future functional characterization of Ces/Csl genes in pineapple.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.