Recent high-throughput techniques have generated a flood of biological data in all aspects. The transformation and visualization of multi-dimensional and numerical gene or protein expression data in a single heatmap can provide a concise but comprehensive presentation of molecular dynamics under different conditions. In this work, we developed an easy-to-use tool named HemI (Heat map Illustrator), which can visualize either gene or protein expression data in heatmaps. Additionally, the heatmaps can be recolored, rescaled or rotated in a customized manner. In addition, HemI provides multiple clustering strategies for analyzing the data. Publication-quality figures can be exported directly. We propose that HemI can be a useful toolkit for conveniently visualizing and manipulating heatmaps. The stand-alone packages of HemI were implemented in Java and can be accessed at http://hemi.biocuckoo.org/down.php.
In this work, we developed a family-based database of UUCD (http://uucd.biocuckoo.org) for ubiquitin and ubiquitin-like conjugation, which is one of the most important post-translational modifications responsible for regulating a variety of cellular processes, through a similar E1 (ubiquitin-activating enzyme)–E2 (ubiquitin-conjugating enzyme)–E3 (ubiquitin-protein ligase) enzyme thioester cascade. Although extensive experimental efforts have been taken, an integrative data resource is still not available. From the scientific literature, 26 E1s, 105 E2s, 1003 E3s and 148 deubiquitination enzymes (DUBs) were collected and classified into 1, 3, 19 and 7 families, respectively. To computationally characterize potential enzymes in eukaryotes, we constructed 1, 1, 15 and 6 hidden Markov model (HMM) profiles for E1s, E2s, E3s and DUBs at the family level, separately. Moreover, the ortholog searches were conducted for E3 and DUB families without HMM profiles. Then the UUCD database was developed with 738 E1s, 2937 E2s, 46 631 E3s and 6647 DUBs of 70 eukaryotic species. The detailed annotations and classifications were also provided. The online service of UUCD was implemented in PHP + MySQL + JavaScript + Perl.
As an important protein acylation modification, lysine succinylation (Ksucc) is involved in diverse biological processes, and participates in human tumorigenesis. Here, we collected 26,243 non-redundant known Ksucc sites from 13 species as the benchmark data set, combined 10 types of informative features, and implemented a hybrid-learning architecture by integrating deep-learning and conventional machine-learning algorithms into a single framework. We constructed a new tool named HybridSucc, which achieved area under curve (AUC) values of 0.885 and 0.952 for general and human-specific prediction of Ksucc sites, respectively. In comparison, the accuracy of HybridSucc was 17.84%–50.62% better than that of other existing tools. Using HybridSucc, we conducted a proteome-wide prediction and prioritized 370 cancer mutations that change Ksucc states of 218 important proteins, including PKM2, SHMT2, and IDH2. We not only developed a high-profile tool for predicting Ksucc sites, but also generated useful candidates for further experimental consideration. The online service of HybridSucc can be freely accessed for academic research at http://hybridsucc.biocuckoo.org/.
Here, we reported the compendium of protein lysine modifications (CPLM 4.0, http://cplm.biocuckoo.cn/), a data resource for various post-translational modifications (PTMs) specifically occurred at the side-chain amino group of lysine residues in proteins. From the literature and public databases, we collected 450 378 protein lysine modification (PLM) events, and combined them with the existing data of our previously developed protein lysine modification database (PLMD 3.0). In total, CPLM 4.0 contained 592 606 experimentally identified modification events on 463 156 unique lysine residues of 105 673 proteins for up to 29 types of PLMs across 219 species. Furthermore, we carefully annotated the data using the knowledge from 102 additional resources that covered 13 aspects, including variation and mutation, disease-associated information, protein-protein interaction, protein functional annotation, DNA & RNA element, protein structure, chemical-target relation, mRNA expression, protein expression/proteomics, subcellular localization, biological pathway annotation, functional domain annotation, and physicochemical property. Compared to PLMD 3.0 and other existing resources, CPLM 4.0 achieved a >2-fold increase in collection of PLM events, with a data volume of ∼45GB. We anticipate that CPLM 4.0 can serve as a more useful database for further study of PLMs.
Background Dehydration responsive element-binding (DREB) transcription factors play a crucial role in plant growth, development and stress responses. Although DREB genes have been characterized in many plant species, genome-wide identification of the DREB gene family has not yet been reported in pineapple (Ananas comosus (L.) Merr.). Results Using comprehensive genome-wide screening, we identified 20 AcoDREB genes on 14 chromosomes. These were categorized into five subgroups. AcoDREBs within a group had similar gene structures and domain compositions. Using gene structure analysis, we showed that most AcoDREB genes (75%) lacked introns, and that the promoter regions of all 20 AcoDREB genes had at least one stress response-related cis-element. We identified four genes with high expression levels and six genes with low expression levels in all analyzed tissues. We detected expression changes under abiotic stress for eight selected AcoDREB genes. Conclusions This report presents the first genome-wide analysis of the DREB transcription factor family in pineapple. Our results provide preliminary data for future functional analysis of AcoDREB genes in pineapple, and useful information for developing new pineapple varieties with key agronomic traits such as stress tolerance.
Ubiquitination, an essential post-transcriptional modification (PTM), plays a vital role in nearly every biological process, including development and growth. Despite its functions in plant reproductive development, its targets in rice panicles remain unclear. In this study, we used proteome-wide profiling of lysine ubiquitination in rice ( O. sativa ssp. indica ) young panicles. We created the largest ubiquitinome dataset in rice to date, identifying 1638 lysine ubiquitination sites on 916 unique proteins. We detected three conserved ubiquitination motifs, noting that acidic glutamic acid (E) and aspartic acid (D) were most frequently present around ubiquitinated lysine. Enrichment analysis of Gene Ontology (GO) annotations and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways of these ubiquitinated proteins revealed that ubiquitination plays an important role in fundamental cellular processes in rice young panicles. Interestingly, enrichment analysis of protein domains indicated that ubiquitination was enriched on a variety of receptor-like kinases and cytoplasmic tyrosine and serine-threonine kinases. Furthermore, we analyzed the crosstalk between ubiquitination, acetylation, and succinylation, and constructed a potential protein interaction network within our rice ubiquitinome . Moreover, we identified ubiquitinated proteins related to pollen and grain development , indicating that ubiquitination may play a critical role in the physiological functions in young panicles. Taken together, we reported the most comprehensive lysine ubiquitinome in rice so far, and used it to reveal the functional role of lysine ubiquitination in rice young panicles.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.