Understanding early development offers a striking opportunity to investigate genetic disease, stem cell and assisted reproductive technology. Recent advances in high-throughput sequencing technology have led to the rising influx of omics data, which have rapidly boosted our understanding of mammalian developmental mechanisms. Here, we review the database EmExplorer (a database for exploring time activation of gene expression in mammalian embryos), which systematically organizes the genes from development-related pathways, and which we have already established and continue to update it. The current version of EmExplorer incorporates over 26 000 genes obtained from 306 functional pathways in five species. The function annotations of development-related genes were also integrated into EmExplorer. To facilitate data extraction, the database also contains the following information. (i) The dynamic expression values for each development stage are matched to the corresponding genes. (ii) A two-layer search tool which supports multi-option searching, such as by official symbol, pathway name and function annotation. The returned entries can directly link to the analysis results for the corresponding gene or pathway in the analysis module. (iii) The analysis module provides different gene comparisons at the multi-species level and functional pathway level, which shows the species specificity and stage specificity at the gene or pathway level. (iv) The analysis based on the hypergeometric distribution test reveals the enrichment of gene functions at a particular stage of one organism's pathway. (v) The browser is designed for users with ambiguous searching goals and greatly helps new users to get a general idea of the contents of the database. (vi) The experimentally validated pathways are manually curated and shown on the home page. EmExplorer will be helpful for elucidating early developmental mechanisms and exploring time activation genes. EmExplorer is freely available at http://bioinfor.imu.edu.cn/emexplorer .
Large-scale tumor genome sequencing projects have revealed a complex landscape of genomic mutations in multiple cancer types. A major goal of these projects is to characterize somatic mutations and discover cancer drivers, thereby providing important clues to uncover diagnostic or therapeutic targets for clinical treatment. However, distinguishing only a few somatic mutations from the majority of passenger mutations is still a major challenge facing the biological community. Fortunately, combining other functional features with mutations to predict cancer driver genes is an effective approach to solve the above problem. Protein lysine modifications are an important functional feature that regulates the development of cancer. Therefore, in this work, we have systematically analyzed somatic mutations on seven protein lysine modifications and identified several important drivers that are responsible for tumorigenesis. From published literature, we first collected more than 100,000 lysine modification sites for analysis. Another 1 million non-synonymous single nucleotide variants (SNVs) were then downloaded from TCGA and mapped to our collected lysine modification sites. To identify driver proteins that significantly altered lysine modifications, we further developed a hierarchical Bayesian model and applied the Markov Chain Monte Carlo (MCMC) method for testing. Strikingly, the coding sequences of 473 proteins were found to carry a higher mutation rate in lysine modification sites compared to other background regions. Hypergeometric tests also revealed that these gene products were enriched in known cancer drivers. Functional analysis suggested that mutations within the lysine modification regions possessed higher evolutionary conservation and deleteriousness. Furthermore, pathway enrichment showed that mutations on lysine modification sites mainly affected cancer related processes, such as cell cycle and RNA transport. Moreover, clinical studies also suggested that the driver proteins were significantly associated with patient survival, implying an opportunity to use lysine modifications as molecular markers in cancer diagnosis or treatment. By searching within protein-protein interaction networks using a random walk with restart (RWR) algorithm, we further identified a series of potential treatment agents and therapeutic targets for cancer related to lysine modifications. Collectively, this study reveals the functional importance of lysine modifications in cancer development and may benefit the discovery of novel mechanisms for cancer treatment.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.