Protein phosphorylation is a major regulator of protein function and biological outcomes. This was first recognized through functional biochemical experiments, and in the past decade, major technological advances in mass spectrometry have enabled the study of protein phosphorylation on a global scale. This rapidly growing field of phosphoproteomics has revealed that more than 100,000 distinct phosphorylation events occur in human cells, which likely affect the function of every protein. Phosphoproteomics has improved the understanding of the function of even the most well-characterized protein kinases by revealing new downstream substrates and biology. However, current biochemical and bioinformatic approaches have only identified kinases for less than 5% of the phosphoproteome, and functional assignments of phosphosites are almost negligible. Notably, our understanding of the relationship between kinases and their substrates follows a power law distribution, with almost 90% of phosphorylation sites currently assigned to the top 20% of kinases. In addition, more than 150 kinases do not have a single known substrate. Despite a small group of kinases dominating biomedical research, the number of substrates assigned to a kinase does not correlate with disease relevance as determined by pathogenic human mutation prevalence and mouse model phenotypes. Improving our understanding of the substrates targeted by all kinases and functionally annotating the phosphoproteome will be broadly beneficial. Advances in phosphoproteomics technologies, combined with functional screening approaches, should make it feasible to illuminate the connectivity and functionality of the entire phosphoproteome, providing enormous opportunities for discovering new biology, therapeutic targets, and possibly diagnostics.
Transcription factors (TFs) play a fundamental role in coordinating biological processes in response to stimuli. Consequently, we often seek to determine the key TFs and their regulated target genes (TGs) amidst gene expression data. This requires a knowledge-base of TF-TG interactions, which would enable us to determine the topology of the transcriptional network and predict novel regulatory interactions. To address this, we generated an Open-access Repository of Transcriptional Interactions, ORTI, by integrating available TF-TG interaction databases. These databases rely on different types of experimental evidence, including low-throughput assays, high-throughput screens, and bioinformatics predictions. We have subsequently categorised TF-TG interactions in ORTI according to the quality of this evidence. To demonstrate its capabilities, we applied ORTI to gene expression data and identified modulated TFs using an enrichment analysis. Combining this with pairwise TF-TG interactions enabled us to visualise temporal regulation of a transcriptional network. Additionally, ORTI enables the prediction of novel TF-TG interactions, based on how well candidate genes co-express with known TGs of the target TF. By filtering out known TF-TG interactions that are unlikely to occur within the experimental context, this analysis predicts context-specific TF-TG interactions. We show that this can be applied to experimental designs of varying complexities. In conclusion, ORTI is a rich and publicly available database of experimentally validated mammalian transcriptional interactions which is accompanied with tools that can identify and predict transcriptional interactions, serving as a useful resource for unravelling the topology of transcriptional networks.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.