Protein phosphorylation is a key post-translational modification regulating protein function in almost all cellular processes. Although tens of thousands of phosphorylation sites have been identified in human cells, approaches to determine the functional importance of each phosphosite are lacking. Here, we manually curated 112 datasets of phospho-enriched proteins generated from 104 different human cell types or tissues. We reanalyzed the 6,801 proteomics experiments that passed our quality control criteria, creating a reference phosphoproteome containing 119,809 human phosphosites. To prioritize functional sites, we used machine learning to identify 59 features indicative of proteomic, structural, regulatory or evolutionary relevance and integrate them into a single functional score. Our approach identifies regulatory phosphosites across different molecular mechanisms, processes and diseases, and reveals genetic susceptibilities at a genomic scale. Several novel regulatory phosphosites were experimentally validated, including a Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use:
In miRNA biogenesis, pri-miRNA transcripts are converted into pre-miRNA hairpins. The in vivo properties of this process remain enigmatic. Here, we determine in vivo transcriptome-wide pri-miRNA processing using next-generation sequencing of chromatin-associated pri-miRNAs. We identify a distinctive Microprocessor signature in the transcriptome profile from which efficiency of the endogenous processing event can be accurately quantified. This analysis reveals differential susceptibility to Microprocessor cleavage as a key regulatory step in miRNA biogenesis. Processing is highly variable among pri-miRNAs and a better predictor of miRNA abundance than primary transcription itself. Processing is also largely stable across three cell lines, suggesting a major contribution of sequence determinants. On the basis of differential processing efficiencies, we define functionality for short sequence features adjacent to the pre-miRNA hairpin. In conclusion, we identify Microprocessor as the main hub for diversified miRNA output and suggest a role for uncoupling miRNA biogenesis from host gene expression.
Protein phosphorylation is a key post-translational modification regulating protein function in almost all cellular processes. While tens of thousands of phosphorylation sites have been identified in human cells to date, the extent and functional importance of the phosphoproteome remains largely unknown. Here, we have analyzed 6,801 publicly available phospho-enriched mass spectrometry proteomics experiments, creating a state-of-the-art phosphoproteome containing 119,809 human phosphosites. To prioritize functional sites, 59 features indicative of proteomic, structural, regulatory or evolutionary relevance were integrated into a single functional score using machine learning. We demonstrate how this prioritization identifies regulatory phosphosites across different molecular mechanisms and pinpoint genetic susceptibilities at a genomic scale. Several novel regulatory phosphosites were experimentally validated including a role in neuronal differentiation for phosphosites present in the SWI/SNF SMARCC2 complex member. The scored reference phosphoproteome and its annotations identify the most relevant phosphorylations for a given process or disease addressing a major bottleneck in cell signaling studies.
Nuclear Argonaute proteins, guided by their bound small RNAs to nascent target transcripts, mediate cotranscriptional silencing of transposons and repetitive genomic loci through heterochromatin formation. The molecular mechanisms involved in this process are incompletely understood. Here, we show that the SFiNX complex, a silencing mediator downstream from nuclear Piwi-piRNA complexes in Drosophila, facilitates cotranscriptional silencing as a homodimer. The dynein light chain protein Cut up/LC8 mediates SFiNX dimerization, and its function can be bypassed by a heterologous dimerization domain, arguing for a constitutive SFiNX dimer. Dimeric, but not monomeric SFiNX, is capable of forming molecular condensates in a nucleic acid-stimulated manner. Mutations that prevent SFiNX dimerization result in loss of condensate formation in vitro and the inability of Piwi to initiate heterochromatin formation and silence transposons in vivo. We propose that multivalent SFiNX-nucleic acid interactions are critical for heterochromatin establishment at piRNA target loci in a cotranscriptional manner.
The systematic mutation of histone 3 (H3) genes in model organisms has proven to be a valuable tool to distinguish the functional role of histone residues. No system exists in mammalian cells to directly manipulate canonical histone H3 due to a large number of clustered and multi-loci histone genes. Over the years, oncogenic histone mutations in a subset of H3 have been identified in humans, and have advanced our understanding of the function of histone residues in health and disease. The oncogenic mutations are often found in one allele of the histone variant H3.3 genes, but they prompt severe changes in the epigenetic landscape of cells, and contribute to cancer development. Therefore, mutation approaches using H3.3 genes could be relevant to the determination of the functional role of histone residues in mammalian development without the replacement of canonical H3 genes. In this review, we describe the key findings from the H3 mutation studies in model organisms wherein the genetic replacement of canonical H3 is possible. We then turn our attention to H3.3 mutations in human cancers, and discuss H3.3 substitutions in the N-terminus, which were generated in order to explore the specific residue or associated post-translational modification.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.