The annotation of the mammalian protein coding genome is incomplete. Arbitrary open reading frame (ORF) size restriction and the absolute requirement for a methionine codon as the sole initiator of translation, have constrained identification of potentially important transcripts with non-canonical protein coding potential1,2. Using unbiased transcriptomic approaches in macrophages responding to bacterial infection, we show widespread ribosome association with a large number of RNAs that were previously annotated as “non-protein coding”. Although the ability of such non-canonical ORFs to encode functional protein is controversial3,4, we identify a plethora of novel short and non-ATG initiated ORFs with the ability to generate stable and spatially distinct proteins. Importantly, we show that the translation of a novel ORF ‘hidden’ within the long non-coding RNA Aw112010 is essential for the orchestration of mucosal immunity during both bacterial infection and colitis. Together this work expands our interpretation of the protein coding genome and demonstrates the critical nature of proteinaceous products generated from non-canonical ORFs to the immune response in vivo. We therefore propose that the misannotation of non-canonical ORF-containing genes as non-coding RNAs may obscure the essential role of a multitude of previously undiscovered protein coding genes in immunity and disease.
Proteogenomic identification of translated small open reading frames in humans has revealed thousands of microproteins, or polypeptides of fewer than 100 amino acids, that were previously invisible to geneticists. Hundreds of microproteins have been shown to be essential for cell growth and proliferation, and many regulate macromolecular complexes. One such regulatory microprotein is NBDY, a 68-amino acid component of the human cytoplasmic RNA decapping complex. Heterologously expressed NBDY was previously reported to regulate cytoplasmic ribonucleoprotein granules known as P-bodies and reporter gene stability, but the global effect of endogenous NBDY on the cellular transcriptome remained undefined. In this work, we demonstrate that endogenous NBDY directly interacts with the human RNA decapping complex through EDC4 and DCP1A and localizes to P-bodies. Global profiling of RNA stability changes in NBDY knockout (KO) cells reveals dysregulated stability of more than 1400 transcripts. DCP2 substrate transcript half-lives are both increased and decreased in NBDY KO cells, which correlates with 5′ UTR length. NBDY deletion additionally alters the stability of non-DCP2 target transcripts, possibly as a result of downregulated expression of nonsense-mediated decay factors in NBDY KO cells. We present a comprehensive model of the regulation of RNA stability by NBDY.
The precise regulation of gene expression is fundamental to neurodevelopment, plasticity, and cognitive function. While several studies have deeply profiled mRNA dynamics in the developing human brain, there is a fundamental gap in our understanding of accompanying translational regulation. We perform ribosome profiling from more than 70 human prenatal and adult cortex samples across ontogeny and into adulthood, mapping translation events at nucleotide resolution. In addition to characterizing the translational regulation of annotated open reading frames (ORFs), we identify thousands of previously unknown translation events, including small open reading frames (sORFs) that give rise to human-and/or brain-specific microproteins, many of which we independently verify using size-selected proteomics.Ribosome profiling in stem cell-derived human neuronal cultures further corroborates these findings and shows that several neuronal activity-induced long non-coding RNAs (lncRNAs), including LINC00473, a primate-specific lncRNA implicated in depression, encode previously undescribed microproteins. Physicochemical analysis of these brain microproteinss identifies a large class harboring arginine-glycine-glycine (RGG) repeats as strong candidates for regulating RNA metabolism. Moreover, we find that, collectively, these previously unknown human brain sORFs are enriched for variants associated with schizophrenia. In addition to significantly expanding the translational landscape of the developing brain, this atlas will serve as a rich resource for the annotation and functional interrogation of thousands of previously unknown brain-specific protein products. MAINThe human brain leverages extraordinary protein diversity to execute developmental programs, organize neural circuits, and perform complex cognitive tasks 1 . Proteomic diversity is
Ribosome profiling and mass spectrometry have revealed thousands of small and alternative open reading frames (sm/alt-ORFs) that are translated into polypeptides variously termed as microproteins and alt-proteins in mammalian cells. Some micro-/alt-proteins exhibit stress-, cell-type-, and/or tissue-specific expression; understanding this regulated expression will be critical to elucidating their functions. While differential translation has been inferred by ribosome profiling, quantitative mass spectrometry-based proteomics is needed for direct comparison of microprotein and alt-protein expression between samples and conditions. However, while label-free quantitative proteomics has been applied to detect stress-dependent expression of bacterial microproteins, this approach has not yet been demonstrated for analysis of differential expression of unannotated ORFs in the more complex human proteome. Here, we present global micro-/alt-protein quantitation in two human leukemia cell lines, K562 and MOLT4. We identify 12 unannotated proteins that are differentially expressed in these cell lines. The expression of six micro/alt-proteins from cDNA was validated biochemically, and two were found to localize to the nucleus. Thus, we demonstrate that label-free comparative proteomics enables quantitation of micro-/alt-protein expression between human cell lines. We anticipate that this workflow will enable the discovery of regulated sm/alt-ORF products across many biological conditions in human cells.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.