OpenProt (www.openprot.org) is the first proteogenomic resource supporting a polycistronic annotation model for eukaryotic genomes. It provides a deeper annotation of open reading frames (ORFs) while mining experimental data for supporting evidence using cutting-edge algorithms. This update presents the major improvements since the initial release of OpenProt. All species support recent NCBI RefSeq and Ensembl annotations, with changes in annotations being reported in OpenProt. Using the 131 ribosome profiling datasets re-analysed by OpenProt to date, non-AUG initiation starts are reported alongside a confidence score of the initiating codon. From the 177 mass spectrometry datasets re-analysed by OpenProt to date, the unicity of the detected peptides is controlled at each implementation. Furthermore, to guide the users, detectability statistics and protein relationships (isoforms) are now reported for each protein. Finally, to foster access to deeper ORF annotation independently of one’s bioinformatics skills or computational resources, OpenProt now offers a data analysis platform. Users can submit their dataset for analysis and receive the results from the analysis by OpenProt. All data on OpenProt are freely available and downloadable for each species, the release-based format ensuring a continuous access to the data. Thus, OpenProt enables a more comprehensive annotation of eukaryotic genomes and fosters functional proteomic discoveries.
Proteomic diversity in biological samples can be characterized by mass spectrometry (MS)-based proteomics using customized protein databases generated from sets of transcripts previously detected by RNA-seq. This diversity has only been increased by the recent discovery that many translated alternative open reading frames rest unannotated at unsuspected locations of mRNAs and ncRNAs. These novel protein products, termed alternative proteins, have been left out of all previous custom database generation tools. Consequently, genetic variations that impact alternative open reading frames and variant peptides from their translated proteins are not detectable with current computational workflows. To fill this gap, we present OpenCustomDB, a bioinformatics tool that uses sample-specific RNaseq data to identify genomic variants in canonical and alternative open reading frames, allowing for more than one coding region per transcript. In a test reanalysis of a cohort of 16 patients with acute myeloid leukemia, 5666 peptides from alternative proteins were detected, including 201 variant peptides. We also observed that a significant fraction of peptide-spectrum matches previously assigned to peptides from canonical proteins got better scores when reassigned to peptides from alternative proteins. Custom protein libraries that include sample-specific sequence variations of all possible open reading frames are promising contributions to the development of proteomics and precision medicine. The raw and processed proteomics data presented in this study can be found in PRIDE repository with accession number PXD029240.
During aging, changes in gene expression are associated with a decline in physical and cognitive abilities. Here, we investigate the connection between changes in mRNA and protein expression in the brain by comparing the transcriptome and proteome of the mouse cortex during aging. Our transcriptomic analysis revealed that aging mainly triggers gene activation in the cortex. We showed that an increase in mRNA expression correlates with protein expression, specifically in the anterior cingulate cortex, where we also observed an increase in cortical thickness during aging. Genes exhibiting an aging-dependent increase of mRNA and protein levels are involved in sensory perception and immune functions. Our proteomic analysis also identified changes in protein abundance in the aging cortex and highlighted a subset of proteins that were differentially enriched but exhibited stable mRNA levels during aging, implying the contribution of aging-related post- transcriptional and post-translational mechanisms. These specific genes were associated with general biological processes such as translation, ribosome assembly and protein degradation, and also important brain functions related to neuroplasticity. By decoupling mRNA and protein expression, we have thus characterized distinct subsets of genes that differentially adjust to cellular aging in the cerebral cortex.
Alternative splicing (AS) constitutes a mechanism by which protein-coding genes and long non-coding RNA (lncRNA) genes produce more than a single mature transcript. From plants to humans, AS is a powerful process that increases transcriptome complexity. Importantly, splice variants produced from AS can potentially encode for distinct protein isoforms which can lose or gain specific domains and, hence, differ in their functional properties. Advances in proteomics have shown that the proteome is indeed diverse due to the presence of numerous protein isoforms. For the past decades, with the help of advanced high-throughput technologies, numerous alternatively spliced transcripts have been identified. However, the low detection rate of protein isoforms in proteomic studies raised debatable questions on whether AS contributes to proteomic diversity and on how many AS events are really functional. We propose here to assess and discuss the impact of AS on proteomic complexity in the light of the technological progress, updated genome annotation, and current scientific knowledge.
During aging, changes in gene expression are associated with decline in physical and cognitive abilities. Here, we investigated the connection between changes of mRNA and protein expression in the brain by comparing the transcriptome and proteome of the mouse cortex during aging. Our transcriptomic analysis revealed that aging mainly triggers gene activation in the cortex. We showed that increase of mRNA expression correlates with protein expression, specifically in the anterior cingulate cortex where we also observed an increase of cortical thickness during aging. Genes exhibiting an aging-dependent increase of mRNA and protein levels are involved in sensory perception and immune functions. Our proteomic analysis also identified changes in protein abundance in the aging cortex and highlighted a subset of proteins that were differentially enriched but exhibited stable mRNA levels during aging, implying the contribution of aging-related post transcriptional mechanisms. These specific genes were associated with general biological processes such as translation, ribosome assembly and protein degradation, but also important brain functions related to neuroplasticity. By decoupling mRNA and protein expression, we have thus characterized distinct subsets of genes that differentially adjust to cellular aging in the cerebral cortex.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.