The availability of human genome sequence has transformed biomedical research over the past decade. However, an equivalent map for the human proteome with direct measurements of proteins and peptides does not exist yet. Here, we present a draft map of the human proteome using high resolution Fourier transform mass spectrometry. In-depth proteomic profiling of 30 histologically normal human samples including 17 adult tissues, 7 fetal tissues and 6 purified primary hematopoietic cells resulted in identification of proteins encoded by 17,294 genes accounting for ~84% of the total annotated protein-coding genes in humans. A unique and comprehensive strategy for proteogenomic analysis enabled us to discover a number of novel protein-coding regions, which includes translated pseudogenes, non-coding RNAs and upstream ORFs. This large human proteome catalog (available as an interactive web-based resource at http://www.humanproteomemap.org) will complement available human genome and transcriptome data to accelerate biomedical research in health and disease.
Human Protein Reference Database (HPRD—http://www.hprd.org/), initially described in 2003, is a database of curated proteomic information pertaining to human proteins. We have recently added a number of new features in HPRD. These include PhosphoMotif Finder, which allows users to find the presence of over 320 experimentally verified phosphorylation motifs in proteins of interest. Another new feature is a protein distributed annotation system—Human Proteinpedia (http://www.humanproteinpedia.org/)—through which laboratories can submit their data, which is mapped onto protein entries in HPRD. Over 75 laboratories involved in proteomics research have already participated in this effort by submitting data for over 15 000 human proteins. The submitted data includes mass spectrometry and protein microarray-derived data, among other data types. Finally, HPRD is also linked to a compendium of human signaling pathways developed by our group, NetPath (http://www.netpath.org/), which currently contains annotations for several cancer and immune signaling pathways. Since the last update, more than 5500 new protein sequences have been added, making HPRD a comprehensive resource for studying the human proteome.
Sulforaphane (SFN), an isothiocyanate found in cruciferous vegetables, is a potent inhibitor of experimental mammary carcinogenesis and may be an effective, safe chemopreventive agent for use in humans. SFN acts in part on the Keap1/Nrf2 pathway to regulate a battery of cytoprotective genes. In this study transcriptomic and proteomic changes in the estrogen receptor negative, non tumorigenic human breast epithelial MCF10A cell line were analyzed following SFN treatment or KEAP1 knockdown with siRNA using microarray and stable isotopic labeling with amino acids in culture (SILAC), respectively. Changes in selected transcripts and proteins were confirmed by PCR and Western blot in MCF10A and MCF12A cells. There was strong correlation between the transcriptomic and proteomic responses in both the SFN treatment (R=0.679, P<0.05) and KEAP1 knockdown (R=0.853, P<0.05) experiments. Common pathways for SFN treatment and KEAP1 knockdown were xenobiotic metabolism and antioxidants, glutathione metabolism, carbohydrate metabolism and NADH/NADPH regeneration. Moreover, these pathways were most prominent in both the transcriptomic and proteomic analyses. The aldo-keto reductase family members, AKR1B10, AKR1C1, AKR1C2 and AKR1C3, as well as NQO1 and ALDH3A1, were highly upregulated at both the transcriptomic and proteomic level. Collectively, these studies served to identify potential biomarkers that can be used in clinical trials to investigate the initial pharmacodynamic action of SFN in the breast.
The study of the human urinary proteome has the potential to offer significant insights into normal physiology as well as disease pathology. The information obtained from such studies could be applied to the diagnosis of various diseases. The high sensitivity, resolution, and mass accuracy of the latest generation of mass spectrometers provides an opportunity to accurately catalog the proteins present in human urine, including those present at low levels. To this end, we carried out a comprehensive analysis of human urinary proteome from healthy individuals using high-resolution Fourier transform mass spectrometry. Importantly, we used the Orbitrap for detecting ions in both MS (resolution 60 000) and MS/MS (resolution 15 000) modes. To increase the depth of our analysis, we characterized both unfractionated as well as lectin-enriched proteins in our experiments. In all, we identified 1823 proteins with less than 1% false discovery rate, of which 671 proteins have not previously been reported as constituents of human urine. This data set should serve as a comprehensive reference list for future studies aimed at identification and characterization of urinary biomarkers for various diseases.
Cancer biomarkers are currently the subject of intense research because of their potential utility for diagnosis, prognosis, and targeted therapy. In theory, the gene products resulting from somatic mutations are the ultimate protein biomarkers, being not simply associated with tumors but actually responsible for tumorigenesis. We show here that the altered protein products resulting from somatic mutations can be identified directly and quantified by mass spectrometry. The peptides expressed from normal and mutant alleles were detected by selected reaction monitoring (SRM) of their product ions using a triple-quadrupole mass spectrometer. As a prototypical example of this approach, we demonstrated that it is possible to quantify the number and fraction of mutant Ras protein present in cancer cell lines. There were an average of 1.3 million molecules of Ras protein per cell, and the ratio of mutant to normal Ras proteins ranged from 0.49 to 5.6. Similarly, we found that mutant Ras proteins could be detected and quantified in clinical specimens such as colorectal and pancreatic tumor tissues as well as in premalignant pancreatic cyst fluids. In addition to answering basic questions about the relative levels of genetically abnormal proteins in tumors, this approach could prove useful for diagnostic applications.genetics | proteomics | pancreatic cancer | genetic diagnosis | companion diagnostics
In order to complement the recent genomic sequencing of Chinese hamster ovary (CHO) cells, proteomic analysis was performed on CHO including the cellular proteome, secretome, and glycoproteome using tandem mass spectrometry (MS/MS) of multiple fractions obtained from gel electrophoresis, multi-dimensional liquid chromatography, and solid phase extraction of glycopeptides (SPEG). From the 120 different mass spectrometry analyses generating 682,097 MS/MS spectra, 93,548 unique peptide sequences were identified with at most a 0.02 false discovery rate (FDR). A total of 6164 grouped proteins were identified from both glycoproteome and proteome analysis, representing an 8-fold increase in the number of proteins currently identified in the CHO proteome. Furthermore, this is the first proteomic study done using CHO genome exclusively which provides for more accurate identification of proteins. From this analysis, the CHO codon frequency was determined and found to be distinct from humans, which will facilitate expression of human proteins in CHO cells. Analysis of the combined proteomic and mRNA data sets indicated the enrichment of a number of pathways including protein processing and apoptosis but depletion of proteins involved in steroid hormone and glycosphingolipid metabolism. 504 of the detected proteins included N-acetylation modifications and 1292 different proteins were observed to be N-glycosylated. This first large-scale proteomic analysis will enhance the knowledge base about CHO capabilities for recombinant expression and provide information useful in cell engineering efforts aimed at modifying CHO cellular functions.
Keratoconus is a thinning corneal dystrophy that begins in the early teenage years and ultimately requires cornea transplantation to restore vision. Here we conducted a highly sensitive mass spectrometric analysis of the epithelium and the stroma from keratoconus and normal donor corneas. We identified a total of 932 and 1,157 proteins in the consolidated data of the epithelium and stroma, respectively. Technical replicates showed strong correlations (≥ 0.88) in levels of all common proteins, indicating very low technical variations in the data. Analysis of the most increased (≥ 1.5 fold) and decreased (≤ 0.8 fold) proteins in the keratoconus corneal epithelial protein extracts identified proteins related to dermal diseases, inflammation, epithelial stratification and mesenchymal changes. Increased proteins included keratins 6A, 16 and vimentin, while the iron transporter lactotransferrin was decreased. The keratoconus stromal proteome suggest endoplasmic reticular stress, oxidative stress and widespread decreases in many extracellular matrix proteoglycan core proteins, lumican and keratocan, collagen types I, III, V and XII. Marked increase in apoptosis and endocytosis-related proteins suggest degenerative changes in keratocytes, the resident cells of the stroma. This is the most comprehensive proteome analysis of the cornea that highlights similarities of keratoconus with other neurodegenerative diseases.
Plasma is one of the best studied compartments in the human body and serves as an ideal body fluid for the diagnosis of diseases. This report provides a detailed functional annotation of all the plasma proteins identified to date. In all, gene products encoded by 3778 distinct genes were annotated based on proteins previously published in the literature as plasma proteins and the identification of multiple peptides from proteins under HUPO's Plasma Proteome Project. Our analysis revealed that 51% of these genes encoded more than one protein isoform. All single nucleotide polymorphisms involving protein-coding regions were mapped onto the protein sequences. We found a number of examples of isoform-specific subcellular localization as well as tissue expression. This database is an attempt at comprehensive annotation of a complex subproteome and is available on the web at http://www.plasmaproteomedatabase.org.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.