This work discusses bioinformatics and experimental approaches to explore the human proteome, a constellation of proteins expressed in different tissues and organs. As the human proteome is not a static entity, it seems necessary to estimate the number of different protein species (proteoforms) and measure the number of copies of the same protein in a specific tissue. Here, meta-analysis of neXtProt knowledge base is proposed for theoretical prediction of the number of different proteoforms that arise from alternative splicing (AS), single amino acid polymorphisms (SAPs), and posttranslational modifications (PTMs). Three possible cases are considered: (1) PTMs and SAPs appear exclusively in the canonical sequences of proteins, but not in splice variants; (2) PTMs and SAPs can occur in both proteins encoded by canonical sequences and in splice variants; (3) all modification types (AS, SAP, and PTM) occur as independent events. Experimental validation of proteoforms is limited by the analytical sensitivity of proteomic technology. A bell-shaped distribution histogram was generated for proteins encoded by a single chromosome, with the estimation of copy numbers in plasma, liver, and HepG2 cell line. The proposed metabioinformatics approaches can be used for estimation of the number of different proteoforms for any group of protein-coding genes.
Future development of proteomics may be hindered by limitations in the concentration sensitivity of widespread technological approaches. The concentration sensitivity limit (CSL) of currently used approaches, like 2-DE/LC separation coupled with MS detection, etc., varies from 10 29 to 10 212 M. Therefore, proteomic technologies enable detection of up to 20% of the protein species present in the plasma. New technologies, like atomic force microscopy (AFM molecular detector), enable the counting of single molecules, whereas biospecific fishing can be used to capture these molecules from the biomaterial. At the same time, fishing also has thermodynamic limitations due to the reversibility of the binding. In cases where the fishing becomes irreversible, its combination with an AFM detector enables the registration of single protein molecules, and that opens up a way to lower the CSL down to the reverse Avogadro number.
In the absence of an analog of PCR for proteins, the concentration detection limit (DL) becomes a real challenge. The problem may be solved by means of a combination of biospecific irreversible fishing with atomic force microscopy (AFM). AFM offers the ability to register individual molecules and their complexes, while biospecific fishing takes advantage of an affine interaction between analyte molecules spread over a large volume of biomaterial and ligand molecules immobilized on the chip surface. Fishing may be conducted in Kd-dependent reversible mode and in Kd-independent irreversible mode. In this study, the DLs of two previously applied proteomic approaches were determined and compared to the DL of a newly developed analytical method. The first approach, based on MS analysis of biomaterial after 2-DE or LC separation of proteins, attained a DL at the level of 10(-8)-10(-10) M. The second approach, based on the optical biosensor analysis of molecular interactions in the format of proteomic microarrays, had a DL of 10(-9)-10(-10) M. Our proposed method which combines biospecific fishing with AFM allowed us to attain DL values of 10(-11) M under reversible binding conditions and 10(-16) M under irreversible binding conditions.
The final goal of the Russian part of the Chromosome-centric Human Proteome Project (C-HPP) was established as the analysis of the chromosome 18 (Chr 18) protein complement in plasma, liver tissue and HepG2 cells with the sensitivity of 10(-18) M. Using SRM, we have recently targeted 277 Chr 18 proteins in plasma, liver, and HepG2 cells. On the basis of the results of the survey, the SRM assays were drafted for 250 proteins: 41 proteins were found only in the liver tissue, 82 proteins were specifically detected in depleted plasma, and 127 proteins were mapped in both samples. The targeted analysis of HepG2 cells was carried out for 49 proteins; 41 of them were successfully registered using ordinary SRM and 5 additional proteins were registered using a combination of irreversible binding of proteins on CN-Br Sepharose 4B with SRM. Transcriptome profiling of HepG2 cells performed by RNAseq and RT-PCR has shown a significant correlation (r = 0.78) for 42 gene transcripts. A pilot affinity-based interactome analysis was performed for cytochrome b5 using analytical and preparative optical biosensor fishing followed by MS analysis of the fished proteins. All of the data on the proteome complement of the Chr 18 have been integrated into our gene-centric knowledgebase ( www.kb18.ru ).
Isatin (indole-2,3-dione) is an endogenous indole that has a distinct and discontinuous distribution in the brain and in other mammalian tissues and body fluids. Its output is increased under conditions of stress and anxiety. Isatin itself and its analogues exhibit a wide range of pharmacological activities but its specific biological targets still are not well characterized. Affinity chromatography of Triton X-100 lysates of soluble and particulate fractions of mouse and rat whole brain homogenates on 5-aminocaproyl-isatin-Sepharose followed by subsequent proteomic analysis resulted in identification of 65 and 64 individual proteins, respectively. Isatin-binding capacity of some of the identified proteins has been validated in an optical biosensor study using a Biacore 3000 optical biosensor, 5-aminocarproyl-isatin, and 5-aminoisatin as the affinity ligands. The K(d) values (of 0.1-20 microM) obtained during the optical biosensor experiments were consistent with the range of K(d) values recently reported for [(3)H]isatin binding to brain sections. Although the number of isatin-binding proteins identified in the mouse and rat brain was similar, only 21 proteins (about one-third) were identical in the two species. This may be one reason for the differences in isatin effects in rats and mice reported in the literature.
We report the results obtained in 2012-2013 by the Russian Consortium for the Chromosome-centric Human Proteome Project (C-HPP). The main scope of this work was the transcriptome profiling of genes on human chromosome 18 (Chr 18), as well as their encoded proteome, from three types of biomaterials: liver tissue, the hepatocellular carcinoma-derived cell line HepG2, and blood plasma. The transcriptome profiling for liver tissue was independently performed using two RNaseq platforms (SOLiD and Illumina) and also by droplet digital PCR (ddPCR) and quantitative RT-PCR. The proteome profiling of Chr 18 was accomplished by quantitatively measuring protein copy numbers in the three types of biomaterial (the lowest protein concentration measured was 10(-13) M) using selected reaction monitoring (SRM). In total, protein copy numbers were estimated for 228 master proteins, including quantitative data on 164 proteins in plasma, 171 in the HepG2 cell line, and 186 in liver tissue. Most proteins were present in plasma at 10(8) copies/μL, while the median abundance was 10(4) and 10(5) protein copies per cell in HepG2 cells and liver tissue, respectively. In summary, for liver tissue and HepG2 cells a "transcriptoproteome" was produced that reflects the relationship between transcript and protein copy numbers of the genes on Chr 18. The quantitative data acquired by RNaseq, PCR, and SRM were uploaded into the "Update_2013" data set of our knowledgebase (www.kb18.ru) and investigated for linear correlations.
In this study, to reduce the number of major plasma components, we examined thermostable plasma fractions to search for a biomarker of ovarian cancer. An apparent cancer biomarker of 11.7 kDa was detected in these fractions using ProteinChip SELDI-TOF mass spectrometry system. This peak invariably appeared with another close peak of about 11.5 kDa, suggesting that it is a derivative of a larger mass molecule. Of 27 cancer plasma specimens, 15 (55.6%) demonstrated this peak pair, whereas only 2 of 34 controls specimens (5.8%) were shown to express it with low intensity. Using a method involving cysteine modification by 4-vinylpyridine (4-VP), 2-DE and HPLC, these peaks were identified by mass spectrometry as serum amyloid A1 (11.68 kDa) and its N-terminal arginine-truncated form (11.52 kDa).
Adenosine-to-inosine RNA editing is one of the most common types of RNA editing, a posttranscriptional modification made by special enzymes. We present a proteomic study on this phenomenon for Drosophila melanogaster. Three proteome .
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.