An approach to sequence-dependent retention time prediction of peptides based on the concept of liquid chromatography at critical conditions (LCCC) is presented. Within the LCCC approach applied to biopolymers (BioLCCC), the specific retention time corresponds to a particular sequence. In combination with mass spectrometry, this approach provides an efficient tool to solve problems wherein the protein sequencing is essential. In this work, we present a theoretical background of the BioLCCC concept and demonstrate experimentally its feasibility for sequence-dependent LC retention time prediction for peptides. BioLCCC model is based on three notions: (a) a random walk model for a macromolecule chain; (b) an entropy and energy compensation for the macromolecules within the adsorbent pore; and (c) a set of phenomenological parameters for the effective interaction energies of interactions between the amino acid residues and the adsorbent surface. In this work, the phenomenological parameters have been obtained for C18 reversed-phase HPLC. Note, that contrary to alternative additive models for retention time prediction based on summation of the so-called "retention coefficients", the BioLCCC approach takes into account the location of amino acids within the primary structure of a peptide and, thus, allows the identification of the peptides having the same composition of amino acids but differing by their arrangement. As a result, this new approach allows prediction of retention time for any possible amino acid sequence in particular HPLC experiments. In addition, the BioLCCC model lacks of main drawbacks of additive approaches that predict retention time for sequences of limited chain lengths and provide information about amino acid composition only. The proposed BioLCCC approach was characterized experimentally using LTQ FT LC-MS and LC-MS/MS data obtained earlier for Escherichia coli. The HPLC system calibration was performed using peptide retention standards. The results received show a linear correlation between predicted and experimental retention times, with a correlation coefficient, R2, of 0.97 for a peptide standard mixture and 0.9 for E. coli data, respectively, with the standard error below 1 min. The work presents the first description of a BioLCCC approach for high-throughput peptide characterization and preliminary results of its feasibility tests.
Proteome characterization relies heavily on tandem mass spectrometry (MS/MS) and is thus associated with instrumentation complexity, lengthy analysis time, and limited duty-cycle. It was always tempting to implement approaches which do not require MS/MS, yet, they were constantly failing in achieving meaningful depth of quantitative proteome coverage within short experimental times, which is particular important for clinical or biomarker discovery applications. Here, we report on the first successful attempt to develop a truly MS/MS-free and label-free method for bottom-up proteomics. We demonstrate identification of 1000 protein groups for a standard HeLa cell line digest using 5-minute LC gradients. The amount of loaded sample was varied in a range from 1 ng to 500 ng, and the method demonstrated 10-fold higher sensitivity compared with the standard MS/MS-based approach. Due to significantly higher sequence coverage obtained by the developed method, it outperforms all popular MS/MSbased label-free quantitation approaches. Advances in mass-spectrometry-based proteomic technologies resulted in dramatically increased depth, throughput, and sensitivity of proteome coverage. Up to 10,000 proteins can be identified in an 100 minute analysis of human cell proteomes using state-of-the-art high-resolution Orbitrap mass spectrometry 1. Recently, the notable trend in LC-MS technology developments has been toward increasing the throughput of the proteome-wide analysis, while preserving the quantitation accuracy 2,3. However, these achievements rely heavily on the use of tandem mass spectrometry (MS/MS), which includes sequential isolation of eluting peptides followed by their fragmentation. While being a crucial and seemingly the only source of sequence-specific information about the peptides, MS/MS brings a number of well-known challenges. Due to the limited both the speed of the mass analyzer (which is
We present an open-source, extensible search engine for shotgun proteomics. Implemented in Python programming language, IdentiPy shows competitive processing speed and sensitivity compared with the state-of-the-art search engines. It is equipped with a user-friendly web interface, IdentiPy Server, enabling the use of a single server installation accessed from multiple workstations. Using a simplified version of X!Tandem scoring algorithm and its novel "autotune" feature, IdentiPy outperforms the popular alternatives on high-resolution data sets. Autotune adjusts the search parameters for the particular data set, resulting in improved search efficiency and simplifying the user experience. IdentiPy with the autotune feature shows higher sensitivity compared with the evaluated search engines. IdentiPy Server has built-in postprocessing and protein inference procedures and provides graphic visualization of the statistical properties of the data set and the search results. It is open-source and can be freely extended to use third-party scoring functions or processing algorithms and allows customization of the search workflow for specialized applications.
In the last couple of decades, considerable effort has been focused on developing methods for quantitative and qualitative proteome characterization. The method of choice in this characterization is mass spectrometry used in combination with sample separation. One of the most widely used separation techniques at the front end of a mass spectrometer is high performance liquid chromatography (HPLC). A unique feature of HPLC is its specificity to the amino acid sequence of separated peptides and proteins. This specificity may provide additional information about the peptides or proteins under study which is complementary to the mass spectrometry data. The value of this information for proteomics has been recognized in the past few decades, which has stimulated significant effort in the development and implementation of computational and theoretical models for the prediction of peptide retention time for a given sequence. Here we review the advances in this area and the utility of predicted retention times for proteomic applications.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.