Nowadays, mass spectrometry‐based proteomics is carried out primarily in a bottom‐up fashion, with peptides obtained after proteolytic digest of a whole proteome lysate as the primary analytes instead of the proteins themselves. This experimental setup crucially relies on a protease to digest an abundant and complex protein mixture into a far more complex peptide mixture. Full knowledge of the working mechanism and specificity of the used proteases is therefore crucial, both for the digestion step itself as well as for the downstream identification and quantification of the (fragmentation) mass spectra acquired for the peptides in the mixture. Targeted protein analysis through selected reaction monitoring, a relative newcomer in the specific field of mass spectrometry‐based proteomics, even requires a priori understanding of protease behavior for the proteins of interest. Because of the rapidly increasing popularity of proteomics as an analytical tool in the life sciences, there is now a renewed demand for detailed knowledge on trypsin, the workhorse protease in proteomics. This review addresses this need and provides an overview on the structure and working mechanism of trypsin, followed by a critical analysis of its cleavage behavior, typically simply accepted to occur exclusively yet consistently after Arg and Lys, unless they are followed by a Pro. In this context, shortcomings in our ability to understand and predict the behavior of trypsin will be highlighted, along with the downstream implications. Furthermore, an analysis is carried out on the inherent shortcomings of trypsin with regard to whole proteome analysis, and alternative approaches will be presented that can alleviate these issues. Finally, some reflections on the future of trypsin as the workhorse protease in mass spectrometry‐based proteomics will be provided. © 2013 Wiley Periodicals, Inc. Mass Spec Rev 32:453–465, 2013.
GH 11 (glycoside hydrolase family 11) xylanases are predominant enzymes in the hydrolysis of heteroxylan, an abundant structural polysaccharide in the plant cell wall. To gain more insight into the protein-ligand interactions of the glycone as well as the aglycone subsites of these enzymes, catalytically incompetent mutants of the Bacillus subtilis and Aspergillus niger xylanases were crystallized, soaked with xylo-oligosaccharides and subjected to X-ray analysis. For both xylanases, there was clear density for xylose residues in the -1 and -2 subsites. In addition, for the B. subtilis xylanase, there was also density for xylose residues in the -3 and +1 subsite showing the spanning of the -1/+1 subsites. These results, together with the observation that some residues in the aglycone subsites clearly adopt a different conformation upon substrate binding, allowed us to identify the residues important for substrate binding in the aglycone subsites. In addition to substrate binding in the active site of the enzymes, the existence of an unproductive second ligand-binding site located on the surface of both the B. subtilis and A. niger xylanases was observed. This extra binding site may have a function similar to the separate carbohydrate-binding modules of other glycoside hydrolase families.
Trypsin is the workhorse protease in mass spectrometry-based proteomics experiments and is used to digest proteins into more readily analyzable peptides. To identify these peptides after mass spectrometric analysis, the actual digestion has to be mimicked as faithfully as possible in silico. In this paper we introduce CP-DT (Cleavage Prediction with Decision Trees), an algorithm based on a decision tree ensemble that was learned on publicly available peptide identification data from the PRIDE repository. We demonstrate that CP-DT is able to accurately predict tryptic cleavage: tests on three independent data sets show that CP-DT significantly outperforms the Keil rules that are currently used to predict tryptic cleavage. Moreover, the trees generated by CP-DT can make predictions efficiently and are interpretable by domain experts.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.