To improve proteome coverage and protein C-terminal identification, we characterized the Methanosarcina acetivorans thermophilic proteinase LysargiNase, which cleaves before lysine and arginine up to 55 °C. Unlike trypsin, LysargiNase-generated peptides had N-terminal lysine or arginine residues and fragmented with b ion-dominated spectra. This improved protein C terminal-peptide identification and several arginine-rich phosphosite assignments. Notably, cleavage also occurred at methylated or dimethylated lysine and arginine, facilitating detection of these epigenetic modifications.
As proteome-wide C-terminal sequence analysis has been largely intractable, we developed a polymer-based enrichment approach to profile protein C-terminal peptides by mass spectrometry and identified hundreds of C-terminal peptides in the Escherichia coli proteome. We isotopically labeled GluC protease-digested and undigested samples and identified GluC substrates and their cleavage sites by quantification of neo-C-terminal peptides. Our method thus enables global annotation of protein C-terminal posttranslational modifications, including proteolytic truncations.
Secreted and membrane tethered matrix metalloproteinases (MMPs) are key homeostatic proteases regulating the extracellular signaling and structural matrix environment of cells and tissues. For drug targeting of proteases, selectivity for individual molecules is highly desired and can be met by high yield active site specificity profiling. Using the high throughput Proteomic Identification of protease Cleavage Sites (PICS) method to simultaneously profile both the prime and non-prime sides of the cleavage sites of nine human MMPs, we identified more than 4300 cleavages from P6 to P6' in biologically diverse human peptide libraries. MMP specificity and kinetic efficiency were mainly guided by aliphatic and aromatic residues in P1' (with a ~32-93% preference for leucine depending on the MMP), and basic and small residues in P2' and P3', respectively. A wide differential preference for the hallmark P3 proline was found between MMPs ranging from 15 to 46%, yet when combined in the same peptide with the universally preferred P1' leucine, an unexpected negative cooperativity emerged. This was not observed in previous studies, probably due to the paucity of approaches that profile both the prime and non-prime sides together, and the masking of subsite cooperativity effects by global heat maps and iceLogos. These caveats make it critical to check for these biologically highly important effects by fixing all 20 amino acids one-by-one in the respective subsites and thorough assessing of the inferred specificity logo changes. Indeed an analysis of bona fide MEROPS physiological substrate cleavage data revealed that of the 37 natural substrates with either a P3-Pro or a P1'-Leu only 5 shared both features, confirming the PICS data. Upon probing with several new quenched-fluorescent peptides, rationally designed on our specificity data, the negative cooperativity was explained by reduced non-prime side flexibility constraining accommodation of the rigidifying P3 proline with leucine locked in S1'. Similar negative cooperativity between P3 proline and the novel preference for asparagine in P1 cements our conclusion that non-prime side flexibility greatly impacts MMP binding affinity and cleavage efficiency. Thus, unexpected sequence cooperativity consequences were revealed by PICS that uniquely encompasses both the non-prime and prime sides flanking the proteomic-pinpointed scissile bond.
A goal
of the Chromosome-centric Human Proteome Project is to identify
all human protein species. With 3844 proteins annotated as “missing”,
this is challenging. Moreover, proteolytic processing generates new
protein species with characteristic neo-N termini that are frequently
accompanied by altered half-lives, function, interactions, and location.
Enucleated and largely void of internal membranes and organelles,
erythrocytes are simple yet proteomically challenging cells due to
the high hemoglobin content and wide dynamic range of protein concentrations
that impedes protein identification. Using the N-terminomics procedure
TAILS, we identified 1369 human erythrocyte natural and neo-N-termini
and 1234 proteins. Multiple semitryptic N-terminal peptides exhibited
improved mass spectrometric identification properties versus the intact
tryptic peptide enabling identification of 281 novel erythrocyte proteins
and six missing proteins identified for the first time in the human
proteome. With an improved bioinformatics workflow, we developed a
new classification system and the Terminus Cluster Score. Thereby
we described a new stabilizing N-end rule for processed protein termini,
which discriminates novel protein species from degradation remnants,
and identified protein domain hot spots susceptible to cleavage. Strikingly,
68% of the N-termini were within genome-encoded protein sequences,
revealing alternative translation initiation sites, pervasive endoproteolytic
processing, and stabilization of protein fragments in vivo. The mass
spectrometry proteomics data have been deposited to ProteomeXchange
with the data set identifier
Glyoxysomes are a subclass of peroxisomes involved in lipid mobilization. Two distinct peroxisomal targeting signals (PTSs), the C-terminal PTS1 and the N-terminal PTS2, are defined. Processing of the PTS2 on protein import is conserved in higher eukaryotes. The cleavage site typically contains a Cys at P1 or P2. We purified the glyoxysomal processing protease (GPP) from the fat-storing cotyledons of watermelon (Citrullus vulgaris) by column chromatography, preparative native isoelectric focusing, and 2D PAGE. The GPP appears in two forms, a 72-kDa monomer and a 144-kDa dimer, which are in equilibrium with one another. The equilibrium is shifted on Ca 2؉ removal toward the monomer and on Ca 2؉ addition toward the dimer. The monomer is a general degrading protease and is activated by denatured proteins. The dimer constitutes the processing protease because the substrate specificity proven for the monomer (⌽-Arg/Lys2) is different from the processing substrate specificity (Cys-Xxx2/Xxx-Cys2) found with the mixture of monomer and dimer. The Arabidopsis genome analysis disclosed three proteases predicted to be in peroxisomes, a Deg-protease, a pitrilysin-like metallopeptidase, and a Lonprotease. Specific antibodies against the peroxisomal Degprotease from Arabidopsis (Deg15) identify the watermelon GPP as a Deg15. A knockout mutation in the DEG15 gene of Arabidopsis (At1g28320) prevents processing of the glyoxysomal malate dehydrogenase precursor to the mature form. Thus, the GPP/Deg15 belongs to a group of trypsin-like serine proteases with Escherichia coli DegP as a prototype. Nevertheless, the GPP/Deg15 possesses specific characteristics and is therefore a new subgroup within the Deg proteases.Arabidopsis thaliana ͉ Ca 2ϩ signal ͉ Citrullus vulgaris ͉ monomer/dimer equilibrium N umerous matrix enzymes have to be imported from the cytosol into peroxisomes, in plants especially into glyoxysomes for seed storage oil mobilization or into leaf peroxisomes for photorespiration. The majority of these enzymes are imported in their mature form and targeted by a C-terminal SKL designated peroxisomal targeting signal 1 (PTS1) (1). In a few matrix enzymes, the peroxisomal targeting signal 2 (PTS2) with the consensus RL-X5-HL is located in the N-terminal 30 to 50 amino acids of the protein (2). In plants, these are four enzymes of the glyoxylate cycle and -oxidation of fatty acids: glyoxysomal malate dehydrogenase (gMDH), glyoxysomal citrate synthase (gCS), acyl-CoA oxidase, and thiolase. In mammals, three enzymes with a PTS2 have been identified: thiolase, alkyl-DHAP synthase, and phytanoyl-CoA hydroxylase. In higher eukaryotes, such as plants and mammals, the PTS2 is removed on import; a Cys is consistently found near the cleavage site [supporting information (SI) Table 2]. In lower eukaryotes, such as yeasts, a PTS2 is present in the N terminus of the mature subunit of thiolase and amine oxidase (2) (SI Table 2). The Cys in position P2 is required for processing the presequences of gCS and gMDH in pumpkin; deletion of the...
To link cleaved substrates in complex systems with a specific protease, the protease active site specificity is required. Proteomic identification of cleavage sites (PICS) simultaneously determines both the prime- and non-prime-side specificities of individual proteases through identification of hundreds of individual cleavage sequences from biologically relevant, proteome-derived peptide libraries. PICS also identifies subsite cooperativity. To generate PICS peptide libraries, cellular proteomes are digested with a specific protease such as trypsin. Following protease inactivation, primary amines are protected. After incubation with a test protease, each prime-side cleavage fragment has a free newly formed N-terminus, which is biotinylated for affinity isolation and identification by liquid chromatography-tandem mass spectrometry. The corresponding non-prime sequences are derived bioinformatically. The step-by-step protocol also presents a web service for PICS data analysis, as well as introducing and validating PICS peptide libraries made from Escherichia coli.
Highlights • Single-pot workflow for manual or automated enrichment of N-terminal peptides. • Sensitive enrichment of protein N termini from 10,000 cells or 2 g crude proteome. • Data independent acquisition improves precision of peptide level quantification. • First degradomic analyses of sorted immune cells, single seedlings, and mitochondria from patient cells.
Two distinct peroxisomal targeting signals (PTSs), the C-terminal PTS1 and the N-terminal PTS2, are defined. Processing of the PTS2 on protein import is conserved in higher eukaryotes. Recently, candidates for the responsible processing protease were identified from plants (DEG15) and mammals (TYSND1). We demonstrate that plants lacking DEG15 show an expressed phenotype potentially linked to reduced b-oxidation, indicating the impact of protein processing on peroxisomal functions in higher eukaryotes. Mutational analysis of Arabidopsis (Arabidopsis thaliana) DEG15 revealed that conserved histidine, aspartic acid, and serine residues are essential for the proteolytic activity of this enzyme in vitro. This indicates that DEG15 and related enzymes are trypsin-like serine endopeptidases. Deletion of a plant-specific stretch present in the protease domain diminished, but did not abolish, the proteolytic activity of DEG15 against the PTS2-containing glyoxysomal malate dehydrogenase. Fluorescence microscopy showed that a DEG15-green fluorescent protein fusion construct is targeted to peroxisomes in planta. In vivo studies with isolated homozygous deg15 knockout mutants and complemented mutant lines suggest that this enzyme mediates general processing of PTS2-containing proteins.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.