Statistical analysis of 12 unstable and 32 stable proteins revealed that there are certain dipeptides, the occurrence of which is significantly different in the unstable proteins compared with those in the stable ones. Based on the impact of these dipeptides on the unstable proteins over the stable ones, a weight value of instability is assigned to each of the dipeptides. For a given protein the summation of these weight values normalized to the length of its sequence helps to distinguish between unstable and stable proteins. Results suggest that the in vivo instability of proteins is possibly determined by the order of certain amino acids in its sequence. An attempt is made to correlate metabolic stability of proteins with features of their primary sequence where weight values of instability for a protein of known sequence could thus be used as an index for predicting its stability characteristics.
The pregnancy-associated glycoproteins (PAGs) are secretory products synthesized by the outer epithelial cell layer (chorion) of the placentas of various ungulate species. The amino acid sequences of eight PAGs have been inferred from cloned cDNA of cattle and sheep, as well as of the non-ruminant pig and horse. We compare the PAG sequences and present results of the three-dimensional models of boPAG-1 and ovPAG-1 that were constructed on the basis of the crystal structures of homologous porcine pepsin and bovine chymosin using a rule-based comparative modelling approach. Further, we compare peptide binding subsites defined by interactions with pepstatin and a decapeptide inhibitor (CH-66) modelled on the basis of crystal structures of other aspartic proteinases. We have extended our analysis of the peptide binding subsites to the other PAG molecules of known sequence by aligning the PAG sequences to the structural template derived from the pepsin family and by making use of the three-dimensional models of the boPAG-1 and ovPAG-1. The residues that are likely to affect peptide binding in the boPAG-1, ovPAG-1 and other PAG molecules have been identified. Sequence comparisons reveal that all PAG molecules may have evolved from a pepsin-like progenitor molecule with the equine PAG most closely related to the pepsins. The presence of substitutions at the S1 and other subsites relative to pepsin make it unlikely that either bovine, ovine or the porcine PAG-1 have catalytic activity. Only two of the eight PAGs examined (porcine PAG-2 and equine PAG-1) retain features of active aspartic proteinases with pepsin-like activity. Our results indicate that in the PAGs so far characterized the peptide binding specificities differ significantly from each other and from pepsin, despite their high sequence identities. Analysis of the various peptide binding subsites demonstrates why both bovine and ovine PAG-1 are capable of binding pepstatin. The strong negative charge in the binding cleft of boPAG-1 and ovPAG-1 indicates a preference for lysine- or arginine-rich peptides. PAGs represent a family where the possible peptide binding function may be retained through their binding specificities, but where the catalytic activity may be lost in some cases, such as the boPAG-1, ovPAG-1 and the poPAG-1.
The family of aspartic proteinases includes several human enzymes that may play roles in both physiological and pathophysiological processes. The human lysosomal aspartic proteinase cathepsin D is thought to function in the normal degradation of intracellular and endocytosed proteins but has also emerged as a prognostic indicator of breast tumor invasiveness. Presented here are results from a continuing effort to elucidate the factors that contribute to specificity of ligand binding at individual subsites within the cathepsin D active site. The synthetic peptide Lys-Pro-Ile-Glu-Phe*Nph-Arg-Leu has proven to be an excellent chromogenic substrate for cathepsin D yielding a value of kcat/Km = 0.92 x 10(-6) s-1 M-1 for enzyme isolated from human placenta. In contrast, the peptide Lys-Pro-Ala-Lys-Phe*Nph-Arg-Leu and all derivatives with Ala-Lys in the P3-P2 positions are either not cleaved at all or cleaved with extremely poor efficiency. To explore the binding requirements of the S3 and S2 subsites of cathepsin D, a series of synthetic peptides was prepared with systematic replacements at the P2 position fixing either Ile or Ala in P3. Kinetic parameters were determined using both human placenta cathepsin D and recombinant human fibroblast cathepsin D expressed in Escherichia coli. A rule-based structural model of human cathepsin D, constructed on the basis of known three-dimensional structures of other aspartic proteinases, was utilized in an effort to rationalize the observed substrate selectivity.
A model of the barley-grain aspartic proteinase (HvAP; Hordeum vulgare aspartic proteinase) has been constructed using the rule-based comparative modelling approach encoded in the COMPOSER suite of computer programs. The model was based on the high resolution crystal structures of six highly homologous aspartic proteinases. Results suggest that the overall three-dimensional structure of HvAP (excluding the plant-specific insert; 104 residues in HvAP) is closer to human cathepsin D than other aspartic proteinases of known three-dimensional structure.
We have identified four novel repeats and two domains in cell surface proteins encoded by the Methanosarcina acetivorans genome and in some archaeal and bacterial genomes. The repeats correspond to a certain number of amino acid residues present in tandem in a protein sequence and each repeat is characterized by conserved sequence motifs. These correspond to: (a) a 42 amino acid (aa) residue RIVW repeat; (b) a 45 aa residue LGxL repeat; (c) a 42 aa residue LVIVD repeat; and (d) a 54 aa residue LGFP repeat. The domains correspond to a certain number of aa residues in a protein sequence that do not comprise internal repeats. These correspond to: (a) a 200 aa residue DNRLRE domain; and (b) a 70 aa residue PEGA domain. We discuss the occurrence of these repeats and domains in the different proteins and genomes analysed in this work.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.