Large Hydrophobic Residues (LHR) such as phenylalanine, isoleucine, leucine, methionine and valine play an important role in protein structure and activity. We describe the role of LHR in complete set of protein sequences in 15 different species. That is the distribution of LHR in different proteins of different species is reported. It is observed that the proteins prefer to have 27% of large hydrophobic residues in total and all along the sequence. It is also observed that proteins accumulate more LHR in its active sites. A window analysis on these protein sequences shows that the 27% of LHR is more frequent at window length of 45 amino acids. The influenza virus and P. falciparum show a random distribution of LHR in its proteins compared to other model organisms.
The tetrapeptides Li504 and Li520, differing in the modification of the 4trans-hydroxylation of proline, are novel conopeptides derived from the venom duct transcriptome of the marine cone snail Conus lividus. These predicted mature peptides are homologous to the active site motif of oxidoreductases that catalyze the oxidation, reduction, and rearrangement of disulfide bonds in peptides and proteins. The estimated reduction potential of the disulfide of Li504 and Li520 is within the range of disulfide reduction potentials of oxidoreductases, indicating that they may catalyze the oxidative folding of conotoxins. Conformational features of Li504 and Li520 include the trans configuration of the Cys1−Pro2/Hyp2 peptide bond with a type 1 turn that is similar to the active site motif of glutaredoxin that regulates the oxidation of cysteine thiols to disulfides. Li504-and Li520-assisted oxidative folding of α-conotoxin ImI confirms that Li520 improves the yield of the natively folded peptide by concomitantly decreasing the yield of the non-native disulfide isomer and thus acts as a miniature disulfide isomerase. The geometry of the Cys1−Hyp2 peptide bond of Li520 shifts between the trans and cis configurations in the disulfide form and thiol/thiolate form, which regulates the deprotonation of the N-terminal cysteine residue. Hydrogen bonding of the hydroxyl group of 4-trans-hydroxyproline with the interpeptide chain unit in the mixed disulfide form may play a vital role in shifting the geometry of the Cys1−Hyp2 peptide bond from cis to trans configuration. The Li520 conopeptide together with similar peptides derived from other species may constitute a new family of "redox-active" conopeptides that are integral components of the oxidative folding machinery of conotoxins.
The post-translational modification of N-terminal glutamine (Q) to a pyroglutamyl (Z) residue is observed in the conotoxins produced by marine cone snails. This conversion requires the action of the enzyme glutaminyl cyclase (QC). Four complete QC sequences from the species C. araneosus, C. frigidus, C. litteratus, and C. monile and two partial sequences from C. amadis and C. miles have been obtained by analysis of transcriptomic data. Comparisons with mammalian enzyme sequences establish a high level of identity and complete conservation of functional active site residues, including a cluster of hydrogen-bonded acidic side chains. Mass spectrometric analysis of crude venom samples coupled to conotoxin precursor protein sequences obtained from transcriptomic data establishes the presence of pyroglutamyl conotoxins in the venom of C. frigidus and C. amadis. The C. frigidus peptide belongs to the M superfamily, with cysteine framework III, whereas the C. amadis peptide belongs to the divergent superfamily with cysteine framework VI/VII. Additionally, gamma carboxylation of glutamic acid and hydroxylation of proline are observed in the C. frigidus peptide. Mass spectral data are available via ProteomeXchange with identifier PXD009006.
Sequence stretches in proteins that do not fold into a form are referred as disordered regions. Databases like Disport describe disordered regions in proteins and web servers like PrDOS and DisEMBL, facilitate the prediction of disordered regions. These studies are often based on residue level features. Here, we describe proteins with disordered regions using carbon content and distributions. The distribution pattern for proteins with disordered regions is different from those that do not show disordered regions.
Abstract:There are lots of works gone into proteins to understand the nature of proteins. Hydrophobic interaction is the dominant force that drives the proteins to carry out the biochemical reactions in all living system. Carbon is the only element that contributes towards this hydrophobic interaction. Studies find that globular proteins prefer to have 31.45% of carbon for its stability. Taking this as standard, a carbon analysis program has been developed to study the carbon distribution profile of protein sequences. This carbon analysis program has been made available online. This can be accessed at www.rajasekaran.net.in/tools/carbana.html. This new program is hoped to help in identification and development of active sites, study of protein stability, evolutionary understating of proteins, gene identification, ligand binding site identification, and to solve the long-standing problem of protein-protein and protein-DNA interactions.Keywords: carbon distribution; CARBANA analysis; hydrophobicity; carbon profile; hydropathy plot; Background:There is lot of work gone into proteins to understand the ultimate truth of real information [1-3]. Hydrophobic interaction is the dominant force that comes from presence of carbon. Recent studies reveal that proteins prefer to have 31.45% of carbon in its structure and in sequence [2]. To understand the buried information further in proteins this work has been taken up.
The occurrence of contryphans, a class of single-disulfide-bond-containing peptides, is demonstrated by the analysis of the venom of nine species of cone snails. Ten full gene sequences and two partial gene sequences coding for contryphan precursor proteins have been identified by next-generation sequencing and compared with available sequences. The occurrence of mature peptides in isolated venom has been demonstrated by LC-ESI-MS/MS analysis. De novo sequencing of reduced, alkylated contryphans from C. frigidus and C. araneosus provides evidence of sequence variation and post-translational modification, notably gamma carboxylation of glutamic acid. The characterization of Fr965 (C. frigidus) provides a rare example of a sequence lacking Pro at position 5 in the disulfide loop. The widespread occurrence of contryphan genes and mature peptides in the venom of diverse cone snails is suggestive of their potential biological significance.
There is lot of work gone into proteins to understand the nature of proteins. Hydrophobic interaction is the dominant force that drives the proteins to carry out the biochemical reactions in all living being. Carbon is the only element that contributes towards this hydrophobic interaction. In this study the carbon distribution along the protein sequence has been computed by representing the protein sequence as a series of atoms instead of amino acids. Given any length, there is a maximum frequency occurs at 31.44% of carbon. That is any globular proteins prefer to have 31.44% of carbon. This is not only in global but also in local. This newly identified carbon distribution profile is hoped to help in identification and development of active sites, study of protein stability, evolutionary understating of proteins, gene identification and to solve the long-standing problem of protein-protein and protein-DNA specific and non-specific interactions. This can also distinguish poisonous, viral and diseased proteins from the normal one.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.