Abstract:BackgroundHeme binding proteins (HBPs) are metalloproteins that contain a heme ligand (an iron-porphyrin complex) as the prosthetic group. Several computational methods have been proposed to predict heme binding residues and thereby to understand the interactions between heme and its host proteins. However, few in silico methods for identifying HBPs have been proposed.ResultsThis work proposes a scoring card method (SCM) based method (named SCMHBP) for predicting and analyzing HBPs from sequences. A balanced d… Show more
“…Compared to earlier SCM-based methods [10, 13, 14], SCMBYK is more strictly formulated for the purpose of characterization of BY-kinases, as long as it relies on a carefully selected dataset of 26 different bacterial phyla. With the advent of next-generation sequencing, the rate at which protein databases grow is very fast.…”
Section: Discussionmentioning
confidence: 99%
“…The original SCM algorithm was first proposed by Huang et al [10] and was consequently applied to discriminate and analyze proteins with various functions [8–10, 13, 14] based on their sequence information. To train the classifier, two FASTA files are expected as the input: one for the positive training data and one for the negative training data.…”
Section: Methodsmentioning
confidence: 99%
“…SCM-PCP is a PCP mining method used to identify the important physicochemical properties (PCPs) based on the propensity scores of 20 amino acids [13]. To find a set of PCPs possibly correlated with a considered protein function, we examined the 544 indices representing different PCPs available from the AA-index database.…”
BackgroundBacterial tyrosine-kinases (BY-kinases), which play an important role in numerous cellular processes, are characterized as a separate class of enzymes and share no structural similarity with their eukaryotic counterparts. However, in silico methods for predicting BY-kinases have not been developed yet. Since these enzymes are involved in key regulatory processes, and are promising targets for anti-bacterial drug design, it is desirable to develop a simple and easily interpretable predictor to gain new insights into bacterial tyrosine phosphorylation. This study proposes a novel SCMBYK method for predicting and characterizing BY-kinases.ResultsA dataset consisting of 797 BY-kinases and 783 non-BY-kinases was established to design the SCMBYK predictor, which achieved training and test accuracies of 97.55 and 96.73%, respectively. Furthermore, the leave-one-phylum-out method was used to predict specific bacterial phyla hosts of target sequences, gaining 97.39% average test accuracy. After analyzing SCMBYK-derived propensity scores, four characteristics of BY-kinases were determined: 1) BY-kinases tend to be composed of α-helices; 2) the amino-acid content of extracellular regions of BY-kinases is expected to be dominated by residues such as Val, Ile, Phe and Tyr; 3) BY-kinases structurally resemble nuclear proteins; 4) different domains play different roles in triggering BY-kinase activity.ConclusionsThe SCMBYK predictor is an effective method for identification of possible BY-kinases. Furthermore, it can be used as a part of a novel drug repurposing method, which recognizes putative BY-kinases and matches them to approved drugs. Among other results, our analysis revealed that azathioprine could suppress the virulence of M. tuberculosis, and thus be considered as a potential antibiotic for tuberculosis treatment.Electronic supplementary materialThe online version of this article (doi:10.1186/s12859-016-1371-4) contains supplementary material, which is available to authorized users.
“…Compared to earlier SCM-based methods [10, 13, 14], SCMBYK is more strictly formulated for the purpose of characterization of BY-kinases, as long as it relies on a carefully selected dataset of 26 different bacterial phyla. With the advent of next-generation sequencing, the rate at which protein databases grow is very fast.…”
Section: Discussionmentioning
confidence: 99%
“…The original SCM algorithm was first proposed by Huang et al [10] and was consequently applied to discriminate and analyze proteins with various functions [8–10, 13, 14] based on their sequence information. To train the classifier, two FASTA files are expected as the input: one for the positive training data and one for the negative training data.…”
Section: Methodsmentioning
confidence: 99%
“…SCM-PCP is a PCP mining method used to identify the important physicochemical properties (PCPs) based on the propensity scores of 20 amino acids [13]. To find a set of PCPs possibly correlated with a considered protein function, we examined the 544 indices representing different PCPs available from the AA-index database.…”
BackgroundBacterial tyrosine-kinases (BY-kinases), which play an important role in numerous cellular processes, are characterized as a separate class of enzymes and share no structural similarity with their eukaryotic counterparts. However, in silico methods for predicting BY-kinases have not been developed yet. Since these enzymes are involved in key regulatory processes, and are promising targets for anti-bacterial drug design, it is desirable to develop a simple and easily interpretable predictor to gain new insights into bacterial tyrosine phosphorylation. This study proposes a novel SCMBYK method for predicting and characterizing BY-kinases.ResultsA dataset consisting of 797 BY-kinases and 783 non-BY-kinases was established to design the SCMBYK predictor, which achieved training and test accuracies of 97.55 and 96.73%, respectively. Furthermore, the leave-one-phylum-out method was used to predict specific bacterial phyla hosts of target sequences, gaining 97.39% average test accuracy. After analyzing SCMBYK-derived propensity scores, four characteristics of BY-kinases were determined: 1) BY-kinases tend to be composed of α-helices; 2) the amino-acid content of extracellular regions of BY-kinases is expected to be dominated by residues such as Val, Ile, Phe and Tyr; 3) BY-kinases structurally resemble nuclear proteins; 4) different domains play different roles in triggering BY-kinase activity.ConclusionsThe SCMBYK predictor is an effective method for identification of possible BY-kinases. Furthermore, it can be used as a part of a novel drug repurposing method, which recognizes putative BY-kinases and matches them to approved drugs. Among other results, our analysis revealed that azathioprine could suppress the virulence of M. tuberculosis, and thus be considered as a potential antibiotic for tuberculosis treatment.Electronic supplementary materialThe online version of this article (doi:10.1186/s12859-016-1371-4) contains supplementary material, which is available to authorized users.
“…Derived from a scoring card method (SCM), the latest prediction method for heme binding to proteins "SCMHBP" benefits from an evaluation of heme-binding tendencies of 400 dipeptides and 20 amino acids, which is transferred onto protein sequences. Consequently, two non-redundant training datasets were designed with 747 heme-binding proteins and 747 non-heme-binding proteins, and two already existing datasets were taken into account for testing the SCMHBP, resulting in a mean accuracy of 85.9% [9,10,14]. In another approach, Zhang et al clustered 4003 X-ray structures of heme-binding proteins via Blastclust [15] with a sequence identity of less than 30% and selected 260 representatives for testing [16].…”
Background: The notion of heme as a regulator of many physiological processes via transient binding to proteins is one that is recently being acknowledged. The broad spectrum of the effects of heme makes it important to identify further heme-regulated proteins to understand physiological and pathological processes. Moreover, several proteins were shown to be functionally regulated by interaction with heme, yet, for some of them the hemebinding site(s) remain unknown. The presented application HeMoQuest enables identification and qualitative evaluation of such heme-binding motifs from protein sequences. Results: We present HeMoQuest, an online interface (http://bit.ly/hemoquest) to algorithms that provide the user with two distinct qualitative benefits. First, our implementation rapidly detects transient heme binding to nonapeptide motifs from protein sequences provided as input. Additionally, the potential of each predicted motif to bind heme is qualitatively gauged by assigning binding affinities predicted by an ensemble learning implementation, trained on experimentally determined binding affinity data. Extensive testing of our implementation on both existing and new manually curated datasets reveal that our method produces an unprecedented level of accuracy (92%) in identifying those residues assigned "heme binding" in all of the datasets used. Next, the machine learning implementation for the prediction and qualitative assignment of binding affinities to the predicted motifs achieved 71% accuracy on our data. Conclusions: Heme plays a crucial role as a regulatory molecule exerting functional consequences via transient binding to surfaces of target proteins. HeMoQuest is designed to address this imperative need for a computational approach that enables rapid detection of heme-binding motifs from protein datasets. While most existing implementations attempt to predict sites of permanent heme binding, this application is to the best of our knowledge, the first of its kind to address the significance of predicting transient heme binding to proteins.
“…However, the use of a HasA hemophore incorporating an iron electron-transfer system for the asymmetric oxidation (with oxygen) of secondary alcohols in organic synthesis has not yet been examined as a heterogeneous enzyme catalysis reaction [3]. However, it is generally accepted that the system of hemophore HasA secreted by host ABC transporters [4] enables heme uptake across the cell outer membrane [5] and spontaneously transforms it into the HasR receptor at the heme-binding site [6].…”
Abstract:This study aims to demonstrate the coordination of oxygen regarding the hemophore HasApf expressed by Escherichia coli cells, which appears to create an unlikely oxygen-activating system in HasA due to the already-coordinated iron.In the asymmetric oxidation of rac-1-(6-methoxynaphthalen-2-yl)ethanol (rac-1) using dissolved oxygen, the signals at g-values of 2.8, 2.22, and 1.72 in the electron spin resonance (ESR) spectra disappeared in conjunction with the promotion of oxoferric (Fe III´O -O´) species in the distal site. These results suggest that the iron of porphyrin/Fe may be oxidized in water, leading to exhibition of greater asymmetric oxidation activity in the promotion of oxoferryl (Fe IV =O) species. A ketone (~50% chemical yield) produced from (R)-(´)-sec-alcohol can be desymmetrized by NaBH 4 in aqueous medium at 40˝C (>99% enantiomer excess, ee, >90% chemical yield) in the absence of NAD(P). Therefore, HasA can be regenerated via successive asymmetric catalytic events through an incorporated iron electron-transfer system in the presence of oxygen: Fe II + O 2 Ñ Fe III´O -O´Ñ Fe IV =O (oxidizing rac-1) Ñ Fe II + H 2 O. This process is similar to a Fenton reaction. The use of a HasA-catalytic system with an incorporated redox cofactor for asymmetric oxidation overcomes the apparent difficulties in working with pure dehydrogenase enzyme/redox cofactor systems for biotransformations.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.