SCMHBP: prediction and analysis of heme binding proteins using propensity scores of dipeptides

Liou, Yi Fan; Charoenkwan, Phasit; Sathipati, Srinivasulu Yerukala; Vasylenko, Tamara; Lai, Shih Chung; Lee, Hua Chin; Chen, Yi Hsiung; Huang, Hui Ling; Ho, Shinn-Ying

doi:10.1186/1471-2105-15-s16-s4

Cited by 29 publications

(35 citation statements)

References 63 publications

(104 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Compared to earlier SCM-based methods [10, 13, 14], SCMBYK is more strictly formulated for the purpose of characterization of BY-kinases, as long as it relies on a carefully selected dataset of 26 different bacterial phyla. With the advent of next-generation sequencing, the rate at which protein databases grow is very fast.…”

Section: Discussionmentioning

confidence: 99%

“…The original SCM algorithm was first proposed by Huang et al [10] and was consequently applied to discriminate and analyze proteins with various functions [8–10, 13, 14] based on their sequence information. To train the classifier, two FASTA files are expected as the input: one for the positive training data and one for the negative training data.…”

Section: Methodsmentioning

confidence: 99%

“…SCM-PCP is a PCP mining method used to identify the important physicochemical properties (PCPs) based on the propensity scores of 20 amino acids [13]. To find a set of PCPs possibly correlated with a considered protein function, we examined the 544 indices representing different PCPs available from the AA-index database.…”

Section: Methodsmentioning

confidence: 99%

See 2 more Smart Citations

SCMBYK: prediction and characterization of bacterial tyrosine-kinases based on propensity scores of dipeptides

et al. 2016

Self Cite

View full text Add to dashboard Cite

BackgroundBacterial tyrosine-kinases (BY-kinases), which play an important role in numerous cellular processes, are characterized as a separate class of enzymes and share no structural similarity with their eukaryotic counterparts. However, in silico methods for predicting BY-kinases have not been developed yet. Since these enzymes are involved in key regulatory processes, and are promising targets for anti-bacterial drug design, it is desirable to develop a simple and easily interpretable predictor to gain new insights into bacterial tyrosine phosphorylation. This study proposes a novel SCMBYK method for predicting and characterizing BY-kinases.ResultsA dataset consisting of 797 BY-kinases and 783 non-BY-kinases was established to design the SCMBYK predictor, which achieved training and test accuracies of 97.55 and 96.73%, respectively. Furthermore, the leave-one-phylum-out method was used to predict specific bacterial phyla hosts of target sequences, gaining 97.39% average test accuracy. After analyzing SCMBYK-derived propensity scores, four characteristics of BY-kinases were determined: 1) BY-kinases tend to be composed of α-helices; 2) the amino-acid content of extracellular regions of BY-kinases is expected to be dominated by residues such as Val, Ile, Phe and Tyr; 3) BY-kinases structurally resemble nuclear proteins; 4) different domains play different roles in triggering BY-kinase activity.ConclusionsThe SCMBYK predictor is an effective method for identification of possible BY-kinases. Furthermore, it can be used as a part of a novel drug repurposing method, which recognizes putative BY-kinases and matches them to approved drugs. Among other results, our analysis revealed that azathioprine could suppress the virulence of M. tuberculosis, and thus be considered as a potential antibiotic for tuberculosis treatment.Electronic supplementary materialThe online version of this article (doi:10.1186/s12859-016-1371-4) contains supplementary material, which is available to authorized users.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Methodsmentioning

confidence: 99%

See 1 more Smart Citation

SCMBYK: prediction and characterization of bacterial tyrosine-kinases based on propensity scores of dipeptides

et al. 2016

Self Cite

View full text Add to dashboard Cite

show abstract

“…Derived from a scoring card method (SCM), the latest prediction method for heme binding to proteins "SCMHBP" benefits from an evaluation of heme-binding tendencies of 400 dipeptides and 20 amino acids, which is transferred onto protein sequences. Consequently, two non-redundant training datasets were designed with 747 heme-binding proteins and 747 non-heme-binding proteins, and two already existing datasets were taken into account for testing the SCMHBP, resulting in a mean accuracy of 85.9% [9,10,14]. In another approach, Zhang et al clustered 4003 X-ray structures of heme-binding proteins via Blastclust [15] with a sequence identity of less than 30% and selected 260 representatives for testing [16].…”

Section: Introductionmentioning

confidence: 99%

HeMoQuest: a webserver for qualitative prediction of transient heme binding to protein motifs

et al. 2020

View full text Add to dashboard Cite

Background: The notion of heme as a regulator of many physiological processes via transient binding to proteins is one that is recently being acknowledged. The broad spectrum of the effects of heme makes it important to identify further heme-regulated proteins to understand physiological and pathological processes. Moreover, several proteins were shown to be functionally regulated by interaction with heme, yet, for some of them the hemebinding site(s) remain unknown. The presented application HeMoQuest enables identification and qualitative evaluation of such heme-binding motifs from protein sequences. Results: We present HeMoQuest, an online interface (http://bit.ly/hemoquest) to algorithms that provide the user with two distinct qualitative benefits. First, our implementation rapidly detects transient heme binding to nonapeptide motifs from protein sequences provided as input. Additionally, the potential of each predicted motif to bind heme is qualitatively gauged by assigning binding affinities predicted by an ensemble learning implementation, trained on experimentally determined binding affinity data. Extensive testing of our implementation on both existing and new manually curated datasets reveal that our method produces an unprecedented level of accuracy (92%) in identifying those residues assigned "heme binding" in all of the datasets used. Next, the machine learning implementation for the prediction and qualitative assignment of binding affinities to the predicted motifs achieved 71% accuracy on our data. Conclusions: Heme plays a crucial role as a regulatory molecule exerting functional consequences via transient binding to surfaces of target proteins. HeMoQuest is designed to address this imperative need for a computational approach that enables rapid detection of heme-binding motifs from protein datasets. While most existing implementations attempt to predict sites of permanent heme binding, this application is to the best of our knowledge, the first of its kind to address the significance of predicting transient heme binding to proteins.

show abstract

“…However, the use of a HasA hemophore incorporating an iron electron-transfer system for the asymmetric oxidation (with oxygen) of secondary alcohols in organic synthesis has not yet been examined as a heterogeneous enzyme catalysis reaction [3]. However, it is generally accepted that the system of hemophore HasA secreted by host ABC transporters [4] enables heme uptake across the cell outer membrane [5] and spontaneously transforms it into the HasR receptor at the heme-binding site [6].…”

Section: Introductionmentioning

confidence: 99%

Heterogeneous Asymmetric Oxidation Catalysis Using Hemophore HasApf. Application in the Chemoenzymatic Deracemization of sec-Alcohols with Sodium Borohydride

Nagaoka

2016

Catalysts

View full text Add to dashboard Cite

Abstract:This study aims to demonstrate the coordination of oxygen regarding the hemophore HasApf expressed by Escherichia coli cells, which appears to create an unlikely oxygen-activating system in HasA due to the already-coordinated iron.In the asymmetric oxidation of rac-1-(6-methoxynaphthalen-2-yl)ethanol (rac-1) using dissolved oxygen, the signals at g-values of 2.8, 2.22, and 1.72 in the electron spin resonance (ESR) spectra disappeared in conjunction with the promotion of oxoferric (Fe III´O -O´) species in the distal site. These results suggest that the iron of porphyrin/Fe may be oxidized in water, leading to exhibition of greater asymmetric oxidation activity in the promotion of oxoferryl (Fe IV =O) species. A ketone (~50% chemical yield) produced from (R)-(´)-sec-alcohol can be desymmetrized by NaBH 4 in aqueous medium at 40˝C (>99% enantiomer excess, ee, >90% chemical yield) in the absence of NAD(P). Therefore, HasA can be regenerated via successive asymmetric catalytic events through an incorporated iron electron-transfer system in the presence of oxygen: Fe II + O 2 Ñ Fe III´O -O´Ñ Fe IV =O (oxidizing rac-1) Ñ Fe II + H 2 O. This process is similar to a Fenton reaction. The use of a HasA-catalytic system with an incorporated redox cofactor for asymmetric oxidation overcomes the apparent difficulties in working with pure dehydrogenase enzyme/redox cofactor systems for biotransformations.

show abstract

SCMHBP: prediction and analysis of heme binding proteins using propensity scores of dipeptides

Cited by 29 publications

References 63 publications

SCMBYK: prediction and characterization of bacterial tyrosine-kinases based on propensity scores of dipeptides

SCMBYK: prediction and characterization of bacterial tyrosine-kinases based on propensity scores of dipeptides

HeMoQuest: a webserver for qualitative prediction of transient heme binding to protein motifs

Heterogeneous Asymmetric Oxidation Catalysis Using Hemophore HasApf. Application in the Chemoenzymatic Deracemization of sec-Alcohols with Sodium Borohydride

Contact Info

Product

Resources

About