2013
DOI: 10.1093/protein/gzt042
|View full text |Cite
|
Sign up to set email alerts
|

Using ensemble SVM to identify human GPCRs N-linked glycosylation sites based on the general form of Chou's PseAAC

Abstract: As the most frequent drug target, G-protein coupled receptors (GPCRs) are a large family of seven transmembrane receptors that sense molecules outside the cell and activate inside signal transduction pathways. Glycosylation is one of the most complex post-translational modifications (PTMs) of proteins in eukaryotic cells. It plays important roles in a variety of cellular functions, including protein folding, protein trafficking and localization, cell-cell interactions and epitope recognition. Therefore, invest… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
24
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 91 publications
(24 citation statements)
references
References 50 publications
0
24
0
Order By: Relevance
“…To avoid completely losing the sequence-order information for proteins, the pseudo amino acid composition [96,97] or Chou’s PseAAC [98] was proposed. Ever since the concept of PseAAC was proposed in 2001 [96], it has penetrated into almost all the areas of computational proteomics, such as predicting anticancer peptides [99], predicting protein subcellular location [100106], predicting membrane protein types [107,108], predicting protein submitochondria locations [109112], predicting GABA(A) receptor proteins [113], predicting enzyme subfamily classes [114], predicting antibacterial peptides [115], predicting supersecondary structure [116], predicting bacterial virulent proteins [117], predicting protein structural class [118], predicting the cofactors of oxidoreductases [119], predicting metalloproteinase family [120], identifying cysteine S -nitrosylation sites in proteins [66], identifying bacterial secreted proteins [121], identifying antibacterial peptides [115], identifying allergenic proteins [122], identifying protein quaternary structural attributes [123,124], identifying risk type of human papillomaviruses [125], identifying cyclin proteins [126], identifying GPCRs and their types [15,16], discriminating outer membrane proteins [127], classifying amino acids [128], detecting remote homologous proteins [129], among many others (see a long list of papers cited in the References section of [60]). Moreover, the concept of PseAAC was further extended to represent the feature vectors of nucleotides [65], as well as other biological samples (see, e.g., [130132]).…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…To avoid completely losing the sequence-order information for proteins, the pseudo amino acid composition [96,97] or Chou’s PseAAC [98] was proposed. Ever since the concept of PseAAC was proposed in 2001 [96], it has penetrated into almost all the areas of computational proteomics, such as predicting anticancer peptides [99], predicting protein subcellular location [100106], predicting membrane protein types [107,108], predicting protein submitochondria locations [109112], predicting GABA(A) receptor proteins [113], predicting enzyme subfamily classes [114], predicting antibacterial peptides [115], predicting supersecondary structure [116], predicting bacterial virulent proteins [117], predicting protein structural class [118], predicting the cofactors of oxidoreductases [119], predicting metalloproteinase family [120], identifying cysteine S -nitrosylation sites in proteins [66], identifying bacterial secreted proteins [121], identifying antibacterial peptides [115], identifying allergenic proteins [122], identifying protein quaternary structural attributes [123,124], identifying risk type of human papillomaviruses [125], identifying cyclin proteins [126], identifying GPCRs and their types [15,16], discriminating outer membrane proteins [127], classifying amino acids [128], detecting remote homologous proteins [129], among many others (see a long list of papers cited in the References section of [60]). Moreover, the concept of PseAAC was further extended to represent the feature vectors of nucleotides [65], as well as other biological samples (see, e.g., [130132]).…”
Section: Resultsmentioning
confidence: 99%
“…Only the jackknife test is the least arbitrary that can always yield a unique result for a given benchmark dataset [73,74,156,166168]. Therefore, the jackknife test has been widely recognized and increasingly utilized by investigators to examine the quality of various predictors (see, e.g., [14,15,68,99,106,107,124,169,170]). Accordingly, in this study the jackknife test was also adopted to evaluate the accuracy of the current predictor.…”
Section: Methodsmentioning
confidence: 99%
“…Since the concept of PseAAC was proposed in 2001 [29], it has been penetrating into almost all the fields of protein attribute predictions (see, e.g., [3173]). Because it has been widely used, recently two types of open access software, called “PseAAC-Builder” [51] and “propy” [74], were established for generating various modes of PseAAC.…”
Section: Methodsmentioning
confidence: 99%
“…It is considered to be the most rigorous test for evaluation of performance (Chou and Zhang, 1995;Kumar and Raghava, 2009). Although it is time consuming but gives better results than other cross-validation methods (Feng et al, 2013;Xie et al, 2013;Xu et al, 2013;Zhou and Assa-Munt, 2001). During the LOOCV, each protein in the dataset was, in turn, used for testing by the classifier trained with the remaining proteins.…”
Section: Cross Validation and Performance Evaluationmentioning
confidence: 99%