2018
DOI: 10.3389/fmicb.2018.00476
|View full text |Cite
|
Sign up to set email alerts
|

PVP-SVM: Sequence-Based Prediction of Phage Virion Proteins Using a Support Vector Machine

Abstract: Accurately identifying bacteriophage virion proteins from uncharacterized sequences is important to understand interactions between the phage and its host bacteria in order to develop new antibacterial drugs. However, identification of such proteins using experimental techniques is expensive and often time consuming; hence, development of an efficient computational algorithm for the prediction of phage virion proteins (PVPs) prior to in vitro experimentation is needed. Here, we describe a support vector machin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
148
1
1

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 166 publications
(150 citation statements)
references
References 57 publications
0
148
1
1
Order By: Relevance
“…A five-step guideline has increasingly been endorsed (Manavalan, Tae Hwan Shin, et al 2018) in a series of recent publications, to develop a sequence-based predictor for a biological system that can easily be used, which goes as follow:…”
Section: Methodsmentioning
confidence: 99%
“…A five-step guideline has increasingly been endorsed (Manavalan, Tae Hwan Shin, et al 2018) in a series of recent publications, to develop a sequence-based predictor for a biological system that can easily be used, which goes as follow:…”
Section: Methodsmentioning
confidence: 99%
“…Alternatively, computational prediction methods for discovering gene-anatomical entity associations can be employed because of their higher speed and low resource consumption. Sequence similarity-based function prediction is such an example, which is widely used to predict the molecular functions of proteins [10, 11]. However, using it to predict anatomical associations of genes is questionable, because anatomical entities develop from a combination of several biological pathways that include proteins with diverse molecular functions and sequences [12].…”
Section: Introductionmentioning
confidence: 99%
“…Such small sample sets are challenging to model using traditional statistical methods, often709 requiring feature selection or regularization, as shown in(Wu et al, 2009). For example, in a 710 2018 study(Manavalan et al, 2018) this problem was approached using random forests to reduce 711 the total number of variables assigned to each tree, and resulted in an improved 87% accuracy,712 using leave-one-out cross-validation. Regularization is a core technique in machine learning used 713 to mitigate overfitting.…”
mentioning
confidence: 99%