2016
DOI: 10.1080/21655979.2016.1149271
|View full text |Cite
|
Sign up to set email alerts
|

The importance of physicochemical characteristics and nonlinear classifiers in determining HIV-1 protease specificity

Abstract: This paper reviews recent research relating to the application of bioinformatics approaches to determining HIV-1 protease specificity, outlines outstanding issues, and presents a new approach to addressing these issues. Leading machine learning theory for the problem currently suggests that the direct encoding of the physicochemical properties of the amino acid substrates is not required for optimal performance. A number of amino acid encoding approaches which incorporate potentially relevant physicochemical p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 60 publications
(82 reference statements)
0
1
0
Order By: Relevance
“… [ 63 ] Rognvaldsson et al 64 , 65 argue that nonlinear models including neural networks should not be used for cleavage prediction, however the HIV dataset from Cai et al [ 66 ] that they used and their expanded dataset only included 299 and 746 samples, respectively. Additionally, physiochemical or structural encodings have outperformed one-hot encoding (also called orthogonal encoding) for their small HIV datasets [ 67 ] and have moreover eliminated differences between linear and nonlinear classifiers in an equivalent HCV dataset with 891 samples. [ 68 ] To my knowledge no one has expanded the 3CLpro cleavage dataset to the point where nonlinearity becomes significant, investigated the entire human proteome for 3CLpro cleavages sites with any method, or performed enrichment analysis and classification of these affected proteins.…”
Section: Introductionmentioning
confidence: 99%
“… [ 63 ] Rognvaldsson et al 64 , 65 argue that nonlinear models including neural networks should not be used for cleavage prediction, however the HIV dataset from Cai et al [ 66 ] that they used and their expanded dataset only included 299 and 746 samples, respectively. Additionally, physiochemical or structural encodings have outperformed one-hot encoding (also called orthogonal encoding) for their small HIV datasets [ 67 ] and have moreover eliminated differences between linear and nonlinear classifiers in an equivalent HCV dataset with 891 samples. [ 68 ] To my knowledge no one has expanded the 3CLpro cleavage dataset to the point where nonlinearity becomes significant, investigated the entire human proteome for 3CLpro cleavages sites with any method, or performed enrichment analysis and classification of these affected proteins.…”
Section: Introductionmentioning
confidence: 99%