2016
DOI: 10.1186/s12859-016-1114-6
|View full text |Cite
|
Sign up to set email alerts
|

Automated prediction of HIV drug resistance from genotype data

Abstract: BackgroundHIV/AIDS is a serious threat to public health. The emergence of drug resistance mutations diminishes the effectiveness of drug therapy for HIV/AIDS. Developing a computational prediction of drug resistance phenotype will enable efficient and timely selection of the best treatment regimens.ResultsA unified encoding of protein sequence and structure was used as the feature vector for predicting phenotypic resistance from genotype data. Two machine learning algorithms, Random Forest and K-nearest neighb… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
37
1

Year Published

2017
2017
2021
2021

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 27 publications
(42 citation statements)
references
References 23 publications
1
37
1
Order By: Relevance
“…The nearly uniform values of close to 1.0 for accuracy, PPV, recall, and F-score, show that the models reliably predict both resistant and non-resistant classes. These results compare favorably with our earlier results using non-generative machine learning algorithms [46, 8]. …”
Section: Resultssupporting
confidence: 87%
“…The nearly uniform values of close to 1.0 for accuracy, PPV, recall, and F-score, show that the models reliably predict both resistant and non-resistant classes. These results compare favorably with our earlier results using non-generative machine learning algorithms [46, 8]. …”
Section: Resultssupporting
confidence: 87%
“…Based on ligand-based virtual screening, kNN can be viewed as a prolongation form chemical similarity searching to supervised learning, and the top search results predicted the best bioactivities. Weber et al [79] tried two machine learning algorithms of classification (KNN and RF) to analyze genotype-phenotype datasets of HIV protease and reverse transcriptase (RT). As a result, both algorithms had high accuracies for predicting the drug resistance for protease and RT inhibitors.…”
Section: Classical Qsar Methodsmentioning
confidence: 99%
“…Further, the dataset was randomly divided into 5 subsets of approximately equal size, and 5 different ANNs were trained on datasets that comprised 4 of the 5 subsets, and then 5 different R 2 values were calculated; we then calculated the mean and the standard deviation of these 5 R 2 values. Regression performances were then compared against prediction models from the article published in 2016 by Shen and co-workers [ 19 ], in which regression machine learning models, namely the Random Forest and the K-nearest neighbor algorithms were used. The raw dataset used in this work and in ref.…”
Section: Methodsmentioning
confidence: 99%
“…Our method incorporated the following features: (a) The prediction algorithm used was a regression Artificial Neural Network (ANN); (b) because the great majority of publicly available data in the Stanford HIVdb is for subtype B HIV, only subtype B data was used in this database to train and test the network, so that the prediction algorithm is mainly applicable to subtype B sequence data; (c) in order to reduce data noise, various forms of data filtering, as described in the Methodology section, were used. Our regression ANN models compared favourably against recent work by Shen and co-workers [ 19 ], for which similar metrics were used. The ANN regression models were applied to the protease (PR) inhibitors fosamprenavir (FPV), atazanavir (ATV), indinavir (IDV), lopinavir (LPV), saquinavir (SQV), tipranavir (TPV), nelfinavir (NFV) and darunavir (DRV), and to the reverse transcriptase (RT) inhibitors lamivudine (3TC), abacavir (ABC), zidovudine (AZT), stavudine (D4T), didanosine (DDI), tenofovir (TDF), efavirenz (EFV), etravirine (ETR), nevirapine (NVP), rilpivirine (RPV).…”
Section: Introductionmentioning
confidence: 96%
See 1 more Smart Citation