2021
DOI: 10.1021/acs.jcim.1c00251
|View full text |Cite|
|
Sign up to set email alerts
|

Alignment-Free Antimicrobial Peptide Predictors: Improving Performance by a Thorough Analysis of the Largest Available Data Set

Abstract: In the last two decades, a large number of machine-learning-based predictors for the activities of antimicrobial peptides (AMPs) have been proposed. These predictors differ from one another in the learning method and in the training and testing data sets used. Unfortunately, the training data sets present several drawbacks, such as a low representativeness regarding the experimentally validated AMP space, and duplicated peptide sequences between negative and positive data sets. These limitations give a low con… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
69
2

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 32 publications
(71 citation statements)
references
References 104 publications
0
69
2
Order By: Relevance
“… Target or calibration databases: We considered five databases of APPs and non-APPs reported in ref. 30 There were different balanced and unbalanced datasets stored in five FASTA files with thousands of labeled APPs and non-APPs (SI1C-G).…”
Section: Description Of Modelsmentioning
confidence: 99%
See 4 more Smart Citations
“… Target or calibration databases: We considered five databases of APPs and non-APPs reported in ref. 30 There were different balanced and unbalanced datasets stored in five FASTA files with thousands of labeled APPs and non-APPs (SI1C-G).…”
Section: Description Of Modelsmentioning
confidence: 99%
“…To assess the relative performance of the mQSSMs, we used the five data sets of APPs and non-APPs recently provided in ref. 30 These datasets were obtained from starPepDB, whose description can be found in https://biocom-ampdiscover.cicese.mx/dataset. Each set of queries and similarity thresholds were wrapped into a calibration algorithm, comprising a modified virtual screening simulation technique.…”
Section: Selection Of the Best Of Models And Comparisons With ML Apps Prediction Serversmentioning
confidence: 99%
See 3 more Smart Citations