2017
DOI: 10.1007/978-3-319-56850-8_1
|View full text |Cite
|
Sign up to set email alerts
|

Towards the Revival of Interpretable QSAR Models

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
40
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
1
1

Relationship

3
4

Authors

Journals

citations
Cited by 28 publications
(41 citation statements)
references
References 212 publications
0
40
0
Order By: Relevance
“…Using eighteen informative PCPs could provide faster and more cost-effective models, while model developers could gain an insight into the underlying prediction processes [58,[62][63][64]; (iii) selecting a powerful method for QSP prediction. Although, iQSP displayed a superior performance over the existing methods assessed by the rigorous crossvalidation methods, there is still room for further improvements, including increasing the size of QSPs by gathering peptide sequences from various data sources, utilizing an interpretable learning algorithm, such as scoring card method [44,53], improving the interpretation of important features responsible for the biological activity [50,64] and exploring different ML algorithms, such as extreme gradient boosting [65] or deep learning [66]. To further investigate the power of the proposed iQSP, we compared its performance with other conventional classifiers, i.e., k-nearest neighbor (k-NN), decision tree (rpart), and random forest (RF).…”
Section: Comparison With Existing Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Using eighteen informative PCPs could provide faster and more cost-effective models, while model developers could gain an insight into the underlying prediction processes [58,[62][63][64]; (iii) selecting a powerful method for QSP prediction. Although, iQSP displayed a superior performance over the existing methods assessed by the rigorous crossvalidation methods, there is still room for further improvements, including increasing the size of QSPs by gathering peptide sequences from various data sources, utilizing an interpretable learning algorithm, such as scoring card method [44,53], improving the interpretation of important features responsible for the biological activity [50,64] and exploring different ML algorithms, such as extreme gradient boosting [65] or deep learning [66]. To further investigate the power of the proposed iQSP, we compared its performance with other conventional classifiers, i.e., k-nearest neighbor (k-NN), decision tree (rpart), and random forest (RF).…”
Section: Comparison With Existing Methodsmentioning
confidence: 99%
“…As noticed in Table 7, the top seven important PCPs are QIAN880137, AURR980102, ROBB760113, PRAM820101, GRAR740101, PALJ810111, and Figure 3, the superior performance of our proposed model iQSP over 10-fold CV and independent validation test might mainly be due to the following reasons: (i) Performing with multiple random sampling procedure to protect against the risk of having good predictive result by chance [39][40][41][42][43]49,50]; (ii) using an efficient feature selection method (GA-SAR) to identify m informative features from 531 PCPs. Using eighteen informative PCPs could provide faster and more cost-effective models, while model developers could gain an insight into the underlying prediction processes [58,[62][63][64]; (iii) selecting a powerful method for QSP prediction. Although, iQSP displayed a superior performance over the existing methods assessed by the rigorous cross-validation methods, there is still room for further improvements, including increasing the size of QSPs by gathering peptide sequences from various data sources, utilizing an interpretable learning algorithm, such as scoring card method [44,53], improving the interpretation of important features responsible for the biological activity [50,64] and exploring different ML algorithms, such as extreme gradient boosting [65] or deep learning [66].…”
Section: Feature Contribution Analysismentioning
confidence: 99%
“…24,53 In particular, AD allows the relative estimation of the feasibility of predictions made on query compounds on the basis of how similar they are to the compounds used to train the model. There are several approaches that have been proposed to assess the AD of compounds.…”
Section: Applicability Domain Analysismentioning
confidence: 99%
“…Quantitative structure‐activity relationship models (QSAR models) apply the toolbox of chemometrics and chemoinformatics to predict the chemical, physical, or biological properties of chemicals based on a set of descriptor variables that describe the structure of the chemicals . The main areas of the application of QSAR models are the modelling of different biological activities (eg, antimalarial, antioxidant, and anti‐HIV), chemical properties (eg, chromatographic retention data and biodegradation), and toxic endpoints (eg, genotoxicity and hepatotoxicity).…”
Section: Introductionmentioning
confidence: 99%
“…Quantitative structure-activity relationship models (QSAR models) apply the toolbox of chemometrics and chemoinformatics to predict the chemical, physical, or biological properties of chemicals based on a set of descriptor variables that describe the structure of the chemicals. 1 The main areas of the application of QSAR models are the modelling of different biological activities (eg, antimalarial, 2 antioxidant, 3 and anti-HIV 4 ), chemical properties (eg, chromatographic retention data 5 and biodegradation 6 ), and toxic endpoints (eg, genotoxicity 7 and hepatotoxicity 8 ). Moreover, besides predicting the properties of certain molecular structures, the inverse task, the design of molecular structures for certain properties is achieved as well, as these QSAR models are a time-effective and economically friendly way to design new drugs.…”
Section: Introductionmentioning
confidence: 99%