2020
DOI: 10.1021/acs.chemrestox.0c00030
|View full text |Cite
|
Sign up to set email alerts
|

Ensemble Models Based on QuBiLS-MAS Features and Shallow Learning for the Prediction of Drug-Induced Liver Toxicity: Improving Deep Learning and Traditional Approaches

Abstract: Drug-induced liver injury (DILI) is a key safety issue in the drug discovery pipeline and a regulatory concern. Thus, many in silico tools have been proposed to improve the hepatotoxicity prediction of organic-type chemicals. Here, classifiers for the prediction of DILI were developed by using QuBiLS-MAS 0−2.5D molecular descriptors and shallow machine learning techniques, on a training set composed of 1075 molecules. The best ensemble model build, E13, was obtained with good statistical parameters for the lea… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
24
0
1

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
3

Relationship

2
6

Authors

Journals

citations
Cited by 36 publications
(30 citation statements)
references
References 69 publications
0
24
0
1
Order By: Relevance
“…The applicability domain analysis used in this study is based on a consensus score of four methods. The score represents the fraction of the four methods that indicate a molecule is considered inside the applicability domain, with scores ranging from zero (molecule is identified as an outlier by all four methods) to one (molecule is inside the AD for all four methods) [21,62]. A score greater than 0.25 was used as the criteria for identifying compounds inside the AD.…”
Section: Applicability Domainmentioning
confidence: 99%
See 1 more Smart Citation
“…The applicability domain analysis used in this study is based on a consensus score of four methods. The score represents the fraction of the four methods that indicate a molecule is considered inside the applicability domain, with scores ranging from zero (molecule is identified as an outlier by all four methods) to one (molecule is inside the AD for all four methods) [21,62]. A score greater than 0.25 was used as the criteria for identifying compounds inside the AD.…”
Section: Applicability Domainmentioning
confidence: 99%
“…In silico analysis is an economic strategy for the exploration of new compounds [21,22]. Quantitative structure-activity relationship (QSAR) reveals relationships between structural descriptors and biological activities of chemical compounds.…”
Section: Introductionmentioning
confidence: 99%
“…More recently, Minerali et al 58 have generated and compared ML algorithms for predicting DILI with in‐house software Assay Central (https://www.collaborationspharma.com/assay-central), using data previously collated by Pfizer and AstraZeneca research groups, along with data from the FDA, performing an ROC of 81%, sensitivity of 74%, specificity of 76%, and accuracy of 75% for the best Bayesian model based on the DILI‐concern category from the DILI Rank database. Noncommercial software has been developed by Mora et al, freely available at http://tomocomd.com/apps/ptoxra for the DILI prediction with ML models on a training set of 1075 molecules, attaining an accuracy of 84%, a sensibility of 89%, specificity of 76%, and ROC of 90%; 59 In the pharmaceutical company AstraZeneca Williams et al 60 predicted DILI by using ML, quantifying the probability that a compound falls into either low, medium, or high‐severity categories, with an accuracy of 63%. For a binary yes/no DILI prediction, the model obtained an accuracy of 86%, sensitivity of 87%, specificity of 85%, a positive predictive value of 92%, and a negative predictive value of 78%.…”
Section: Ai‐based Toxicity Predictionmentioning
confidence: 99%
“…Well-known examples of these models are the decision tree (DT) [ 55 ], random forest (RF) [ 56 ], and adaptive boosting (AdaBoost) [ 57 ]. Tree-based models are widely used in several research areas, such as metabolomics [ 58 ], disease detection [ 59 ], toxicological predictions [ 60 62 ], and stock prediction [ 63 ], due to their simplicity, efficiency, and interpretability.…”
Section: Introductionmentioning
confidence: 99%