2021
DOI: 10.1101/2021.05.03.442477
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Predicting Environmental Chemical Carcinogenicity using a Hybrid Machine-Learning Approach

Abstract: Determining environmental chemical carcinogenicity is an urgent need as humans are increasingly exposed to these chemicals. In this study, we determined the carcinogenicity of wide variety real-life exposure chemicals in large scale. To determine chemical carcinogenicity, we have developed carcinogenicity prediction models based on the hybrid neural network (HNN) architecture. In the HNN model, we included new SMILES feature representation method, by modifying our previous 3D array representation of 1D SMILES … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(5 citation statements)
references
References 21 publications
0
5
0
Order By: Relevance
“…Instead, we utilize a dataset used by CarcinoPred-EL to design a comparable method and evaluate it on the continuous CPDB data. Furthermore we design an AdaBoost decision tree model emulating methodology from the work of Limbu and Dakshanamurthy (2021) . We also compare on an external CCRIS test set where our model outperforms all variants of CarcinoPred-EL the AdaBoost decision tree ( Table 2 ).…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Instead, we utilize a dataset used by CarcinoPred-EL to design a comparable method and evaluate it on the continuous CPDB data. Furthermore we design an AdaBoost decision tree model emulating methodology from the work of Limbu and Dakshanamurthy (2021) . We also compare on an external CCRIS test set where our model outperforms all variants of CarcinoPred-EL the AdaBoost decision tree ( Table 2 ).…”
Section: Resultsmentioning
confidence: 99%
“…A number of approaches use cheminformatics features [such as molecular descriptors ( Landrum, 2016 ; Moriwaki et al , 2018 )] to construct regression models for continuous carcinogenicity prediction ( Fjodorova et al , 2010 ; Singh et al , 2013 ; Wu et al , 2015 ). Recently, Limbu and Dakshanamurthy (2021) compared various regression models of these sorts, of which AdaBoost ( Freund and Schapire, 1997 ) was found to be among the most accurate for dose-rate prediction.…”
Section: Related Workmentioning
confidence: 99%
“…Then we concatenate the CNN and FFNN models but not their individual predictions to make the final prediction. We included a new SMILES feature representation method in the HNN framework by modifying our previous 3D array representation of 1D SMILES simulated by the convolutional neural network (CNN) [32,33]. We assembled protein-ligand binding data such as Kd, Ki, and the combined Kd and Ki values from a general set and refined set obtained from the PDBBind database.…”
Section: Resultsmentioning
confidence: 99%
“…15,29,30,130−134 Limbu and Dakshanamurthy built carcinogenicity prediction models using a hybrid NN architecture with three data sets containing more than 10,000 chemicals, and 653 molecular descriptors. 130 The hybrid NN achieved an average ACC of 74.3% and mean ROC-AUC of 80.6%, which were superior to those of the AdaBoost model but not those of the Bagging and RF models using the same data. One of the leading causes of low predictive accuracy of DL models is sparse data sets.…”
Section: Carcinogenesis Predictionmentioning
confidence: 85%
“…In recent years, many AI-based models and tools have been developed to predict the carcinogenic potential of compounds. ,,, Limbu and Dakshanamurthy built carcinogenicity prediction models using a hybrid NN architecture with three data sets containing more than 10,000 chemicals, and 653 molecular descriptors . The hybrid NN achieved an average ACC of 74.3% and mean ROC-AUC of 80.6%, which were superior to those of the AdaBoost model but not those of the Bagging and RF models using the same data.…”
Section: Recent Advances In Ai-based Drug Toxicity Predictionmentioning
confidence: 99%