2021
DOI: 10.1021/acs.jcim.1c00451
|View full text |Cite
|
Sign up to set email alerts
|

ChemBioSim: Enhancing Conformal Prediction of In Vivo Toxicity by Use of Predicted Bioactivities

Abstract: Computational methods such as machine learning approaches have a strong track record of success in predicting the outcomes of in vitro assays. In contrast, their ability to predict in vivo endpoints is more limited due to the high number of parameters and processes that may influence the outcome. Recent studies have shown that the combination of chemical and biological data can yield better models for in vivo endpoints. The ChemBioSim approach presented in this work aims to enhance the performance of conformal… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
38
0
2

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1

Relationship

3
3

Authors

Journals

citations
Cited by 20 publications
(40 citation statements)
references
References 72 publications
0
38
0
2
Order By: Relevance
“…The random split into a training set (80%) and a test set (20%) was also preserved. The chemical structures were processed with a refined preprocessing protocol that was developed by Garcia de Lomana et al [ 56 ]. This protocol includes the removal of solvents and salts, annotation of aromaticity, neutralization of charges, and mesomerization.…”
Section: Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…The random split into a training set (80%) and a test set (20%) was also preserved. The chemical structures were processed with a refined preprocessing protocol that was developed by Garcia de Lomana et al [ 56 ]. This protocol includes the removal of solvents and salts, annotation of aromaticity, neutralization of charges, and mesomerization.…”
Section: Methodsmentioning
confidence: 99%
“…A set of 750 bioactivity descriptors related to 375 predicted binary assay outcomes was calculated for all compounds of the LLNA data set and the three reference data sets (the number of bioactivity descriptors is double that of the predicted binary assay outcomes because the predicted class probabilities of the active and the inactive class were included in the descriptor set independently from each other). More specifically, class probabilities for 372 bioactivity assays were calculated with aggregated Mondrian CP models that we trained on bioactivity assay data collected from ToxCast [ 60 ], eMolTox [ 61 ], the eChemPortal [ 62 ], and literature, following the identical protocol published by Garcia de Lomana et al [ 56 ]. In addition, predicted class probabilities for three assays relevant to skin sensitization prediction (i.e., DPRA, KeratinoSens assay, h-CLAT) were computed using Mondrian CP models generated by applying the identical model generation framework as described for the other assays [ 56 ] to the three corresponding data sets retrieved from Alves et al Prior to modeling, the standard scaler of the preprocessing module of scikit-learn [ 63 ] was used (with default settings) to normalize all bioactivity descriptors.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations