2016
DOI: 10.1021/acs.chemrestox.5b00481
|View full text |Cite
|
Sign up to set email alerts
|

ToxCast EPA in Vitro to in Vivo Challenge: Insight into the Rank-I Model

Abstract: The ToxCast EPA challenge was managed by TopCoder in Spring 2014. The goal of the challenge was to develop a model to predict the lowest effect level (LEL) concentration based on in vitro measurements and calculated in silico descriptors. This article summarizes the computational steps used to develop the Rank-I model, which calculated the lowest prediction error for the secret test data set of the challenge. The model was developed using the publicly available Online CHEmical database and Modeling environment… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
20
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
4
2
1

Relationship

2
5

Authors

Journals

citations
Cited by 26 publications
(21 citation statements)
references
References 44 publications
(98 reference statements)
1
20
0
Order By: Relevance
“…The overall idea was to build several independent models and average their output. This procedure is known to avoid overfitting and minimize prediction errors (Novotarskyi et al, ). In our approach, 30 independent models were produced through stratified random partition of the original dataset to produce two subsets, the training set and the validation set (Figure ).…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…The overall idea was to build several independent models and average their output. This procedure is known to avoid overfitting and minimize prediction errors (Novotarskyi et al, ). In our approach, 30 independent models were produced through stratified random partition of the original dataset to produce two subsets, the training set and the validation set (Figure ).…”
Section: Resultsmentioning
confidence: 99%
“…The overall idea was to build several independent models and average their output. This procedure is known to avoid overfitting and minimize prediction errors (Novotarskyi et al, 2016).…”
Section: Resultsmentioning
confidence: 99%
“…Lipo Lipophilicity [47] 4,200 BBBP Blood-brain barrier [43] 2,039 BACE IC50 of human β-secretase 1 (BACE-1) inhibitors [43] 1,513 JAK3 Janus kinase 3 inhibitor [48] 886 DHFR Dihydrofolate reductase inhibition [49] 739 BioDeg Biodegradability [50] 1,737 LEL Lowest effect level [51] 483 RP AR Endocrine disruptors [52] 930 To validate the model, we sampled 500,000 ChEMBL-like SMILES (only 8,617 (1.7%) of them were canonical) from a generator [53] and checked how accurately the model can restore canonical SMILES for these molecules. We intentionally selected the generated SMILES keeping in mind possible application of the proposed method in the artificial intelligence-driven pipelines of de-novo development of new drugs.…”
Section: Validation Datasetsmentioning
confidence: 99%
“…Models available on OCHEM were frequently top‐performing within different benchmarking exercises, e.g., prediction of metal complexation, environmental toxicity, readily biodegradability, endocrine disruptors, logP,,, AMES toxicity, CYP450 inhibition, etc., as well as contributed the top‐rank model for the EPA ToxCast and the best overall balanced accuracy for twelve targets of the NIH Tox21 challenges. The OCHEM is widely used for educational purposes by a number of Universities across the world.…”
Section: (Q)sar Models In Integrated Modeling Environmentsmentioning
confidence: 99%