Hepatic toxicity is a key concern for novel pharmaceutical drugs since it is difficult to anticipate in preclinical models, and it can originate from pharmacologically unrelated drug effects, such as pathway interference, metabolism, and drug accumulation. Because liver toxicity still ranks among the top reasons for drug attrition, the reliable prediction of adverse hepatic effects is a substantial challenge in drug discovery and development. To this end, more effort needs to be focused on the development of improved predictive in-vitro and in-silico approaches. Current computational models often lack applicability to novel pharmaceutical candidates, typically due to insufficient coverage of the chemical space of interest, which is either imposed by size or diversity of the training data. Hence, there is an urgent need for better computational models to allow for the identification of safe drug candidates and to support experimental design. In this context, a large data set comprising 3712 compounds with liver related toxicity findings in humans and animals was collected from various sources. The complex pathology was clustered into 21 preclinical and human hepatotoxicity endpoints, which were organized into three levels of detail. Support vector machine models were trained for each endpoint, using optimized descriptor sets from chemometrics software. The optimized global human hepatotoxicity model has high sensitivity (68%) and excellent specificity (95%) in an internal validation set of 221 compounds. Models for preclinical endpoints performed similarly. To allow for reliable prediction of "truly external" novel compounds, all predictions are tagged with confidence parameters. These parameters are derived from a statistical analysis of the predictive probability densities. The whole approach was validated for an external validation set of 269 proprietary compounds. The models are fully integrated into our early safety in-silico workflow.
One of the most appreciated capabilities
of computational toxicology
is to support the design of pharmaceuticals with reduced toxicological
hazard. To this end, we have strengthened our drug photosafety assessments
by applying novel computer models for the anticipation of in vitro
phototoxicity and human photosensitization. These models are typically
used in pharmaceutical discovery projects as part of the compound
toxicity assessments and compound optimization methods. To ensure
good data quality and aiming at models with global applicability we
separately compiled and curated highly chemically diverse data sets
from 3T3 NRU phototoxicity reports (450 compounds) and clinical photosensitization
alerts (1419 compounds) which are provided as supplements. The latter
data gives rise to a comprehensive list of explanatory fragments for
visual guidance, termed phototoxophores, by application of a Bayesian
statistics approach. To extend beyond the domain of well sampled fragments
we applied machine learning techniques based on explanatory descriptors
such as pharmacophoric fingerprints or, more important, accurate electronic
energy descriptors. Electronic descriptors were extracted from quantum
chemical computations at the density functional theory (DFT) level.
Accurate UV/vis spectral absorption descriptors and pharmacophoric
fingerprints turned out to be necessary for predictive computer models,
which were both derived from Deep Neural Networks but also the simpler
Random Decision Forests approach. Model accuracies of 83–85%
could typically be reached for diverse test data sets and other company
in-house data, while model sensitivity (the capability of correctly
detecting toxicants) was even better, reaching 86%–90%. Importantly,
a computer model-triggered response-map allowed for graphical/chemical
interpretability also in the case of previously unknown phototoxophores.
The photosafety models described here are currently applied in a prospective
manner for the hazard identification, prioritization, and optimization
of newly designed molecules.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.