2021
DOI: 10.1101/2021.08.08.455550
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Improving QSAR Modeling for Predictive Toxicology using Publicly Aggregated Semantic Graph Data and Graph Neural Networks

Abstract: Quantitative Structure-Activity Relationship (QSAR) modeling is the most common computational technique for predicting chemical toxicity, but a lack of methodological innovations in QSAR have led to underwhelming performance. We show that contemporary QSAR modeling for predictive toxicology can be substantially improved by incorporating semantic graph data aggregated from open-access public databases, and analyzing those data in the context of graph neural networks (GNNs). Furthermore, we introspect the GNNs t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
4
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
2
1

Relationship

3
0

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 26 publications
(31 reference statements)
0
4
0
Order By: Relevance
“…We also acknowledge the recent development of toxicology-focused graph databases, such as ComptoxAI, which provide extensive knowledge on relations among chemicals, genes, assays, and many other entities. 42 Such a database may help researchers generate a more comprehensive feature profile for model training and thus improve the performance of DTox. In addition to better feature profiling, we think the incorporation of context-specific knowledge may further enhance the performance of DTox.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…We also acknowledge the recent development of toxicology-focused graph databases, such as ComptoxAI, which provide extensive knowledge on relations among chemicals, genes, assays, and many other entities. 42 Such a database may help researchers generate a more comprehensive feature profile for model training and thus improve the performance of DTox. In addition to better feature profiling, we think the incorporation of context-specific knowledge may further enhance the performance of DTox.…”
Section: Discussionmentioning
confidence: 99%
“…For each nuclear receptor of interest, we extract compounds with known connections (source compounds) from two resources: DrugBank 45 and ComptoxAI. 42 The performance metric was computed as the proportion of active compounds (query) that exhibit similar structure to at least one source compound. Five thresholds of Tanimoto coefficient were adopted to define structural similarity between source and query compound: 0.8, 0.85, 0.9, 0.95, and 1.…”
Section: Methodsmentioning
confidence: 99%
“…Finally, despite the incorporation of pathway ontology, DTox VNN did not significantly outperform other well-established classification algorithms, as most differences are within the 95% confidence interval of performance metrics. We noticed the recent development of toxicology-focused graph database such as ComptoxAI, which provides extensive knowledge on relations among chemicals, genes, assays, as well as many other entities 40 . Such database may help us generate more comprehensive feature profile for model training, and thus improves the predicative performance of DTox.…”
Section: Discussionmentioning
confidence: 99%
“…ComptoxAI 19, 20 is a comprehensive graph knowledge base that contains curated relationships between chemicals, genes, assays, and many other entities. It contains two types of relationships linking compound and gene nodes: physical binding (with protein product) and expression-alteration (up-regulation/down-regulation).…”
Section: Methodsmentioning
confidence: 99%