2023
DOI: 10.1039/d2dd00099g
|View full text |Cite
|
Sign up to set email alerts
|

Chemical representation learning for toxicity prediction

Abstract: A chemical language model for molecular property prediction: it outperforms prior art, is validated on a large, proprietary toxicity dataset, reveals cytotoxic motifs through attention & uses two uncertainty techniques to improve model reliability.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 7 publications
(7 citation statements)
references
References 100 publications
0
7
0
Order By: Relevance
“…For example, the linking of GHS hazard statements to respective chemicals (see “Terminology Component”) is essential for automating chemical hazard identification and risk assessment. Moreover, the availability of big data on chemical structures and associated properties can help in predicting potential products’ characteristics, such as toxicology . We demonstrated in “Data Access via Complex Queries”, how OntoSpecies can be used in multiobjective experimental planning tasks such as solvent selection.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, the linking of GHS hazard statements to respective chemicals (see “Terminology Component”) is essential for automating chemical hazard identification and risk assessment. Moreover, the availability of big data on chemical structures and associated properties can help in predicting potential products’ characteristics, such as toxicology . We demonstrated in “Data Access via Complex Queries”, how OntoSpecies can be used in multiobjective experimental planning tasks such as solvent selection.…”
Section: Resultsmentioning
confidence: 99%
“…Moreover, the availability of big data on chemical structures and associated properties can help in predicting potential products' characteristics, such as toxicology. 95 We demonstrated in "Data Access via Complex Queries", how OntoSpecies can be used in multiobjective experimental planning tasks such as solvent selection.…”
Section: Case 4: Check Coherence Of Data Reported In Pubchemmentioning
confidence: 99%
“…In drug discovery, these tools have impacted bioactivity prediction, 1,2 de novo molecular design, 3–8 synthesis prediction 9–15 and molecular property prediction. 16–20 In turn, the demand for high-quality data has increased beyond the extent of existing data sources 21 and there is a need to facilitate a larger number of informative experiments to generate data in a standardized format. Combinatorial chemistry is a popular method for producing large collections of compounds, motivated by material efficiency and more sustainable chemistry 22,23 since synthesis of 100 molecules using two building blocks per synthesis could in the worst case require 200 different building blocks, whereas a library of the same size using combinatorial chemistry would use 20 in a 10 × 10 design.…”
Section: Introductionmentioning
confidence: 99%
“…The second of the two environments contains just a single atom, but has five occurrences in the compound. The environment attribution is shared between those five atoms (5,6,7,8,9). To obtain the depiction, the atom attributions obtained from all bits are aggregated.…”
Section: ■ Introductionmentioning
confidence: 99%
“…Typical machine learning methods include k-nearest neighbors, support vector machines, random forest, gradient tree boosting, and more recently deep neural networks (DNN) . Chemical fingerprints are widely used as input features, however, a recent focus has been on the use of different types of descriptors including, for example, string representations such as SMILES, , depictions of chemical structures as inputs to DNNs, and 2D and 3D chemical graphs which have been used with both classical machine learning methods and with more novel graph-based DNN architectures.…”
Section: Introductionmentioning
confidence: 99%