2022
DOI: 10.1002/aic.17634
|View full text |Cite
|
Sign up to set email alerts
|

An accurate and interpretable deep learning model for environmental properties prediction using hybrid molecular representations

Abstract: Lipophilicity, as quantified by the decimal logarithm of the octanol–water partition coefficient (log KOW), is an essential environmental property. Deep neural networks (DNNs) based quantitative structure–property relationship (QSPR) studies have received more and more attention because of their excellent performance for prediction. However, the black‐box nature of DNNs limits the application range where interpretability is essential. Hence, this study aims to develop an accurate and interpretable deep neural … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
22
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 20 publications
(22 citation statements)
references
References 56 publications
0
22
0
Order By: Relevance
“…In this work, we rely on the chemical diversity analysis presented earlier and the William plot. The William plots are visual representations of the hat values and are considered a distance-based approach for identifying the applicability domain of QSPR models. ,, The hat values, also referred to as leverages, are proportional to the Hotelling T 2 and to the Mahalanobis distance which corrects for potential colinearity in the descriptors by using the covariance matrix . The leverages ( h i ) for compound i are the diagonal elements of the matrix H denoted hat matrix, which is acquired through eq , where X is the latent representation of the molecule, i.e., the output of the GNN part used as input to the MLP. H = X false( X T X false) 1 X T …”
Section: Methods and Proceduresmentioning
confidence: 99%
See 3 more Smart Citations
“…In this work, we rely on the chemical diversity analysis presented earlier and the William plot. The William plots are visual representations of the hat values and are considered a distance-based approach for identifying the applicability domain of QSPR models. ,, The hat values, also referred to as leverages, are proportional to the Hotelling T 2 and to the Mahalanobis distance which corrects for potential colinearity in the descriptors by using the covariance matrix . The leverages ( h i ) for compound i are the diagonal elements of the matrix H denoted hat matrix, which is acquired through eq , where X is the latent representation of the molecule, i.e., the output of the GNN part used as input to the MLP. H = X false( X T X false) 1 X T …”
Section: Methods and Proceduresmentioning
confidence: 99%
“…Plotting the acquired leverages against the standardized residuals results in the William plot, which has also been applied to define the domain of applicability of a wide range of QSPR models. , More details about the underlying theory can be found in the review paper on the domain of applicability of QSPR models …”
Section: Methods and Proceduresmentioning
confidence: 99%
See 2 more Smart Citations
“…Nevertheless, the UNIFAC model has not been reported in the screening and design of mixtures. Besides, the deep learning model can be used to predict the thermodynamic properties of solvents. However, the deep learning model is restricted to investigate systems lacking experimental data.…”
Section: Introductionmentioning
confidence: 99%