An Ensemble Structure and Physicochemical (SPOC) Descriptor for Machine‐Learning Prediction of Chemical Reaction and Molecular Properties

Yang, Qi; Liu, Yidi; Cheng, Junjie; Li, Yao; Liu, Siyuan; Duan, Yingdong; Zhang, Long; Luo, Sanzhong

doi:10.1002/cphc.202200255

“…The first comparison was conducted over the results presented by [ 40 ] using a random forest (i.e., RF) and Morgan MFs on BBBP (0.909 ± 0.028 AUC), Tox21 (0.819 ± 0.017), SIDER (0.687 ± 0.014), and ClinTox (0.759 ± 0.060). In [ 41 ], the authors evaluated MAACS fingerprints over the Tox21 dataset achieving an AUC of 0.805 ± 0.01, an AUC of 0.721 ± 0.004 for BBBP, and an AUC equal to 0.797 ± 0.151 for Clintox, applying an ensemble of decision trees over 5-fold cross-validation. Another paper [ 42 ] focused on the Tox21 dataset reporting the outcomes of the in silico toxicity evaluation by five classifiers on Morgan fingerprints: the LightGBM overperformed other classifiers, reaching an AUC of 0.795 on the test set (standard deviation was not reported) for NR-AR.…”

Section: Resultsmentioning

confidence: 99%

“…Standard deviation was included if reported in the original papers. AUC values from [ 40 , 41 , 42 , 43 , 44 , 45 , 47 , 48 ].…”

Section: Figurementioning

confidence: 99%

See 1 more Smart Citation

Molecular Toxicity Virtual Screening Applying a Quantized Computational SNN-Based Framework

Nascimben

¹

,

Rimondini

²

2023

Molecules

2

0

View full text Add to dashboard Cite

Spiking neural networks are biologically inspired machine learning algorithms attracting researchers’ attention for their applicability to alternative energy-efficient hardware other than traditional computers. In the current work, spiking neural networks have been tested in a quantitative structure–activity analysis targeting the toxicity of molecules. Multiple public-domain databases of compounds have been evaluated with spiking neural networks, achieving accuracies compatible with high-quality frameworks presented in the previous literature. The numerical experiments also included an analysis of hyperparameters and tested the spiking neural networks on molecular fingerprints of different lengths. Proposing alternatives to traditional software and hardware for time- and resource-consuming tasks, such as those found in chemoinformatics, may open the door to new research and improvements in the field.

show abstract

“…17 Yang et al combined fingerprint and physicochemical descriptors for reaction prediction. 18 However, these descriptors are not easy to interpret nor generalize well outside the training reaction space except the Hammett and TSEI descriptors.…”

Section: Introductionmentioning

confidence: 99%

Reaxtica: A knowledge-guided machine learning platform for fast and accurate reaction selectivity and yield prediction

Lin

¹

,

Li

²

,

Lin

³

et al. 2022

Preprint

1

0

View full text Add to dashboard Cite

Reaction selectivity and yield prediction are important for chemical synthesis. Most existing computational methods use either computational expensive and complicated quantum mechanics-based models that are not easy for experimental chemists to use or black-box deep learning models that do not generalize well outside of the training space and lack explanation. Herein, using convenient physics-based electronic descriptors and structure-based steric descriptors, we developed an explainable machine learning platform, Reaxtica, that outperformed previous methods in four different reaction types and tasks, including regioselectivity, site-selectivity, enantioselectivity, and yield predictions. Further descriptor analysis helps understand reaction mechanisms behind the data. As a practical and robust toolbox, Reaxtica can be easily applied to different chemical reactions and extended to out-of-sample reaction. To assist chemists’ daily research, we further built an easy-to-use webserver, which only takes seconds to run and can be accessed at http://www.pkumdl.cn:8000/reaxtica/.

show abstract

“…17 Luo et al combined fingerprint and physicochemical descriptors for reaction prediction. 18 However, these descriptors are not easy to interpret nor generalize well outside the training reaction space except the Hammett and TSEI descriptors.…”

Section: Introductionmentioning

confidence: 99%

Reaxtica: A knowledge-guided machine learning platform for fast and accurate reaction selectivity and yield prediction

Lin

¹

,

Li

²

,

Lin

³

et al. 2022

Preprint

1

0

View full text Add to dashboard Cite

Reaction selectivity and yield prediction are important for chemical synthesis. Most existing computational methods use either computational expensive and complicated quantum mechanics-based models that are not easy for experimental chemists to use or black-box deep learning models that do not generalize well outside of the training space and lack explanation. Herein, using convenient physics-based electronic descriptors and structure-based steric descriptors, we developed an explainable machine learning platform, Reaxtica, that outperformed previous methods in four different reaction types and tasks, including regioselectivity, site-selectivity, enantioselectivity, and yield predictions. Further descriptor analysis helps understand reaction mechanisms behind the data. As a practical and robust toolbox, Reaxtica can be easily applied to different chemical reactions and extended to out-of-sample reaction. To assist chemists’ daily research, we further built an easy-to-use webserver, which only takes seconds to run and can be accessed at http://www.pkumdl.cn:8000/reaxtica/.

show abstract

An Ensemble Structure and Physicochemical (SPOC) Descriptor for Machine‐Learning Prediction of Chemical Reaction and Molecular Properties

Cited by 17 publications

References 65 publications

Molecular Toxicity Virtual Screening Applying a Quantized Computational SNN-Based Framework

Molecular Toxicity Virtual Screening Applying a Quantized Computational SNN-Based Framework

Reaxtica: A knowledge-guided machine learning platform for fast and accurate reaction selectivity and yield prediction

Reaxtica: A knowledge-guided machine learning platform for fast and accurate reaction selectivity and yield prediction

Contact Info

Product

Resources

About