Daniyar Mazitov scite author profile

The selection of experimental conditions leading to a reasonable yield is an important and essential element for the automated development of a synthesis plan and the subsequent synthesis of the target compound. The classical QSPR approach, requiring one-to-one correspondence between chemical structure and a target property, can be used for optimal reaction conditions prediction only on a limited scale when only one condition component (e.g., catalyst or solvent) is considered. However, a particular reaction can proceed under several different conditions. In this paper, we describe the Likelihood Ranking Model representing an artificial neural network that outputs a list of different conditions ranked according to their suitability to a given chemical transformation. Benchmarking calculations demonstrated that our model outperformed some popular approaches to the theoretical assessment of reaction conditions, such as k Nearest Neighbors, and a recurrent artificial neural network performance prediction of condition components (reagents, solvents, catalysts, and temperature). The ability of the Likelihood Ranking model trained on a hydrogenation reactions dataset, (~42,000 reactions) from Reaxys® database, to propose conditions that led to the desired product was validated experimentally on a set of three reactions with rich selectivity issues.

show abstract

HyFactor: A Novel Open-Source, Graph-Based Architecture for Chemical Structure Generation

Akhmetshin

Lin

Mazitov

et al. 2022

J. Chem. Inf. Model.

View full text Add to dashboard Cite

Graph-based architectures are becoming increasingly popular as a tool for structure generation. Here, we introduce novel open-source architecture HyFactor in which, similar to the InChI linear notation, the number of hydrogens attached to the heavy atoms was considered instead of the bond types. HyFactor was benchmarked on the ZINC 250K, MOSES, and ChEMBL data sets against conventional graph-based architecture ReFactor, representing our implementation of the reported DEFactor architecture in the literature. On average, HyFactor models contain some 20% less fitting parameters than those of ReFactor. The two architectures display similar validity, uniqueness, and reconstruction rates. Compared to the training set compounds, HyFactor generates more similar structures than ReFactor. This could be explained by the fact that the latter generates many open-chain analogues of cyclic structures in the training set. It has been demonstrated that the reconstruction error of heavy molecules can be significantly reduced using the data augmentation technique.

show abstract

Named Entity Recognition in Russian Using Multi-Task LSTM-CRF

2023

View full text Add to dashboard Cite

Inverse QSAR: reversing descriptor-driven prediction pipeline using attention-based conditional variational autoencoder (ACoVAE)

Bort

Mazitov

Horvath

et al. 2022

Preprint

View full text Add to dashboard Cite

In order to better formalize the notorious Inverse-QSAR problem (finding structures of given QSAR-predicted properties) is considered in this paper as a two-step process1,2,3 including (i) finding “seed” descriptor vectors corresponding to user-constrained QSAR model output values and (ii) identifying the chemical structures best matching the “seed” vectors. The main development effort here was focused on the latter stage, proposing a new Attention-based Conditional Variational AutoEncoder (ACoVAE) neural-network architecture based on recent developments in attention-based methods. The obtained results show that this workflow was capable of generating compounds predicted to display desired activity, while being completely novel compared to the training database (ChEMBL). Moreover, the generated compounds show acceptable druglikeness and synthetic accessibility. Both pharmacophore and docking studies were carried out as “orthogonal” in silico validation methods, proving that some of de novo structures are, beyond being predicted active by 2D-QSAR models, clearly able to match binding 3D pharmacophores and bind the protein pocket

show abstract

HyFactor: Hydrogen-count labelled graph-based defactorization Autoencoder

Akhmetshin

Lin

Mazitov

et al. 2021

Preprint

View full text Add to dashboard Cite

Graph-based architectures are becoming increasingly popular as a tool for structure generation. Here, we introduce a novel open-source architecture HyFactor which is inspired by previously reported DEFactor architecture and based on the hydrogen labeled graphs. Since the original DEFactor code was not available, its new implementation (ReFactor) was prepared in this work for the benchmarking purpose. HyFactor demonstrates its high performance on the ZINC 250K MOSES and ChEMBL data set and in molecular generation tasks, it is considerably more effective than ReFactor. The code of HyFactor and all models obtained in this study are publicly available from our GitHub repository: https://github.com/Laboratoire-de-Chemoinformatique/hyfactor

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Daniyar Mazitov

Prediction of Optimal Conditions of Hydrogenation Reaction Using the Likelihood Ranking Approach

HyFactor: A Novel Open-Source, Graph-Based Architecture for Chemical Structure Generation

Named Entity Recognition in Russian Using Multi-Task LSTM-CRF

Inverse QSAR: reversing descriptor-driven prediction pipeline using attention-based conditional variational autoencoder (ACoVAE)

HyFactor: Hydrogen-count labelled graph-based defactorization Autoencoder

Contact Info

Product

Resources

About