Dmitry Zankov scite author profile

Modern QSAR approaches have wide practical applications in drug discovery for designing potentially bioactive molecules. If such models are based on the use of 2D descriptors, important information contained in the spatial structures of molecules is lost. The major problem in constructing models using 3D descriptors is the choice of a putative bioactive conformation, which affects the predictive performance. The multi-instance (MI) learning approach considering multiple conformations in model training could be a reasonable solution to the above problem. In this study, we implemented several multi-instance algorithms, both conventional and based on deep learning, and investigated their performance. We compared the performance of MI-QSAR models with those based on the classical single-instance QSAR (SI-QSAR) approach in which each molecule is encoded by either 2D descriptors computed for the corresponding molecular graph or 3D descriptors issued for a single lowest energy conformation. The calculations were carried out on 175 data sets extracted from the ChEMBL23 database. It is demonstrated that (i) MI-QSAR outperforms SI-QSAR in numerous cases and (ii) MI algorithms can automatically identify plausible bioactive conformations.

show abstract

Multi-Instance Learning Approach to Predictive Modeling of Catalysts Enantioselectivity

Zankov

Polishchuk

Madzhidov

et al. 2021

Synlett

View full text Add to dashboard Cite

Here, we report an application of the multi-instance learning approach to predictive modeling of enantioselectivity of chiral catalysts. Catalysts were represented by ensembles of conformations encoded by the pmapper physicochemical descriptors capturing stereoconfiguration of the molecule. Each catalyzed chemical reaction was transformed to a condensed graph of reaction for which ISIDA fragment descriptors were generated. This approach does not require any conformations’ alignment and can potentially be used for a diverse set of catalysts bearing different scaffolds. Its efficiency has been demonstrated in predicting the selectivity of BINOL-derived phosphoric acid catalysts in asymmetric thiol addition to N-acylimines and benchmarked with previously reported models.

show abstract

Multiple Conformer Descriptors for QSAR Modeling

Nikonenko

Zankov

Baskin

et al. 2021

Molecular Informatics

View full text Add to dashboard Cite

The most widely used QSAR approaches are mainly based on 2D molecular representation which ignores stereoconfiguration and conformational flexibility of compounds. 3D QSAR uses a single conformer of each compound which is difficult to choose reasonably. 4D QSAR uses multiple conformers to overcome the issues of 2D and 3D methods. However, many of existing 4D QSAR models suffer from the necessity to pre-align conformers, while alignment-independent approaches often ignore stereoconfiguration of compounds. In this study we propose a QSAR modeling approach based on transforming chiralityaware 3D pharmacophore descriptors of individual con-formers into a set of latent variables representing the whole conformer set of a molecule. This is achieved by clustering together all conformers of all training set compounds. The final representation of a compound is a bit string encoding cluster membership of its conformers. In our study we used Random Forest, but this representation can be used in combination with any machine learning method. We compared this approach with conventional 2D and 3D approaches using multiple data sets and investigated the sensitivity of the approach proposed to tuning parameters: number of conformers and clusters.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Dmitry Zankov

QSAR Modeling Based on Conformation Ensembles Using a Multi-Instance Learning Approach

Multi-Instance Learning Approach to Predictive Modeling of Catalysts Enantioselectivity

Multiple Conformer Descriptors for QSAR Modeling

Contact Info

Product

Resources

About