Human Resource Information Systems in Health Care: Protocol for a Systematic Review

CGRtools is an open-source Python library aimed to handle molecular and reaction information. It is the sole library developed so far which can process condensed graph of reaction (CGR) handling. CGR provides the possibility for advanced operations with reaction information and could be used for reaction descriptor calculation, structure−reactivity modeling, atom-to-atom mapping comparison and correction, reaction center extraction, reaction balancing, and some other related tasks. Unlike other popular libraries, CGRtools is fully written in Python with minor dependencies on other libraries and cross-platform. Reaction, molecule, and CGR objects in CGRtools support native Python methods and are comparable with the help of operations "equal to", "less than", and "bigger than". CGRtools supports common structural formats. CGRtools is distributed via an L-GPL license and available on https://github.com/cimmkzn/CGRtools.

show abstract

Atom‐to‐atom Mapping: A Benchmarking Study of Popular Mapping Algorithms and Consensus Strategies

Lin

Dyubankova

Madzhidov

et al. 2021

Molecular Informatics

View full text Add to dashboard Cite

In this paper, we compare the most popular Atom-to-Atom Mapping (AAM) tools: ChemAxon, [1] Indigo, [2] RDTool, [3] NameRXN (NextMove), [4] and RXNMapper [5] which implement different AAM algorithms. An open-source RDTool program was optimized, and its modified version ("new RDTool") was considered together with several consensus mapping strategies. The Condensed Graph of Reaction approach was used to calculate chemical distances and develop the "AAM fixer" algorithm for an automatized correction of erroneous mapping. The benchmarking calculations were performed on a Golden dataset containing 1851 manually mapped and curated reactions. The best performing RXNMapper program together with the AMM Fixer was applied to map the USPTO database. The Golden dataset, mapped USPTO and optimized RDTool are available in the GitHub repository https://github.com/Laboratoire-de-Chemoinformatique.

show abstract

Mapping of the Available Chemical Space versus the Chemical Universe of Lead‐Like Compounds

et al. 2018

View full text Add to dashboard Cite

This is, to our knowledge, the most comprehensive analysis to date based on generative topographic mapping (GTM) of fragment-like chemical space (40 million molecules with no more than 17 heavy atoms, both from the theoretically enumerated GDB-17 and real-world PubChem/ChEMBL databases). The challenge was to prove that a robust map of fragment-like chemical space can actually be built, in spite of a limited (≪10 ) maximal number of compounds ("frame set") usable for fitting the GTM manifold. An evolutionary map building strategy has been updated with a "coverage check" step, which discards manifolds failing to accommodate compounds outside the frame set. The evolved map has a good propensity to separate actives from inactives for more than 20 external structure-activity sets. It was proven to properly accommodate the entire collection of 40 m compounds. Next, it served as a library comparison tool to highlight biases of real-world molecules (PubChem and ChEMBL) versus the universe of all possible species represented by FDB-17, a fragment-like subset of GDB-17 containing 10 million molecules. Specific patterns, proper to some libraries and absent from others (diversity holes), were highlighted.

show abstract

Reaction Data Curation I: Chemical Structures and Transformations Standardization

Gimadiev

Lin

Afonina

et al. 2021

Molecular Informatics

View full text Add to dashboard Cite

The quality of experimental data for chemical reactions is a critical consideration for any reaction-driven study. However, the curation of reaction data has not been extensively discussed in the literature so far. Here, we suggest a 4 steps protocol that includes the curation of individual structures (reactants and products), chemical transformations, reaction conditions and endpoints. Its implementation in Python3 using CGRTools toolkit has been used to clean three popular reaction databases Reaxys, USPTO and Pistachio. The curated USPTO database is available in the GitHub repository (Laboratoire-de-Chemoinformatique/Reaction_Data_Cleaning).

show abstract

Prediction of Optimal Conditions of Hydrogenation Reaction Using the Likelihood Ranking Approach

Afonina

Mazitov

Nurmukhametova

et al. 2021

IJMS

View full text Add to dashboard Cite

The selection of experimental conditions leading to a reasonable yield is an important and essential element for the automated development of a synthesis plan and the subsequent synthesis of the target compound. The classical QSPR approach, requiring one-to-one correspondence between chemical structure and a target property, can be used for optimal reaction conditions prediction only on a limited scale when only one condition component (e.g., catalyst or solvent) is considered. However, a particular reaction can proceed under several different conditions. In this paper, we describe the Likelihood Ranking Model representing an artificial neural network that outputs a list of different conditions ranked according to their suitability to a given chemical transformation. Benchmarking calculations demonstrated that our model outperformed some popular approaches to the theoretical assessment of reaction conditions, such as k Nearest Neighbors, and a recurrent artificial neural network performance prediction of condition components (reagents, solvents, catalysts, and temperature). The ability of the Likelihood Ranking model trained on a hydrogenation reactions dataset, (~42,000 reactions) from Reaxys® database, to propose conditions that led to the desired product was validated experimentally on a set of three reactions with rich selectivity issues.

show abstract

Machine learning modelling of chemical reaction characteristics: yesterday, today, tomorrow

Madzhidov

Rakhimbekova

Afonina

et al. 2021

Mendeleev Communications

View full text Add to dashboard Cite

Chemoinformatics Meets Synthetic Medicinal Chemistry

Madzhidov¹,

Fatykhova²,

Afonina³

et al. 2021

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Valentina A. Afonina

CGRtools: Python Library for Molecule, Reaction, and Condensed Graph of Reaction Processing

Atom‐to‐atom Mapping: A Benchmarking Study of Popular Mapping Algorithms and Consensus Strategies

Mapping of the Available Chemical Space versus the Chemical Universe of Lead‐Like Compounds

Reaction Data Curation I: Chemical Structures and Transformations Standardization

Prediction of Optimal Conditions of Hydrogenation Reaction Using the Likelihood Ranking Approach

Machine learning modelling of chemical reaction characteristics: yesterday, today, tomorrow

Chemoinformatics Meets Synthetic Medicinal Chemistry

Contact Info

Product

Resources

About