Quantitative Structure-Activity Relationship modeling is one of the major computational tools employed in medicinal chemistry. However, throughout its entire history it has drawn both praise and criticism concerning its reliability, limitations, successes, and failures. In this paper, we discuss: (i) the development and evolution of QSAR; (ii) the current trends, unsolved problems, and pressing challenges; and (iii) several novel and emerging applications of QSAR modeling. Throughout this discussion, we provide guidelines for QSAR development, validation, and application, which are summarized in best practices for building rigorously validated and externally predictive QSAR models. We hope that this Perspective will help communications between computational and experimental chemists towards collaborative development and use of QSAR models. We also believe that the guidelines presented here will help journal editors and reviewers apply more stringent scientific standards to manuscripts reporting new QSAR studies, as well as encourage the use of high quality, validated QSARs for regulatory decision making.
Some organic weak bases induce a detachment from inner lysosomal membranes and subsequent inactivation of acid sphingomyelinase (ASM) and thus work as functional ASM inhibitors. The aim of the present investigation was to develop a structure-property-activity relation (SPAR) model in order to specify the structural and physicochemical characteristics of probes capable of functionally inhibiting ASM. High p K a and high log P values are necessary but not sufficient preconditions for functional inhibition of ASM. The experimental data supported the requirement of an additional factor, which is necessary for functional inhibition of ASM. This factor k is related to the steric hindrance of the most basic nitrogen atom and presumably modulates the free presentation of a protonated nitrogen atom at the inner lysosomal surface. During the course of the study, we characterized 26 new functional ASM inhibitors, including doxepine 63, fluoxetine 104, maprotiline 109, nortriptyline 114, paroxetine 118, sertraline 124, suloctidil 125, and terfenadine 127.
We describe a hitherto unknown feature for 27 small drug-like molecules, namely functional inhibition of acid sphingomyelinase (ASM). These entities named FIASMAs (Functional Inhibitors of Acid SphingoMyelinAse), therefore, can be potentially used to treat diseases associated with enhanced activity of ASM, such as Alzheimer's disease, major depression, radiation- and chemotherapy-induced apoptosis and endotoxic shock syndrome. Residual activity of ASM measured in the presence of 10 µM drug concentration shows a bimodal distribution; thus the tested drugs can be classified into two groups with lower and higher inhibitory activity. All FIASMAs share distinct physicochemical properties in showing lipophilic and weakly basic properties. Hierarchical clustering of Tanimoto coefficients revealed that FIASMAs occur among drugs of various chemical scaffolds. Moreover, FIASMAs more frequently violate Lipinski's Rule-of-Five than compounds without effect on ASM. Inhibition of ASM appears to be associated with good permeability across the blood-brain barrier. In the present investigation, we developed a novel structure-property-activity relationship by using a random forest-based binary classification learner. Virtual screening revealed that only six out of 768 (0.78%) compounds of natural products functionally inhibit ASM, whereas this inhibitory activity occurs in 135 out of 2028 (6.66%) drugs licensed for medical use in humans.
Chemotypes are a new approach for representing molecules, chemical substructures and patterns, reaction rules, and reactions. Chemotypes are capable of integrating types of information beyond what is possible using current representation methods (e.g., SMARTS patterns) or reaction transformations (e.g., SMIRKS, reaction SMILES). Chemotypes are expressed in the XML-based Chemical Subgraphs and Reactions Markup Language (CSRML), and can be encoded not only with connectivity and topology but also with properties of atoms, bonds, electronic systems, or molecules. CSRML has been developed in parallel with a public set of chemotypes, i.e., the ToxPrint chemotypes, which are designed to provide excellent coverage of environmental, regulatory, and commercial-use chemical space, as well as to represent chemical patterns and properties especially relevant to various toxicity concerns. A software application, ChemoTyper has also been developed and made publicly available in order to enable chemotype searching and fingerprinting against a target structure set. The public ChemoTyper houses the ToxPrint chemotype CSRML dictionary, as well as reference implementation so that the query specifications may be adopted by other chemical structure knowledge systems. The full specifications of the XML-based CSRML standard used to express chemotypes are publicly available to facilitate and encourage the exchange of structural knowledge.
A variety of sesquiterpene lactones (SLs) possess considerable anti-inflammatory activity. Several studies have shown that they exert this effect in part by inhibiting the activation of the transcription factor NF-kappaB. In the present study we elaborated on the investigation of a data set of 103 structurally diverse SLs for which we had previously developed several different QSAR equations dependent on the skeletal type. Use of 3D structure descriptors resulted in a single model for the entire data set. In particular, local radial distribution functions (L-RDF) were used that centered on the methylene-carbonyl substructure believed to be the site of attack of cysteine-38 of the p65/NF-kappaB subunit. The model was developed by using a counterpropagation neural network (CPGNN), attesting to the power of this method for establishing structure-activity-relationships. The investigations shed more light onto the influence of the chemical structure on NF-kappaB inhibitory activity.
Neural networks are powerful data mining tools with a wide range of applications in drug design. This paper largely concentrates on self-organizing neural networks that can be used for investigating datasets both by unsupervised and by supervised learning. The representation of chemical structures is the key to success in establishing useful relationships. Applications are shown for exploring different structure representations, for establishing quantitative structure-activity relationships and for handling compounds having multicategory activities. The applications comprise the separation of compounds according to different biological activities, the location of biologically active compounds in large chemical spaces, the analysis of high-throughput screening data and the classification of compounds according to mode of toxic action.
ChemInform is a weekly Abstracting Service, delivering concise information at a glance that was extracted from about 200 leading journals. To access a ChemInform Abstract, please click on HTML or PDF.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.