This paper describes a simple method for estimating the aqueous solubility (ESOL--Estimated SOLubility) of a compound directly from its structure. The model was derived from a set of 2874 measured solubilities using linear regression against nine molecular properties. The most significant parameter was calculated logP(octanol), followed by molecular weight, proportion of heavy atoms in aromatic systems, and number of rotatable bonds. The model performed consistently well across three validation sets, predicting solubilities within a factor of 5-8 of their measured values, and was competitive with the well-established "General Solubility Equation" for medicinal/agrochemical sized molecules.
This work provides a comprehensive overview of agrochemical properties in terms of the way they change during progression from screen hit to product and in terms of their limits as expressed in modern commercial products. Most herbicides and fungicides readily meet the Lipinski 'rule
of five' criteria for drug-like compounds with many meeting the more constrained limits reported for pharmaceutical leads.
A method for assessing the biological discriminating power of chemical similarity measures is presented. The main concern of this work was to develop an objective way of evaluating different similarity measures in terms of how well they distinguished between active and inactive compounds. In addition, we have explored the level of similarity required for optimal separation and commented on its implications for work in the field of chemical diversity studies. The results for one simple similarity measure showed that statistically significant separation could be achieved, and indicated a reasonable similarity value for future work.
Figure 1: An efficient subspace re-simulation of novel fluid dynamics. This scene was generated an order of magnitude faster than the original. The solver itself, without velocity reconstruction ( §5), runs three orders of magnitude faster.
AbstractWe present a new subspace integration method that is capable of efficiently adding and subtracting dynamics from an existing highresolution fluid simulation. We show how to analyze the results of an existing high-resolution simulation, discover an efficient reduced approximation, and use it to quickly "re-simulate" novel variations of the original dynamics. Prior subspace methods have had difficulty re-simulating the original input dynamics because they lack efficient means of handling semi-Lagrangian advection methods. We show that multi-dimensional cubature schemes can be applied to this and other advection methods, such as MacCormack advection. The remaining pressure and diffusion stages can be written as a single matrix-vector multiply, so as with previous subspace methods, no matrix inversion is needed at runtime. We additionally propose a novel importance sampling-based fitting algorithm that asymptotically accelerates the precomputation stage, and show that the Iterated Orthogonal Projection method can be used to elegantly incorporate moving internal boundaries into a subspace simulation. In addition to efficiently producing variations of the original input, our method can produce novel, abstract fluid motions that we have not seen from any other solver.
This paper discusses the use of binary kernel discrimination (BKD) for identifying potential active compounds in lead-discovery programs. BKD was compared with established virtual screening methods in a series of experiments using pesticide data from the Syngenta corporate database. It was found to be superior to methods based on similarity searching and substructural analysis but inferior to a support vector machine. Similar conclusions resulted from application of the methods to a pesticide data set for which categorical activity data were available.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.