We have investigated techniques for distinguishing between drugs and nondrugs using a set of molecular descriptors derived from semiempirical molecular orbital (AM1) calculations. The "drug" data set of 2105 compounds was derived from the World Drug Index (WDI) using a procedure designed to select real drugs. The "nondrug" data set was the Maybridge database. We have first investigated the dimensionality of physical properties space based on a set of 26 descriptors that we have used successfully to build absorption, distribution, metabolism, and excretion-related quantitative structure-property relationship models. We discuss the general nature of the descriptors for physical property space and the ability of these descriptors to distinguish between drugs and nondrugs. The third most significant principal component of this set of descriptors serves as a useful numerical index of drug-likeness, but no others are able to distinguish between drugs and nondrugs. We have therefore extended our set of descriptors to a total of 66 and have used recursive partitioning to identify the descriptors that can distinguish between drugs and nondrugs. This procedure pointed to two of the descriptors that play an important role in the principal component found above and one more from the set of 40 extra descriptors. These three descriptors were then used to train a Kohonen artificial neural net for the entire Maybridge data set. Projecting the drug database onto the map obtained resulted in a clear distinction not only between drugs and nondrugs but also, for instance, between hormones and other drugs. Projection of 42 131 compounds from the WDI onto the Kohonen map also revealed pronounced clustering in the regions of the map assigned as druglike.
The first large scale analysis of in vitro absorption, distribution, metabolism, excretion, and toxicity (ADMET) data shared across multiple major pharma has been performed. Using advanced matched molecular pair analysis (MMPA), we combined data from three pharmaceutical companies and generated ADMET rules, avoiding the need to disclose the full chemical structures. On top of the very large exchange of knowledge, all companies involved synergistically gained approximately 20% more rules from the shared transformations. There is good quantitative agreement between the rules based on shared data compared to both individual companies' rules and rules published in the literature. Known correlations between log D, solubility, in vitro clearance, and plasma protein binding also hold in transformation space, but there are also interesting exceptions. Data pools such as this allow focusing on particular functional groups and characterizing their ADMET profile. Finally the role of a corpus of robustly tested medicinal chemistry knowledge in the training of medicinal chemistry is discussed.
Despite the widespread and increasing usage of Pd-catalyzed C-N cross couplings, finding good conditions for these reactions can be challenging. Practitioners mostly rely on few methodology studies or anecdotal experience....
International audienceThe contribution deals with timestepping schemes for nonsmooth dynamical systems. Traditionally, these schemes are locally of integration order one, both in non-impulsive and impulsive periods. This is inefficient for applications with infinitely many events but large non-impulsive phases like circuit breakers, valve trains or slider-crank mechanisms. To improve the behaviour during non-impulsive episodes, we start activities twofold. First, we include the classic schemes in time discontinuous Galerkin methods. Second, we split non-impulsive and impulsive force propagation. The correct mathematical setting is established with mollifier functions, Clenshaw-Curtis quadrature rules and an appropriate impact representation. The result is a Petrov-Galerkin distributional differential inclusion. It defines two Runge-Kutta collocation families and enables higher integration order during non-impulsive transition phases. As the framework contains the classic Moreau-Jean timestepping schemes for constant ansatz and test functions on velocity level, it can be considered as a consistent enhancement. An experimental convergence analysis with the bouncing ball example illustrates the capabilities
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.