Understanding the practical limitations of chemical reactions is critically important for efficiently planning the synthesis of compounds in pharmaceutical, agrochemical, and specialty chemical research and development. However, literature reports of the scope of new reactions are often cursory and biased toward successful results, severely limiting the ability to predict reaction outcomes for untested substrates. We herein illustrate strategies for carrying out large-scale surveys of chemical reactivity by using a material-sparing nanomole-scale automated synthesis platform with greatly expanded synthetic scope combined with ultrahigh-throughput matrix-assisted laser desorption/ionization-time-of-flight mass spectrometry (MALDI-TOF MS).
A critical survey of previously reported van der Waals parameters
for alkali metal cations and halide anions
is presented. A new set of force field parameters is proposed,
derived by fitting the experimental lattice
constants and lattice energies of 20 ionic alkali halide crystals.
These parameters are constrained to satisfy
two relationships connecting the ions with the isoelectronic noble
gasesthe relative van der Waals radii R*
and the coefficients of the London dispersion energies
C
6using the experimentally determined noble
gas
van der Waals parameters. In addition to reproducing physical
trends in common with atoms of isoelectronic
species, the present parameters predict more accurate crystal
structures and energies and, when combined
with a molecular force field for water, also quite accurate gas-phase
ion−water interaction energies and aqueous
solution structures compared to the computed results previously
reported by other authors.
A class II valence force field covering a broad range of organic molecules has been derived employing ab initio quantum mechanical "observables." The procedure includes selecting representative molecules and molecular structures, and systematically sampling their energy surfaces as described by energies and energy first and second derivatives with respect to molecular deformations. In this article the procedure for fitting the force field parameters to these energies and energy derivatives is briefly reviewed. The application of the methodology to the derivation of a class II quantum mechanical force field (QMFF) for 32 organic functional groups is then described. A training set of 400 molecules spanning the 32 functional groups was used to parameterize the force field. The molecular families comprising the functional groups and, within each family, the torsional angles used to sample different conformers, are described. The number of stationary points (equilibria and transition states) for these molecules is given for each functional group. This set contains 1324 stationary structures, with 718 minimum energy structures and 606 transition states. The quality of the fit to the quantum data is gauged based on the deviations between the ab initio and force field energies and energy derivatives. The accuracy with which the QMFF reproduces the ab initio molecular bond lengths, bond angles, torsional angles, vibrational frequencies, and conformational energies is then given for each functional group. Consistently good accuracy is found for these computed properties for the various types of molecules. This demonstrates that the methodology is broadly applicable for the derivation of force field parameters across widely differing types of molecular structures. Copyright 2001 John Wiley & Sons, Inc. J Comput Chem 22: 1782-1800, 2001
The reactivity of a representative set of 17 organozinc pivalates with 18 polyfunctional druglike electrophiles (informers) in Negishi cross-coupling reactions was evaluated by high-throughput experimentation protocols. The high-fidelity scaleup of successful reactions in parallel enabled the isolation of sufficient material for biological testing, thus demonstrating the high value of these new solid zinc reagents in a drug-discovery setting and potentially for many other applications in chemistry. Principal component analysis (PCA) clearly defined the independent roles of the zincates and the informers toward druggable-space coverage.
Pfizer Global Virtual Library (PGVL) of 10(13) readily synthesizable molecules offers a tremendous opportunity for lead optimization and scaffold hopping in drug discovery projects. However, mining into a chemical space of this size presents a challenge for the concomitant design informatics due to the fact that standard molecular similarity searches against a collection of explicit molecules cannot be utilized, since no chemical information system could create and manage more than 10(8) explicit molecules. Nevertheless, by accepting a tolerable level of false negatives in search results, we were able to bypass the need for full 10(13) enumeration and enabled the efficient similarity search and retrieval into this huge chemical space for practical usage by medicinal chemists. In this report, two search methods (LEAP1 and LEAP2) are presented. The first method uses PGVL reaction knowledge to disassemble the incoming search query molecule into a set of reactants and then uses reactant-level similarities into actual available starting materials to focus on a much smaller sub-region of the full virtual library compound space. This sub-region is then explicitly enumerated and searched via a standard similarity method using the original query molecule. The second method uses a fuzzy mapping onto candidate reactions and does not require exact disassembly of the incoming query molecule. Instead Basis Products (or capped reactants) are mapped into the query molecule and the resultant asymmetric similarity scores are used to prioritize the corresponding reactions and reactant sets. All sets of Basis Products are inherently indexed to specific reactions and specific starting materials. This again allows focusing on a much smaller sub-region for explicit enumeration and subsequent standard product-level similarity search. A set of validation studies were conducted. The results have shown that the level of false negatives for the disassembly-based method is acceptable when the query molecule can be recognized for exact disassembly, and the fuzzy reaction mapping method based on Basis Products has an even better performance in terms of lower false-negative rate because it is not limited by the requirement that the query molecule needs to be recognized by any disassembly algorithm. Both search methods have been implemented and accessed through a powerful desktop molecular design tool (see ref. (33) for details). The chapter will end with a comparison of published search methods against large virtual chemical space.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.