High-throughput screening has become a mainstay of small-molecule probe and early drug discovery. The question of how to build and evolve efficient screening collections systematically for cellbased and biochemical screening is still unresolved. It is often assumed that chemical structure diversity leads to diverse biological performance of a library. Here, we confirm earlier results showing that this inference is not always valid and suggest instead using biological measurement diversity derived from multiplexed profiling in the construction of libraries with diverse assay performance patterns for cell-based screens. Rather than using results from tens or hundreds of completed assays, which is resource intensive and not easily extensible, we use high-dimensional image-based cell morphology and gene expression profiles. We piloted this approach using over 30,000 compounds. We show that small-molecule profiling can be used to select compound sets with high rates of activity and diverse biological performance.chemical diversity | biological performance diversity | biological activity | chemical similarity P rofiling small molecules based on multiple biological activity measurements can illuminate mechanisms of action by comparing profiles with compounds whose mechanisms of action are known (1-5). Here, we describe a previously unidentified use of small-molecule profiling-enabling the creation of activityenriched and performance-diverse compound libraries for smallmolecule probe and drug discovery.Biochemical and cell-based high-throughput screening (HTS) is routinely used to discover novel bioactive molecules through unbiased testing of up to several million compounds per screen (6). However, despite ongoing advances in throughput, compound libraries will always represent only a small fraction of all relevant compounds theoretically accessible through chemical synthesis (a concept often referred to as "chemical space") (7). Library composition therefore presents a strong source of bias and potential limitation for any screening endeavor.There is little dissent about the notion that a good screening collection should yield many high-quality hits for a wide range of biological targets or phenotypes. In other words, it should be enriched for bioactive compounds and have high biological performance diversity. A high percentage of compounds lacking any activity will contribute to high cost and low performance of a high-throughput screen. A practical example is a compound collection containing a high percentage of compounds that fail to penetrate cell membranes-such a library will be unlikely to perform effectively in a cell-based HTS exploring an intracellular process. Similarly, a screening collection of compounds with highly redundant biological activities will be less efficient than an equally sized library with diverse performance (Fig. 1). A systematic path to reach these goals, however, remains elusive. One common practice is analyzing structural features of compounds to maximize chemical structural diversity. Ho...