Abstract:High throughput screening (HTS) assesses compound libraries for “activity” using target assays. A subset of HTS data contains a large number of sample measurements replicated a small number of times providing an opportunity to introduce the distribution of standard deviations (DSD). Applying the DSD to some HTS data sets revealed signs of bias in some of the data and discovered a sub-population of compounds exhibiting high variability which may be difficult to screen. In the data examined, 21% of 1189 such com… Show more
“…Practitioners of HTS have learned to recognize a wide range of chemical and physical processes leading to apparent activity in HTS assays 13 . The science of active compound detection and corresponding statistical practice developed early 14,15 with many subsequent refinements [16][17][18][19] . The underpinning instrumental technologies of HTS have been influential for increasing the scale achievable in routine laboratory work and these technologies are widely deployed in the form of plate readers, lab robotics, and compound libraries accessible to researchers 20 .…”
mentioning
confidence: 99%
“…The public investment and consequential data sharing provide a record containing many large primary HTS data sets. A previous investigation into a small number of these discovered an identifiable set of active compounds having excess variance many of which contained PAIN motifs 18 . This work generated an interest in understanding statistical behavior in HTS at scale.…”
High throughput screening (HTS) interrogates compound libraries to find those that are “active” in an assay. To better understand compound behavior in HTS, we assessed an existing binomial survivor function (BSF) model of “frequent hitters” using 872 publicly available HTS data sets. We found large numbers of “infrequent hitters” using this model leading us to reject the BSF for identifying “frequent hitters.” As alternatives, we investigated generalized logistic, gamma, and negative binomial distributions as models for compound behavior. The gamma model reduced the proportion of both frequent and infrequent hitters relative to the BSF. Within this data set, conclusions about individual compound behavior were limited by the number of times individual compounds were tested (1–1613 times) and disproportionate testing of some compounds. Specifically, most tests (78%) were on a 309,847-compound subset (17.6% of compounds) each tested ≥ 300 times. We concluded that the disproportionate retesting of some compounds represents compound repurposing at scale rather than drug discovery. The approach to drug discovery represented by these 872 data sets characterizes the assays well by challenging them with many compounds while each compound is characterized poorly with a single assay. Aggregating the testing information from each compound across the multiple screens yielded a continuum with no clear boundary between normal and frequent hitting compounds.
“…Practitioners of HTS have learned to recognize a wide range of chemical and physical processes leading to apparent activity in HTS assays 13 . The science of active compound detection and corresponding statistical practice developed early 14,15 with many subsequent refinements [16][17][18][19] . The underpinning instrumental technologies of HTS have been influential for increasing the scale achievable in routine laboratory work and these technologies are widely deployed in the form of plate readers, lab robotics, and compound libraries accessible to researchers 20 .…”
mentioning
confidence: 99%
“…The public investment and consequential data sharing provide a record containing many large primary HTS data sets. A previous investigation into a small number of these discovered an identifiable set of active compounds having excess variance many of which contained PAIN motifs 18 . This work generated an interest in understanding statistical behavior in HTS at scale.…”
High throughput screening (HTS) interrogates compound libraries to find those that are “active” in an assay. To better understand compound behavior in HTS, we assessed an existing binomial survivor function (BSF) model of “frequent hitters” using 872 publicly available HTS data sets. We found large numbers of “infrequent hitters” using this model leading us to reject the BSF for identifying “frequent hitters.” As alternatives, we investigated generalized logistic, gamma, and negative binomial distributions as models for compound behavior. The gamma model reduced the proportion of both frequent and infrequent hitters relative to the BSF. Within this data set, conclusions about individual compound behavior were limited by the number of times individual compounds were tested (1–1613 times) and disproportionate testing of some compounds. Specifically, most tests (78%) were on a 309,847-compound subset (17.6% of compounds) each tested ≥ 300 times. We concluded that the disproportionate retesting of some compounds represents compound repurposing at scale rather than drug discovery. The approach to drug discovery represented by these 872 data sets characterizes the assays well by challenging them with many compounds while each compound is characterized poorly with a single assay. Aggregating the testing information from each compound across the multiple screens yielded a continuum with no clear boundary between normal and frequent hitting compounds.
“…2 A ). The average Z′ for the HTS was 0.81 with an overall hit rate of 0.01% where molecules with FP < 3× and the CV were considered hits ( 28 , 29 ) ( Fig. 2 A ).…”
Influenza hemagglutinin (HA) glycoprotein is the primary surface antigen targeted by the host immune response and a focus for development of novel vaccines, broadly neutralizing antibodies (bnAbs), and therapeutics. HA enables viral entry into host cells via receptor binding and membrane fusion and is a validated target for drug discovery. However, to date, only a very few bona fide small molecules have been reported against the HA. To identity new antiviral lead candidates against the highly conserved fusion machinery in the HA stem, we synthesized a fluorescence-polarization probe based on a recently described neutralizing cyclic peptide P7 derived from the complementarity-determining region loops of human bnAbs FI6v3 and CR9114 against the HA stem. We then designed a robust binding assay compatible with high-throughput screening to identify molecules with low micromolar to nanomolar affinity to influenza A group 1 HAs. Our simple, low-cost, and efficient in vitro assay was used to screen H1/Puerto Rico/8/1934 (H1/PR8) HA trimer against ∼72,000 compounds. The crystal structure of H1/PR8 HA in complex with our best hit compound F0045(S) confirmed that it binds to pockets in the HA stem similar to bnAbs FI6v3 and CR9114, cyclic peptide P7, and small-molecule inhibitor JNJ4796. F0045 is enantioselective against a panel of group 1 HAs and F0045(S) exhibits in vitro neutralization activity against multiple H1N1 and H5N1 strains. Our assay, compound characterization, and small-molecule candidate should further stimulate the discovery and development of new compounds with unique chemical scaffolds and enhanced influenza antiviral capabilities.
“…The 3-point method of kinetic analysis answers the current demand for new breakthroughs in the discovery of inhibitors and activators for targets of interest in pharmaceutical research. 33–35 Its first principles are those of the LM recently proposed by us as a new tool to detect enzymatic assay interferences, 22 with the main difference that LM curves are now used to detect “interferences” caused by candidate enzyme effectors. Using the screening for Atx3 77Q modulators as a practical example, major improvements in robustness are achieved through judicious elimination of outliers, by adopting advanced normalization methods, and by increasing the assay resilience to systematic and random variability.…”
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.