2013
DOI: 10.1371/journal.pone.0061007
|View full text |Cite
|
Sign up to set email alerts
|

Comparability of Mixed IC50 Data – A Statistical Analysis

Abstract: The biochemical half maximal inhibitory concentration (IC50) is the most commonly used metric for on-target activity in lead optimization. It is used to guide lead optimization, build large-scale chemogenomics analysis, off-target activity and toxicity models based on public data. However, the use of public biochemical IC50 data is problematic, because they are assay specific and comparable only under certain conditions. For large scale analysis it is not feasible to check each data entry manually and it is ve… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

13
315
1
2

Year Published

2013
2013
2023
2023

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 247 publications
(342 citation statements)
references
References 18 publications
13
315
1
2
Order By: Relevance
“…Bioactivity data from ChEMBL was limited to four different constants: K D , K i , IC 50 , and EC 50 . It has been shown previously that K i and IC 50 can be combined for modeling (55). In order to check if these constants could be used interchangeably, a statistical analysis of their pChEMBL values was performed.…”
Section: Data Selectionmentioning
confidence: 99%
“…Bioactivity data from ChEMBL was limited to four different constants: K D , K i , IC 50 , and EC 50 . It has been shown previously that K i and IC 50 can be combined for modeling (55). In order to check if these constants could be used interchangeably, a statistical analysis of their pChEMBL values was performed.…”
Section: Data Selectionmentioning
confidence: 99%
“…[45] Currently it is almost impossible to check if two IC 50 values from different publications are comparable and thus using mixed IC 50 data introduces more error to chemogenomic modelling than using K i data only. [46] 3.5 Improperly Archived Original Data A smaller, but nevertheless important issue is the lack of standards for proper scientific data archiving. A publication might make use of a third party data source that is no longer available, rendering the study irreproducible.…”
Section: Insufficient Assay Descriptionsmentioning
confidence: 99%
“…Such value pairs can be identified by looking at the values from different publications which differ only by a rounding margin (which will always be less than 0.17 log units). [31,46] An example in ChEMBL is the K i measurement for compound ID CHEMBL282021 against the homo sapiens carbonic anhydrase II (Target ID CHEMBL205). The value of 3.1 nM measured from Ilies et al [54] is cited and rounded to 3 nM by Vullo et al [55] It is important to remember that even if measurements have been replicated with very similar values in a different lab, the result might still be an experimental artefact, for example caused by aggregation.…”
Section: And5mentioning
confidence: 99%
“…For instance, binding constants from different laboratories showed an average deviation of approximately 0.5 log units, which translates to a factor of about 3 by which the measurements differ [1][2][3]. Surprisingly, this is just the average.…”
Section: Introductionmentioning
confidence: 98%
“…Data sets extracted from public databases also critically depend on the initial data quality [1][2][3]. The data sets can be curated to some extent by removing implausible data records (i.e.…”
Section: Introductionmentioning
confidence: 99%