2021
DOI: 10.48550/arxiv.2110.11875
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

GeneDisco: A Benchmark for Experimental Design in Drug Discovery

Abstract: In vitro cellular experimentation with genetic interventions, using for example CRISPR technologies, is an essential step in early-stage drug discovery and target validation that serves to assess initial hypotheses about causal associations between biological mechanisms and disease pathologies. With billions of potential hypotheses to test, the experimental design space for in vitro genetic experiments is extremely vast, and the available experimental capacity -even at the largest research institutions in the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
11
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(12 citation statements)
references
References 51 publications
1
11
0
Order By: Relevance
“…However, while our benchmark contains many large data sets, it cannot cover all possible application scenarios that the considered BMDAL methods could be applied to. For example, it is unclear whether our insights can be transferred to applications like drug discovery (Mehrjou et al, 2021) or atomistic ML (Zaverkin and Kästner, 2021), where other types of data and other NNs are employed. Even in the tabular data setting, the relevance of our results for smaller data sets or recently proposed NN architectures (e.g.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…However, while our benchmark contains many large data sets, it cannot cover all possible application scenarios that the considered BMDAL methods could be applied to. For example, it is unclear whether our insights can be transferred to applications like drug discovery (Mehrjou et al, 2021) or atomistic ML (Zaverkin and Kästner, 2021), where other types of data and other NNs are employed. Even in the tabular data setting, the relevance of our results for smaller data sets or recently proposed NN architectures (e.g.…”
Section: Discussionmentioning
confidence: 99%
“…Zaverkin and Kästner (2021) work with atomistic data sets, which require specialized NN architectures and longer training times, and are therefore less well-suited for a large-scale benchmark. Recently, a benchmark for BMDAL for drug discovery has been proposed, which uses four counterfactual regression data sets (Mehrjou et al, 2021).…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Note that φ is simply the identity function when computing distances in the ambient space. The coreset approach, as defined in [12], would be classified in this category in PyRelationAL, where the latent space representation of a point is defined by the associated penultimate activations in a neural network.…”
Section: A2 Diversity-based Informativeness Scoresmentioning
confidence: 99%
“…Recently, there has been a renewed interest in AL, especially in combination with advances in deep learning, to address various real-world applications [8,9]. Notably, applications in medicine have seen growing interest both from academic and industrial institutes [2,3,10,11,12].…”
Section: Introductionmentioning
confidence: 99%