2024
DOI: 10.1021/acs.jcim.3c01753
|View full text |Cite
|
Sign up to set email alerts
|

COATI: Multimodal Contrastive Pretraining for Representing and Traversing Chemical Space

Benjamin Kaufman,
Edward C. Williams,
Carl Underkoffler
et al.

Abstract: Creating a successful small molecule drug is a challenging multiparameter optimization problem in an effectively infinite space of possible molecules. Generative models have emerged as powerful tools for traversing data manifolds composed of images, sounds, and text and offer an opportunity to dramatically improve the drug discovery and design process. To create generative optimization methods that are more useful than brute-force molecular generation and filtering via virtual screening, we propose that four i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 95 publications
0
2
0
Order By: Relevance
“…For example, this can be used to fit interatomic potentials to experimentally obtainable structural data . It is also an important component of modern generative models for drug discovery . Finally, this idea is also being applied to DFT functional development, where all of the key parameters used to define a DFT functional are treated as differentiable parameters that can be optimized with respect to some loss …”
Section: End-to-end Differentiabilitymentioning
confidence: 99%
“…For example, this can be used to fit interatomic potentials to experimentally obtainable structural data . It is also an important component of modern generative models for drug discovery . Finally, this idea is also being applied to DFT functional development, where all of the key parameters used to define a DFT functional are treated as differentiable parameters that can be optimized with respect to some loss …”
Section: End-to-end Differentiabilitymentioning
confidence: 99%
“…The ability to learn meaningful and useful representations of data is a key challenge toward applying machine learning (ML) and artificial intelligence (AI) to various real-world problems. On one hand, a useful representation extracts and organizes the discriminative information from data to support effective ML for downstream tasks. In the realm of chemistry, such representations offer promising avenues for enhancing predictions of molecular properties, forecasting reactions, predicting pharmacological activities, facilitating exploration of vast chemical spaces, and accelerating computational simulations. , On the other hand, a meaningful representation is often more interpretable, which is fundamental to helping humans trust AI. Recently, a variety of representation learning methods have been proposed based on the idea of autoencodinglearning a mapping from high dimensional data to a low dimensional representation or latent space which is able to approximately reconstruct the original data .…”
Section: Introductionmentioning
confidence: 99%