2022
DOI: 10.1016/j.bmc.2022.116808
|View full text |Cite
|
Sign up to set email alerts
|

DeepAS – Chemical language model for the extension of active analogue series

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5

Relationship

2
3

Authors

Journals

citations
Cited by 6 publications
(10 citation statements)
references
References 23 publications
0
10
0
Order By: Relevance
“…20,21 MMS are defined as series of compounds that are only distinguished by a chemical modification at a single site 20 and can be identified using a matched molecular pair (MMP) algorithm variant. 21 For the original development of DeepAS, 6 104 627 MMS from 2195 target-based compound activity classes were extracted from ChEMBL. 22 From these MMS, all R-groups were systematically isolated, yielding 3852 unique R-groups, which represented all R-groups occurring in AS covering the entire target space of active compounds.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…20,21 MMS are defined as series of compounds that are only distinguished by a chemical modification at a single site 20 and can be identified using a matched molecular pair (MMP) algorithm variant. 21 For the original development of DeepAS, 6 104 627 MMS from 2195 target-based compound activity classes were extracted from ChEMBL. 22 From these MMS, all R-groups were systematically isolated, yielding 3852 unique R-groups, which represented all R-groups occurring in AS covering the entire target space of active compounds.…”
Section: Methodsmentioning
confidence: 99%
“…The length of a sentence was consistently set to 35 tokens and the total number of label tokens amounted to 3855, including 3852 unique R-groups extracted from the qualifying MMS plus the three special tokens. 6…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Another CLM was developed for lead optimization in medicinal chemistry. This model was trained to iteratively extend analogue series with new potent compounds [35]. For this purpose, analogue series can be perceived as sequences of R‐groups attached to an invariant core structure, which provides the conceptual basis for series extension via CLMs.…”
Section: Exemplary Chemical Language Modelsmentioning
confidence: 99%
“…New R‐groups were then predicted by the dense layer of the model based on conditional probabilities derived from preceding R‐groups. Given the ordering of training series, iterative extension of analogue series using DeepAS was implicitly directed towards compounds with increasing potency, and the model correctly reproduced potent analogues for many test series [35].…”
Section: Exemplary Chemical Language Modelsmentioning
confidence: 99%