2023
DOI: 10.1002/minf.202200227
|View full text |Cite
|
Sign up to set email alerts
|

Overproduce and select, or determine optimal molecular descriptor subset via configuration space optimization? Application to the prediction of ecotoxicological endpoints

Abstract: Predicting the likely biological activity (or property) of compounds is a fundamental and challenging task in the drug discovery process. Current computational methodologies aim to improve their predictive accuracies by using deep learning (DL) approaches. However, non‐DL based approaches for small‐ and medium‐sized chemical datasets have demonstrated to be most suitable for. In this approach, an initial universe of molecular descriptors (MDs) is first calculated, then different feature selection algorithms ar… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 92 publications
(116 reference statements)
0
1
0
Order By: Relevance
“…An informed threshold selection and/or descriptor pre-selection by experts with domain knowledge (e.g., in chromatography) can reduce the number of descriptors but will rely on personal experience and may be unintentionally biased. Therefore, a more rational definition of thresholds and descriptors would be helpful in this context as was also recently discussed in the context of small molecules and when comparing self-supervised and manually selected features [46] , [47] .…”
Section: Descriptor Availability Redundancy and Applicabilitymentioning
confidence: 99%
“…An informed threshold selection and/or descriptor pre-selection by experts with domain knowledge (e.g., in chromatography) can reduce the number of descriptors but will rely on personal experience and may be unintentionally biased. Therefore, a more rational definition of thresholds and descriptors would be helpful in this context as was also recently discussed in the context of small molecules and when comparing self-supervised and manually selected features [46] , [47] .…”
Section: Descriptor Availability Redundancy and Applicabilitymentioning
confidence: 99%