2023
DOI: 10.1021/acsomega.2c07346
|View full text |Cite
|
Sign up to set email alerts
|

Cheminformatic Analysis and Machine Learning Modeling to Investigate Androgen Receptor Antagonists to Combat Prostate Cancer

Abstract: Prostate cancer (PCa) is a major leading cause of mortality of cancer among males. There have been numerous studies to develop antagonists against androgen receptor (AR), a crucial therapeutic target for PCa. This study is a systematic cheminformatic analysis and machine learning modeling to study the chemical space, scaffolds, structure−activity relationship, and landscape of human AR antagonists. There are 1678 molecules as final data sets. Chemical space visualization by physicochemical property visualizati… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7

Relationship

1
6

Authors

Journals

citations
Cited by 10 publications
(6 citation statements)
references
References 45 publications
(77 reference statements)
0
6
0
Order By: Relevance
“…Descriptors exhibiting zero variance across the dataset were removed, as they do not contribute to discrimination between active and inactive compounds. Additionally, we applied a threshold of 0.95 for multicollinearity, removing descriptors that were highly correlated with each other [33]. This step aimed to retain only the most relevant and non-redundant descriptors for our classification task.…”
Section: Data Preparationmentioning
confidence: 99%
“…Descriptors exhibiting zero variance across the dataset were removed, as they do not contribute to discrimination between active and inactive compounds. Additionally, we applied a threshold of 0.95 for multicollinearity, removing descriptors that were highly correlated with each other [33]. This step aimed to retain only the most relevant and non-redundant descriptors for our classification task.…”
Section: Data Preparationmentioning
confidence: 99%
“…Next, we removed duplicate data and left 6157 compounds. To carry out the classification process, we construct a class variable by converting the IC50 value to pIC50, and if the pIC50 value < 6, then the compound is assigned to an inactive class, and if pIC50 ≥, then the compound is active [24]. Among the 6157 compounds, 3591 of them (58.32%) were classified as inactive, and 2566 compounds (41.68%) were categorized as active.…”
Section: Datasetmentioning
confidence: 99%
“…We used Mordred to derive 1661 2D-molecular descriptors for each AChE inhibitor compound. Molecular descriptors with high correlation (>0.95) and low variance (<0.1) were eliminated, leaving 280 molecular descriptors [24].…”
Section: Molecular Descriptorsmentioning
confidence: 99%
“…4 Quantitative structure−activity/property relationship (QSAR/QSPR) represents an in silico mathematical model used for predicting the bioactivities or properties of compounds based on their structures and physicochemical parameters. 5,6 QSAR/QSPR is fundamentally based on two key principles: (i) the structure of a compound determines its bioactivity/property and (ii) compounds with more similar structures exhibit more similar bioactivities or properties. 6 Particularly, structural information on compounds is represented using a range of molecular descriptors or fingerprints.…”
Section: Introductionmentioning
confidence: 99%
“…5,6 QSAR/QSPR is fundamentally based on two key principles: (i) the structure of a compound determines its bioactivity/property and (ii) compounds with more similar structures exhibit more similar bioactivities or properties. 6 Particularly, structural information on compounds is represented using a range of molecular descriptors or fingerprints. These descriptors play a crucial role in determining the robustness, generalization, and predictability.…”
Section: Introductionmentioning
confidence: 99%