2018
DOI: 10.1186/s13321-017-0256-5
|View full text |Cite
|
Sign up to set email alerts
|

An automated framework for QSAR model building

Abstract: BackgroundIn-silico quantitative structure–activity relationship (QSAR) models based tools are widely used to screen huge databases of compounds in order to determine the biological properties of chemical molecules based on their chemical structure. With the passage of time, the exponentially growing amount of synthesized and known chemicals data demands computationally efficient automated QSAR modeling tools, available to researchers that may lack extensive knowledge of machine learning modeling. Thus, a full… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
75
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 97 publications
(76 citation statements)
references
References 86 publications
1
75
0
Order By: Relevance
“…Nantasenamat et al [257] reported the development of an automated data mining software for QSAR modeling called AutoWeka that is based on the machine learning software Weka [258]. Kausar and Falcao [259] presents an automated framework based on KNIME for QSAR modeling entailing data pre-processing, model building and validation. Dong et al [260] introduced an online platform for QSAR modeling known as ChemSAR that is capable of handling chemical structures, computing molecular descriptors, model building as well as producing result plots.…”
Section: Workflows For Computational Drug Discoverymentioning
confidence: 99%
“…Nantasenamat et al [257] reported the development of an automated data mining software for QSAR modeling called AutoWeka that is based on the machine learning software Weka [258]. Kausar and Falcao [259] presents an automated framework based on KNIME for QSAR modeling entailing data pre-processing, model building and validation. Dong et al [260] introduced an online platform for QSAR modeling known as ChemSAR that is capable of handling chemical structures, computing molecular descriptors, model building as well as producing result plots.…”
Section: Workflows For Computational Drug Discoverymentioning
confidence: 99%
“…In addition, we tested the workflow by developing four case studies for the prediction of antiplasmodial and antischistosomal activity, as well as cardiotoxicity and mutagenicity. Previously, other groups have proposed automated 26 and semi-automated 27 KNIME workflows for the development of ML models and cheminformatics analysis. Similar workflows 28 using the commercial Pipeline Pilot 29 software have also been published.…”
Section: Automated Frameworkmentioning
confidence: 99%
“…The Dataset containing 4,661 compounds were curated using the nM (nanomolar) activity unit by removing the empty activity value, salt, small fragments. The molecular structures were then normalized, followed by final checking for compound duplication [14,37]. The remaining 3,933 compounds were then used for the regression modeling dataset.…”
Section: Datasetmentioning
confidence: 99%
“…For the QSAR classification models 4,355 compounds were used from the ChEMBL database with a scientific literature filter. The missing value and duplication of the dataset were curated, the salt and small fragments were removed, the molecular structure was normalized, and the duplication was removed [14,37], leaving 3,740 compounds. Furthermore, the data are classified into active and inactive compounds that used for modeling, with pIC50 activity above 7.5 was active compounds, compounds with pIC50 below 6 were inactive compounds, and compounds with pIC50 between 7.5 and 6 were grey compounds that removed [28].…”
Section: Datasetmentioning
confidence: 99%
See 1 more Smart Citation