2020
DOI: 10.1038/s41598-020-67846-1
|View full text |Cite
|
Sign up to set email alerts
|

Genome-wide investigation of gene-cancer associations for the prediction of novel therapeutic targets in oncology

Abstract: A major cause of failed drug discovery programs is suboptimal target selection, resulting in the development of drug candidates that are potent inhibitors, but ineffective at treating the disease. In the genomics era, the availability of large biomedical datasets with genome-wide readouts has the potential to transform target selection and validation. In this study we investigate how computational intelligence methods can be applied to predict novel therapeutic targets in oncology. We compared different machin… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
14
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 14 publications
(14 citation statements)
references
References 34 publications
(37 reference statements)
0
14
0
Order By: Relevance
“…Li & Lai, 2007;Lin et al, 2019;Sikander et al, 2022;Sun et al, 2018;Zhu et al, 2009). Others only focus on a specific target or indication, such as oncology (Bazaga et al, 2020;de Falco et al, 2021;Dezső & Ceccarelli, 2020;Jeon et al, 2014), or ion channels (Huang et al, 2010). Restricting our focus to models which seek to assess the druggability of the entire proteome, we find that PINNED comfortably outperforms much of the prior literature in sensitivity, specificity, and AUC (Bull & Doig, 2015;Costa et al, 2010;Ferrero et al, 2017;Yao & Rzhetsky, 2008)…”
Section: Pinned Modelmentioning
confidence: 77%
“…Li & Lai, 2007;Lin et al, 2019;Sikander et al, 2022;Sun et al, 2018;Zhu et al, 2009). Others only focus on a specific target or indication, such as oncology (Bazaga et al, 2020;de Falco et al, 2021;Dezső & Ceccarelli, 2020;Jeon et al, 2014), or ion channels (Huang et al, 2010). Restricting our focus to models which seek to assess the druggability of the entire proteome, we find that PINNED comfortably outperforms much of the prior literature in sensitivity, specificity, and AUC (Bull & Doig, 2015;Costa et al, 2010;Ferrero et al, 2017;Yao & Rzhetsky, 2008)…”
Section: Pinned Modelmentioning
confidence: 77%
“…Finally, we averaged the AUC that is calculated for each fold for all ten folds. Furthermore, another evaluation setting used by (Bazaga et al, 2020), was also implemented to compare our results with this baseline method using their dataset and their procedure to perform a fair comparison, which is explained in more detail in the comparison section. Finally, we retrained our models using all positive and negative data samples in our datasets and then applied these models on new unseen test data to predict the labels of this new data (i.e., predict the novel therapeutic targets as positive data).…”
Section: Evaluation Protocolsmentioning
confidence: 99%
“…In [ 141 ], a DNN, along with the conventional ML models, utilized multi-omics data to predict the novel targets for a therapeutic treatment in the field of oncology. Starting from the lists of the genes that were either targeted by the FDA-approved drugs or those which, when they are mutated, may cause cancer, the researchers collected the data for the gene expression, the gene mutations (averaged over numerous patient samples for each cancer type), the gene essentiality (real-valued, mean sensitivity from knock-out experiments), and the gene interaction networks that were embedded via AE-based diffusion graphs [ 142 ].…”
Section: Omics Data and Deep Learningmentioning
confidence: 99%
“…In building a model to identify the venomous proteins from amino acid sequences [ 77 ], the researchers collected non-venomous samples by querying for the proteins that did not have the word “venom” on their metadata descriptions. In [ 141 ], the positive samples were data of genes that were targeted by drugs. For the negative samples, the researchers used a random subset, of the same size as the positive subset, of genes that were not listed as known therapeutic targets for FDA-approved drugs.…”
Section: Challengesmentioning
confidence: 99%