2021
DOI: 10.7717/peerj-cs.584
|View full text |Cite
|
Sign up to set email alerts
|

Machine learning analysis of TCGA cancer data

Abstract: In recent years, machine learning (ML) researchers have changed their focus towards biological problems that are difficult to analyse with standard approaches. Large initiatives such as The Cancer Genome Atlas (TCGA) have allowed the use of omic data for the training of these algorithms. In order to study the state of the art, this review is provided to cover the main works that have used ML with TCGA data. Firstly, the principal discoveries made by the TCGA consortium are presented. Once these bases have been… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
15
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 22 publications
(15 citation statements)
references
References 140 publications
0
15
0
Order By: Relevance
“…Recently, there have been many studies of models for analyzing omics data. Especially the success of ML methods in processing a large amount of data has revolutionized bioinformatics and conventional forms of genetic diagnosis [ 19 ].…”
Section: Discussionmentioning
confidence: 99%
“…Recently, there have been many studies of models for analyzing omics data. Especially the success of ML methods in processing a large amount of data has revolutionized bioinformatics and conventional forms of genetic diagnosis [ 19 ].…”
Section: Discussionmentioning
confidence: 99%
“…All of this data is publicly available for mining and analysis in the interest of discovering specific set of genetic markers and targets (TCGA Research Network, 2006). As expected, current TCGA analytics mirrors the experimental feat of collecting such a big data (Cheng, Dummer and Levesque, 2015;Liñares-Blanco, Pazos and Fernandez-Lozano, 2021). However, a definitive answer about the most adequate set of genes for diagnosis and therapy is yet to come.…”
Section: Introductionmentioning
confidence: 83%
“…For example, pattern recognition and data augmentation proved to be promising approaches to assist in generating accurate diagnoses from mammography images [ 53 , 54 ]. Transcriptome data were also employed to develop ML-based analysis pipelines for breast cancer subtyping, diagnosis, patient stratification and identification of altered pathways [ 55 ], and these techniques may improve the accuracy of cancer prognosis in the future. However, shortcomings must be taken into account, as applicable also to currently available breast cancer datasets.…”
Section: Discussionmentioning
confidence: 99%