2020
DOI: 10.3389/fgene.2020.00674
|View full text |Cite
|
Sign up to set email alerts
|

Predicting Cancer Tissue-of-Origin by a Machine Learning Method Using DNA Somatic Mutation Data

Abstract: Patients with carcinoma of unknown primary (CUP) account for 3–5% of all cancer cases. A large number of metastatic cancers require further diagnosis to determine their tissue of origin. However, diagnosis of CUP and identification of its primary site are challenging. Previous studies have suggested that molecular profiling of tissue-specific genes could be useful in inferring the primary tissue of a tumor. The purpose of this study was to evaluate the performance somatic mutations detected in a tumor to ident… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6

Relationship

2
4

Authors

Journals

citations
Cited by 10 publications
(13 citation statements)
references
References 56 publications
(56 reference statements)
0
9
0
Order By: Relevance
“…Considering the sequencing covers more than thousands of genes, but most of genes did not contain informative mutations thus making classification difficult by analyzing all those genes [8] , [9] . In order to avoid the mutation data being too sparse (even all zero), most analytical methods screen genes before classification [10] , [11] . These methods are simple and effective in some cases, but important features (genes) may be removed during the screening process.…”
Section: Introductionmentioning
confidence: 99%
“…Considering the sequencing covers more than thousands of genes, but most of genes did not contain informative mutations thus making classification difficult by analyzing all those genes [8] , [9] . In order to avoid the mutation data being too sparse (even all zero), most analytical methods screen genes before classification [10] , [11] . These methods are simple and effective in some cases, but important features (genes) may be removed during the screening process.…”
Section: Introductionmentioning
confidence: 99%
“…While previous models based on transcriptomic data have been attempted for neoplastic classification, these have typically employed a single neural layer and have demonstrated modest performance metrics 25 , 26 . The architecture of our model incorporates five feed forward layers, which affords increased accuracy and a greater number of classifiers.…”
Section: Discussionmentioning
confidence: 99%
“…Using genomic profiling data as an example, bioinformatics and ML algorithms are applied to score and rank the most relevant genes for creating tumor–gene associations and constructing TOO classifiers. Several ML algorithms to identify the TOO of CUP have been applied in this context [ 28 , 29 , 33 , 42 , 44 , 45 , 52–54 , 57 , 58 , 64 , 73 , 96 ] ( Supplementary Figure 3 and Table 2 ). These associations are subsequently assessed through independent validation sets, and the classifier’s efficacy is further verified with challenging clinical cases.…”
Section: Main Textmentioning
confidence: 99%