2018
DOI: 10.1101/393801
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Deep convolutional neural networks for accurate somatic mutation detection

Abstract: We present NeuSomatic, the first convolutional neural network approach for somatic mutation detection, which significantly outperforms previous methods on different sequencing platforms, sequencing strategies, and tumor purities. NeuSomatic summarizes sequence alignments into small matrices and incorporates more than a hundred features to capture mutation signals effectively. It can be used universally as a stand-alone somatic mutation detection method or with an ensemble of existing methods to achieve the hig… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
31
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 21 publications
(31 citation statements)
references
References 34 publications
0
31
0
Order By: Relevance
“…We used several different training models in our analysis. First, we used the already available model published recently 11 which was trained using in silico spike-ins from the DREAM Challenge Stage 3 dataset 13 . Despite the large discrepancy between the sample types, sequencing platforms, coverages, spike-in mutation frequencies, and heterogeneity of the samples used to train the DREAM3 model, this model outperformed other conventional techniques across the real cancer datasets of diverse characteristics by more than ∼4% by the mean F1-score averaged across different samples for both SNVs and INDELs ( Figure 1a ).…”
Section: Resultsmentioning
confidence: 99%
See 4 more Smart Citations
“…We used several different training models in our analysis. First, we used the already available model published recently 11 which was trained using in silico spike-ins from the DREAM Challenge Stage 3 dataset 13 . Despite the large discrepancy between the sample types, sequencing platforms, coverages, spike-in mutation frequencies, and heterogeneity of the samples used to train the DREAM3 model, this model outperformed other conventional techniques across the real cancer datasets of diverse characteristics by more than ∼4% by the mean F1-score averaged across different samples for both SNVs and INDELs ( Figure 1a ).…”
Section: Resultsmentioning
confidence: 99%
“…As the baseline WGS model we employed the DREAM3 model, developed recently 11 by training on ICGC-TCGA DREAM Challenge Stage 3 data 14 . The Stage 3 dataset consists of a normal sample and a tumor sample constructed by computationally spiking 7,903 SNVs and 7,604 INDELs mutations into a healthy genome of the same normal sample with three different AFs of 50%, 33%, and 20% to create synthetic but realistic tumoral normal pairs.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations