2021
DOI: 10.1371/journal.pcbi.1009119
|View full text |Cite
|
Sign up to set email alerts
|

De novo mutational signature discovery in tumor genomes using SparseSignatures

Abstract: Cancer is the result of mutagenic processes that can be inferred from tumor genomes by analyzing rate spectra of point mutations, or “mutational signatures”. Here we present SparseSignatures, a novel framework to extract signatures from somatic point mutation data. Our approach incorporates a user-specified background signature, employs regularization to reduce noise in non-background signatures, uses cross-validation to identify the number of signatures, and is scalable to large datasets. We show that SparseS… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
54
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
1

Relationship

4
1

Authors

Journals

citations
Cited by 26 publications
(54 citation statements)
references
References 39 publications
0
54
0
Order By: Relevance
“…While it can be omitted from the analysis, we suggest including it explicitly. The package provides two background signatures: the SBS5 signature extrapolated from the COSMIC database ( Lal et al., 2021 ), and a signature derived from the human germline mutation spectrum ( Rahbari et al., 2016 ). Import one of the available background signatures:
# To load the SBS5 signature from COSMIC data(background) # OR, for the human germline-derived signature data(background2)
…”
Section: Step-by-step Methods Detailsmentioning
confidence: 99%
See 4 more Smart Citations
“…While it can be omitted from the analysis, we suggest including it explicitly. The package provides two background signatures: the SBS5 signature extrapolated from the COSMIC database ( Lal et al., 2021 ), and a signature derived from the human germline mutation spectrum ( Rahbari et al., 2016 ). Import one of the available background signatures:
# To load the SBS5 signature from COSMIC data(background) # OR, for the human germline-derived signature data(background2)
…”
Section: Step-by-step Methods Detailsmentioning
confidence: 99%
“…Mutational signatures are discovered by analyzing ensemble point-mutation counts from a sample of individuals. Each individual record lists the count of mutations detected in the cancer sample, classified with respect to the trinucleotide contexts: the observed nucleotide substitution (C>A, C>G, C>T, T>A, T>C, or T>G) and the neighboring 5′ and 3′ nucleotides, for a total of 96 trinucleotide mutation types ( Lal et al., 2021 ). The counts are naturally summarized in a rectangular data-frame (an R dataframe, in our case) whose row records refer to the individual cancer samples and whose columns correspond to the 96 types.…”
Section: Before You Beginmentioning
confidence: 99%
See 3 more Smart Citations