2018
DOI: 10.1101/457606
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Gkmexplain: Fast and Accurate Interpretation of Nonlinear Gapped k-mer SVMs Using Integrated Gradients

Abstract: Support VectorMachines with gapped k-mer kernels (gkm-SVMs) have been used to learn predictive models of regulatory DNA sequence. However, interpreting predictive sequence patterns learned by gkm-SVMs can be challenging. Existing interpretation methods such as deltaSVM, in-silico mutagenesis (ISM), or SHAP either do not scale well or make limiting assumptions about the model that can produce misleading results when the gkm kernel is combined with nonlinear kernels. Here, we propose gkmexplain: a novel approach… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 8 publications
(16 reference statements)
0
3
0
Order By: Relevance
“…For example, one research scientist noted that, "Many [financial institutions] use kernel-based methods on tabular data. " As a result, there is a desire to translate explainability techniques for kernel support vector machines in genomics [58] to models trained on tabular data.…”
Section: Beyond Deepmentioning
confidence: 99%
“…For example, one research scientist noted that, "Many [financial institutions] use kernel-based methods on tabular data. " As a result, there is a desire to translate explainability techniques for kernel support vector machines in genomics [58] to models trained on tabular data.…”
Section: Beyond Deepmentioning
confidence: 99%
“…This method yields more interpretable results than individually visualizing convolutional filters, as such filters often learn distributed representations of sequence features. More detail can be found in the respective paper and in Shrikumar et al [ 53 ] and Avsec et al [ 54 ].…”
Section: Methodsmentioning
confidence: 99%
“…For example, one research scientist noted that, "Many [financial institutions] use kernel-based methods on tabular data. " As a result, there is a desire to translate explainability techniques for kernel support vector machines for genomics [54] to models trained on tabular data.…”
Section: Beyond Deep Learningmentioning
confidence: 99%