Gkmexplain: Fast and Accurate Interpretation of Nonlinear Gapped <i>k</i>-mer SVMs Using Integrated Gradients

Shrikumar, Avanti; Prakash, Eva; Kundaje, Anshul

doi:10.1101/457606

Cited by 5 publications

(3 citation statements)

References 8 publications

(16 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For example, one research scientist noted that, "Many [financial institutions] use kernel-based methods on tabular data. " As a result, there is a desire to translate explainability techniques for kernel support vector machines in genomics [58] to models trained on tabular data.…”

Section: Beyond Deepmentioning

confidence: 99%

Explainable machine learning in deployment

Bhatt

Xiang

Sharma

et al. 2020

Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency

392

283

View full text Add to dashboard Cite

Section: Beyond Deepmentioning

confidence: 99%

Explainable machine learning in deployment

Bhatt

Xiang

Sharma

et al. 2020

Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency

392

283

View full text Add to dashboard Cite

“…This method yields more interpretable results than individually visualizing convolutional filters, as such filters often learn distributed representations of sequence features. More detail can be found in the respective paper and in Shrikumar et al [ 53 ] and Avsec et al [ 54 ].…”

Section: Methodsmentioning

confidence: 99%

Predicting mean ribosome load for 5’UTR of any length using deep learning

2021

View full text Add to dashboard Cite

The 5’ untranslated region plays a key role in regulating mRNA translation and consequently protein abundance. Therefore, accurate modeling of 5’UTR regulatory sequences shall provide insights into translational control mechanisms and help interpret genetic variants. Recently, a model was trained on a massively parallel reporter assay to predict mean ribosome load (MRL)—a proxy for translation rate—directly from 5’UTR sequence with a high degree of accuracy. However, this model is restricted to sequence lengths investigated in the reporter assay and therefore cannot be applied to the majority of human sequences without a substantial loss of information. Here, we introduced frame pooling, a novel neural network operation that enabled the development of an MRL prediction model for 5’UTRs of any length. Our model shows state-of-the-art performance on fixed length randomized sequences, while offering better generalization performance on longer sequences and on a variety of translation-related genome-wide datasets. Variant interpretation is demonstrated on a 5’UTR variant of the gene HBB associated with beta-thalassemia. Frame pooling could find applications in other bioinformatics predictive tasks. Moreover, our model, released open source, could help pinpoint pathogenic genetic variants.

show abstract

“…For example, one research scientist noted that, "Many [financial institutions] use kernel-based methods on tabular data. " As a result, there is a desire to translate explainability techniques for kernel support vector machines for genomics [54] to models trained on tabular data.…”

Section: Beyond Deep Learningmentioning

confidence: 99%

Explainable Machine Learning in Deployment

Bhatt¹,

Xiang²,

Sharma³

et al. 2019

Preprint

View full text Add to dashboard Cite

Explainable machine learning seeks to provide various stakeholders with insights into model behavior via feature importance scores, counterfactual explanations, and influential samples, among other techniques. Recent advances in this line of work, however, have gone without surveys of how organizations are using these techniques in practice. This study explores how organizations view and use explainability for stakeholder consumption. We find that the majority of deployments are not for end users affected by the model but for machine learning engineers, who use explainability to debug the model itself. There is a gap between explainability in practice and the goal of public transparency, since explanations primarily serve internal stakeholders rather than external ones. Our study synthesizes the limitations with current explainability techniques that hamper their use for end users. To facilitate end user interaction, we develop a framework for establishing clear goals for explainability, including a focus on normative desiderata.

show abstract

Gkmexplain: Fast and Accurate Interpretation of Nonlinear Gapped k-mer SVMs Using Integrated Gradients

Cited by 5 publications

References 8 publications

Explainable machine learning in deployment

Explainable machine learning in deployment

Predicting mean ribosome load for 5’UTR of any length using deep learning

Explainable Machine Learning in Deployment

Contact Info

Product

Resources

About