2022
DOI: 10.31219/osf.io/4rtjs
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Listen to Interpret: Post-hoc Interpretability for Audio Networks with NMF

Abstract: This paper tackles post-hoc interpretability for audio processing networks. Our goal is to interpret decisions of a network in terms of high-level audio objects that are also listenable for the end-user. To this end, we propose a novel interpreter design that incorporates non-negative matrix factorization (NMF). In particular, a carefully regularized interpreter module is trained to take hidden layer representations of the targeted network as input and produce time activations of pre-learnt NMF components as i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(5 citation statements)
references
References 44 publications
0
5
0
Order By: Relevance
“…with a two-step optimization process [8]. In this work, we consider the sparse NMF implementation [24] as suggested in [7]. In our segmentation proxy model, the dictionary W is pre-learned while the activation H is extracted by a neural model Ψ and referred to as an embedding.…”
Section: Non-negative Matrix Factorization (Nmf)mentioning
confidence: 99%
See 4 more Smart Citations
“…with a two-step optimization process [8]. In this work, we consider the sparse NMF implementation [24] as suggested in [7]. In our segmentation proxy model, the dictionary W is pre-learned while the activation H is extracted by a neural model Ψ and referred to as an embedding.…”
Section: Non-negative Matrix Factorization (Nmf)mentioning
confidence: 99%
“…In this work, the f model is pre-trained with frozen weights and serves as a teacher for the proxy model. We use a similar approach as [7] where the proxy model is composed of two functions. Let Ψ be a function that maps a sequence of D-dimension feature vectors S ∈ R D×T to the embedding H ∈ R K×T…”
Section: Proxy Model Frameworkmentioning
confidence: 99%
See 3 more Smart Citations