2013 IEEE International Conference on Acoustics, Speech and Signal Processing 2013
DOI: 10.1109/icassp.2013.6638240
|View full text |Cite
|
Sign up to set email alerts
|

Non-negative dynamical system with application to speech and audio

Abstract: Non-negative data arise in a variety of important signal processing domains, such as power spectra of signals, pixels in images, and count data. This paper introduces a novel non-negative dynamical system (NDS) for sequences of such data, and describes its application to modeling speech and audio power spectra. The NDS model can be interpreted both as an adaptation of linear dynamical systems (LDS) to non-negative data, and as an extension of non-negative matrix factorization (NMF) to support Markovian dynamic… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
51
0

Year Published

2014
2014
2018
2018

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 43 publications
(51 citation statements)
references
References 21 publications
0
51
0
Order By: Relevance
“…At testing, the proposed method, like standard NMF approaches, treats different time-frames independently, ignoring the temporal dynamics of speech signals. Recent studies have proposed regularized variants of NMF or PLCA trying to overcome this limitation, including co-occurrence statistics of the basis functions [3], smoothness of the activation coefficients [17], and learned temporal dynamics [5,18,19]. In all these methods the model is expressed as the minimization of a cost with a data fitting term and some structure-promoting penalties.…”
Section: Introductionmentioning
confidence: 99%
“…At testing, the proposed method, like standard NMF approaches, treats different time-frames independently, ignoring the temporal dynamics of speech signals. Recent studies have proposed regularized variants of NMF or PLCA trying to overcome this limitation, including co-occurrence statistics of the basis functions [3], smoothness of the activation coefficients [17], and learned temporal dynamics [5,18,19]. In all these methods the model is expressed as the minimization of a cost with a data fitting term and some structure-promoting penalties.…”
Section: Introductionmentioning
confidence: 99%
“…With respect to previous work on combining NMF with LDS: while the methods of [25] [26] are able to provide a component activation matrix that is able to evolve smoothly over time, in the present work we are primarily interested in using the LDS in a supervised scenario, to provide a mapping between the observed 'noisy' output of an event detection system and the latent 'true' sound event output, which is not possible using the aforementioned methods.…”
Section: A Motivation and System Overviewmentioning
confidence: 99%
“…Recently, two NMF-based models were proposed for speech denoising and separation tasks, which incorporated temporal constraints similar to those of an LDS. In [25], an extension of NMF was proposed which supported Markovian dynamics: the observation model operates similarly to standard NMF, while the latent dynamics capture statistical dependencies between time frames similarly to LDS. In [26], a dynamic NMF model is proposed, where the observation model is similar to NMF/PLCA and follows a multinomial distribution, and the encoding matrix dynamics are formulated using an autoregressive model.…”
Section: B Linear Dynamical Systemsmentioning
confidence: 99%
“…The general trade-off is that discrete-state approaches [4,5] can be more precise, especially in their temporal dynamics, whereas continuous approaches [6,7] can be more flexible with respect to gain and subspace variability.…”
Section: Introductionmentioning
confidence: 99%
“…Discrete state models, such as HMMs, represent dynamics using discrete state transitions over time [4,11]. Continuous state Gaussian dynamical models, such as linear dynamical systems (LDSs), have long been studied [12], and recently rich models of continuous dynamics have been extended to the NMF family using gammadistributed models [6,7] in models known as non-negative dynamical systems (NDSs). There have also been combinations with discrete dynamics and NMF observation models [13].…”
Section: Introductionmentioning
confidence: 99%