2007
DOI: 10.1109/tasl.2007.901312
|View full text |Cite
|
Sign up to set email alerts
|

Switching Linear Dynamical Systems for Noise Robust Speech Recognition

Abstract: Abstract. Real world applications such as hands-free dialling in cars may have to deal with potentially very noisy environments. Existing state-of-the-art solutions to this problem use featurebased HMMs, with a preprocessing stage to clean the noisy signal. However, the effect that raw signal noise has on the induced HMM features is poorly understood, and limits the performance of the HMM system. An alternative to feature-based HMMs is to model the raw signal, which has the potential advantage that including a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
44
0

Year Published

2008
2008
2015
2015

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 62 publications
(46 citation statements)
references
References 18 publications
0
44
0
Order By: Relevance
“…Various problems have been successfully solved using the SLDS. These include speech recognition (Mesot and Barber, 2007), econometric (Kim, 1994) and human figure tracking problems . Several other human behaviour detection applications are discussed in Turaga et al (2008).…”
Section: Methodsmentioning
confidence: 99%
“…Various problems have been successfully solved using the SLDS. These include speech recognition (Mesot and Barber, 2007), econometric (Kim, 1994) and human figure tracking problems . Several other human behaviour detection applications are discussed in Turaga et al (2008).…”
Section: Methodsmentioning
confidence: 99%
“…These include the switching Kalman filter and the switching state space model [19]. The SLDS has been successfully applied to various problems that include human motion modelling in computer vision [21], econometrics [22] and speech recognition [23]. No attempts to use SLDS as a generative model for data synthesis have been found in literature.…”
Section: Background and Related Workmentioning
confidence: 99%
“…1 A similar approximation approach to a constrained non-fully Bayesian switching LGSSM has independently been introduced in [9] in the context of supervised speech processing.…”
Section: Time-series Segmentationmentioning
confidence: 99%
“…Segmentation is then be performed based on an application of Bayes' rule to infer which set of parameters was most likely to have generated the observations at any particular time. This approach has successfully been used in several application domains such as finance, speech processing, modeling of human motion and medicine [1,10,11,9]. In our fully unsupervised scenario, the underlying number of different parameter sets is not known in advance and needs to be estimated.…”
Section: Introductionmentioning
confidence: 99%