1995
DOI: 10.1109/89.466660
|View full text |Cite
|
Sign up to set email alerts
|

Automatic word recognition in cars

Abstract: This paper compares, on a database recorded in a car, a number of signal analysis and speech enhancement techniques as well as some approaches to adapt speech recognition systems. It is shown that a new nonlinear spectral subtraction associated with Me1 frequency cepstral coefficients (MFCC) is an adequate compromise for low-cost integration. The Lombard effect is analyzed and simulated. Such a simulation is used to derive realistic training utterances from noise-free utterances. Adapting a continuous-density … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
27
0
2

Year Published

1996
1996
2020
2020

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 45 publications
(29 citation statements)
references
References 42 publications
(21 reference statements)
0
27
0
2
Order By: Relevance
“…In [32], front-end constrained iterative enhancement was shown to improve ASR in noise. However, in [58], back-end noise modeling with sufficient noise data was shown both analytically and experimentally to be preferable for ASR compared to noise suppression techniques.…”
Section: Recognizer With Codebook Of Noisy Modelsmentioning
confidence: 99%
See 1 more Smart Citation
“…In [32], front-end constrained iterative enhancement was shown to improve ASR in noise. However, in [58], back-end noise modeling with sufficient noise data was shown both analytically and experimentally to be preferable for ASR compared to noise suppression techniques.…”
Section: Recognizer With Codebook Of Noisy Modelsmentioning
confidence: 99%
“…This requirement is difficult to meet in real world scenarios where noise level and type may vary continuously, introducing mismatch between the acoustic models typically trained with either clean or partial noise data. Numerous techniques to reduce mismatch have been proposed in two domains: 1) transformation of noisy speech towards clean, such as noise suppression [32], [54], [58]; 2) noise modeling in the ASR back-end, such as updating acoustic models for noise characteristics [58], using two-dimensional HMMs to decompose speech and noise components [59], and parallel model combination (PMC) [60]. In [32], front-end constrained iterative enhancement was shown to improve ASR in noise.…”
Section: Recognizer With Codebook Of Noisy Modelsmentioning
confidence: 99%
“…These AR parameters were then used to provide car specific speech enhancement for noisy speech recognition. In another approach, Mokbel and Chollet (1995) proposed a feature enhancement/adaptation strategy where the characteristics of the Mel Frequency Cepstral Coefficients (MFCCs) in car environment were utilized to derive a non-linear Spectral Subtraction (SS) scheme. For perception centric systems, Arslan et al (1995) suggested adaptive noise suppression algorithms for mobile applications and tested them on simulated car noise.…”
Section: Speech Systems With Vehiclesmentioning
confidence: 99%
“…By setting the singular values representing the n-oise to zero, we can reconstruct a new reduced rank matrix H representing the clean speech signal R using (4). To obtain the vector jz from H , the anti-diagonal components of H are then averaged.…”
Section: Singular Value Decomposition Based Speech Enhancementmentioning
confidence: 99%
“…Spectral subtraction has been shown to work to some extent for speech recognition [2] [4]. The spectral subtraction algorithm works on the assumption that the noise contained in the noisy speech signal is additive and uncorrelated with the clean speech signal.…”
Section: Introductionmentioning
confidence: 99%