2014
DOI: 10.1109/taslp.2014.2304637
|View full text |Cite
|
Sign up to set email alerts
|

An Overview of Noise-Robust Automatic Speech Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
239
0
1

Year Published

2015
2015
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 486 publications
(240 citation statements)
references
References 209 publications
0
239
0
1
Order By: Relevance
“…More recently, Li et al (2014) reviewed new techniques that may resolve the issue of sensitivity to noise in voice-controlled systems, and that may soon be implemented in commercially available vehicle systems. With sight of such development, the present study aims to compare a noise-sensitive system that degrades in accuracy due to the presence of background noise to a noise-robust one in terms of user experience and driving performance.…”
Section: Introductionmentioning
confidence: 99%
“…More recently, Li et al (2014) reviewed new techniques that may resolve the issue of sensitivity to noise in voice-controlled systems, and that may soon be implemented in commercially available vehicle systems. With sight of such development, the present study aims to compare a noise-sensitive system that degrades in accuracy due to the presence of background noise to a noise-robust one in terms of user experience and driving performance.…”
Section: Introductionmentioning
confidence: 99%
“…However, modern recognition systems suffer from severe performance degradation in the presence of unavoidable interrupting factors like environment noise, room reverberation, disturbances from different microphones and recording non-linearities [1]. To solve these problems, many processing techniques [2,3,4], including speech enhancement algorithms [5] and new robust acoustic features [6] [7], have been developed to improve recognition performance under low signal-to-noise ratio (SNR) conditions. However these existing approaches, while achieving some improvements, are far from being a comprehensive solution.…”
Section: Introductionmentioning
confidence: 99%
“…Each topic had a mean length of 18 words, 55 percent of which were identified as keywords on average. 57 The study was a within-subjects, single-session design with three conditions: Dynamic Screen Display vs. Google Glass vs. Control. Each session lasted approximately 90 minutes.…”
Section: Methodsmentioning
confidence: 99%
“…Speech corpora generally have low diversity of speakers, therefore acoustic models generated from them might be inaccurate for transcribing speech input from nonnative speakers, speakers with accents, speakers affected with speech impairments [10], or others underrepresented in the corpora, such as older adults and children. Also, recording factors such as noise and other audio distortions can result in lower ASR performance [57].…”
Section: Acoustic Modelmentioning
confidence: 99%
See 1 more Smart Citation