2013
DOI: 10.1250/ast.34.133
|View full text |Cite
|
Sign up to set email alerts
|

Noise suppression method for preprocessor of time-lag speech recognition system based on bidirectional optimally modified log spectral amplitude estimation

Abstract: In this paper, we propose a new noise suppression method, that is best used as a preprocessor for time-lag speech recognition. Assuming that a time lag of a few seconds is acceptable in various speech recognition applications, the proposed method is realized as a combination of forward and backward estimation flows over time. Each estimation flow is based on the optimally modified log spectral amplitude (OM-LSA) speech estimator, but a look-ahead estimation mechanism is additionally equipped to make the estima… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
2
2

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 18 publications
(20 reference statements)
0
4
0
Order By: Relevance
“…The latency of the proposed noise suppression was 50ms due to the ΔΔ feature calculation in Eq. (7). Memory usage was 1,168,347 bytes.…”
Section: Implementation On Dspmentioning
confidence: 99%
See 1 more Smart Citation
“…The latency of the proposed noise suppression was 50ms due to the ΔΔ feature calculation in Eq. (7). Memory usage was 1,168,347 bytes.…”
Section: Implementation On Dspmentioning
confidence: 99%
“…One [1]- [10] does not require pre-training, and the other [11]- [23] does. Well-known examples of the first type include spectral subtraction (SS) [1], Wiener filtering (WF) [2], minimum mean-square error short-time spectral amplitude (MMSE STSA) [3], [4], minimum meansquare error log-spectral amplitude (MMSE LSA) [5], [8], and optimally-modified log-spectral amplitude (OM-LSA) [6], [7]. Advanced Front-end (AFE) [9], [10] of ETSI (European Telecommunications Standards Institute), which applies WF in two stages, is particularly known for its high noise robustness, and it is often used for comparison in research studies.…”
Section: Introductionmentioning
confidence: 99%
“…As a speech enhancement algorithm for our system, we implemented bidirectional OM-LSA (BOMLSA) speech estimator [23]. OM-LSA [24] is an extension of FFT-based a priori SNR estimator of Ephraim and Malah [25].…”
Section: Speech Enhancement and Merging Asr Outputsmentioning
confidence: 99%
“…Furthermore, we made HN-E (high noise, enhanced) and LN-E (low noise, enhanced) speech samples by applying a speech enhancement algorithm to HN-R and LN-R, respectively. We used the bidirectional OMLSA algorithm [23] for the enhancement.…”
Section: Interview-based Collection Of Experimental Datasetmentioning
confidence: 99%