ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020
DOI: 10.1109/icassp40776.2020.9054454
|View full text |Cite
|
Sign up to set email alerts
|

Low-Frequency Compensated Synthetic Impulse Responses For Improved Far-Field Speech Recognition

Abstract: We propose a method for generating low-frequency compensated synthetic impulse responses that improve the performance of farfield speech recognition systems trained on artificially augmented datasets. We design linear-phase filters that adapt the simulated impulse responses to equalization distributions corresponding to realworld captured impulse responses. Our filtered synthetic impulse responses are then used to augment clean speech data from Lib-riSpeech dataset [1]. We evaluate the performance of our metho… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
16
0
1

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 13 publications
(17 citation statements)
references
References 32 publications
0
16
0
1
Order By: Relevance
“…• We show that, on a modified Kaldi LibriSpeech far-field ASR benchmark [2], far-field speech augmented using our improved RIRs outperforms the far-field speech augmented using unmodified RIRs by up to 19.9%.…”
Section: Introductionmentioning
confidence: 90%
See 2 more Smart Citations
“…• We show that, on a modified Kaldi LibriSpeech far-field ASR benchmark [2], far-field speech augmented using our improved RIRs outperforms the far-field speech augmented using unmodified RIRs by up to 19.9%.…”
Section: Introductionmentioning
confidence: 90%
“…On the other hand, we observe a boost or diminishing effect in the frequency response at different frequency bands in real RIRs due to wave modes created by room resonance. Some methods tend to compensate for the missing room response in synthetic RIRs using a sub-band room equalization approach [2].…”
Section: Techniques For Improving Synthetic Rirmentioning
confidence: 99%
See 1 more Smart Citation
“…The authors of the IR-GAN algorithm also evaluate the generated RIRs by performing an ASR test, using the Kaldi LibriSpeech acoustic model, which is based on the (Tang et al, 2020b). The paper states that the proposed algorithm can reduce WER by almost 9% compared to the GAS method.…”
Section: Evaluation Methodsmentioning
confidence: 99%
“…However, the collection of a sufficiently-varied dataset with multiple positions per room is a tedious task and needs to be repeated for different microphone geometries. There have been numerous studies on the differences between using simulated and measured RIRs, with several attempts to close the performance gap: point-source noises [8], directional sources [12], and artificially mimicking low-frequency wave effects [13].…”
Section: Introductionmentioning
confidence: 99%