Interspeech 2020 2020
DOI: 10.21437/interspeech.2020-2261
|View full text |Cite
|
Sign up to set email alerts
|

StoRIR: Stochastic Room Impulse Response Generation for Audio Data Augmentation

Abstract: In this paper we introduce StoRIR -a stochastic room impulse response generation method dedicated to audio data augmentation in machine learning applications. This technique, in contrary to geometrical methods like image-source or ray tracing, does not require prior definition of room geometry, absorption coefficients or microphone and source placement and is dependent solely on the acoustic parameters of the room. The method is intuitive, easy to implement and allows to generate RIRs of very complicated enclo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(3 citation statements)
references
References 16 publications
0
3
0
Order By: Relevance
“…We use the default configuration provided in the official example 4 . 4) StoRIR [17]: StoRIR uses a random energy-rescaled impulse train to estimate the RIR filter. Although it is not an ISM-based method, we select it as one of the comparable methods as it also generates the RIR filters in a stochastic way.…”
Section: Visualizationmentioning
confidence: 99%
“…We use the default configuration provided in the official example 4 . 4) StoRIR [17]: StoRIR uses a random energy-rescaled impulse train to estimate the RIR filter. Although it is not an ISM-based method, we select it as one of the comparable methods as it also generates the RIR filters in a stochastic way.…”
Section: Visualizationmentioning
confidence: 99%
“…Despite this, these datasets do not straightforwardly generalize to, e.g., multichannel settings with a specific microphone array geometry. To alleviate such stringent computational requirements, a fast stochastic RIR simulator was proposed in [13] and used to train speech enhancement models. While better generalization to real data was demonstrated w.r.t.…”
Section: Introductionmentioning
confidence: 99%
“…In recent years, an increasing number of RIR generators have been introduced to generate a realistic RIR for a given acoustic environment [5][6][7][8]. Accurate RIR generators can generate RIRs with various acoustic effects (e.g., diffraction, scattering, early reflections, late reverberations) [9].…”
Section: Introductionmentioning
confidence: 99%