2016 IEEE Spoken Language Technology Workshop (SLT) 2016
DOI: 10.1109/slt.2016.7846239
|View full text |Cite
|
Sign up to set email alerts
|

A study of speech distortion conditions in real scenarios for speech processing applications

Abstract: The growing demand for robust speech processing applications able to operate in adverse scenarios calls for new evaluation protocols and datasets beyond artificial laboratory conditions. The characteristics of real data for a given scenario are rarely discussed in the literature. As a result, methods are often tested based on the author expertise and not always in scenarios with actual practical value. This paper aims to open this discussion by identifying some of the main problems with data simulation or coll… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
2

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(6 citation statements)
references
References 43 publications
0
6
0
Order By: Relevance
“…The residual mechanism was extremely useful in this case, since the signal has always a linear shortcut and the non-linear path enhances it in certain steps by adding or subtracting corrections. In practical applications, this is a valuable property because realistic scenarios could challenge the system with many different conditions, and sometimes the real world is not so noisy as research studies consider in experimental setups [63].…”
Section: Discussionmentioning
confidence: 99%
“…The residual mechanism was extremely useful in this case, since the signal has always a linear shortcut and the non-linear path enhances it in certain steps by adding or subtracting corrections. In practical applications, this is a valuable property because realistic scenarios could challenge the system with many different conditions, and sometimes the real world is not so noisy as research studies consider in experimental setups [63].…”
Section: Discussionmentioning
confidence: 99%
“…The residual mechanism was extremely useful in this case since the signal has always a linear shortcut and the non-linear path enhances it in certain steps by adding or subtracting corrections. In practical applications, this is a valuable property because realistic scenarios could challenge the system with many different conditions [28].…”
Section: Discussionmentioning
confidence: 99%
“…Finally, the speech mixtures were reverberated using RIRs ( 26 ) dataset to simulate different room settings (e.g., small room, medium room, and large room). Based on the studies ( 27 , 28 ) that conducted comprehensive analyses of daily acoustic scenarios in terms of noise and reverberation level, our synthetic datasets were able to simulate various scenarios like bedroom, kitchen, meeting room, office, classroom, restaurant, hospital hall, etc., as shown in Figure 3 .…”
Section: Methodsmentioning
confidence: 99%
“… Simulated scenarios in our synthetic datasets, based on the comprehensive analyses of daily acoustic scenarios in terms of noise and reverberation level ( 27 , 28 ). …”
Section: Methodsmentioning
confidence: 99%