Interspeech 2018 2018
DOI: 10.21437/interspeech.2018-1768
|View full text |Cite
|
Sign up to set email alerts
|

The Fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, Task and Baselines

Abstract: The CHiME challenge series aims to advance robust automatic speech recognition (ASR) technology by promoting research at the interface of speech and language processing, signal processing, and machine learning. This paper introduces the 5th CHiME Challenge, which considers the task of distant multimicrophone conversational ASR in real home environments. Speech material was elicited using a dinner party scenario with efforts taken to capture data that is representative of natural conversational speech and recor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
184
0
4

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 246 publications
(188 citation statements)
references
References 26 publications
0
184
0
4
Order By: Relevance
“…Depending on how many arrays were available during test time, the challenge had a single (reference) array and a multiple array track. For more details about the corpus, the reader is referred to [11].…”
Section: Chime-5 Corpus Descriptionmentioning
confidence: 99%
See 1 more Smart Citation
“…Depending on how many arrays were available during test time, the challenge had a single (reference) array and a multiple array track. For more details about the corpus, the reader is referred to [11].…”
Section: Chime-5 Corpus Descriptionmentioning
confidence: 99%
“…We perform experiments using data from the CHiME-5 challenge which focuses on distant multi-microphone conversational ASR in real home environments [11]. The CHiME-5 data is heavily degraded by reverberation and overlapped speech.…”
Section: Introductionmentioning
confidence: 99%
“…The CHiME challenge was launched in 2011 to address the problem of recognizing speech recorded with multiple microphones in real, noisy environments, such as a family's living room, a cafe, a busy intersection, on public transport and in pedestrian areas . To develop noise‐robust ASR systems, various approaches based not only on speech processing but also on sound‐source separation and machine learning have been widely investigated, and ASR performance has improved significantly as a result of past challenges.…”
Section: Recent Research Trends In Environmental Sound Processingmentioning
confidence: 99%
“…In addition, these methods were tested almost exclusively on small-scale segmented synthetic data and have not been applied to continuous conversational speech audio. Although the recently held CHiME-5 challenge helped the community make a step forward to a realistic setting, it still allowed the use of ground-truth speaker segments [22,23].…”
Section: Introductionmentioning
confidence: 99%