2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) 2015
DOI: 10.1109/asru.2015.7404863
|View full text |Cite
|
Sign up to set email alerts
|

The MGB challenge: Evaluating multi-genre broadcast media recognition

Abstract: This paper describes the Multi-Genre Broadcast (MGB) Challenge at ASRU 2015, an evaluation focused on speech recognition, speaker diarization, and "lightly supervised" alignment of BBC TV recordings. The challenge training data covered the whole range of seven weeks BBC TV output across four channels, resulting in about 1,600 hours of broadcast audio. In addition several hundred million words of BBC subtitle text was provided for language modelling. A novel aspect of the evaluation was the exploration of speec… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
151
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 125 publications
(158 citation statements)
references
References 22 publications
(17 reference statements)
0
151
0
Order By: Relevance
“…The ASR system we chose was evaluated using a large multi-genre television dataset ( Bell et al, 2015 ). It had an overall word error rate of 47%, however for news content, which is clearly spoken by a native speaker, this dropped to 16%.…”
Section: Methodsmentioning
confidence: 99%
“…The ASR system we chose was evaluated using a large multi-genre television dataset ( Bell et al, 2015 ). It had an overall word error rate of 47%, however for news content, which is clearly spoken by a native speaker, this dropped to 16%.…”
Section: Methodsmentioning
confidence: 99%
“…Research in this field has made great progress thanks to real speech corpora collected for various application scenarios such as voice command for cars (Hansen et al, 2001), smart homes (Ravanelli et al, 2015), or tablets (Barker et al, 2015), and automatic transcription of lectures (Lamel et al, 1994), meetings (Renals et al, 2008), conversations (Harper, 2015), dialogues (Stupakov et al, 2011), game sessions (Fox et al, 2013), or broadcast media (Bell et al, 2015). In most corpora, the training speakers differ from the test speakers.…”
Section: Introductionmentioning
confidence: 99%
“…To train and evaluate the effectiveness of our proposed approach, we conducted experiments on a recent and very challenging dataset from the Multi-Genre Broadcast (MGB) Challenge [18]. The MGB data is a large broad and multigenre, spanning the whole range of TV output.…”
Section: Experimental Setup 41 Data and Asr Systemmentioning
confidence: 99%