2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) 2015
DOI: 10.1109/asru.2015.7404854
|View full text |Cite
|
Sign up to set email alerts
|

The 2015 sheffield system for transcription of Multi-Genre Broadcast media

Abstract: We describe the University of Sheffield system for participation in the 2015 Multi-Genre Broadcast (MGB) challenge task of transcribing multi-genre broadcast shows. Transcription was one of four tasks proposed in the MGB challenge, with the aim of advancing the state of the art of automatic speech recognition, speaker diarisation and automatic alignment of subtitles for broadcast media. Four topics are investigated in this work: Data selection techniques for training with unreliable data, automatic speech segm… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
0

Year Published

2015
2015
2019
2019

Publication Types

Select...
5
2

Relationship

6
1

Authors

Journals

citations
Cited by 19 publications
(21 citation statements)
references
References 26 publications
0
21
0
Order By: Relevance
“…• University of Sheffield (SU; spandh.dcs.shef.ac.uk) [26]: 1st pass -DNNs with PLP+bMMI features with global cMLLR xform. Speech segmented based on 1st-pass output.…”
Section: Submitted Systems and Resultsmentioning
confidence: 99%
“…• University of Sheffield (SU; spandh.dcs.shef.ac.uk) [26]: 1st pass -DNNs with PLP+bMMI features with global cMLLR xform. Speech segmented based on 1st-pass output.…”
Section: Submitted Systems and Resultsmentioning
confidence: 99%
“…The setup for ASR experiments is the same as in [23,6] with a baseline 4-gram language model built on LM 1+LM 2 text by first selecting a vocabulary of 200k words was chosen from all the words in the LM 2 text (87k) and augmented with the most frequently occuring words in LM 1. The RNNLMs were trained with a 60k vocabulary for the input word list and a 50k vocabulary for the output word list, both obtained by shortlisting the 200k vocabulary based on most frequent words.…”
Section: Methodsmentioning
confidence: 99%
“…The MGB data is a large broad and multigenre, spanning the whole range of TV output. The Automatic transcriptions were produced by the Sheffield system described in [3]. This system was built using two different types of systems.…”
Section: Experimental Setup 41 Data and Asr Systemmentioning
confidence: 99%
“…The focus of this work was on substitution and insertion errors. The proposed system was evaluated on the transcription of a dataset from Multi-Genre Broadcast Media using Sheffield ASR system [3].…”
Section: Introductionmentioning
confidence: 99%