2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) 2015
DOI: 10.1109/asru.2015.7404856
|View full text |Cite
|
Sign up to set email alerts
|

Cambridge university transcription systems for the multi-genre broadcast challenge

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
59
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 35 publications
(61 citation statements)
references
References 37 publications
2
59
0
Order By: Relevance
“…• University of Cambridge (CU; (mi.eng.cam.ac.uk)) [25]: Primarily HTK-based hybrid DNN and tandem systems via joint decoding. Trained on 700 hrs (PMER=30%).…”
Section: Submitted Systems and Resultsmentioning
confidence: 99%
“…• University of Cambridge (CU; (mi.eng.cam.ac.uk)) [25]: Primarily HTK-based hybrid DNN and tandem systems via joint decoding. Trained on 700 hrs (PMER=30%).…”
Section: Submitted Systems and Resultsmentioning
confidence: 99%
“…While for TDNNs, only the 40 dimensional log-Mel filter bank features were considered. For all experiments, the input features were normalised at the utterance level for mean and at the show-segment level for variance [25].…”
Section: Methodsmentioning
confidence: 99%
“…To evaluate the generalisation performance of the trained models, a 158k word vocabulary trigram LM was used to decode the validation and test set. 1 Note that most results in [25] use a larger 700h training set, stronger language models and other setup differences. Training configuration for SGD: The best results with SGD were achieved through annealing of the learning rates at subsequent epochs.…”
Section: Methodsmentioning
confidence: 99%
“…The development data was used as the evaluation set in order to provide fair comparison with previous work [4,22,6]. For language model experiments, the LM 2 data was partitioned into a training and development set by selecting 90% of text for each programme for training and the remaining 10% for development, after shuffling the lines for each programme.…”
Section: Multi-genre Broadcast Challenge Datamentioning
confidence: 99%