2013 IEEE International Conference on Acoustics, Speech and Signal Processing 2013
DOI: 10.1109/icassp.2013.6639314
|View full text |Cite
|
Sign up to set email alerts
|

Language model adaptation for video lectures transcription

Abstract: Videolectures are currently being digitised all over the world for its enormous value as reference resource. Many of these lectures are accompanied with slides. The slides offer a great opportunity for improving ASR systems performance. We propose a simple yet powerful extension to the linear interpolation of language models for adapting language models with slide information. Two types of slides are considered, correct slides, and slides automatic extracted from the videos with OCR. Furthermore, we compare bo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
17
0

Year Published

2014
2014
2017
2017

Publication Types

Select...
3
2

Relationship

2
3

Authors

Journals

citations
Cited by 15 publications
(17 citation statements)
references
References 14 publications
0
17
0
Order By: Relevance
“…We compare our approach with a strong baseline computed from a large collection of out-of-domain and in-domain documents comprising 46 billion words. Furthermore, we compare our results with those obtained by slide adaptation [15], using as slides the text extracted from the video using OCR. We also combine both approaches to further improve adaptation which yields significant improvements with respect to both the baseline model and the slide-adapted model.…”
Section: Introductionmentioning
confidence: 95%
See 4 more Smart Citations
“…We compare our approach with a strong baseline computed from a large collection of out-of-domain and in-domain documents comprising 46 billion words. Furthermore, we compare our results with those obtained by slide adaptation [15], using as slides the text extracted from the video using OCR. We also combine both approaches to further improve adaptation which yields significant improvements with respect to both the baseline model and the slide-adapted model.…”
Section: Introductionmentioning
confidence: 95%
“…In this work, we further consider the scenario where the lecture slides can be extracted from the video using OCR and they are available to adapt the models [15], or a mixed scenario that combines both the text in the slides and the retrieved documents as follows…”
Section: Language Model Adaptation Techniquementioning
confidence: 99%
See 3 more Smart Citations