Towards methods for efficient access to spoken content in the ami corpus

Jones, Gareth J. F.; Eskevich, Maria; Gyarmati, Ágnes

doi:10.1145/1878101.1878108

Proceedings of the 2010 International Workshop on Searching Spontaneous Conversational Speech 2010

DOI: 10.1145/1878101.1878108

|View full text |Cite

Towards methods for efficient access to spoken content in the ami corpus

Gareth J. F. Jones

Maria Eskevich

Ágnes Gyarmati

Abstract: Increasing amounts of informal spoken content are being collected. This material does not have clearly defined document forms either in terms of structure or topical content, e.g. recordings of meetings, lectures and personal data sources. Automated search of this content poses challenges beyond retrieval of defined documents, including definition of search items and location of relevant content within them. While most existing work on speech search focused on clearly defined document units, in this paper we d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2013

Publication Types

Select...

Other1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

References 10 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

Presentation video retrieval using automatically recovered slide and spoken text

Cooper

2013

SPIE Proceedings

View full text Add to dashboard Cite

Video is becoming a prevalent medium for e-learning. Lecture videos contain text information in both the visual and aural channels: the presentation slides and lecturer's speech. This paper examines the relative utility of automatically recovered text from these sources for lecture video retrieval. To extract the visual information, we apply video content analysis to detect slides and optical character recognition to obtain their text. Automatic speech recognition is used similarly to extract spoken text from the recorded audio. We perform controlled experiments with manually created ground truth for both the slide and spoken text from more than 60 hours of lecture video. We compare the automatically extracted slide and spoken text in terms of accuracy relative to ground truth, overlap with one another, and utility for video retrieval. Results reveal that automatically recovered slide text and spoken text contain different content with varying error profiles. Experiments demonstrate that automatically extracted slide text enables higher precision video retrieval than automatically recovered spoken text.

show abstract

Presentation video retrieval using automatically recovered slide and spoken text

Cooper

2013

SPIE Proceedings

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Towards methods for efficient access to spoken content in the ami corpus

Cited by 1 publication

References 10 publications

Presentation video retrieval using automatically recovered slide and spoken text

Presentation video retrieval using automatically recovered slide and spoken text

Contact Info

Product

Resources

About