The prospects for unrestricted speech input for TV content search

Wittenburg, Kent; Lanning, Tom; Schwenke, Derek; Shubin, Hal; Vetro, Anthony

doi:10.1145/1133265.1133338

Cited by 22 publications

(17 citation statements)

References 7 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In this way, speech-based queries could be better matched with speech-based metadata that pertains to the content, which in our example would be restaurant names, menu items, information about ratings, etc. Similar challenges for search and retrieval of multimedia information as described in this example also exist in the mobile and television environments, e.g., see the work of Wittenburg, et al [39] on applying speech-based query to EPG search on the television. While the solutions may be different depending on the context and type of media to be consumed, it is necessary to identify common requirements for future metadata standards that enable richer forms of multimodal input, such as voice and gestures, to be used for multimedia information retrieval as illustrated in Fig.…”

Section: A Ease Of Usementioning

confidence: 82%

Multimedia Retrieval and Delivery: Essential Metadata Challenges and Standards

Pereira¹,

Vetro

Sikora

2008

Proc. IEEE

View full text Add to dashboard Cite

To allow reduced costs, technical competition and evolution, and development of sizeable markets, standards are needed for metadata about the nature, production, management and use of multimedia material.By Fernando Pereira, Fellow IEEE, Anthony Vetro, Senior Member IEEE, and Thomas Sikora, Senior Member IEEE ABSTRACT | Multimedia information retrieval (MIR) and delivery plays an important role in many application domains due to the increasing need to identify, filter, and manage growing amounts of data, notably multimedia information. To efficiently manage and exchange multimedia information, interoperability between coded data and metadata is required and standardization is central to achieving the necessary level of interoperability. In the context of this paper, the term retrieval refers to the process by which a user, human or machine, identifies the content it needs, and the term delivery refers to the adaptive transport and consumption of the identified content in a particular context or usage environment.Both the retrieval and delivery processes may require content and context metadata. This paper will argue that maximum quality of experience depends not only on the content itself (and thus content metadata) but also on the consumption conditions (thus context metadata). Additionally, the rights and protection conditions have become critically important in recent years, especially with the explosion of electronic music commerce and different Bshopping[ conditions. This paper will review existing multimedia standards related to information retrieval and adaptive delivery of multimedia content, emphasizing the need for such standards, and will show how these standards can help the development, dissemination, and valorization of MIR research results. Moreover, it will also discuss limitations of the current standards and anticipate what future standardization activities are relevant and needed. Due to space limitations, the paper will mainly concentrate on MPEG standards although many other relevant standards are also reviewed and discussed.

show abstract

Section: A Ease Of Usementioning

confidence: 82%

Multimedia Retrieval and Delivery: Essential Metadata Challenges and Standards

Pereira¹,

Vetro

Sikora

2008

Proc. IEEE

View full text Add to dashboard Cite

show abstract

“…This is timeconsuming and complex for users. Emerging multimodal [10], gestural [11] and auxiliary [12] input interfaces show promise but currently address niche requirements and require more research before they can realistically match user expectations.…”

Section: Inline Search For Tvmentioning

confidence: 99%

TV Answers - Using the Wisdom of Crowds to Facilitate Searches with Rich Media Context

Narasimhan

Wodka

Wong

et al. 2010

2010 7th IEEE Consumer Communications and Networking Conference

View full text Add to dashboard Cite

Television has always been a popular entertainment medium with considerable impact on consumer purchase behaviors. However, unlike mobiles and PCs, it has yet to support an explicit search capability that extends beyond simple content navigation. A core reason for this is lack of usable input interfaces for TV coupled with the difficulty of creating and executing queries based on rich media context. In this paper, we present TV Answers -a system that combines a novel context capture capability (Freeze-Frame) with an intelligent web services mediator (Edge Proxy) to "crowd-source" user-generated queries. We describe the design and implementation of an early prototype of TV Answers and highlight the challenges and opportunities presented by such systems. Our goal is to demonstrate the viability of using "social search" as a complement to existing algorithm-driven search solutions for rich media consumption.

show abstract

“…After a large consumer survey [3], we have focused on developing a multimodal user interface for a media center. This application area is rapidly becoming popular in homes, and provides opportunities and challenges for user multimodal interaction [1,2]. Our media center provides users full control over digital television content, including an advanced electronic program guide (EPG).…”

Section: Media Center Applicationmentioning

confidence: 99%

Multimodal Media Center Interface Based on Speech, Gestures and Haptic Feedback

Turunen

Hakulinen

Hella

et al. 2009

Human-Computer Interaction – INTERACT 2009

View full text Add to dashboard Cite

Abstract. We present a multimodal media center interface based on speech input, gestures, and haptic feedback (hapticons). In addition, the application includes a zoomable context + focus GUI in tight combination with speech output. The resulting interface is designed for and evaluated with different user groups, including visually and physically impaired users. Finally, we present the key results from its user evaluation and public pilot studies.

show abstract

The prospects for unrestricted speech input for TV content search

Cited by 22 publications

References 7 publications

Multimedia Retrieval and Delivery: Essential Metadata Challenges and Standards

Multimedia Retrieval and Delivery: Essential Metadata Challenges and Standards

TV Answers - Using the Wisdom of Crowds to Facilitate Searches with Rich Media Context

Multimodal Media Center Interface Based on Speech, Gestures and Haptic Feedback

Contact Info

Product

Resources

About