Managing video collections at large

Moenne-Loccoz, Nicolas; Janvier, Bruno; Marchand-Maillet, Stéphane; Bruno, Éric

doi:10.1145/1039470.1039484

Cited by 11 publications

(6 citation statements)

References 8 publications

(9 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The semantic gap makes it difficult for the user to formulate queries against the image library [19]. CBIR methods that rely on higher-level semantic features, perhaps organized into a video ontology [11] that is sensible for a user community, can improve user understanding and bridge the semantic gap. An ontology -a powerful way to describe objects and their relationships to other objects -can be better than keywords for retrieval from an image library, because a general information need can be satisfied by the ontology even without exact matches to provided keywords [18].…”

Section: Background and Related Work 21 Concept-based And Content-bamentioning

confidence: 99%

Addressing the challenge of visual information access from digital image and video libraries

Christel

Conescu

2005

Proceedings of the 5th ACM/IEEE-CS Joint Conference on Digital Libraries

View full text Add to dashboard Cite

While it would seem that digital video libraries should benefit from access mechanisms directed to their visual contents, years of TREC Video Retrieval Evaluation (TRECVID) research have shown that text search against transcript narrative text provides almost all the retrieval capability, even with visually oriented generic topics. A within-subjects study involving 24 novice participants on TRECVID 2004 tasks again confirms this result. The study shows that satisfaction is greater and performance is significantly better on specific and generic information retrieval tasks from news broadcasts when transcripts are available for search. Additional runs with 7 expert users reveal different novice and expert interaction patterns with the video library interface, helping explain the novices' lack of success with image search and visual feature browsing for visual information needs. Analysis of TRECVID visual features well suited for particular tasks provides additional insights into the role of automated feature classification for digital image and video libraries.

show abstract

Section: Background and Related Work 21 Concept-based And Content-bamentioning

confidence: 99%

Addressing the challenge of visual information access from digital image and video libraries

Christel

Conescu

2005

Proceedings of the 5th ACM/IEEE-CS Joint Conference on Digital Libraries

View full text Add to dashboard Cite

show abstract

“…However, a database schema for meeting recording annotations was presented in [4]. The authors of [44] discuss a general video database framework in the context of TREC video retrieval.…”

Section: 1storagementioning

confidence: 99%

Building a Smart Meeting Room: From Infrastructure to the Video Gap (Research and Open Issues)

Jaimes

Miyazaki

2005

21st International Conference on Data Engineering Workshops (ICDEW'05)

View full text Add to dashboard Cite

At FXPAL Japan we have built an (experimental) Smart Conference Room (SCR) that contains multiple cameras, microphones, displays, and capture devices. Based on our experience, in this paper we discuss research and open issues in constructing SCRs like the one built at FXPAL for the purpose of automatic content analysis. Our discussion is grounded on a novel conceptual meeting model that consists of physical (from layout to cameras), conceptual (meeting types, actors), sensory (audio-visual capture), and content (syntax and semantics) components. We also discuss storage, retrieval, and deployment issues. INTRODUCTIONMeetings are important events in any organization and recently there has been a renewed interest in building smart meeting rooms to capture meetings on video for future viewing. This is due to lower computer and video equipment costs, higher computational power, and because keeping accurate records in companies has become more important than ever (for knowledge, risk management, and compliance, among others). In the United States, for example, the SOX act [21] and recent laws require accurate record keeping to ensure the financial data the CEO and CFO sign off on is auditable. Although recording of meetings is not a requirement, it is possible for meeting videos to play an important role in the future: traditional note-taking is insufficient to store all relevant meeting events, it is subjective, often incomplete, and inaccurate.Many smart meeting conference room environments [39][61] [43] and portable meeting systems [38] have been developed. Most of the focus has been on developing techniques to automatically process the generated audiovisual content (e.g., face detection and action recognition [67]; speech recognition for topic detection [62], and many others [3]). However, little attention has been given to the overall meeting capture framework, the issues around building the infrastructure necessary to deploy a real world application, and the impact of such infrastructure on the development of automatic content analysis techniques.In this paper, we propose a multiple-component conceptual meeting model, and give an overview of the major research issues in building and deploying a smart conference room environment from the perspective of automatic content analysis. We discuss issues ranging from physical room layout and hardware infrastructure to automatic content analysis and metadata. Our model (Figure 1) consists of four components: physical structure, conceptual structure, sensory acquisition, and acquired content 1 . The physical component models the objects and layout of a smart meeting room (e.g., tables). The conceptual component models the structure of the meeting (e.g., meeting type, roles). The sensory component models the capture of the meeting using multiple sensing devices (cameras, microphones, etc.). The four components of our model are directly linked by a contextual mesh, which we define as the set of conditions under which the meeting takes place. As the circle in the ce...

show abstract

“…They compared seven classification strategies to evaluate the active learning contribution in CBIR. Finally, in [8], Moënne-Loccoz et al considered the challenges of video document retrieval, which include balancing efficient content modeling and storage against fast access at various levels. They detailed the framework they have built to accommodate their developments in content-based multimedia retrieval.…”

Section: Technical Papersmentioning

confidence: 99%

Report from the first international workshop on computer vision meets databases (CVDB 2004)

2005

View full text Add to dashboard Cite

show abstract

Managing video collections at large

Cited by 11 publications

References 8 publications

Addressing the challenge of visual information access from digital image and video libraries

Addressing the challenge of visual information access from digital image and video libraries

Building a Smart Meeting Room: From Infrastructure to the Video Gap (Research and Open Issues)

Report from the first international workshop on computer vision meets databases (CVDB 2004)

Contact Info

Product

Resources

About