A novel methodology to represent the contents of a video sequence is presented. The representation is used to allow the user to rapidly view a video sequence in order to find a particular point within the sequence ad/or to decide whether the contents of the sequence are relevant to his or her needs. This system, referred to as content-based browsing, forms an abstraction to represent each shot of the sequence by using a representative frame, or an Rframe, and it includes management techniques to allow the user to easily navigate the Rframes. This methodology is superior to the current techniques of fast forward and rewind because rather than using every frame to view and judge the contents, only a few abstractions are used. Therefore, the need to retrieve the video from a storage system and to transmit every frame over the network in its entirety no longer exists, saving time, expenses, and bandwidth.
Recent years have witnessed a significant price reduction in many enabling technologies for wide-spread deployment of multimedia to desktop PCs and workstations. This advancement has lead to an increasing demand for systems that can store, retrieve, and manipulate large volumes of multimedia information. For a multimedia information system to better meet information users' needs, it must provide suitable access structures and methods. The answers to this demand fall into the research area of what most people called content-based multimedia indexing and retrieval. Existing approaches to content-based indexing and retrieval have limitations. What we need is a digital-media-archiving system that is both efficient and reliable. By reliable, we mean that users should be able to retrieve documents that have the most potential for being relevant to their queries. On the other hand, an efficient digital-media-archiving system should provide an environment that allows human operators to create document indices without the need to manually watch every multimedia object and enter keyword descriptions. This can be done by providing initial structure information of the video to the user, by guiding the human operator through the indexing process, and by offering tools to create multiple media representations in a hierarchical structure.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.