“…If we consider low-level tasks by Amar et al [1], only a few of the 10 tasks can be applied to images [66]. Thus, an important challenge in interactive visualization of image data is automatic extraction of semantic information, interactive exploration of categories, or both [53,62,67].…”
Fig. 1. With DendroMap, users can explore large-scale image datasets by overviewing the overall distributions and zooming down into hierarchies of image groups at multiple levels of abstraction. In this example, we visualize images of the CIFAR-100 dataset by hierarchically clustering the image representations obtained from a ResNet50 image classification model. (B) DendroMap View displays these clusters of images organized as a hierarchical structure by adapting Treemaps. By clicking on a cluster, a user can interactively (C) Zoom into that image group, revealing subgroups that replace and fill the available space with animation (see the submitted video).The user clicked on a cluster for organism images, which creates distinct subgroups of fish, insects, worms, fruits, and flowers. With (A) Sidebar View, the user can dynamically adjust the number of clusters to be displayed and inspect the class-level statistics.
“…If we consider low-level tasks by Amar et al [1], only a few of the 10 tasks can be applied to images [66]. Thus, an important challenge in interactive visualization of image data is automatic extraction of semantic information, interactive exploration of categories, or both [53,62,67].…”
Fig. 1. With DendroMap, users can explore large-scale image datasets by overviewing the overall distributions and zooming down into hierarchies of image groups at multiple levels of abstraction. In this example, we visualize images of the CIFAR-100 dataset by hierarchically clustering the image representations obtained from a ResNet50 image classification model. (B) DendroMap View displays these clusters of images organized as a hierarchical structure by adapting Treemaps. By clicking on a cluster, a user can interactively (C) Zoom into that image group, revealing subgroups that replace and fill the available space with animation (see the submitted video).The user clicked on a cluster for organism images, which creates distinct subgroups of fish, insects, worms, fruits, and flowers. With (A) Sidebar View, the user can dynamically adjust the number of clusters to be displayed and inspect the class-level statistics.
“…The features (esp. semantically meaningful ones, such as concept labels) can be used as additional metadata (e.g., [38]) or to build a content-based index to fuel search capabilities. Indexing approaches include clustering-based approaches such as product quantization [17] or extended cluster pruning [13], and hash-based approaches [2], especially those based on locality-sensitive hashing [6].…”
Section: Related Workmentioning
confidence: 99%
“…There are approaches that incorporate interactive model building to cover a wider range of the exploration-search axis. To advance the analytic session, they usually make use of a rich set of filters on the data (e.g., [3,19,20,38,39]), an interactive (multimodal) learning model (e.g., [15,43]), or a combination of both. Whilst these techniques go beyond mere search, on the exploration-search axis, they tend to lean towards search anyway: they simply fetch what the users are looking for or what they found relevant previously.…”
Fig. 1. II-20's novel model fully supports flexible analytic categorization of image collections, closing the pragmatic gap. The user can categorize or discard images, and the system intelligently chooses the level of exploration and search for the categories. Pictured: the novel Tetris interface metaphor.
“…Multimedia analytics systems, such as Multimedia Pivot Tables [31], ICLIC [28], and Blackthorn [38] facilitate search and exploration in large collections of multimedia data as well as interactive multimodal learning. Blackthorn, for example, compresses semantic information from the visual and text domain and learns user preferences on the fly from the interactions with the system in a relevance feedback framework.…”
In this paper we present a novel interactive multimodal learning system, which facilitates search and exploration in large networks of social multimedia users. It allows the analyst to identify and select users of interest, and to find similar users in an interactive learning setting. Our approach is based on novel multimodal representations of users, words and concepts, which we simultaneously learn by deploying a general-purpose neural embedding model. We show these representations to be useful not only for categorizing users, but also for automatically generating user and community profiles. Inspired by traditional summarization approaches, we create the profiles by selecting diverse and representative content from all available modalities, i.e. the text, image and user modality. The usefulness of the approach is evaluated using artificial actors, which simulate user behavior in a relevance feedback scenario. Multiple experiments were conducted in order to evaluate the quality of our multimodal representations, to compare different embedding strategies, and to determine the importance of different modalities. We demonstrate the capabilities of the proposed approach on two different multimedia collections originating from the violent online extremism forum Stormfront and the microblogging platform Twitter, which are particularly interesting due to the high semantic level of the discussions they feature.
CCS CONCEPTS• Information systems → Multimedia and multimodal retrieval. KEYWORDS multimedia analytics, search, exploration, interactive learning, multimodal embeddings, online discussion forums, social multimedia • First, compact but meaningful multimodal content representations are needed to ensure the interactivity of the system
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.