Network models have been increasingly used in the past years to support summarization and analysis of narratives, such as famous TV series, books and news. Inspired by social network analysis, most of these models focus on the characters at play. The network model well captures all characters interactions, giving a broad picture of the narration's content. A few works went beyond by introducing additional semantic elements, always captured in a single layer network. In contrast, we introduce in this work a multilayer network model to capture more elements of the narration of a movie from its script: people, locations, and other semantic elements. This model enables new measures and insights on movies. We demonstrate this model on two very popular movies.
With the recent advances in complex networks theory, graph-based techniques for image segmentation has attracted great attention recently. In order to segment the image into meaningful connected components, this paper proposes an image segmentation general framework using complex networks based community detection algorithms. If we consider regions as communities, using community detection algorithms directly can lead to an over-segmented image. To address this problem, we start by splitting the image into small regions using an initial segmentation. The obtained regions are used for building the complex network. To produce meaningful connected components and detect homogeneous communities, some combinations of color and texture based features are employed in order to quantify the regions similarities. To sum up, the network of regions is constructed adaptively to avoid many small regions in the image, and then, community detection algorithms are applied on the resulting adaptive similarity matrix to obtain the final segmented image. Experiments are conducted on Berkeley Segmentation Dataset and four of the most influential community detection algorithms are tested. Experimental results have shown that the proposed general framework increases the segmentation performances compared to some existing methods.
Image segmentation has an important role in many image processing applications. Several methods exist for segment ing an image. However, this technique is still a relatively open topic for which various research works are regularly presented.With the recent developments on complex networks theory, image segmentation techniques based on graphs has considerably improved. In this paper, we present a new perspective of image segmentation, by applying three of the most efficient community detection algorithms, Louvain, infomap and stability optimization based on the louvain algorithm, and we extract communities in which the highest modularity feature is achieved. After we show that this measure is invariant to non-structural change on image, which mean that the image segmentation is also invariant to rotation. Finally we evaluate the three proposed algorithms for Berkeley database images, and we show that our results can outperform other segmentation methods in terms of accuracy and can achieve much better segmentation results.
Discovering content and stories in movies is one of the most important concepts in multimedia content research studies. Network models have proven to be an efficient choice for this purpose. When an audience watches a movie, they usually compare the characters and the relationships between them. For this reason, most of the models developed so far are based on social networks analysis. They focus essentially on the characters at play. By analyzing characters interactions, we can obtain a broad picture of the narration's content. Other works have proposed to exploit semantic elements such as scenes, dialogues, etc.. However, they are always captured from a single facet. Motivated by these limitations, we introduce in this work a multilayer network model to capture the narration of a movie based on its script, its subtitles, and the movie content. After introducing the model and the extraction process from the raw data, we perform a comparative analysis of the whole 6-movie cycle of the Star Wars saga. Results demonstrate the effectiveness of the proposed framework for video content representation and analysis.
International audienceWith the recent advances in complex networks, image segmentation becomes one of the most appropriate application areas. In this context, we propose in this paper a new perspective of image segmentation by applying two efficient community detection algorithms. By considering regions as communities, these methods can give an over-segmented image that has many small regions. So, the proposed algorithms are improved to automatically merge those neighboring regions agglomerative to achieve the highest modularity/stability. To produce sizable regions and detect homogeneous communities, we use the combination of a feature based on the Histogram of Oriented Gradients of the image, and feature based on color to characterize the similarity of two regions. By constructing the similarity matrix in an adaptive manner, we avoid the problem of the over-segmentation. We evaluate the proposed algorithms for Berkeley Segmentation Dataset, and we show that our experimental results can outperform other segmentation methods in terms of accuracy and can achieve much better segmentation results
Compressive Sensing is a theory that can reconstruct a signal (or image) from a very small number of measurements, beyond the limits traditionally imposed by Shannon's theorem. To make this reconstruction perfect, some conditions are necessary, the signal must be sparse in a known basis and the number of measures should be sufficient enough to be in accordance with the rate of the signal sparseness. In this paper, we propose to compare different bases of transformation for compressive sensing of images. For this purpose, we use the most popular transformations that are DWT, DCT, DT-CWT and Contourlet. For our study, we choose two of the most efficient image recovery methods. The first is the L1-dantzig selector based on convex optimization approach, and the second is the Orthogonal Matching Pursuit (OMP) based on greedy algorithms. Experimental results show the efficiency of the DT-CWT in term of Peak Signal to Noise Ratio (PSNR) and Structural SIMilarity (SSIM) and also with the visual assessment of the reconstructed images.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with đź’™ for researchers
Part of the Research Solutions Family.