A method for full-reference visual quality assessment based on the 2-D combination of two diverse metrics is described. The first metric is a measure of structural information loss based on the Fisher information about the position of the structures in the observed images. The second metric acts as a categorical indicator of the type of distortion that images underwent. These two metrics constitute the inner state of a virtual cognitive model, viewed as a system whose output is the automatic visual quality estimate. The use of a 2-D metric fills the intrinsic incompleteness of methods based on a single metric while providing consistent response across different image impairment factors and blind distortion classification capability with a modest computational overhead. The high accuracy and robustness of the method are demonstrated through cross-validation experiments.
Action recognition in videos is a relevant and challenging task of automatic semantic video analysis. Most successful approaches exploit local space-time descriptors. These descriptors are usually carefully engineered in order to obtain feature invariance to photometric and geometric variations. The main drawback of space-time descriptors is high dimensionality and efficiency. In this paper we propose a novel descriptor based on 3D Zernike moments computed for space-time patches. Moments are by construction not redundant and therefore optimal for compactness. Given the hierarchical structure of our descriptor we propose a novel similarity procedure that exploits this structure comparing features as pyramids. The approach is tested on a public dataset and compared with state-of-the art descriptors.
""In this work the authors propose a classification method based on Support Vector Machine (SVM) and key frames features extraction to classify historical sport video contents. In the context of the Italian Project, IRMA (Information Retrieval in Multimedia Archives), with the goal to recover and preserve historical videos of proven cultural interest, a data set made up of several hours of videos from the 1960 Olympic games, provided by RAI and Teche RAI, is adopted as testbed. Each video is summarized by its key frames and represented by the features vectors computed in the Laguerre Gauss transformed domain. The high-level video classification starts from these vectors that are the input of the SVM classifier. The experimental results show the effectiveness of the proposed method."
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.