Giuseppe Serra scite author profile

Giuseppe Serra

5Publications

29Citation Statements Received

54Citation Statements Given

How they've been cited

How they cite others

Affiliations

University of Florence

Publications

Order By: Most citations

Space-Time Zernike Moments and Pyramid Kernel Descriptors for Action Classification

Costantini

Seidenari

Serra

et al. 2011

View full text Add to dashboard Cite

Action recognition in videos is a relevant and challenging task of automatic semantic video analysis. Most successful approaches exploit local space-time descriptors. These descriptors are usually carefully engineered in order to obtain feature invariance to photometric and geometric variations. The main drawback of space-time descriptors is high dimensionality and efficiency. In this paper we propose a novel descriptor based on 3D Zernike moments computed for space-time patches. Moments are by construction not redundant and therefore optimal for compactness. Given the hierarchical structure of our descriptor we propose a novel similarity procedure that exploits this structure comparing features as pyramids. The approach is tested on a public dataset and compared with state-of-the art descriptors.

show abstract

OntoNotes: A Unified Relational Semantic Representation

Bagdanov

Bertini

Bimbo

et al. 2007

View full text Add to dashboard Cite

The OntoNotes project is creating a corpus of large-scale, accurate, and integrated annotation of multiple levels of the shallow semantic structure in text. Such rich, integrated annotation covering many levels will allow for richer, cross-level models enabling significantly better automatic semantic analysis. At the same time, it demands a robust, efficient, scalable mechanism for storing and accessing these complex inter-dependent annotations. We describe a relational database representation that captures both the inter-and intra-layer dependencies and provide details of an object-oriented API for efficient, multi-tiered access to this data.

show abstract

Video Event Classification Using Bag of Words and String Kernels

Ballan

Bertini

Bimbo

et al. 2009

View full text Add to dashboard Cite

Abstract. The recognition of events in videos is a relevant and challenging task of automatic semantic video analysis. At present one of the most successful frameworks, used for object recognition tasks, is the bag-ofwords (BoW) approach. However this approach does not model the temporal information of the video stream. In this paper we present a method to introduce temporal information within the BoW approach. Events are modeled as a sequence composed of histograms of visual features, computed from each frame using the traditional BoW model. The sequences are treated as strings where each histogram is considered as a character. Event classification of these sequences of variable size, depending on the length of the video clip, are performed using SVM classifiers with a string kernel that uses the Needlemann-Wunsch edit distance. Experimental results, performed on two datasets, soccer video and TRECVID 2005, demonstrate the validity of the proposed approach.

show abstract

Recognizing Human Actions by Using Effective Codebooks and Tracking

Ballan

Seidenari

Serra

et al. 2013

View full text Add to dashboard Cite

Recognition and classification of human actions for annotation of unconstrained video sequences has proven to be challenging because of the variations in the environment, appearance of actors, modalities in which the same action is performed by different persons, speed and duration and points of view from which the event is observed. This variability reflects in the difficulty of defining effective descriptors and deriving appropriate and effective codebooks for action categorization. In this chapter, we present a novel and effective solution to classify human actions in unconstrained videos. In the formation of the codebook, we employ radius-based clustering with soft assignment in order to create a rich vocabulary that may account for the high variability of human actions. We show that our solution scores very good performance with no need of parameter tuning. We also show that a strong reduction of computation time can be obtained by applying codebook size reduction with Deep Belief Networks with little loss of accuracy

show abstract

Learning Rules for Semantic Video Event Annotation

Bertini

Bimbo

Serra

2008

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Giuseppe Serra

Space-Time Zernike Moments and Pyramid Kernel Descriptors for Action Classification

OntoNotes: A Unified Relational Semantic Representation

Video Event Classification Using Bag of Words and String Kernels

Recognizing Human Actions by Using Effective Codebooks and Tracking

Learning Rules for Semantic Video Event Annotation

Contact Info

Product

Resources

About