Improving the Fisher Kernel for Large-Scale Image Classification

Perronnin, Florent; Sánchez, Jorge M. Balestena; Mensink, Thomas

doi:10.1007/978-3-642-15561-1_11

Cited by 1,963 publications

(1,954 citation statements)

References 18 publications

Supporting

Mentioning

1,911

Contrasting

Unclassified

Order By: Relevance

“…The image representation we used was the improved Fisher Vector (FV) but without spatial pyramids (cf. [12]). Before going into details regarding the experimental procedure, we give a brief overview of this state-of-the-art image signature.…”

Section: Methodsmentioning

confidence: 99%

See 1 more Smart Citation

Classifier Combination Using Random Walks on the Space of Concepts

Sánchez

Redolfi

2012

Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications

Self Cite

View full text Add to dashboard Cite

Abstract. We propose a novel approach for the combination of classifiers based on two commonly adopted strategies in multiclass classification: one-vs-all and one-vs-one. The method relies on establishing the relevance of nodes in a graph defined in the space of concepts. Following a similar approach as in the ranking of websites, the relative strength of the nodes is given by the stationary distribution of a Markov chain defined on that graph. The proposed approach do not requires the base classifiers to provide calibrated probabilities. Experiments on the challenging problem of multiclass image classification show the potentiality of our approach.

show abstract

Section: Methodsmentioning

confidence: 99%

“…Before going into details regarding the experimental procedure, we give a brief overview of this state-of-the-art image signature. Details can be found in [11,12].…”

Section: Methodsmentioning

confidence: 99%

Classifier Combination Using Random Walks on the Space of Concepts

Sánchez

Redolfi

2012

Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications

Self Cite

View full text Add to dashboard Cite

show abstract

“…Then LOAD feature is calculated on round patches densely extracted from the eigth images (the original plus the seven in different scales). The Improved Fisher Vector (IFV) by Perronnin et al (2010) is used to encode the features to preserve its discriminative power. Then, classification is performed through a linear SVM.…”

Section: Accepted Manuscriptmentioning

confidence: 99%

HEp-2 staining pattern recognition at cell and specimen levels: Datasets, algorithms and results

Hobson

Lovell

Percannella

et al. 2016

Pattern Recognition Letters

View full text Add to dashboard Cite

1 Research Highlights (Required)To create your highlights, please type the highlights against each \item command.It should be short collection of bullet points that convey the core findings of the article. It should include 3 to 5 bullet points (maximum 85 characters, including spaces, per bullet point.)• Updates the state of the art in HEp-2 cell and specimen image classification ABSTRACTThe Indirect Immunofluorescence (IIF) protocol applied on Human Epithelial type 2 (HEp-2) cells is the current gold standard for the Antinuclear Antibody (ANA) test. The formulation of the diagnosis requires the visual analysis of a patient's specimen under a fluorescence microscope in order to recognize the cells' staining pattern which could be related to a connective tissue disease. This analysis is time consuming and error prone, thus in the recent past we have witnessed a growing interest in the pattern recognition scientific community directed at the development of methods for supporting this complex task. The main driver of the interest towards this problem is represented by the series of international benchmarking initiatives organized in the last four years that allowed dozens of research groups to propose innovative methodologies for HEp-2 cells' staining pattern classification. In this paper we update the state of the art on HEp-2 cells and specimens classification, by analyzing the performance achieved by the methods participating the contest on Performance Evaluation of IIF Image Analysis Systems, hosted by the 22nd edition of the International Conference on Pattern Recognition ICPR 2014, and to the Executable Thematic Special Issue of Pattern Recognition Letters on Pattern Recognition Techniques for IIF Images Analysis, and by highlighting the trends in the design of the best performing methods.

show abstract

“…These patch statistics are then aggregated at an image level. We choose to compute the patch statistics using the Fisher Vector (FV) principle [19], since it obtained state-of-the-art results in image retrieval [5] and classification [2]. We assume that we have a generative model of patches (a Gaussian Mixture Model in our case) and measure the gradient of the log-likelihood of the descriptor with respect to the model parameters.…”

Section: Image Embeddingmentioning

confidence: 99%

“…To include spatial information about the word image into the signature, we can partition the image into regions, aggregate the per-patch statistics at a region level and then concatenate the region-level signatures as proposed for instance in [8]. See [19] for more details about the FV.…”

Section: Image Embeddingmentioning

confidence: 99%

Label embedding for text recognition

Rodríguez¹,

Perronnin²

2013

Procedings of the British Machine Vision Conference 2013

Self Cite

View full text Add to dashboard Cite

The standard approach to recognizing text in images consists in first classifying local image regions into candidate characters and then combining them with high-level word models such as conditional random fields (CRF). This paper explores a new paradigm that departs from this bottom-up view. We propose to embed word labels and word images into a common Euclidean space. Given a word image to be recognized, the text recognition problem is cast as one of retrieval: find the closest word label in this space. This common space is learned using the Structured SVM (SSVM) framework by enforcing matching label-image pairs to be closer than non-matching pairs. This method presents the following advantages: it does not require costly pre-or post-processing operations, it allows for the recognition of never-seen-before words and the recognition process is efficient. Experiments are performed on two challenging datasets (one of license plates and one of scene text) and show that the proposed method is competitive with standard bottom-up approaches to text recognition.

show abstract

Improving the Fisher Kernel for Large-Scale Image Classification

Cited by 1,963 publications

References 18 publications

Classifier Combination Using Random Walks on the Space of Concepts

Classifier Combination Using Random Walks on the Space of Concepts

HEp-2 staining pattern recognition at cell and specimen levels: Datasets, algorithms and results

Label embedding for text recognition

Contact Info

Product

Resources

About