2005
DOI: 10.1007/11575832_33
|View full text |Cite
|
Sign up to set email alerts
|

Deriving TF-IDF as a Fisher Kernel

Abstract: Abstract. The Dirichlet compound multinomial (DCM) distribution has recently been shown to be a good model for documents because it captures the phenomenon of word burstiness, unlike standard models such as the multinomial distribution. This paper investigates the DCM Fisher kernel, a function for comparing documents derived from the DCM. We show that the DCM Fisher kernel has components that are similar to the term frequency (TF) and inverse document frequency (IDF) factors of the standard TF-IDF method for r… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2009
2009
2023
2023

Publication Types

Select...
3
3
3

Relationship

0
9

Authors

Journals

citations
Cited by 28 publications
(13 citation statements)
references
References 5 publications
(5 reference statements)
0
13
0
Order By: Relevance
“…In the field of text processing, [8] connected the Fisher kernel and tf-idf scoring in the case of a specific model: the Dirichlet compound multinomial. We now show that the Fisher vector as applied to a GMM extends the tf-idf scoring to continuous events, where an event is the occurrence of a descriptor in an image.…”
Section: Fisher Vector and Tf-idfmentioning
confidence: 99%
“…In the field of text processing, [8] connected the Fisher kernel and tf-idf scoring in the case of a specific model: the Dirichlet compound multinomial. We now show that the Fisher vector as applied to a GMM extends the tf-idf scoring to continuous events, where an event is the occurrence of a descriptor in an image.…”
Section: Fisher Vector and Tf-idfmentioning
confidence: 99%
“…The Dirichlet Fisher kernel is also related to the Pólya Fisher kernel which has been first proposed in [7] for text categorization and then presented in [4] for visual classification. The Pólya model accounts for symbol (word) burstiness, which is measured discretely, by means of the compound of Dirichlet and multinomial models [30].…”
Section: Discussionmentioning
confidence: 99%
“…In contrast, the proposed method deals with various types of features into which those countable/uncountable histograms are L 1 -normalized. In addition, the Pólya method [7,4] inevitably requires to learn hyper parameters, which is computationally exhaustive in the case of large-scale high-dimensional histogram features.…”
Section: Discussionmentioning
confidence: 99%
“…The role of the Fisher information matrix, however, is less significant, as shown in [67], and then, can be approximated by the identity matrix. This approach has been used in [68], for instance, where a Fisher kernel was developed in the case of probabilistic latent semantic analysis, and in [69], where it was used to justify the success of the term frequency and inverse document frequency (TF-IDF) representation for documents. In the following, we shall derive the Fisher kernel for our model.…”
Section: Application To Svmsmentioning
confidence: 99%