Fisher kernel

From Wikipedia, the free encyclopedia

In mathematics, the Fisher kernel, named in honour of Sir Ronald Fisher, is a kernel useful in information retrieval. It was introduced in 1998 by Tommi Jaakola [1].

The Fisher kernel is the kernel for a generative probabilistic model. As such, it constitutes a bridge between generative and probabilistic models of documents[2]. Fisher kernels exist for numerous models, notably tf–idf [3], Naive Bayes and PLSI.

Contents

[edit] Fisher score

The Fisher kernel makes use of the Fisher score, defined as

U_X = \nabla_{\theta} \log P(X|\theta)

with θ begin a set (vector) of parameters. logP(X | θ) is the log-likelihood of the probabilistic model.

[edit] Fisher kernel

The Fisher kernel is defined as

K(X_i, X_j) = U_{X_{i}}^T I U_{X_{j}}

with I the Fisher information matrix

[edit] See also

[edit] Notes and references

  1. ^ Exploiting Generative Models in Discriminative Classifiers (1998) PS, Citeseer
  2. ^ Generative vs Discriminative Approaches to Entity Recognition from Label-Deficient Data (2003) PDF, Citeseer
  3. ^ Deriving TF-IDF as a fisher kernel (2005) PDF [1]