Mean reciprocal rank

Mean reciprocal rank is a statistic measure for evaluating any process that produces a list of possible responses to a sample of queries, ordered by probability of correctness. The reciprocal rank of a query response is the multiplicative inverse of the rank of the first correct answer. The mean reciprocal rank is the average of the reciprocal ranks of results for a sample of queries Q:[1]

 \text{MRR} = \frac{1}{|Q|} \sum_{i=1}^{|Q|} \frac{1}{\text{rank}_i}. \!

where  \text{rank}_i refers to the rank position of the first relevant document for the ith query.

The reciprocal value of the mean reciprocal rank corresponds to the harmonic mean of the ranks.

Example

For example, suppose we have the following three sample queries for a system that tries to translate English words to their plurals. In each case, the system makes three guesses, with the first one being the one it thinks is most likely correct:

Query Results Correct response Rank Reciprocal rank
cat catten, cati, cats cats 3 1/3
torus torii, tori, toruses tori 2 1/2
virus viruses, virii, viri viruses 1 1

Given those three samples, we could calculate the mean reciprocal rank as (1/3 + 1/2 + 1)/3 = 11/18 or about 0.61.

This basic definition does not specify what to do if...

  1. none of the proposed results are correct (use reciprocal rank 0), or if
  2. there are multiple correct answers in the list. Consider using mean average precision (MAP).

See also Information retrieval and Question answering.[2]

References

  1. E.M. Voorhees (1999). "Proceedings of the 8th Text Retrieval Conference". TREC-8 Question Answering Track Report. pp. 7782.
  2. D. R. Radev, H. Qi, H. Wu, W. Fan (2002). "Evaluating web-based question answering systems". Proceedings of LREC.
This article is issued from Wikipedia - version of the Friday, January 22, 2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.