ROUGE (metric)

From Wikipedia, the free encyclopedia

ROUGE, or Recall-Oriented Understudy for Gisting Evaluation,^[1] is a set of metrics and a software package used for evaluating automatic summarization and machine translation software in natural language processing. The metrics compare an automatically produced summary or translation against a reference or a set of references (human-produced) summary or translation.

Metrics

The following five evaluation metrics^[2] are available.

ROUGE-N: N-gram^[3] based co-occurrence statistics.
ROUGE-L: Longest Common Subsequence (LCS)^[4] based statistics. Longest common subsequence problem takes into account sentence level structure similarity naturally and identifies longest co-occurring in sequence n-grams automatically.
ROUGE-W: Weighted LCS-based statistics that favors consecutive LCSes .
ROUGE-S: Skip-bigram^[5] based co-occurrence statistics. Skip-bigram is any pair of words in their sentence order.
ROUGE-SU: Skip-bigram plus unigram-based co-occurrence statistics.

ROUGE can be downloaded from berouge download link.

References

External links

ROUGE web site

This article is issued from Wikipedia. The text is available under the Creative Commons Attribution/Share Alike; additional terms may apply for the media files.

ROUGE (metric)

Metrics

See also

References

External links