Semantic similarity

From Wikipedia, the free encyclopedia

It has been suggested that Semantic relatedness be merged into this article or section. (Discuss)

This article or section does not cite its references or sources.
Please help improve this article by introducing appropriate citations. (help, get involved!) This article has been tagged since December 2006.

Semantic similarity, also called semantic relatedness or semantic closeness/proximity/nearness, is a concept whereby a set of documents or terms within term lists are assigned a metric based on the likeness of their meaning / semantic content.

An intuitive way of displaying terms according to their semantic similarity is by grouping together closer related terms and spacing more distantly related ones wider apart. This is common - if sometime subconscious - practice for mind maps and concept maps.

Concretely, this can be achieved for instance by defining a topological similarity, by using ontologies to define a distance between words (a naive metric for terms arranged as nodes in a directed acyclic graph like a hierarchy would be the minimal distance (in separating edges) between the two term nodes), or using statistical means to correlate words and textual contexts from a suitable text corpus (co-occurrence).