Tag (metadata)

From Wikipedia, the free encyclopedia

This article is about the information technology term. For other uses, see Tag (disambiguation).

A tag is a (relevant) keyword or term associated with or assigned to a piece of information (like picture, article, or video clip), thus describing the item and enabling keyword-based classification of information it is applied to.

Tags are usually chosen informally and personally by the author/creator of the item — i.e. not usually as part of some formally defined classification scheme. Tags are typically used in dynamic, flexible, automatically generated internet taxonomies for online resources such as computer files, web pages, digital images, and internet bookmarks (both in social bookmarking services, and in the current generation of web browsers - see Flock). For this reason, "tagging" has become associated with the Web 2.0 buzz.

Typically, an item will have one or more "tags" associated with it, as part of some classification software or system. The software will provide links to other items that share that keyword tag, or even to specified collections of tags. This allows for multiple "browse-able paths" through the items which can quickly and easily be altered by the collection's administrator, with minimal effort and planning.

Contents

[edit] Tagging example

A web page hosted on a web server or blog server which supports tagging, might have the tags Baseball, Red Sox, Tickets, Away Games, and Discounts. A human reader can probably tell the purpose of the page by quickly scanning the list of tags. Typically, the server would display the tags in a list on that page, with each tag displayed as a web link leading to an index page listing all web pages which use that tag. This allows a reader to locate quickly all pages which have been associated with the term Red Sox. If the server supports tag searching, a reader would be able to find all pages that use a particular set of tags, such as Red Sox and Tickets.

If the page's author wishes to reclassify the page, all that is required is for them to change the list of tags. In this case, the author could add the tags Blue Jays, and Paypal to their page. All connections between pages are automatically tracked and updated by the server software. There is no need to relocate the page within a complex hierarchy of categories.

While using tags in such an organizational system is flexible and easy, tagging is not without its drawbacks. Typically there is no information about the meaning or semantics of a tag. For example, the tag "apple" might refer to the fruit, Apple Computer, the Beatles' music label, or Gwyneth Paltrow's baby. This lack of semantic distinction in tags can lead to inappropriate connections between items. Additionally, selection of "tag terms" is highly individualistic. Different people may use drastically different terms to describe the same concept: for example items related to a version of Apple Computer's operating system might be tagged both "Mac OS X", "Tiger", and possibly many other terms. Users of tagging systems must make judgements, based on the number of connections and the choices of "tag terms", whether possible connections between items are valid for their interests.

Tag classification, and the concept of connecting sets of tags between web/blog servers, has lead to the rise of folksonomy classification over the internet, the concept of social bookmarking, and other forms of social software. Larger-scale folksonomies tend to address some of the problems of tagging, as astute users of tagging system will monitor/search the current use of "tag terms" within these systems, and tend to use existing tags in order to easily form connections to related items. In this way, evolving folksonomies define a set of tagging conventions through eventual group consensus, rather than by use of a formalized standard.

Although "tagging" is often promoted as an alternative to organization by a hierarchy of categories, more and more online resources seem to use a hybrid system, where items are organized into broad categories, with finer classification distinctions being made by the use of tags.

[edit] Other forms of categorization

  • Hierarchy - not always applicable, but often more accurate
  • Ordered list - seldom applicable for large object sets
  • Network - Always applicable, but may result in enormous taxonomies to be able to define all types of relations

[edit] Syntax

Some tagging systems provide a single text box to enter textual tags. To be able to tokenize the string, a separator must be used. A popular separator is the space character. To enable the use of separators in the tags, a system may allow for higher-level separators (such as quotation marks) or escape characters. Systems can avoid the use of separators by allowing only one tag to be added to each input widget at a time, although this makes adding multiple tags inefficient.

Another syntax for use within HTML is to use the attribute rel="tag" to indicate that the linked-to page acts as a tag for the current context. For example, to tag this page with 'folksonomy' you would add <a href="http://en.wikipedia.org/wiki/folksonomy" rel="tag">folksonomy</a>.

More detail is available in the rel tag specification.

[edit] Early and popular examples of websites that use tags

  • Del.icio.us - A social bookmarking site that allows users to bookmark many sites and then tag them with many descriptive words, allowing other people to search by those terms to find pages that other people found useful.
  • Flickr - A service that allows users to tag images with many specific nouns, verbs, and adjectives that describe the picture. This is then search-able.
  • Gmail - A webmail site that was one of the first to allow categorization of objects using tags, known as "labels" on emails.
  • Technorati - A weblog search engine.

[edit] See also

[edit] External links

In other languages