Semantic publishing

From Wikipedia, the free encyclopedia

Semantic publishing on the Web or semantic web publishing refers to publishing information as data objects using a semantic web language or as documents with explicit semantic markups. Semantic publication is intended for computers to understand the structure and even the meaning of the published information, making information search and data integration more efficient.

Although semantic publishing is not specific to the Web, it has been driven by the rising of the semantic web – a web of data. In order to make the semantic web work and realize its potentials, information must be presented (i.e. published) in semantic format on the web. Thus, as the semantic web is further developed and adopted, semantic publishing will become a main form of web publishing.

Semantic publishing is not only changing the face of web publishing but also scientific publishing. Tim Berners-Lee predicted in 2001 that the semantic web “will likely profoundly change the very nature of how scientific knowledge is produced and shared, in ways that we can now barely imagine” [1]. Revisiting the semantic web in 2006, he and his colleagues believed the semantic web “could bring about a revolution in how, for example, scientific content is managed throughout its life cycle” [2]. One simple idea that may radically change scientific communication is for researchers to directly self-publish their experiment data in semantic format on the web. In one scenario, a scientist could design and run an experiment, and share the experiment information with the world in real time by publishing the data as a semantic object on the web. Semantic search engines will make these semantic data available at everyone’s finger tips. W3C interest group in healthcare and life sciences is exploring this idea of self-publishing of experiment now, for which a demo is available.

Contents

[edit] Two different approaches to semantic publishing

  • Publish information as data objects using semantic web languages like RDF and OWL. Ontology is usually developed for specific information domain, which is then used to formally represent the data in such domain.
  • Embed formal metadata in documents using new markup languages like RDFa and Microformats.

[edit] Examples of ontologies and vocabularies for publishing

[edit] Examples of free or open source tools and services

  • Semantic MediaWiki software: A single solution for semantic annotation that fits the needs of most Wikimedia projects and still meets the Wiki-specific requirements of usability and performance.
  • Swoogle: A search engine for ontologies and instance data on the Web.
  • Web2x: A new platform for semantic publishing and search, with current focus in experiment data and product information.

[edit] See also

[edit] References