Taverna Workbench |
|
Developer(s) | myGrid |
Stable release | 2.3 / July 14, 2011 |
Development status | Active |
Written in | Java |
Operating system | Linux, Mac OS X, Microsoft Windows |
License | LGPL |
Website | http://www.taverna.org.uk |
Taverna Workbench is an open source software[1] tool for designing and executing workflows, created by the myGrid project and funded through the OMII-UK. Taverna allows users to integrate many different software components, including SOAP or REST Web services, such as those provided by the National Center for Biotechnology Information, the European Bioinformatics Institute, the DNA Databank of Japan (DDBJ), SoapLab, BioMOBY and EMBOSS. The set of available services is not finite and users can import new service descriptions into the Taverna Workbench.
Taverna Workbench provides a desktop authoring environment and enactment engine for scientific workflows. The Taverna workflow enactment engine is also available separately, as a command line tool or as a server.
Taverna is used by users in many domains, such as bioinformatics, cheminformatics, medicine, astronomy, social science and music.
Some of the services for the use in Taverna workflows can be discovered through the BioCatalogue - a public, centralised and curated registry of Life Science Web services. Taverna workflows can also be shared with other people through the myExperiment social web site for scientists. BioCatalogue and myExperiment are another two product from the myGrid consortium.
Taverna is used in over 350 organizations around the world, both academic and commercial. As of 2011, there have been over 80,000 downloads of Taverna across different versions.
Contents |
Taverna workflows can invoke general SOAP/WSDL or REST Web services, and more specific SADI, BioMart, BioMoby and SoapLab Web services. It can also invoke R statistical services, local Java code, external tools on remote machines (via ssh), do XPath and other text manipulation, import a spreadsheet and include sub-workflows.
Taverna Workbench includes the ability to monitor the running of a workflow and to examine the provenance of the data produced. The provenance system for Taverna 2.1 is being co-ordinated with the Open Provenance Model.
Taverna includes the ability to search for services described in BioCatalogue to include within workflows. However, services do not need to be described within BioCatalogue to be included in workflows.
Taverna also includes the capability to search for workflows on myExperiment. You can download, modify and run the workflows discovered on myExperiment from within the Taverna Workbench. You can also upload you workflows from the Workbench to myExperiment in order to share them with others.
Taverna workflows do not need to be executed within the Taverna Workbench. Workflows can also be run by:
Taverna 2 (the second generation of the Taverna software) allows pipelining and streaming of data. This means that services downstream a workflow can start as soon as the first piece of data is received, without waiting for the whole piece of data to become available. It also has improved memory usage allowing the handling of much larger data sets.
Taverna allows developers to plugin new functionality and also to use Taverna within their own products. Taverna has been extended to allow additional components within workflows, for example those from the Chemistry Development Kit, SADI semantic Web services, and caGrid. It has also been bundled with other products, for example the Taverna-LC plugin for OpenOffice Calc allows calling services as spreadsheet functions.
Various projects and institutions have run Taverna workflows on grids or have used Taverna to access services on grids, such as KnowARC, NGS (National Grid Service), EGEE (Enabling Grids for E-sciencE) and caGrid.
External tools can be included within Taverna workflows either scripts such as Java Beanshell, though the use of an API Consumer service that generates services for the methods exposed by the tool written in Java or via external tools plugin, which allows users to run tools on a grid or remote/local machine using grid or ssh authentication.