Developer(s) | Unipro |
---|---|
Stable release | 1.10 / 9 December 2011 |
Written in | C++, QtScript |
Operating system | Cross-platform |
Available in | English, Russian, Czech, Chinese |
Type | Bioinformatics toolkit |
License | GPL |
Website | http://ugene.unipro.ru |
UGENE is free open-source cross-platform bioinformatics software.[1]
It integrates dozens of well-known biological tools and algorithms, providing both graphical user and command line interfaces. Using UGENE Workflow Designer one can arrange the required tools and algorithms into a workflow schema.[2]
In order to provide maximum possible performance UGENE utilizes multicore CPUs and GPUs to optimize some of its computational routines. Another way to speed up computations is to use Amazon EC2 cloud resources.[3]
Contents |
The software supports the following features:
The software has three main views to display biological data on the user’s screen.
1. The Sequence view is used to visualize, analyze and modify nucleic acid or protein sequences. Depending on the sequence type and the options selected the followings views can be presented inside the Sequence view window:
2. The Alignment editor is used to visualize, analyze and modify a nucleic acid or protein multiple sequence alignment.
3. The Assembly Browser allows to visualize and browse next-generation sequencing data.
4. The Phylogenetic tree viewer.
UGENE Workflow Designer allows creating and running complex computational workflow schemas.[4]
The elements that a schema consists of correspond to the bulk of algorithms integrated into UGENE. Using the Workflow Designer one can also create custom workflow elements.
The workflow schemas can be run both locally and remotely, either using the graphical interface or launched from the command line.
UGENE Query Designer allows a user to analyze a nucleotide sequence using different algorithms (Repeats finder, ORF finder, Weight matrix matching, etc.) at the same time imposing constraints on the positional relationship of the results obtained from the algorithms.
A schema of the algorithms and constraints is either created from the GUI or edited as a plain text.
The results are saved as a set of annotations to a specified file in the GenBank format.
UGENE Assembly Browser project was started in 2010 as an entry for Illumina iDEA Challenge 2011. The Assembly Browser allows a user to visualize and browse large (up to hundreds of millions of short reads) next generation sequence assemblies. The only format currently supported is BAM (which is the binary version of SAM). To browse assembly data in UGENE an input file should be converted to a UGENE database file. This approach has both advantages and disadvantages. The disadvantages are that the conversion may take time for a large BAM file and there should be enough disk space to store the database. On the other hand this allows to overview the whole assembly, navigate in it and go to well-covered regions rather rapidly. In addition before the conversion the user can choose contigs to be extracted from the BAM file. By this mean it is possible to open big files such as 1000 Genomes Project data.
UGENE is primarily developed by Unipro LLC. Each iteration lasts about 6 weeks. By the end of iteration a release comes out. One can also download a development snapshot of the software.
The features to be included into the next release are mostly initiated by users.