Developer(s) | Andreas Biegert and Johannes Soeding |
---|---|
Stable release | 1.03 / April 14, 2009 |
Preview release | 1.1 / April 14, 2009 |
Written in | C++ |
Available in | English |
Type | Bioinformatics tool |
License | Creative Commons Attribution-NonCommercial-3.0 |
Website | ftp://toolkit.lmb.uni-muenchen.de/csblast/ |
CS-BLAST (context-specific BLAST), an improved version[1] of BLAST (Basic Local Alignment Search Tool),[2] is a program for protein sequence searching.
Sequence searches are frequently performed by biologists to infer the function of an unknown protein from its sequence. For this purpose, the protein's sequence is compared to the sequences of millions of other protein sequences in public databases. The functions of the protein can often be inferred from the functions of the most similar sequences found in such a search.[3]
Contents |
CS-BLAST is implemented as a wrapper around BLAST. It finds up to twice as many distantly related protein sequences as BLAST at the same speed and error rate.[4] This is achieved by considering the letters that make up protein sequences, the amino acids, in the context of the 12 neighboring amino acids (six on either side). Whereas the probabilities for amino acids to mutate into other types of amino acids in related proteins depend only on the single amino acid in BLAST and other sequence search programs, mutation probabilities in CS-BLAST depend on the local sequence contexts.
An extension of CS-BLAST for iterative search with position-specific scoring matrices is also available. This program (CSI-BLAST) is similar to the popular PSI-BLAST (position-specific iterative BLAST) program.[5] Two search iterations of CSI-BLAST are more sensitive than five search iterations of PSI-BLAST.[4]