SEQUEST

From Wikipedia, the free encyclopedia

SEQUEST
Design by Jimmy Eng and John Yates
Genre Protein identification
License U.S. patents 6,017,693 and 5,538,897
Website http://fields.scripps.edu/sequest/

SEQUEST is a tandem mass spectrometry data analysis program used for protein identification.[1] Sequest identifies collections of tandem mass spectra to peptide sequences that have been generated from databases of protein sequences.

[edit] Applications

This tool is most useful in the context of proteomics. Starting with a complex mixture of proteins, this strategy typically employs trypsin to digest proteins. These peptides are separated by liquid chromatography en route to a tandem mass spectrometer. The mass spectrometer then isolates ions of a particular peptide, subjects them to collision-induced dissociation, and records the produced fragments in a tandem mass spectrum. This process, repeated for several hours, will produce thousands of tandem mass spectra. Identifying such a data collection requires automation, and Sequest was the first software to fill that need.

Sequest identifies each tandem mass spectrum individually. The software evaluates protein sequences from a database to compute the list of peptides that could result from each. The peptide's intact mass is known from the mass spectrum, and Sequest uses this information to determine the set of candidate peptides sequences that could meaningfully be compared to the spectrum by including only those which are near the mass of the observed peptide ion. For each candidate peptide, Sequest projects a theoretical tandem mass spectrum, and Sequest compares these theoretical spectra to the observed tandem mass spectrum by the use of cross correlation. The candidate sequence with the best matching theoretical tandem mass spectrum is reported as the best identification for this spectrum.

[edit] References

  1. ^ Jimmy K. Eng, Ashley L. McCormack, and John R. Yates, III (1994). "An Approach to Correlate Tandem Mass Spectral Data of Peptides with Amino Acid Sequences in a Protein Database". J Am Soc Mass Spectrom 5: 976–989. doi:10.1016/1044-0305(94)80016-2. 

[edit] External links