GutenTag

Software for automated sequence tag identification of peptides

Tandem mass spectra can reveal the sequences of peptides. Generally, these spectra have been matched to peptide sequences through database search algorithms such as SEQUEST or Matrix Sciences' Mascot. An alternative method was proposed in 1994 by Matthias Mann (Mann, M.; Wilm, M. Anal. Chem. 1994, 66: 4390-4399). This technique, called "sequence tagging," infers a short region of sequence (the tag) directly from the spectrum. Then it searches a sequence database for sequences which match this sequence tag and the flanking masses. Several problems have prevented this technique from achieving wide use:

GutenTag addresses all of these problems. It automates the process of sequence tag inference with a more accurate model of fragment ions than found elsewhere. It can retain multiple tags for each spectrum, ranking them by their scores. It can search a sequence database for all of these tags in a single pass. It can evaluate the candidate sequences returned from the database to determine which is the correct match for each spectrum. In short, GutenTag makes the sequence tagging approach usable on real-world data.

The program can now be licensed for use at your university, institute, or company. Academic and nonprofit use is free of charge. Commercial use is available for a license fee. To start the licensing process, fill out this form.

The manual for GutenTag is available here.


GutenTag is copyright 2003 by David L. Tabb, The Scripps Research Institute