|
A method for performing protein identification & peptide sequencing by utilizing mass spectrometry fragmentation patterns to search protein and nucleotide databases has been developed by our lab. Our program, SEQUEST, converts the character-based representation of amino acid sequences in a protein database to fragmentation patterns which are compared against the MS/MS spectrum generated on the target peptide. The algorithm initally identifies amino acid sequences in the database that match the measured mass of the peptide, compares fragment ions against the MS/MS spectrum, and generates a preliminary score for each amino acid sequence. A cross correlation analysis is then performed on the top 500 preliminary scoring peptides by correlating theoretical, reconstructed spectra against the experimental spectrum. Output results are displayed accordingly.
In short, SEQUEST performs automated peptide/protein sequencing via database searching of MS/MS spectra without the need for any manual sequence interepretation, though it can make use of interpreted sequence information if available. |
|
|
DTASelect and Contrast were designed to make interpretation and comparison of proteomic data faster and more effective. DTASelect organizes and filters SEQUEST identifications, reducing the time required to interpret the results for each sample. Contrast differentiates multiple samples and comprises a powerful meta-analytical tool. |
![[DTASelect logo]](DTASelect/20010712-DTASelect-Logo.gif) |
|
Quantitative Analysis Tool for both labeling and labeling free analysis. Visit Census web page for more info. |
|
ProLuCID is a fast and sensitive tandem mass spectra-based protein identification program recently developed in the Yates laboratory at The Scripps Research Institute.
|
|
GutenTag is software to identify peptides by the sequence tagging technique. SEQUEST searches a sequence database by mass, but GutenTag searches with short sequences derived directly from the spectrum. |
|
RawExtractor is a program to extract MS and MS/MS spectra from RAW files generated by Thermo mass spectrometers, such as LTQ, LTQ-Orbitrap, LCQ, and stores the spectra in ms1, ms2 or mzXML file format. The spectra files generated by RawExtractor program are used as input for protein identification programs SEQUEST, ProLuCID and quantitatation program Census.
|
|
For truly complex protein samples, separation prior to mass spectrometry is increasingly necessary. MudPIT describes the process of digesting, separating, and identifying the components of samples consisting of thousands of proteins. Our protocol uses nanoscale strong cation exchange liquid chromatography upstream of reversed phase liquid chromatography online with microelectrospray. |
|
A modification of the SEQUEST algorithm allows the software to be run in parallel, sharing the protein identification task across several computers. Our Beowulf cluster, Shamu, has processed millions of spectra to date.
|
|
|
Research Computing at TSRI has three SGI SuperComputers (2x64 CPU and 1x128 CPU, SGI Origin 2400 and 3800 respectively) and a LINUX Cluster (128 nodes, 256 2.4 Ghz Intel XEON CPU's). run_ms2 and PEP_PROBE have been ported to run on these clusters. Group members can obtain information on how to use RC computers here.
|
|
|
Biological mass spectrometry need not be limited to peptides, of course. DFCalc is software designed to assist the interpretation of tandem mass spectra from DNA molecules. The program predicts the fragment ions for known sequences, producing a list to be compared against a spectrum. [link fixed 6/11/02] |
|
NoDupe identifies similarity among uninterpreted tandem mass spectra. Optionally, the program can remove duplicate copies of spectra. |
|
QCorr Add QCorr description here. |