NEXT

40
Validating sequence sequence assignments for assignments for peptide fragmentation peptide fragmentation patterns patterns by Karen Jonscher by Karen Jonscher www.proteomesoftware.com www.proteomesoftware.com 503-244-6027 503-244-6027 NEXT

description

Validating sequence assignments for peptide fragmentation patterns by Karen Jonscher www.proteomesoftware.com 503-244-6027. NEXT. With the advent of high throughput proteomics, data is being generated at an astonishing rate. - PowerPoint PPT Presentation

Transcript of NEXT

Page 1: NEXT

Validating sequence sequence assignments for peptide assignments for peptide fragmentation patternsfragmentation patterns

by Karen Jonscherby Karen Jonscher

www.proteomesoftware.comwww.proteomesoftware.com503-244-6027503-244-6027

NEXT

Page 2: NEXT

With the advent of high throughput proteomics, data is being generated at an astonishing rate.

It has become clear that validating peptide sequence assignments generated by database search engines is an increasingly important, but often overlooked aspect of protein identification using tandem mass spectrometry.

In this tutorial you will learn about some of the factors important in low energy peptide fragmentation and how to use this information to accept or reject database search engine peptide sequence assignments.

NEXT

Page 3: NEXT

Mascot Score = 101Mascot Score = 101

Mascot Score = 102Mascot Score = 102

Why is validating data important?Why is validating data important?

• Every database search, for a variety of reasons, generates false positive and false negative assignments. We would like to at least reduce, if not eliminate, these incorrect hits.• Decisions, often involving a great deal of money for bioassays, will be made downstream of our peptide identification. In this era of tight funding, it is crucial that the data upon which these crucial decisions rest are completely accurate.

High quality spectrum High quality spectrum with good signal-to-with good signal-to-noise and all of the noise and all of the dominant ions dominant ions assignedassigned..

Poor quality spectrum Poor quality spectrum with low signal-to-with low signal-to-noise. Fragment ions noise. Fragment ions are most likely are most likely randomly assigned to randomly assigned to noise peaks.noise peaks.

These two fragment ion spectra from a multi-dimensional LC proteomics experiment using a quadrupole ion trap were searched with Mascot. They both had similar scores. Are these two spectra equivalent?

NEXT

Page 4: NEXT

How do peptides break apart?How do peptides break apart?

NEXT

In order to assess whether a sequence assignment is correct or not, it is important to understand how and why peptides break apart.

Under low energy dissociation conditions, peptides primarily fragment at the C – N bond. If the charge is retained on the N-terminal end of the peptide, the ion is known as a b-type ion. If the charge is retained on the C-terminal end, the ion is termed a y-type ion. The fragmentation energy in some instruments, especially triple quadrupoles or quadrupole-time-of-flight hybrids (Q-TOFs) is often sufficient to generate cleavage at the C-C bond as well, causing loss of CO from the b ion. These ions are known as a-type ions.

Page 5: NEXT

HH22N-AN-A++ = b = b11

HH22N-ABN-AB++ = b = b22

HH22N-ABCN-ABC++ = b = b33

HH22N-ABCDN-ABCD++ = b = b44

HH22N-ABCDEN-ABCDE++ = b = b55

HH22N-ABCDEFN-ABCDEF++ = b = b66

HH22N-ABCDEFGN-ABCDEFG++ = b = b77

yy11 = = ++G-COOHG-COOH

yy22 = = ++FG-COOHFG-COOH

yy33 = = ++EFG-COOHEFG-COOH

yy44 = = ++DEFG-COOHDEFG-COOH

yy55 = = ++CDEFG-COOHCDEFG-COOH

yy66 = = ++BCDEFG-COOHBCDEFG-COOH

yy77 = = ++ABCDEFG-COOHABCDEFG-COOH

Peptides dissociate into Peptides dissociate into nested sets of fragmentsnested sets of fragments

NEXT

AA BB CC DD EE FF GGHH22NN COOHCOOH-- --

bb11 bb22 bb33 bb44 bb55 bb66

yy66 yy55 yy44 yy33 yy22 yy11

If A, B, C, D, E, F and G represent different amino acids, this peptide can dissociate to form the following fragments:

Page 6: NEXT

Rel

ati

ve

Ab

un

da

nce

m/z

AB

b2

C

b3

D

b4

E

b5

Fb6

G

FG

y2

E

y3

D

y4

C

y5

B

y6

A

The peptide amino acid sequence can be deduced by calculating the difference in mass between peaks. If the mass corresponds to an amino acid residue (a table is shown in the next slide) then that amino acid is assigned to the peak representing the difference. The largest y-type ion will appear anywhere between 57 to 186 amu below the mass of the precursor ion. The smallest y ion will appear at the amino acid residue mass plus 19 amu. The largest b ion will be at 18 amu plus the residue mass below the precursor and the smallest b ion will be at the residue mass + 1. Using this information, we can interpret fragmentation spectra to deduce the amino acid sequence. Keep clicking to view an ion series.

Mass difference reflects Mass difference reflects peptide sequencepeptide sequence

AA BB CC DD EE FF GGHH22NN COOHCOOH-- --

bb11 bb22 bb33 bb44 bb55 bb66

yy66 yy55 yy44 yy33 yy22 yy11

NEXT

Page 7: NEXT

Name Symbol Mass (-H2O) Side Chain Immonium/Related Ions

Alanine A, Ala 71.079 CH3- 44

Arginine R, Arg 156.188 HN=C(NH2)-NH-(CH2)3- 129/59, 70. 73, 87, 100, 112

Asparagine N, Asn 114.104 H2N-CO-CH2- 87/70

Aspartic acid D, Asp 115.089 HOOC-CH2- 88/70

Cysteine C, Cys 103.145 HS-CH2- 76

Glutamine Q, Gln 128.131 H2N-CO-(CH2)2- 101

Glutamic acid E, Glu 129.116 HOOC-(CH2)2- pro102/56, 84, 129

Glycine G, Gly 57.052 H- 30

Histidine H, His 137.141N=CH-NH-CH=C-CH2- |

__________|110/82, 121, 123, 138, 166

Isoleucine I, Ile 113.160 CH3-CH2-CH(CH3)- 86/44, 72

Leucine L, Leu 113.160 (CH3)2-CH-CH2- 86/44, 72

Lysine K, Lys 128.17 H2N-(CH2)4- 101/70, 84, 112, 129

Methionine M, Met 131.199 CH3-S-(CH2)2- 104/61

Phenylalanine F, Phe 147.177 Phenyl-CH2- 120/91

Proline P, Pro 97.117 -N-(CH2)3-CH- |_________| 70

Serine S, Ser 87.078 HO-CH2- 60

Threonine T, Thr 101.105 CH3-CH(OH)- 74

Tryptophan W, Trp 186.213Phenyl-NH-CH=C-CH2- |

___________|159/77, 117, 130, 123, 170, 171

Tyrosine Y, Tyr 163.176 4-OH-Phenyl-CH2- 136/91, 107

Valine V, Val 99.133 CH3-CH(CH2)- 72/41, 55, 69

basic

basic

basic

acidic

acidic

lose water

lose water

suppress b

suppress b

typically dominant

lose ammonia

lose ammonia

lose ammonia

lose ammonia

-H2S=34

-CH3SH=48

Not usually observedin ion traps due to low masscutoff. Same with b1/y1.

abundant y

isobaric

isobaric

NEXT

The table of common amino acids provides molecular weights for the residues, structures of the side chains, and masses for the low mass immonium ions that result from side chain loss.

These amino acids have chemical properties that need to be considered when validating sequence assignments.

Page 8: NEXT

Other things to keep in mindOther things to keep in mind

G-G = 114 = NG-A = 128 = K/QV-G = 156 = RG-E = 186 = WA-D = 186 = WS-V = 186 = WAcG = 99 = VAcA = 113 = L/IAcS = 129 = EAcN = 156 = R

NEXT

Some pairs of amino acids add up Some pairs of amino acids add up to the mass of a different amino to the mass of a different amino acid. The same can happen with acid. The same can happen with acetylated amino acids, a common acetylated amino acids, a common modification. modification.

Basic amino acids can generate doubly-charged ions. Ion signal can be intense for cleavages C-terminal to acidic amino acids. These residues also tend to lose water and cyclize to randomly eject portions of the sequence. Isobaric amino acids cannot be differentiated using low energy fragmentation instruments. Loss of water from threonine is particularly intense if the amino acid is near a terminal end of the peptide. If a peptide is tryptic, y1 will either be lysine at 147 or arginine at 175.

Page 9: NEXT

13

37

.5

12

76

.51

23

8.6

12

19

.4

11

39

.4

10

91

.4

99

2.4

NEXT

We’re going to manually interpret an MS/MS spectrum generated by a quadrupole ion trap.

The deconvoluted mass of the precursor is 1449.38 (the observed ion was doubly charged).

We’ll start by looking at the dominant peaks that are below the mass of the precursor ion.

We’ll look for possible y ions between 57 and 186 amu below 1450 (the (M+H)+ ion) and possible b ions in a window offset by another 18 amu.

Lxx Val Val Phe K/Q Gly Arg

The first ion is at 1337.5. 1450-1337=113=Lxx, therefore we assign the largest y ion as either leucine or isoleucine.

The next ion is at 1276. 1450-18-1276=156=Arg, therefore we assign the largest b ion as arginine. Since the sample was digested with trypsin, we would expect lysine or arginine as the first y ion.

Next we look at the ion at 1238. Assuming it is a y ion, we note that 1337-1238=99=Val, so we assign the next y ion as Val.

Continuing in this manner, we find b ions at 1276-1219=57=Gly and 1219-1091=128=Lys/Gln and y ions at 1238-1139=99=Val and 1139-992=147=Phe.

Page 10: NEXT

13

37

.5

12

76

.512

38.

61

21

9.4

11

39

.4

10

91

.4

99

2.4

97

5.49

44

.39

27

.3

86

3.4

21

3.0

23

2.0

31

1.8

36

0.2

44

3.3

45

9.0

NEXT

Lxx Val Val Phe K/Q Gly Arg

To verify the high mass y ion assignments, we look for the complimentary low mass b ions.

Since the data is from an ion trap, we will probably not see b1.

Therefore, we start by looking for b2.

Since the largest y ion was either leucine or isoleucine and the next y ion in the series was valine, we look for the complementary ion at 113+1+99=213=b2.

The next b ion will result from addition of another valine, therefore we’d expect a signal at 213+99=312=b3.

When phenylalanine is added, we find the b4 ion at 312+147=459.

Page 11: NEXT

13

37

.5

12

76

.512

38.

61

21

9.4

11

39

.4

10

91

.4

99

2.4

97

5.49

44

.39

27

.3

86

3.4

21

3.0

23

2.0

31

1.8

36

0.2

44

3.3

45

9.0

NEXT

To verify the high mass b ion assignments, we look for the complimentary low mass y ions.

Since the data is from an ion trap, we will probably not see y1, however the assignment of arginine makes sense given that the sample was digested using trypsin.

Therefore, we start by looking for y2.

Assuming arginine is y1, we look for a signal resulting from the addition of glycine at 156+1+18+57=232=y2. We add the 19 amu to account for the carboxyl group.

The next complementary y ion should be at 232+128=360=y3 resulting from the addition of either lysine or glutamine. Since the sample was digested using trypsin, it is unlikely that the amino acid is lysine, since we would have expected a cleavage there.

Lxx Val Val Phe K/Q Gly ArgQ

Page 12: NEXT

1091-975=1161091-944=147=Phe1091-927=164

13

37

.5

12

76

.512

38.

61

21

9.4

11

39

.4

10

91

.4

99

2.4

97

5.49

44

.39

27

.3

86

3.4

b 2y 2

b 3y 3

44

3.3

b 4

83

0.2

48

9.0

50

7.2

54

3.3

58

8.0

NEXT

Lxx Val Val Phe GlyArgGlnGlu PheAsn

We look for the next largest b ion below 1091. There are three choices:

1091-975=1161091-944=147=Phe1091-927=164

Only the signal at 944 corresponds to an amino acid residue mass, therefore the next b ion is Phe.

The next largest y ion will appear below 992 Since 992-863=129=Glu, we assign the next y ion as glutamic acid.

The next dominant peak should correspond to a b ion so we look below 944 and note that 944-830=114=Asn.

Validating our high mass ion assignments, we expect to find b5 resulting from addition of glutamic acid at 459+129=588=b5 and

y4 resulting from addition of phenylalanine, 360+147=507=y4.

Page 13: NEXT

1091-975=1161091-944=147=Phe1091-927=164

13

37

.5

12

76

.512

38.

61

21

9.4

11

39

.4

10

91

.4

99

2.4

97

5.49

44

.39

27

.3

86

3.4

b 2y 2

b 3y 3

44

3.3

b 4

83

0.2

48

9.0

54

3.3

y 4

b 5

75

0.3

70

1.1

621.3

NEXT

The next largest y ion will appear below 863. Since 863-750=113=Lxx, we assign the next y ion as leucine/isoleucine.

Lxx Val Val Phe Gly ArgGlnGlu PheAsnLxx

The next dominant peak should correspond to a b ion so we look below 830 and note that 830-701=129=Glu.

Glu

The next y ion will appear below 750. Since 750-621=129=Glu, we assign the next y ion as glutamic acid. However, since this is complementary to the b ion we just assigned, our sequence is complete.

Page 14: NEXT

1091-975=1161091-944=147=Phe1091-927=164

b 2y 2

b 3y 3

44

3.3

b 44

89

.0

54

3.3

y 4

b 5

b 5

b 7

b 8

b 9

b 10

b 11

y 5

y 6

y 7

y 8

y 9

y 10

y 11

b

yL V V F E L E N F Q G R

NEXT

In this example we have observed a complete series of complementary b and y ions. Purple bars indicate b ions while orange bars are for y ions.

Page 15: NEXT

# Rank/Sp Id# (M+H)+ deltCn XCorr Sp Ions Reference Peptide --- -------- -------- -------- ------ ------ ---- ---- --------- ------- 1. 1 / 1 0 1450.7694 0.0000 4.7567 2559.1 20/22 CRB1_HUMAN R.LVVFELENFQGR.R 2. 2 / 2 0 1451.7534 0.0254 4.6357 2541.5 20/22 CRB1_HUMAN R.LVVFELEN*FQGR.R 3. 3 / 2 0 1451.7534 0.0571 4.4851 2541.5 20/22 CRB1_HUMAN R.LVVFELENFQ*GR.R 4. 4 / 3 0 1452.7374 0.2804 3.4230 2036.0 18/22 CRB1_HUMAN R.LVVFELEN*FQ*GR.R 5. 5 / 6 0 1451.7569 0.4038 2.8358 1057.7 15/22 TP3B_HUMAN K.LN*M#VKFLQ*VEGR.G 6. 6 / 5 0 1450.7729 0.4619 2.5595 1426.3 17/22 TP3B_HUMAN K.LN*M#VKFLQVEGR.G 7. 7 / 13 0 1450.6549 0.4654 2.5430 840.0 13/22 NEUM_HUMAN R.TKQ*VEKN*DDDQ*K.I 8. 8 / 13 0 1449.6709 0.4752 2.4964 840.0 13/22 NEUM_HUMAN R.TKQ*VEKNDDDQ*K.I 9. 9 / 15 0 1449.6709 0.4757 2.4941 817.4 13/22 NEUM_HUMAN R.TKQ*VEKN*DDDQK.I 10. 10 / 11 0 1451.7055 0.5042 2.3586 843.6 13/22 ING_HUMAN K.S]VETIKEDM#NVK.F 11. 11 / 16 0 1451.8011 0.5145 2.3093 817.3 14/22 GGT5_HUMAN R.VNVYHHLVETLK.F 12. 12 / 4 0 1451.7494 0.5179 2.2932 1458.5 16/20 DESP_HUMAN R.LTYEIEDEKRR.R

Incorrect Identification

NEXT

Sequest tenatively matches each spectrum to 12 peptides. Matches are rated with the Xcorr value (Sequest’s score criterion). Here is a poorly rated match. Although most of the dominant ions were assigned, they were not assigned to b and y ions but to water loss and other ions that are generally less abundant.

Page 16: NEXT

# Rank/Sp Id# (M+H)+ deltCn XCorr Sp Ions Reference Peptide --- -------- -------- -------- ------ ------ ---- ---- --------- ------- 1. 1 / 1 0 1450.7694 0.0000 4.7567 2559.1 20/22 CRB1_HUMAN R.LVVFELENFQGR.R 2. 2 / 2 0 1451.7534 0.0254 4.6357 2541.5 20/22 CRB1_HUMAN R.LVVFELEN*FQGR.R 3. 3 / 2 0 1451.7534 0.0571 4.4851 2541.5 20/22 CRB1_HUMAN R.LVVFELENFQ*GR.R 4. 4 / 3 0 1452.7374 0.2804 3.4230 2036.0 18/22 CRB1_HUMAN R.LVVFELEN*FQ*GR.R 5. 5 / 6 0 1451.7569 0.4038 2.8358 1057.7 15/22 TP3B_HUMAN K.LN*M#VKFLQ*VEGR.G 6. 6 / 5 0 1450.7729 0.4619 2.5595 1426.3 17/22 TP3B_HUMAN K.LN*M#VKFLQVEGR.G 7. 7 / 13 0 1450.6549 0.4654 2.5430 840.0 13/22 NEUM_HUMAN R.TKQ*VEKN*DDDQ*K.I 8. 8 / 13 0 1449.6709 0.4752 2.4964 840.0 13/22 NEUM_HUMAN R.TKQ*VEKNDDDQ*K.I 9. 9 / 15 0 1449.6709 0.4757 2.4941 817.4 13/22 NEUM_HUMAN R.TKQ*VEKN*DDDQK.I 10. 10 / 11 0 1451.7055 0.5042 2.3586 843.6 13/22 ING_HUMAN K.S]VETIKEDM#NVK.F 11. 11 / 16 0 1451.8011 0.5145 2.3093 817.3 14/22 GGT5_HUMAN R.VNVYHHLVETLK.F 12. 12 / 4 0 1451.7494 0.5179 2.2932 1458.5 16/20 DESP_HUMAN R.LTYEIEDEKRR.R

Correct Identification

NEXT

When we compare the incorrect with the correct identification, we see that all of the dominant ions are assigned to b and y ions, with water losses accounting for many of the lower abundance peaks. The rank is 1 and the Xcorr is high in the Sequest output file for the correct ID.

Page 17: NEXT

Bad Spectral QualityBad Spectral Quality

Good Spectral QualityGood Spectral Quality

Good ScoreGood Score Bad ScoreBad Score

Bad IDBad IDGood IDGood ID

? ID? ID

Bad IDBad ID

Bad IDBad ID

Good IDGood ID

Bad IDBad ID

Score vs. Spectral QualityScore vs. Spectral Quality

NEXT

As we have seen, a high score may be an indicator that an identification is correct.

However, this does not hold true in all cases. In the next few examples, we will see instances where good scores actually corresponded to bad identifications, and bad scores corresponded to good identifications.

The importance of spectral quality is demonstrated and the case of questionable identification is reviewed.

Page 18: NEXT

Bad Spectral QualityBad Spectral Quality

Good Spectral QualityGood Spectral Quality

Good ScoreGood Score Bad ScoreBad Score

Bad IDBad ID

NEXT

Example 1Example 1

Proteomics DataProteomics DataMascot SearchMascot Search

Page 19: NEXT

1. IPI00001661 Mass: 45425 Total score: 111 Peptides matched: 1 Tax_Id=9606 Regulator of chromosome condensation Check to include this hit in error tolerant search or archive report

  Query   Observed   Mr(expt)   Mr(calc)   Delta  Miss Score  Rank   Peptide

   1     950.77    1899.53    1899.00     0.53    0   111     1    VVQVSAGDSHTAALTDDGR

Great score, nice spectrumGreat score, nice spectrum

NEXT

The Mascot score is 111. Usually, scores over 40 or 50 typically generate correct identifications.

The score is well beyond the 95% confidence level and is well separated from the other possibilities, generally a positive indicator of a correct ID.

The mass spectrum has a lot of fragment ions and has good signal to noise. Oftentimes, good quality spectra like this provide good search results.

Page 20: NEXT

# b b++ b* b*++ b0 b0++ Seq. y y++ y* y*++ y0 y0++ #

1 100.14 50.57         V             19

2 199.27 100.14         V 1800.88 900.94 1783.85 892.43 1782.87 891.94 18

3 327.40 164.21 310.37 155.69     Q 1701.75 851.38 1684.72 842.86 1683.73 842.37 17

4 426.54 213.77 409.51 205.26     V 1573.62 787.31 1556.59 778.80 1555.60 778.30 16

5 513.61 257.31 496.58 248.80 495.60 248.30 S 1474.48 737.75 1457.45 729.23 1456.47 728.74 15

6 584.69 292.85 567.66 284.34 566.68 283.84 A 1387.41 694.21 1370.38 685.69 1369.39 685.20 14

7 641.75 321.38 624.72 312.86 623.73 312.37 G 1316.33 658.67 1299.30 650.15 1298.31 649.66 13

8 756.83 378.92 739.80 370.41 738.82 369.91 D 1259.28 630.14 1242.24 621.63 1241.26 621.13 12

9 843.91 422.46 826.88 413.94 825.90 413.45 S 1144.19 572.60 1127.16 564.08 1126.17 563.59 11

10 981.05 491.03 964.02 482.52 963.04 482.02 H 1057.11 529.06 1040.08 520.54 1039.09 520.05 10

11 1082.16 541.58 1065.13 533.07 1064.14 532.58 T 919.97 460.49 902.94 451.97 901.95 451.48 9

12 1153.24 577.12 1136.21 568.61 1135.22 568.12 A 818.86 409.94 801.83 401.42 800.85 400.93 8

13 1224.32 612.66 1207.29 604.15 1206.30 603.65 A 747.78 374.40 730.75 365.88 729.77 365.39 7

14 1337.48 669.24 1320.45 660.73 1319.46 660.23 L 676.70 338.86 659.67 330.34 658.69 329.85 6

15 1438.58 719.79 1421.55 711.28 1420.57 710.79 T 563.55 282.28 546.51 273.76 545.53 273.27 5

16 1553.67 777.34 1536.64 768.82 1535.65 768.33 D 462.44 231.72 445.41 223.21 444.42 222.72 4

17 1668.76 834.88 1651.73 826.37 1650.74 825.88 D 347.35 174.18 330.32 165.66 329.34 165.17 3

18 1725.81 863.41 1708.78 854.89 1707.79 854.40 G 232.26 116.64 215.23 108.12     2

19             R 175.21 88.11 158.18 79.59     1

The masses of the expected sequence ions are summarized in the data table. Masses labeled in red were observed in the experiment. As we can see, we have a fairly long run of contiguous sequence for the y ions and a shorter run for the b ions. There is some overlap of the b and y ions, however, so we have a complementary set.

NEXT

Page 21: NEXT

Can’t account for dominant ions!!!

Incorrect IDIncorrect ID

NEXT

BUT…BUT…

Despite the many of the factors which seemed to lead to a correct identification, it is wrong. Since this is a good quality spectrum, it would be worth pursuing other interpretation options. For one, the peptide may be modified and it would be worth re-searching the data using a database including modifications such as phosphorylation or glycosylation, among others. Several searches may be required. De novo searching would also be a possible approach if modification searches do not provide acceptable results.

Page 22: NEXT

Bad Spectral QualityBad Spectral Quality

Good Spectral QualityGood Spectral Quality

Good ScoreGood Score Bad ScoreBad Score

Bad IDBad ID

NEXT

Example 2Example 2

Page 23: NEXT

No significant hits to reportUnassigned queries: (no details means no match)  Query   Observed   Mr(expt)   Mr(calc)   Delta  Miss Score  Rank   Peptide   

1     868.07    1734.13    1730.98     3.15    2    29     1    KGVASTDNTLIARSLGK

All All dominant ionsdominant ions are are unidentifiedunidentified, , the the spectrumspectrum is of is of good qualitygood quality, , with good signal to noise and well-separated fragment ionswith good signal to noise and well-separated fragment ions

NEXT

Mascot score is below generally accepted thresholds.

Page 24: NEXT

# b b++ b* b*++ b0 b0++ Seq. y y++ y* y*++ y0 y0++ #

1 129.18 65.10 112.15 56.58     K             17

2 186.23 93.62 169.20 85.11     G 1603.82 802.41 1586.79 793.90 1585.80 793.40 16

3 285.37 143.19 268.34 134.67     V 1546.76 773.89 1529.73 765.37 1528.75 764.88 15

4 356.45 178.73 339.42 170.21     A 1447.63 724.32 1430.60 715.80 1429.62 715.31 14

5 443.52 222.27 426.49 213.75 425.51 213.26 S 1376.55 688.78 1359.52 680.27 1358.54 679.77 13

6 544.63 272.82 527.60 264.30 526.61 263.81 T 1289.48 645.24 1272.44 636.73 1271.46 636.23 12

7 659.72 330.36 642.69 321.85 641.70 321.36 D 1188.37 594.69 1171.34 586.17 1170.35 585.68 11

8 773.82 387.41 756.79 378.90 755.81 378.41 N 1073.28 537.14 1056.25 528.63 1055.27 528.14 10

9 874.93 437.97 857.90 429.45 856.91 428.96 T 959.18 480.09 942.15 471.58 941.16 471.09 9

10 988.09 494.55 971.06 486.03 970.07 485.54 L 858.07 429.54 841.04 421.02 840.06 420.53 8

11 1101.25 551.13 1084.21 542.61 1083.23 542.12 I 744.91 372.96 727.88 364.45 726.90 363.95 7

12 1172.32 586.67 1155.29 578.15 1154.31 577.66 A 631.75 316.38 614.72 307.87 613.74 307.37 6

13 1328.51 664.76 1311.48 656.24 1310.50 655.75 R 560.67 280.84 543.64 272.33 542.66 271.83 5

14 1415.59 708.30 1398.56 699.78 1397.57 699.29 S 404.49 202.75 387.46 194.23 386.47 193.74 4

15 1528.75 764.88 1511.72 756.36 1510.73 755.87 L 317.41 159.21 300.38 150.69     3

16 1585.80 793.40 1568.77 784.89 1567.79 784.40 G 204.25 102.63 187.22 94.11     2

17             K 147.20 74.10 130.17 65.59     1

There are only short runs of contiguous sequence, there is little complementarity between the b and y ions. With dominant ions unidentified the assignment is obviously incorrect. Since the spectrum is of good quality, the next step should be to consider modifications.

NEXT

Page 25: NEXT

Bad Spectral QualityBad Spectral Quality

Good Spectral QualityGood Spectral Quality

Good ScoreGood Score Bad ScoreBad Score

Good IDGood ID

NEXT

Example 3Example 3

Page 26: NEXT

Score = 31 below threshold

# b b0 Seq. y y* y0 #

1 72.09   A       15

2 129.14   G 1409.58 1392.55 1391.57 14

3 186.19   G 1352.53 1335.50 1334.52 13

4 257.27   A 1295.48 1278.45 1277.46 12

5 328.35   A 1224.40 1207.37 1206.38 11

6 427.48   V 1153.32 1136.29 1135.31 10

7 526.61   V 1054.19 1037.16 1036.17 9

8 639.77   I 955.06 938.03 937.04 8

9 740.88 722.86 T 841.90 824.87 823.88 7

10 869.99 851.98 E 740.79 723.76 722.78 6

11 967.11 949.10 P 611.68 594.65 593.66 5

12 1096.23 1078.21 E 514.56 497.53 496.54 4

13 1233.37 1215.35 H 385.44 368.41 367.43 3

14 1334.47 1316.46 T 248.30 231.27 230.29 2

15     K 147.20 130.17   1

All dominant ions accounted forLargest peak corresponds to y5 – cleavage at proline

BUT…

•Chemistry is plausible•A Sequest search provided the same identification as the Mascot search•Likely correct assignment

NEXT

No water loss for b ions until b9 when Thr appears.

Water loss for y ions also starts after the threonine.

Possibly the presence of the basic histidine and the acidic glutamic acid inhibit the water loss at y3 and y4.

Page 27: NEXT

Bad Spectral QualityBad Spectral Quality

Good Spectral QualityGood Spectral Quality

Good ScoreGood Score Bad ScoreBad Score

Bad IDBad ID

NEXT

Example 4Example 4

Page 28: NEXT

Score = 18 Very weak spectrumScore = 18 Very weak spectrum

There is no baseline, thus we must assume this is a noise spectrum. The ions are likely assigned by random chance.

NEXT

Page 29: NEXT

# b b++ b* b*++ b0 b0++ Seq. y y++ y* y*++ y0 y0++ #

1 58.06 29.53         G             25

2 157.19 79.10         V 2413.73 1207.37 2396.70 1198.85 2395.71 1198.36 24

3 228.27 114.64         A 2314.60 1157.80 2297.57 1149.29 2296.58 1148.79 23

4 299.35 150.18         A 2243.52 1122.26 2226.49 1113.75 2225.50 1113.25 22

5 412.51 206.76         L 2172.44 1086.72 2155.41 1078.21 2154.42 1077.72 21

6 513.61 257.31     495.60 248.30 T 2059.28 1030.14 2042.25 1021.63 2041.26 1021.14 20

7 600.69 300.85     582.68 291.84 S 1958.17 979.59 1941.14 971.08 1940.16 970.58 19

8 715.78 358.39     697.77 349.39 D 1871.10 936.05 1854.06 927.54 1853.08 927.04 18

9 812.90 406.95     794.88 397.95 P 1756.01 878.51 1738.98 869.99 1737.99 869.50 17

10 883.98 442.49     865.96 433.48 A 1658.89 829.95 1641.86 821.43 1640.87 820.94 16

11 983.11 492.06     965.09 483.05 V 1587.81 794.41 1570.78 785.89 1569.80 785.40 15

12 1111.24 556.12 1094.21 547.61 1093.23 547.12 Q 1488.68 744.84 1471.65 736.33 1470.66 735.84 14

13 1182.32 591.66 1165.29 583.15 1164.30 582.66 A 1360.55 680.78 1343.52 672.26 1342.53 671.77 13

14 1295.48 648.24 1278.45 639.73 1277.46 639.24 I 1289.47 645.24 1272.44 636.72 1271.45 636.23 12

15 1394.61 697.81 1377.58 689.29 1376.60 688.80 V 1176.31 588.66 1159.28 580.14 1158.29 579.65 11

16 1507.77 754.39 1490.74 745.87 1489.76 745.38 L 1077.18 539.09 1060.15 530.58 1059.16 530.08 10

17 1622.86 811.93 1605.83 803.42 1604.84 802.93 D 964.02 482.51 946.99 474.00 946.00 473.51 9

18 1723.96 862.49 1706.93 853.97 1705.95 853.48 T 848.93 424.97 831.90 416.45 830.91 415.96 8

19 1795.04 898.03 1778.01 889.51 1777.03 889.02 A 747.82 374.42 730.79 365.90 729.81 365.41 7

20 1882.12 941.56 1865.09 933.05 1864.11 932.56 S 676.74 338.88 659.71 330.36 658.73 329.87 6

21 1997.21 999.11 1980.18 990.59 1979.19 990.10 D 589.67 295.34 572.64 286.82 571.65 286.33 5

22 2096.34 1048.68 2079.31 1040.16 2078.33 1039.67 V 474.58 237.79 457.55 229.28 456.56 228.79 4

23 2209.50 1105.26 2192.47 1096.74 2191.49 1096.25 L 375.45 188.23 358.42 179.71 357.43 179.22 3

24 2324.59 1162.80 2307.56 1154.28 2306.58 1153.79 D 262.29 131.65 245.26 123.13 244.27 122.64 2

25             K 147.20 74.10 130.17 65.59     1

Although there are some complementary runs of contiguous sequence ions, it is unlikely they are significant, given the quality of the mass spectrum.

We do see a dominant fragment ion at the y17 proline, however other large signals do not correspond to expected cleavages C-terminal of the acidic amino acids.

The assignment of doubly-charged ions without the presence of a basic ion is highly unlikely, therefore this identification is incorrect.

This spectrum could likely be discarded.NEXT

Page 30: NEXT

Bad Spectral QualityBad Spectral Quality

Good Spectral QualityGood Spectral Quality

Good ScoreGood Score Bad ScoreBad Score? ID? ID

NEXT

Example 5Example 5

Page 31: NEXT

# b b++ b* b*++ b0 b0++ Seq. y y++ y* y*++ y0 y0++ #

1 132.20 66.60         M             9

2 261.32 131.16     243.30 122.15 E 945.06 473.04 928.03 464.52 927.05 464.03 8

3 358.43 179.72     340.42 170.71 P 815.95 408.48 798.92 399.96 797.93 399.47 7

4 472.54 236.77 455.51 228.26 454.52 227.76 N 718.83 359.92 701.80 351.40 700.82 350.91 6

5 559.61 280.31 542.58 271.80 541.60 271.30 S 604.73 302.87 587.70 294.35 586.71 293.86 5

6 672.77 336.89 655.74 328.38 654.76 327.88 L 517.65 259.33 500.62 250.81 499.63 250.32 4

7 828.96 414.98 811.93 406.47 810.95 405.98 R 404.49 202.75 387.46 194.23 386.47 193.74 3

8 930.07 465.54 913.04 457.02 912.05 456.53 T 248.30 124.66 231.27 116.14 230.29 115.65 2

9             K 147.20 74.10 130.17 65.59     1

Score = 38

Most abundant peak results from loss of water from serine, not proline cleavage as expected.The doubly-charged y8 ion is reasonable, given the presence of the arginine. However, the b2 doubly-charged ion is unlikely.

Score slightly below threshold, spectral quality OK but fragmentation is limited, questionable ID

The identification could possibly be disregarded if the protein assignment was confirmed by another peptide.

NEXT

Page 32: NEXT

Bad Spectral QualityBad Spectral Quality

Good Spectral QualityGood Spectral Quality

Good ScoreGood Score Bad ScoreBad Score

Good IDGood ID

NEXT

Example 6Example 6

Page 33: NEXT

# b b++ b* b*++ b0 b0++ Seq. y y++ y* y*++ y0 y0++ #

1 132.20 66.60         M             14

2 233.31 117.16     215.29 108.15 T 1689.83 845.42 1672.79 836.90 1671.81 836.41 13

3 348.39 174.70     330.38 165.69 D 1588.72 794.86 1571.69 786.35 1570.71 785.86 12

4 476.52 238.77 459.49 230.25 458.51 229.76 Q 1473.63 737.32 1456.60 728.80 1455.62 728.31 11

5 605.64 303.32 588.61 294.81 587.63 294.32 E 1345.50 673.25 1328.47 664.74 1327.49 664.25 10

6 676.72 338.86 659.69 330.35 658.70 329.86 A 1216.39 608.70 1199.35 600.18 1198.37 599.69 9

7 789.88 395.44 772.85 386.93 771.86 386.44 I 1145.31 573.16 1128.28 564.64 1127.29 564.15 8

8 918.01 459.51 900.98 450.99 899.99 450.50 Q 1032.15 516.58 1015.12 508.06 1014.13 507.57 7

9 1033.10 517.05 1016.07 508.54 1015.08 508.05 D 904.02 452.51 886.99 444.00 886.00 443.50 6

10 1146.26 573.63 1129.23 565.12 1128.24 564.63 L 788.93 394.97 771.90 386.45     5

11 1332.47 666.74 1315.44 658.22 1314.46 657.73 W 675.77 338.39 658.74 329.87     4

12 1460.60 730.80 1443.57 722.29 1442.59 721.80 Q 489.55 245.28 472.52 236.77     3

13 1646.82 823.91 1629.78 815.40 1628.80 814.90 W 361.42 181.22 344.39 172.70     2

14             R 175.21 88.11 158.18 79.59     1

Score = 79Complete b and y seriesPlausible ion chemistryCorrect ID!Correct ID!

NEXT

Abundant ions at D and W cleavages

Page 34: NEXT

The following examples compare spectra that were acquired on a Q-TOF and an ion trap.

Both correct and incorrect identifications are shown.

The different appearance of the fragmentation spectra, and the presence of immonium ions and low mass fragment ions in the Q-TOF, means that different considerations may need to be taken into account when different instruments are being used.

NEXT

Page 35: NEXT

QTOF

Ion Trap

NEXT

Correct Identification

Note the presence of y1 and the immonium ion for F in the QTOF spectrum. The QTOF spectrum shows b ion suppression except for b2.

Page 36: NEXT

QTOF

Ion Trap

NEXT

Incorrect Identification

Both spectra have many dominant ions unaccounted for. The dominant ions are assigned as b type in the QTOF spectrum, which is unlikely.

Page 37: NEXT

Look at the next two slides.

Can you guess which are the correct identifications and which are incorrect?

NEXT

Page 38: NEXT

QTOF

Ion Trap

NEXT

Page 39: NEXT

QTOF

Ion Trap

NEXT

Page 40: NEXT

When interpreting the results of search engines it is important to – Look at the score– Look at the sequence runs– Consider the ion fragmentation chemistry– What about the instrument?– Does it all make sense?

www.proteomesoftware.com

Summary