STRAP PTM: Differential Characterization by PTM Counting ... · STRAP PTM: Differential...

1
STRAP PTM: Differential Characterization by PTM Counting and Much More Jean L. Spencer, Vivek N. Bhatia, Amanuel Kehasse, Stephen A. Whelan, Christian F. Heckendorf, Catherine E. Costello and Mark E. McComb Cardiovascular Proteomics Center, Center for Biomedical Mass Spectrometry, Boston University School of Medicine, Boston, MA 02118 Overview Purpose: Software tool required to perform fast and easy characterization of global PTM changes in large proteomics data sets from LC-MS/MS experiments. Methods: In-house application (STRAP PTM) developed to use spectral counting and novel scoring algorithm to identify and rank differential PTMs. Results: Differential PTMs readily detected in diverse data sets and easily sorted for biologically relevant modifications. Introduction Protein post-translational modifications (PTMs) play significant roles in disease pathology, and the identification of PTMs is an increasingly important component of proteomics and biomarker discovery. There are few tools for performing relatively fast and easy characterization of global PTM changes in large proteomics data sets and differential comparison of PTMs across groups. A software program called the Software Tool for Rapid Annotation of Proteins: Post-Translational Modification edition (STRAP PTM) was developed in response to this challenge. STRAP PTM uses a novel counting- based scoring algorithm that facilitates multi-sample PTM comparisons through collation and visualization. Here we explore the utility of STRAP PTM through various scoring factors and across different PTM/proteomics experiments. Methods STRAP PTM: PTM Counting Workflow Setup Window - Enter protXML data files. - Select peptide/protein parameters. - Download PTM database. - Enter protein sequence database. - Select PTM score factors. - Execute analysis. Results Windows Methods STRAP PTM: PTM Scoring PTM Score (S): Overall score for a specific PTM (m) on a specific site (i) of a specific protein (p) based on user-selectable factors relevant to the system (max value = 100) Quality (Q): Goodness of database search results assigned to the MS/MS spectrum for a specific PTM on a specific site of a specific protein (max value = 1) Grouping (G): Variation of a specific PTM on a specific site of a specific protein across groups (max value = 1) Occupancy (W): Degree of modification of a specific site of a specific protein with a specific PTM (max value = 1) Uniqueness (U): Rarity of a specific PTM on a specific protein (max value = 1) Results Example 2: EGFR Phosphorylation PTM Map (phosphorylated sites on EGFR) Legend: Control ATP EGF MRPSGT ∙∙∙ WMIDADSRPK FRELIIEFSK MARDPQRYLV IQGDERMHLP SPTDSNFYRA LMDEEDMDDV VDADEYLIPQ QGFFSSPSTS RTPLLSSLSA TSNNSTVACI DRNGLQSCPI KEDSFLQRYS SDPTGALTED SIDDTFLPVP EYINQSVPKR PAGSVQNPVY HNQPLNPAPS RDPHYQDPHS TAVGNPEYLN TVQPTCVNST FDSPAHWAQK GSHQISLDNP DYQQDFFPKE AKPNGIFKGS TAENAEYLRV APQSSEFIGA PTM Scoring (all PTMs) Sites involved in cellular signaling processes (Y998, Y1110, Y1172, Y1197) ranked highest. Distinct patterns observed at two sites (Y998, Y1172) for ATP and EGF stimulation. Label-Free Comparison (Progenesis LC-MS) Results Example 3: CD40L In Vitro Oxidation PTM Map (all modified sites on CD40L) Legend: 1 μM 5 μM 20 μM 50 μM MENSFEMQKG DQNPQIAAHV ISEASSKTTS VLQWAEKGYY TMSNNLVTLE NGKQLTVKRQ GLYYIYAQVT FCSNREASSQ APFIASLCLK SPGRFERILL RAANTHSSAK PCGQQSIHLG GVFELQPGAS VFVNVTDPSQ VSHGTGFTSF GLLKL PTM Scoring (all PTMs) Sites involved in CD40L structural functionality (Y39, Y40, C88) ranked high. Redox-sensitive residues observed for two nitrotyrosines (Y39, Y40) associated with receptor binding. Conclusions STRAP PTM is a powerful counting approach for ranking differential PTMs in large MS-based proteomics data sets. STRAP PTM applies a novel multi-component score to PTMs with results clearly visualized in interactive tables and maps. STRAP PTM software is easy to implement on a PC and provides fast turnaround for large data sets. STRAP PTM produces semi-quantitative results with trends substantiated by label-free analysis. Acknowledgments NIH-NHLBI contract HHSN268201000031C NIH grants P41 RR010888/GM104603 and S10 RR020946 10 7 10 6 10 5 10 4 10 3 EGFR Epidermal growth factor receptor HCLE cells Human corneal-limbal epithelial cells Control + ATP + EGF 100 μM 5 nM Stimulation (5 min) Cell extraction Lysis / digestion with trypsin Phosphoenrichment IP / IMAC Increasing peroxynitrite 0 1 5 20 50 Conc (μM) CD40L fragment R&D Systems P = probability of modified peptides P 0 = probability of unmodified peptides = G = 0/2.1 = 0 G = 2.1/2.1 = 1 Group A Group B Greater variation = = + M = total PTMs N = modified peptide counts N 0 = unmodified peptide counts = × × × × Quality Grouping Uniqueness Occupancy User-selectable factors =− = = = I = total sites M = total PTMs N = modified peptide counts σ = std dev of counts max σ = max σ of all proteins = (Y1110) H 2 PO 3 (Y1197) H 2 PO 3 (Y998) H 2 PO 3 (Y1172) H 2 PO 3 Results Example 1: PTM Standards in Plasma PTM Map (modified sites on peptide standards) Legend: A (125 nM) B (62 nM) C (25 nM) 280 SYSMEHFRWG 412 RPKPQQFFGL M 580 EMFTYICNHI K 301 DRVYIHPF 500 EAISPPDAAS AAPLR 321 DRVYIHPFHL 532 DFNKFHTFPQ TAIGV PTM Scoring (top 7 PTMs) All PTM standards ranked at top of list. Correct trend of decreasing counts observed for all PTMs. Two potential sites (S4, S10) indicated for glycosylation on Std #500; ranking determined by average PTM score (30.7). Label-Free Comparison (Progenesis LC-MS) (Y39) NO 2 (C88) SO 3 H PTM Standards Protea Biosciences Digested proteins from mouse plasma (S) GalNAc (C) NO (M) OMe (Y) NO 2 (M) O (Y) H 2 PO 3 (K) Ac (Y) SO 3 H C A Decreasing conc 125 62 25 Conc (nM) B (M4) O (Y4) NO 2 (S4) GalNAc (S10) GalNAc (K4) Ac (M11) OMe (C7) NO (Y4) H 2 PO 3 Protein Summary PTM Map Global PTMs Peptide Summary PTM Overview W = 1/6 = 0.17 W = 2/6 = 0.33 Greater modification U = 1 - 1/4 = 0.75 U = 1 - 3/4 = 0.25 Less frequent 0.2 0.8 0.8 0.5 1.0 1.0 Probability Q = = 0.7 2.0/2 2.1/3 Q = = 0.2 2.0/2 0.2/1 Better results MS/MS Data Database Search Engine TPP ProtXML Mascot Data (for example) Control Group Spectra Disease Group Spectra 1 o Results 1° Results 2 o Results 2 o Results STRAP PTM Phosphorylated Tyrosines Normalized Abundance (Y40) NO 2 Normalized Abundance PTM Standards 10 7 10 6 10 5 10 4 Std # * * *

Transcript of STRAP PTM: Differential Characterization by PTM Counting ... · STRAP PTM: Differential...

Page 1: STRAP PTM: Differential Characterization by PTM Counting ... · STRAP PTM: Differential Characterization by PTM Counting and Much More Jean L. Spencer, Vivek N. Bhatia, Amanuel Kehasse,

STRAP PTM: Differential Characterization by PTM Counting and Much More

Jean L. Spencer, Vivek N. Bhatia, Amanuel Kehasse, Stephen A. Whelan, Christian F. Heckendorf,

Catherine E. Costello and Mark E. McComb

Cardiovascular Proteomics Center, Center for Biomedical Mass Spectrometry, Boston University School of Medicine, Boston, MA 02118

Overview

Purpose: Software tool required to perform fast and easy

characterization of global PTM changes in large proteomics

data sets from LC-MS/MS experiments.

Methods: In-house application (STRAP PTM) developed to

use spectral counting and novel scoring algorithm to identify

and rank differential PTMs.

Results: Differential PTMs readily detected in diverse data

sets and easily sorted for biologically relevant modifications.

Introduction

Protein post-translational modifications (PTMs) play significant

roles in disease pathology, and the identification of PTMs is an

increasingly important component of proteomics and biomarker

discovery. There are few tools for performing relatively fast and

easy characterization of global PTM changes in large

proteomics data sets and differential comparison of PTMs

across groups. A software program called the Software Tool

for Rapid Annotation of Proteins: Post-Translational

Modification edition (STRAP PTM) was developed in

response to this challenge. STRAP PTM uses a novel counting-

based scoring algorithm that facilitates multi-sample PTM

comparisons through collation and visualization. Here we

explore the utility of STRAP PTM through various scoring

factors and across different PTM/proteomics experiments.

Methods

STRAP PTM: PTM Counting

Workflow

Setup Window

- Enter protXML data files.

- Select peptide/protein parameters.

- Download PTM database.

- Enter protein sequence database.

- Select PTM score factors.

- Execute analysis.

Results Windows

Methods

STRAP PTM: PTM Scoring

PTM Score (S): Overall score for a specific PTM (m) on a

specific site (i) of a specific protein (p) based on user-selectable

factors relevant to the system (max value = 100)

Quality (Q): Goodness of database search results assigned

to the MS/MS spectrum for a specific PTM on a specific site

of a specific protein (max value = 1)

Grouping (G): Variation of a specific PTM on a specific site

of a specific protein across groups (max value = 1)

Occupancy (W): Degree of modification of a specific site of

a specific protein with a specific PTM (max value = 1)

Uniqueness (U): Rarity of a specific PTM on a specific

protein (max value = 1)

Results

Example 2: EGFR Phosphorylation

PTM Map (phosphorylated sites on EGFR)

Legend: Control ATP EGF

MRPSGT ∙∙∙ WMIDADSRPK FRELIIEFSK MARDPQRYLV IQGDERMHLP SPTDSNFYRA LMDEEDMDDV VDADEYLIPQ QGFFSSPSTS RTPLLSSLSA TSNNSTVACI DRNGLQSCPI KEDSFLQRYS SDPTGALTED SIDDTFLPVP EYINQSVPKR PAGSVQNPVY HNQPLNPAPS RDPHYQDPHS TAVGNPEYLN TVQPTCVNST FDSPAHWAQK GSHQISLDNP DYQQDFFPKE AKPNGIFKGS TAENAEYLRV APQSSEFIGA

PTM Scoring (all PTMs)

Sites involved in cellular signaling processes (Y998, Y1110,

Y1172, Y1197) ranked highest.

Distinct patterns observed at two sites (Y998, Y1172) for ATP

and EGF stimulation.

Label-Free Comparison (Progenesis LC-MS)

Results

Example 3: CD40L In Vitro Oxidation

PTM Map (all modified sites on CD40L)

Legend: 1 µM 5 µM 20 µM 50 µM

MENSFEMQKG DQNPQIAAHV ISEASSKTTS VLQWAEKGYY TMSNNLVTLE NGKQLTVKRQ GLYYIYAQVT FCSNREASSQ APFIASLCLK SPGRFERILL RAANTHSSAK PCGQQSIHLG GVFELQPGAS VFVNVTDPSQ VSHGTGFTSF GLLKL

PTM Scoring (all PTMs)

Sites involved in CD40L structural functionality (Y39, Y40,

C88) ranked high.

Redox-sensitive residues observed for two nitrotyrosines

(Y39, Y40) associated with receptor binding.

Conclusions

STRAP PTM is a powerful counting approach for ranking

differential PTMs in large MS-based proteomics data sets.

STRAP PTM applies a novel multi-component score to PTMs

with results clearly visualized in interactive tables and maps.

STRAP PTM software is easy to implement on a PC and

provides fast turnaround for large data sets.

STRAP PTM produces semi-quantitative results with trends

substantiated by label-free analysis.

Acknowledgments

NIH-NHLBI contract HHSN268201000031C

NIH grants P41 RR010888/GM104603 and S10 RR020946

107

106

105

104

103

EGFR Epidermal growth

factor receptor

HCLE cells Human corneal-limbal

epithelial cells

Control + ATP + EGF

100 μM 5 nM

Stimulation (5 min)

Cell extraction Lysis / digestion with trypsin

Phosphoenrichment IP / IMAC

Increasing peroxynitrite

0 1 5 20 50

Conc (µM)

CD40L

fragment

R&D Systems

P = probability of modified peptides

P

0 = probability of unmodified peptides 𝐐𝒎𝒊𝒑 = 𝐏𝒎𝒊𝒑

𝐏𝟎𝒊𝒑

G = 0/2.1 = 0

G = 2.1/2.1 = 1

Group A Group B

Greater

variation

𝐖𝒎𝒊𝒑 =𝐍𝒎𝒊𝒑

𝐍𝒎𝒊𝒑𝑴𝒎=𝟏 + 𝐍𝟎𝒊𝒑

M = total PTMs

N = modified peptide counts

N0 = unmodified peptide counts

𝐒𝒎𝒊𝒑 = 𝟏𝟎𝟎 × 𝐐𝒎𝒊𝒑 × 𝐆𝒎𝒊𝒑 × 𝐖𝒎𝒊𝒑 × 𝐔𝒎𝒑

Quality Grouping Uniqueness Occupancy

User-selectable factors

𝐔𝒎𝒑 = 𝟏 − 𝐍𝒎𝒊𝒑𝑰𝒊=𝟏

𝐍𝒎𝒊𝒑𝑰𝒊=𝟏

𝑴𝒎=𝟏

I = total sites

M = total PTMs

N = modified peptide counts

σ = std dev of counts

max σ = max σ of all proteins 𝐆𝒎𝒊𝒑 =𝛔𝒎𝒊𝒑

𝐦𝐚𝐱 𝛔

(Y1110) H2PO3 (Y1197) H2PO3 (Y998) H2PO3 (Y1172) H2PO3

Results

Example 1: PTM Standards in Plasma

PTM Map (modified sites on peptide standards)

Legend: A (125 nM) B (62 nM) C (25 nM)

280 SYSMEHFRWG

412 RPKPQQFFGL M

580 EMFTYICNHI K

301 DRVYIHPF

500 EAISPPDAAS AAPLR

321 DRVYIHPFHL

532 DFNKFHTFPQ TAIGV

PTM Scoring (top 7 PTMs)

All PTM standards ranked at top of list.

Correct trend of decreasing counts observed for all PTMs.

Two potential sites (S4, S10) indicated for glycosylation on

Std #500; ranking determined by average PTM score (30.7).

Label-Free Comparison (Progenesis LC-MS)

(Y39) NO2 (C88) SO3H

PTM Standards Protea Biosciences

Digested proteins

from mouse plasma

(S) GalNAc

(C) NO

(M) OMe

(Y) NO2

(M) O

(Y) H2PO3

(K) Ac

(Y) SO3H

C A

Decreasing conc

125 62 25

Conc (nM)

B

(M4) O

(Y4) NO2

(S4) GalNAc

(S10) GalNAc

(K4) Ac

(M11) OMe

(C7) NO

(Y4) H2PO3

Protein Summary PTM Map

Global PTMs

Peptide Summary

PTM Overview

W = 1/6 = 0.17

W = 2/6 = 0.33 Greater modification

U = 1 - 1/4 = 0.75

U = 1 - 3/4 = 0.25

Less frequent

0.2

0.8

0.8

0.5

1.0

1.0

Probability

Q = = 0.7 2.0/2

2.1/3

Q = = 0.2 2.0/2

0.2/1

Better results

MS/MS Data

Database

Search

Engine

TPP ProtXML Mascot Data

(for example)

Control Group

Spectra

Disease Group

Spectra

1o Results

1° Results

2o Results

2o Results

STRAP PTM

Phosphorylated Tyrosines N

orm

ali

zed

Ab

un

dan

ce

(Y40) NO2

No

rmali

zed

Ab

un

dan

ce

PTM Standards

107

106

105

104

Std #

*

*

*