Proteomics & Bioinformatics: A Canadian Perspective

78
Proteomics & Bioinformatics Part II David Wishart 3-41 Athabasca Hall [email protected]

Transcript of Proteomics & Bioinformatics: A Canadian Perspective

Page 1: Proteomics & Bioinformatics: A Canadian Perspective

Proteomics & Bioinformatics Part II

David Wishart3-41 Athabasca Hall

[email protected]

Page 2: Proteomics & Bioinformatics: A Canadian Perspective

3 Kinds of Proteomics*• Structural Proteomics

– High throughput X-ray Crystallography/Modelling– High throughput NMR Spectroscopy/Modelling

• Expressional or Analytical Proteomics– Electrophoresis, Protein Chips, DNA Chips, 2D-HPLC– Mass Spectrometry, Microsequencing

• Functional or Interaction Proteomics– HT Functional Assays, Ligand Chips– Yeast 2-hybrid, Deletion Analysis, Motif Analysis

Page 3: Proteomics & Bioinformatics: A Canadian Perspective

Historically...

• Most of the past 100 years of biochemistry has focused on the analysis of small molecules (i.e. metabolism and metabolic pathways)

• These studies have revealed much about the processes and pathways for about 400 metabolites which can be summarized with this...

Page 4: Proteomics & Bioinformatics: A Canadian Perspective
Page 5: Proteomics & Bioinformatics: A Canadian Perspective

More Recently...

• Molecular biologists and biochemists have focused on the analysis of larger molecules (proteins and genes) which are much more complex and much more numerous

• These studies have primarily focused on identifying and cataloging these molecules (Human Genome Project)

Page 6: Proteomics & Bioinformatics: A Canadian Perspective

Nature’s Parts Warehouse

The protein universe

Living cells

Page 7: Proteomics & Bioinformatics: A Canadian Perspective

The Protein Parts List

Page 8: Proteomics & Bioinformatics: A Canadian Perspective

However...

• This cataloging (which consumes most of bioinformatics) has been derogatively referred to as “stamp collecting”

• Having a collection of parts and names doesn’t tell you how to put something together or how things connect -- this is biology

Page 9: Proteomics & Bioinformatics: A Canadian Perspective

Remember: Proteins Interact*

Page 10: Proteomics & Bioinformatics: A Canadian Perspective

Proteins Assemble*

Page 11: Proteomics & Bioinformatics: A Canadian Perspective

For the Past 10 Years...• Scientists have increasingly focused on

“signal transduction” and transient protein interactions

• New techniques have been developed which reveal which proteins and which parts of proteins are important for interaction

• The hope is to get something like this..

Page 12: Proteomics & Bioinformatics: A Canadian Perspective
Page 13: Proteomics & Bioinformatics: A Canadian Perspective

Protein Interaction Tools and Techniques -

Experimental Methods

Page 14: Proteomics & Bioinformatics: A Canadian Perspective

3D Structure Determination*• X-ray crystallography

– grow crystal– collect diffract. data– calculate e- density– trace chain

• NMR spectroscopy– label protein– collect NMR spectra– assign spectra & NOEs– calculate structure using

distance geom.

Page 15: Proteomics & Bioinformatics: A Canadian Perspective

Quaternary Structure

Some interactionsare real

Others are not

Page 16: Proteomics & Bioinformatics: A Canadian Perspective

Protein Interaction Domains*

http://pawsonlab.mshri.on.ca/ 82 domains

Page 17: Proteomics & Bioinformatics: A Canadian Perspective

Protein Interaction Domains

http://pawsonlab.mshri.on.ca/

Page 18: Proteomics & Bioinformatics: A Canadian Perspective

Yeast Two-Hybrid Analysis*• Yeast two-hybrid

experiments yield information on protein protein interactions

• GAL4 Binding Domain• GAL4 Activation Domain• X and Y are two proteins of

interest• If X & Y interact then

reporter gene is expressed

Page 19: Proteomics & Bioinformatics: A Canadian Perspective

Invitrogen Yeast 2-Hybrid

LexA

lacZLexA

X

Y

Y

B42

B42

lacZ

lacZLexA

X

Page 20: Proteomics & Bioinformatics: A Canadian Perspective

Example of 2-Hybrid Analysis*

• Uetz P. et al., “A Comprehensive Analysis of Protein-Protein Interactions in Saccharomyces cerevisiae” Nature 403:623-627 (2000)

• High Throughput Yeast 2 Hybrid Analysis• 957 putative interactions• 1004 of 6000 predicted proteins involved

Page 21: Proteomics & Bioinformatics: A Canadian Perspective

Example of 2-Hybrid Analysis

• Rain JC. et al., “The protein-protein interaction map of Helicobacter pylori” Nature 409:211-215 (2001)

• High Throughput Yeast 2 Hybrid Analysis• 261 H. pylori proteins scanned against genome• >1200 putative interactions identified• Connects >45% of the H. pylori proteome

Page 22: Proteomics & Bioinformatics: A Canadian Perspective

Another Way?*• Ho Y, Gruhler A, et al. Systematic identification

of protein complexes in Saccharomyces cerevisiae by mass spectrometry. Nature 415:180-183 (2002)

• High Throughput Mass Spectral Protein Complex Identification (HMS-PCI)

• 10% of yeast proteins used as “bait”• 3617 associated proteins identified• 3 fold higher sensitivity than yeast 2-hybrid

Page 23: Proteomics & Bioinformatics: A Canadian Perspective

Affinity Pull-down*

Page 24: Proteomics & Bioinformatics: A Canadian Perspective

HMS-PCI*

Page 25: Proteomics & Bioinformatics: A Canadian Perspective
Page 26: Proteomics & Bioinformatics: A Canadian Perspective

Synthetic Genetic Interactions*

• Two mutations are synthetically lethal if cells with either of the single mutations are viable but cells with both mutations are non-viable

• Two types of synthetic lethal genetic interactions (lethal, slow growth)

• Mate two mutants without phenotypes to get a daughter cell with a phenotype

• Genetic interactions provide functional data on protein interactions or redundant genes

• About 23% of known SLs (1295 - YPD+MIPS) are known protein interactions in yeast

Page 27: Proteomics & Bioinformatics: A Canadian Perspective

Synthetic Lethality*

Page 28: Proteomics & Bioinformatics: A Canadian Perspective

Cell PolarityCell Wall Maintenance Cell StructureMitosisChromosome StructureDNA Synthesis DNA RepairUnknownOthers

Synthetic Genetic Interactions in Yeast

Page 29: Proteomics & Bioinformatics: A Canadian Perspective

Protein Chips*

Antibody Array Antigen Array Ligand Array

Detection by: SELDI MS, fluorescence, SPR, electrochemical, radioactivity, microcantelever

Page 30: Proteomics & Bioinformatics: A Canadian Perspective

Protein (Antigen) Chips

His6

GST

ORF

Nickel coating

H Zhu, J Klemic, S Chang, P Bertone, A Casamayor, K Klemic, D Smith, M Gerstein, M Reed, & M Snyder (2000).Analysis of yeast protein kinases using protein chips. Nature Genetics 26: 283-289

Page 31: Proteomics & Bioinformatics: A Canadian Perspective

Protein (Antigen) Chips

Nickel coating

Page 32: Proteomics & Bioinformatics: A Canadian Perspective

Arraying Process

Page 33: Proteomics & Bioinformatics: A Canadian Perspective

Probe with anti-GST Mab

Nickel coating

Page 34: Proteomics & Bioinformatics: A Canadian Perspective

Anti-GST Probe

Page 35: Proteomics & Bioinformatics: A Canadian Perspective

Probe with Cy3-labeled Calmodulin

Nickel coating

Page 36: Proteomics & Bioinformatics: A Canadian Perspective

“Functional” Protein Array*

Nickel coating

Page 37: Proteomics & Bioinformatics: A Canadian Perspective

Antigen Array (ELISA Chip)*

Mezzasoma et al. Clinical Chem. 48:121 (2002)

Page 38: Proteomics & Bioinformatics: A Canadian Perspective

Diagnostic Antigen Array

Page 39: Proteomics & Bioinformatics: A Canadian Perspective

Protein Chips

Antibody Array Antigen Array Ligand Array

Page 40: Proteomics & Bioinformatics: A Canadian Perspective

Ciphergen “Ligand” Chips*• Hydrophobic (C8) Arrays

• Hydrophilic (SiO2) Arrays• Anion exchange Arrays• Cation exchange Arrays• Immobilized Metal Affinity

(NTA-nitroloacetic acid) Arrays

• Epoxy Surface (amine and thiol binding) Arrays

Page 41: Proteomics & Bioinformatics: A Canadian Perspective

Ciphergen (BioRad) ProteinChip*

Page 42: Proteomics & Bioinformatics: A Canadian Perspective

Peptide/Protein ProfileE. coli

Salmonella

Page 43: Proteomics & Bioinformatics: A Canadian Perspective

Protein Interaction Tools and Techniques -

Computational Methods

Page 44: Proteomics & Bioinformatics: A Canadian Perspective

Sequence Searching Against Known Domains*

http://pawsonlab.mshri.on.ca/

Page 45: Proteomics & Bioinformatics: A Canadian Perspective

Motif Searching Using Known Motifs

Page 46: Proteomics & Bioinformatics: A Canadian Perspective

Text Mining*• Searching Medline or Pubmed for

words or word combinations• “X binds to Y”; “X interacts with Y”;

“X associates with Y” etc. etc.• Requires a list of known gene names

or protein names for a given organism (a protein/gene thesaurus)

Page 47: Proteomics & Bioinformatics: A Canadian Perspective

iHOP (Information hyperlinked over proteins)

http://www.ihop-net.org/UniPub/iHOP/

Page 48: Proteomics & Bioinformatics: A Canadian Perspective

PolySearch*

http://wishart.biology.ualberta..ca/polysearch

Page 49: Proteomics & Bioinformatics: A Canadian Perspective

Rosetta Stone Method

Page 50: Proteomics & Bioinformatics: A Canadian Perspective

Interologs, Homologs, Paralogs*...• Homolog

– Common Ancestors– Common 3D Structure– Common Active Sites

• Ortholog– Derived from Speciation

• Paralog– Derived from Duplication

• Interolog– Protein-Protein Interaction

YM2

Page 51: Proteomics & Bioinformatics: A Canadian Perspective

Finding Interologs*• If A and B interact in organism X, then if

organism Y has a homolog of A (A’) and a homolog of B (B’) then A’ and B’ should interact too!

• Makes use of BLAST searches against entire proteome of well-studied organisms (yeast, E. coli)

• Requires list of known interacting partners

Page 52: Proteomics & Bioinformatics: A Canadian Perspective

A Flood of Data

• High throughput techniques are leading to more and more data on protein interactions

• This is where bioinformatics can play a key role

• Some suggest that this is the “future” for bioinformatics

Page 53: Proteomics & Bioinformatics: A Canadian Perspective

Interaction Databases• DIP

– http://dip.doe-mbi.ucla.edu/dip/Main.cgi

• MINT– http://mint.bio.uniroma2.it/mint/

• String– http://string.embl.de/

• IntAct– http://www.ebi.ac.uk/intact/

main.xhtml

Page 54: Proteomics & Bioinformatics: A Canadian Perspective

DIP Database of Interacting Proteins

http://dip.doe-mbi.ucla.edu/dip/Main.cgi

Page 55: Proteomics & Bioinformatics: A Canadian Perspective

DIP Query Page

CGPC

Page 56: Proteomics & Bioinformatics: A Canadian Perspective

DIP Results Page

click

Page 57: Proteomics & Bioinformatics: A Canadian Perspective

DIP Results Page

Page 58: Proteomics & Bioinformatics: A Canadian Perspective

MINT Molecular Interaction Database

http://mint.bio.uniroma2.it/mint/

Page 59: Proteomics & Bioinformatics: A Canadian Perspective

MINT Resultsclick

Page 60: Proteomics & Bioinformatics: A Canadian Perspective
Page 61: Proteomics & Bioinformatics: A Canadian Perspective

IntAct*

Page 62: Proteomics & Bioinformatics: A Canadian Perspective

IntAct

Page 63: Proteomics & Bioinformatics: A Canadian Perspective

KEGG Kyoto Encyclopedia of Genes and Genomes*

http://www.genome.ad.jp/kegg/kegg2.html

Page 64: Proteomics & Bioinformatics: A Canadian Perspective

KEGG

Page 65: Proteomics & Bioinformatics: A Canadian Perspective

KEGG

Page 66: Proteomics & Bioinformatics: A Canadian Perspective

TRANSPATH

http://www.gene-regulation.com/pub/databases.html

Page 67: Proteomics & Bioinformatics: A Canadian Perspective

BIOCARTA*

• www.biocarta.com• Go to “Pathways”• Web interactive links to many

signalling pathways and other eukaryotic protein-protein interactions

Page 68: Proteomics & Bioinformatics: A Canadian Perspective
Page 69: Proteomics & Bioinformatics: A Canadian Perspective

Visualizing Interactions

DIP

MINT

Page 70: Proteomics & Bioinformatics: A Canadian Perspective

Visualizing Interactions*

Cytoscape (www.cytoscape.org) Osprey http://biodata.mshri.on.ca/osprey/servlet/Index

Page 71: Proteomics & Bioinformatics: A Canadian Perspective

Pathway Visualization with BioCarta*

http://www.biocarta.com/genes/allpathways.asp

Page 72: Proteomics & Bioinformatics: A Canadian Perspective

Pathway Database Comparison*KEGG BioCyc GenMAPP Reactome BioCarta TransPATH

Organisms 181 (varied)

E.Coli, human (20

others)

Human, mouse, rat, fly, yeast

Human, rat, mouse,

chicken, fugu, zebrafish

Human, mouse

Human, mouse

Pathway types

Metabolic, genetic,

signaling,complexes

Metabolic, complexes

Metabolic, signaling,

complexes

Metabolic, signaling,

complexes

Metabolic, signaling,

complexes

Signaling, genetic

Tools/ viewing

linked to from many

Pathway Tools

GenMAPP PathView applets

none Pathway Builder

Images Static box flow

diagrams

Detailed flow

diagrams

Static box flow

diagrams

“starry sky” “Graphics rich” cell diagrams

Graphics rich cell

diagrams

Download Formats

KGMLXML

SBML

BioPaxSBML

MAPP format

SBMLMySQL

Just images

Propietary XML files

Page 73: Proteomics & Bioinformatics: A Canadian Perspective

Other Databases

http://www.imb-jena.de/jcb/ppi/jcb_ppi_databases.html

Page 74: Proteomics & Bioinformatics: A Canadian Perspective

Functional Proteomics

• Mixture of experimental and computational techniques

• Trying to reach a point where functions and interactions can be predicted and modelled

• The future of proteomics (and bioinformatics)

Page 75: Proteomics & Bioinformatics: A Canadian Perspective

Final Exam• Short answer to long answer format• Bring calculators• Typically one question from each of the

lectures in the last ½ of the course• Some questions/answers will involve recall• Most questions require analysis or some

thinking or explaining• Dec. 13, 9:00 am - 2 hours not 3 hours• This room, M-229

Page 76: Proteomics & Bioinformatics: A Canadian Perspective

Typical Questions• What is the correlation between protein

expression and transcript expression? Provide three reasons to explain the difference

• Describe the algorithm or diagram a flow chart for XXXXX

• Explain the differences and similarities between functional proteomics and structural proteomics

Page 77: Proteomics & Bioinformatics: A Canadian Perspective

Typical Questions• Here is some YYYY data from some XXXX

experiment – interpret it and explain what it means

• Explain the difference between the XXX algorithm and the YYY algorithm. Give some examples or provide an illustration

• Here are two small molecules, calculate their difference distance matrix, show calculations. What is the difference between the two?

Page 78: Proteomics & Bioinformatics: A Canadian Perspective

Typical Questions• Define normalization. Provide 3

examples. Show equations or algorithms• What are the three different kinds of

proteomics, compare and contrast• Show the equations and explain the

algorithm you would use to rotate, expand and translate this small molecule