Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key...

76
Lecture 3.3 1 Superposition & Threading Gary Van Domselaar University of Alberta [email protected] Slides adapted from David Wishart

Transcript of Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key...

Page 1: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 1

Superposition & Threading†

Gary Van DomselaarUniversity of Alberta

[email protected]

†Slides adapted from David Wishart

Page 2: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 2

Outline

• Vectors, matrices and other geometry issues

• General Superposition concepts

• Threading and threading methods

Page 3: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 3

Vectors Define Bonds and Atomic Positions

x

y

z H3N+

O

ORH

Origin

CO bond

Page 4: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 4

Review - Vectors

(1,2,1)

(0,0,0)

u

u = 1i + 2j + 1k^ ^ ^

u =

121

= (1-0)2 + (2-0)2 + (1-0)2 = 6u

Vectors have a length & a direction

x

y

z

Page 5: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 5

Review - Vectors

• Vectors can be added together

• Vectors can be subtracted

• Vectors can be multiplied (dot or cross or by a matrix)

• Vectors can be transformed (resized)

• Vectors can be translated

• Vectors can be rotated

Page 6: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 6

Matrices

• A matrix is a table or “array” of characters

• A matrix is also called a tensor of “rank 2”

2 4 6 8 9 41 3 5 7 9 31 0 1 0 1 09 4 6 4 3 53 4 3 4 3 4

row

colu

mn

A 5 x 6 Matrix

# co

lum

ns

# ro

ws

Page 7: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 7

Different Types of Matrices

2 4 6 8 9 41 3 5 7 9 31 0 1 0 1 09 4 6 4 3 53 4 3 4 3 43 6 7 9 1 0

2 4 6 8 9 44 3 5 7 9 36 5 1 0 1 08 7 0 4 3 59 9 1 3 3 44 3 0 5 4 0

135973

A squareMatrix

A symmetricMatrix

A columnMatrix

(A vector)

Page 8: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 8

Different Types of Matrices

A B C D E FG H I J K LM N O P Q RS T U V W X

cosθ sinθ 0

sinθ -cosθ 0

0 0 1

A rectangularMatrix

A rotationMatrix

A rowMatrix

(A vector)

2 4 6 8 9

Page 9: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 9

Review - Matrix Multiplication

2 4 01 3 11 0 0

1 0 22 1 30 1 0

2x1 + 4x2 + 0x02x0 + 4x1 + 0x12x2 + 4x3 + 0x01x1 + 3x2 + 1x01x0 + 3x1 + 1x11x2 + 3x3 + 1x01x1 + 0x2 + 0x01x0 + 0x1 + 0x11x2 + 0x3 + 0x0

x10 4 16 7 4 11 1 0 0

Page 10: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 10

Rotation

1 0 0

0 cosθ sinθ 0 -sinθ cosθ

cosφ sinφ 0-sinφ cosφ 0 0 0 1

Rotateabout x

Rotateabout z

θ

φx

z

y

Page 11: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 11

Rotation

1 0 0

0 cosθ sinθ 0 -sinθ cosθ

cosφ sinφ 0-sinφ cosφ 0 0 0 1

Clockwise about x Clockwise about z

1 0 0

0 cosθ -sinθ 0 sinθ cosθ

cosφ -sinφ 0 sinφ cosφ 0 0 0 1

Counterclockwise about x Counterclockwise about z

Page 12: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 12

Rotation

X =

X =

x

y

z

x

y

z

1 0 0

0 cosθ sinθ 0 -sinθ cosθ

1 0 0

0 cosθ sinθ 0 -sinθ cosθ

Page 13: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 13

Rotation (Detail)

X =

x

y

z

x

y

z

= 1 cosθ + sinθ-sinθ + cosθ

111

1 0 0

0 cosθ sinθ 0 -sinθ cosθ

1 0 0

0 cosθ sinθ 0 -sinθ cosθ

Page 14: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 14

Superposition

• Objective is to match or overlay 2 or more similar objects

• Requires use of translation and rotation operators (matrices/vectors)

• Recall that very three dimensional object can be represented by a plane defined by 3 points

Page 15: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 15

Superposition

x

y

z

a

b

c

a’b’

c’

x

y

z

a

b

c

a’b’

c’

Identify 3 “equivalence” points in objects to be aligned

Page 16: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 16

b’

c’

Superposition

x

y

z

x

y

z

a

b

c

a’b’

c’

a

b

c

Translate points a,b,c and a’,b’,c’ to origin

Page 17: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 17

b’

c’

Superposition

x

y

z

a

b

c

b’

c’

x

y

z

θ a

b

c

Rotate the a,b,c plane clockwise by θ about x axis

Page 18: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 18

Superposition

b’

c’

x

y

z

a

b

c

b’

c’

x

y

z

a

bc

φ φ

Rotate the a,b,c plane clockwise by φ about z axis

Page 19: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 19

Superposition

b’

c’

x

y

z

a

bc

b’

c’

x

y

z

a

bc

ψ

Rotate the a,b,c plane clockwise by ψ about x axis

Page 20: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 20

Superposition

b’

c’

x

y

z

a

bc

b’

c’

x

y

z

a

bc

θ ’

Rotate the a’,b’,c’ plane anticlockwise by θ ’ about x axis

Page 21: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 21

Superposition

b’

c’

x

y

z

a

bc

b’c’

x

y

z

a

bc

φ ‘

Rotate the a’,b’,c’ plane anticlockwise by φ ’ about z axis

Page 22: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 22

Superposition

b’c’

x

y

z

a

bc

Rotate the a’,b’,c’ plane clockwise by ψ ’ about x axis

b’c’

x

y

z

a

bc

ψ ’

Page 23: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 23

Superposition

Apply all rotations and translations to remaining points

b’c’

x

y

z

a

bcb’c’

x

y

z

a

bc

Page 24: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 24

Superposition

Before After

b’c’

x

y

z

a

bcx

y

z

a

b

c

a’b’

c’

Page 25: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 25

Returning to the “red” frame

Before After

y

z

x

b’c’

x

y

z

a

bc

a

b

c

Page 26: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 26

Returning to the “red” frame

• Begin with the superimposed structures on the x-y plane

• Apply counterclockwise rot. By ψ• Apply counterclockwise rot. By φ• Apply counterclockwise rot. By θ• Apply red translation to red origin

Just do things in reverse order!

Page 27: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 27

Superposition - Applications

• Ideal for comparing or overlaying two or more protein structures

• Allows identification of structural homologues (CATH and SCOP)

• Allows loops to be inserted or replaced from loop libraries (comparative modelling)

• Allows side chains to be replaced or inserted with relative ease

Page 28: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 28

Side Chain Placement

http://www.fccc.edu/research/labs/dunbrack/scwrl/

SCWRL

Page 29: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 29

CCOOHH2N

H

NH3+

Amino Acid Side Chains

Page 30: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 30

Adding a Side Chain

x

y

z

x

y

z

x

y

z

Page 31: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 31

Adding a Side Chain

x

y

z

x

y

z

y

Page 32: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 32

Adding a Side Chain

x

y

z

x

y

z

y

Page 33: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 33

Adding a Side Chain

x

y

z

x

y

z

y

Page 34: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 34

Adding a Side Chain

x

y

z

x

y

z

y

Page 35: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 35

Superposition• The concept of superposition is key to

many aspects of protein structure generation and comparison

• Superposition may be used to insert side chains and loops (for homology models)

• Side chains require more consideration as side chain packing ultimately determines the 3D structure of proteins

Page 36: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 36

Superposition - RMSD• The degree of similarity between two or

more structures is described by its average root mean square deviation (RMSD):

x1

RMSD N;x ,y =∑i=1N

∥xi−yi∥2

N

x1

x5

x4

x3

x2

y1

y2

y3 y

4

y5

0"

0.3"

0.8 "

0.7"

1"

Page 37: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 37

Superposition Software

• Swiss PDB Viewer– Aligns 2

homologous structures

Page 38: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 38

Superposition Software• CE: Structure Comparison by

Combinatorial Extension

• http://cl.sdsc.edu/ce.html

• Superposition for 2 chains and for multiple chains (new)

Page 39: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 39

Superposition Software• SuperPose

http://wishart.biology.ualberta.ca/SuperPose/

• Superposition for 2 chains and for multiple chains

• Subdomain superposition

• Superposition of structures with low sequence identity

Page 40: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 40

Definition

• Threading - A protein fold recognition technique that involves incrementally replacing the sequence of a known protein structure with a query sequence of unknown structure. The new “model” structure is evaluated using a simple heuristic measure of protein fold quality. The process is repeated against all known 3D structures until an optimal fit is found.

Page 41: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 41

Why Threading?

• Secondary structure is more conserved than primary structure

• Tertiary structure is more conserved than secondary structure

• Therefore very remote relationships can be better detected through 2o or 3o structural homology instead of sequence homology

Page 42: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 42

Visualizing Threading

TH

READ

THREADINGSEQNCEECNQESGNIERHTHREADINGSEQNCETHREADGSEQNCEQCQESGIDAERTHR...

Page 43: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 43

Visualizing Threading

TH

RE

THREADINGSEQNCEECNQESGNIERHTHREADINGSEQNCETHREADGSEQNCEQCQESGIDAERTHR...

Page 44: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 44

Visualizing Threading

TH

THREADINGSEQNCEECNQESGNIERHTHREADINGSEQNCETHREADGSEQNCEQCQESGIDAERTHR...

Page 45: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 45

Visualizing Threading

THREADINGSEQNCEECNQESGNIERHTHREADINGSEQNCETHREADGSEQNCEQCQESGIDAERTHR...

Page 46: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 46

Visualizing ThreadingTHREAD..SEQNCEECN..

Page 47: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 47

Threading• Database of 3D structures and sequences

– Protein Data Bank (or non-redundant subset)

• Query sequence– Sequence < 25% identity to known structures

• Alignment protocol– Dynamic programming

• Evaluation protocol– Distance-based potential or secondary structure

• Ranking protocol

Page 48: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 48

2 Kinds of Threading

• 2D Threading or Prediction Based Methods (PBM)– Predict secondary structure (SS) or ASA of query

– Evaluate on basis of SS and/or ASA matches

• 3D Threading or Distance Based Methods (DBM)– Create a 3D model of the structure

– Evaluate using a distance-based “hydrophobicity” or pseudo-thermodynamic potential

Page 49: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 49

2D Threading Algorithm

• Convert PDB to a database containing sequence, SS and ASA information

• Predict the SS and ASA for the query sequence using a “high-end” algorithm

• Perform a dynamic programming alignment using the query against the database (include sequence, SS & ASA)

• Rank the alignments and select the most probable fold

Page 50: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 50

Database Conversion>Protein1THREADINGSEQNCEECNQESGNIHHHHHHCCCCEEEEECCCHHHHHHERHTHREADINGSEQNCETHREADHHCCEEEEECCCCCHHHHHHHHHH

>Protein2QWETRYEWQEDFSHAECNQESGNIEEEEECCCCHHHHHHHHHHHHHHHYTREWQHGFDSASQWETRACCCCEEEEECCCEEEEECC

>Protein3LKHGMNSNWEDFSHAECNQESGEEECCEEEECCCEEECCCCCCC

Page 51: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 51

Secondary Structure

Structure Phi (Φ) Psi(Ψ)

Antiparallel β-sheet -139 +135Parallel β-Sheet -119 +113Right-handed α-helix +64 +40

310 helix -49 -26

π helix -57 -70Polyproline I -83 +158Polyproline II -78 +149Polyglycine II -80 +150

Phi & Psi angles for Regular Secondary Structure Conformations

Table 10

- -

Page 52: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 52

2o Structure Identification

• DSSP - Database of Secondary Structures for Proteins (swift.embl-heidelberg.de/dssp)

• VADAR - Volume Area Dihedral Angle Reporter (redpoll.pharmacy.ualberta.ca)

• PDB - Protein Data Bank (www.rcsb.org)

QHTAWCLTSEQHTAAVIWDCETPGKQNGAYQEDCAHHHHHHCCEEEEEEEEEEECCHHHHHHHCCCCCCC

Page 53: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 53

Accessible Surface Area

Solvent ProbeAccessible Surface

Van der Waals Surface

Reentrant Surface

Page 54: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 54

ASA Calculation• DSSP - Database of Secondary Structures for Proteins

(swift.embl-heidelberg.de/dssp)

• VADAR - Volume Area Dihedral Angle Reporter (www.redpoll.pharmacy.ualberta.ca/vadar/)

• GetArea - www.scsb.utmb.edu/getarea/area_form.html

QHTAWCLTSEQHTAAVIWDCETPGKQNGAYQEDCAMD BBPPBEEEEEPBPBPBPBBPEEEPBPEPEEEEEEEEE1056298799415251510478941496989999999

Page 55: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 55

Other ASA sites• Connolly Molecular Surface Home Page

– http://www.biohedron.com/

• Naccess Home Page – http://sjh.bi.umist.ac.uk/naccess.html

• ASA Parallelization– http://cmag.cit.nih.gov/Asa.htm

• Protein Structure Database – http://www.psc.edu/biomed/pages/research/PSdb/

Page 56: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 56

2D Threading Algorithm

• Convert PDB to a database containing sequence, SS and ASA information

• Predict the SS and ASA for the query sequence using a “high-end” algorithm

• Perform a dynamic programming alignment using the query against the database (include sequence, SS & ASA)

• Rank the alignments and select the most probable fold

Page 57: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 57

ASA Prediction

• PredictProtein-PHDacc (58%)– http://cubic.bioc.columbia.edu/predictprotein

• PredAcc (70%?)– condor.urbb.jussieu.fr/PredAccCfg.html

QHTAW... QHTAWCLTSEQHTAAVIWBBPPBEEEEEPBPBPBPB

Page 58: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 58

2D Threading Algorithm

• Convert PDB to a database containing sequence, SS and ASA information

• Predict the SS and ASA for the query sequence using a “high-end” algorithm

• Perform a dynamic programming alignment using the query against the database (include sequence, SS & ASA)

• Rank the alignments and select the most probable fold

Page 59: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 59

G E N E T I C S

| | | | * | |

G E N E S I S

G E N E T I C SG 10 0 0 0 0 0 0 0E 0 10 0 10 0 0 0 0N 0 0 10 0 0 0 0 0E 0 0 0 10 0 10 0 0S 0 0 0 0 0 0 0 10I 0 0 0 0 0 10 0 0S 0 0 0 0 0 0 0 10

G E N E T I C SG 60 40 30 20 20 0 10 0E 40 50 30 30 20 0 10 0N 30 30 40 20 20 0 10 0E 20 20 20 30 20 10 10 0S 20 20 20 20 20 0 10 10I 10 10 10 10 10 20 10 0S 0 0 0 0 0 0 0 10

Page 60: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 60

Sij (Identity Matrix) A C D E F G H I K L M N P Q R S T V W YA 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0C 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0D 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0E 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0F 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0G 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0H 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0I 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0K 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0L 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0M 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0N 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0P 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0Q 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0R 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0S 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0T 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0V 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0W 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0Y 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1

Page 61: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 61

A A T V DA 1VVD

A A T V DA 1 1 VVD

A A T V DA 1 1 0 0 0VVD

A A T V DA 1 1 0 0 0V 0VD

A A T V DA 1 1 0 0 0V 0 1 1VD

A A T V DA 1 1 0 0 0V 0 1 1 2VD

Page 62: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 62

A Simple Example...

A A T V DA 1 1 0 0 0V 0 1 1 2 1VD

A A T V DA 1 1 0 0 0V 0 1 1 2 1V 0 1 1 2 2D 0 1 1 1 3

A A T V DA 1 1 0 0 0V 0 1 1 2 1V 0 1 1 2 2D 0 1 1 1 3

A A T V D | | | |A - V V D

A A T V D | | | | A V V D

A A T V D | | | |A V - V D

Page 63: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 63

Let’s Include 2o info & ASA

H E CH 1 0 0E 0 1 0C 0 0 1

E P BE 1 0 0P 0 1 0B 0 0 1

Sij = k1Sij + k2Sij + k3Sijseq strc asatotal

Sijstrc Sij

asa

Page 64: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 64

A A T V DA 2VVD

A A T V DA 2 2 VVD

A A T V DA 2 2 1 0 0VVD

A A T V DA 2 2 1 0 0V 1VD

A A T V DA 2 2 1 0 0V 1 3 3VD

A A T V DA 2 2 1 0 0V 1 3 3 3VD

E E E C C E E E C C E E E C C

E E E C C E E E C C E E E C C

EECC

EECC

EECC

EECC

EECC

EECC

Page 65: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 65

A Simple Example...

A A T V DA 2 2 1 0 0V 1 3 3 3 2VD

A A T V DA 2 2 1 0 0V 1 3 3 3 2V 0 2 3 5 4D 0 2 3 4 7

A A T V DA 2 2 1 0 0V 1 3 3 3 2V 0 2 3 5 4D 0 2 3 4 7

E E E C C E E E C C E E E C C

EECC

EECC

EECC

A A T V D | | | |A - V V D

A A T V D | | | | A V V D

A A T V D | | | |A V - V D

Page 66: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 66

2D Threading Performance

• In test sets 2D threading methods can identify 30-40% of proteins having very remote homologues (i.e. not detected by BLAST) using “minimal” non-redundant databases (<700 proteins)

• If the database is expanded ~4x the performance jumps to 70-75%

• Performs best on true homologues as opposed to postulated analogues

Page 67: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 67

2D Threading Advantages

• Algorithm is easy to implement

• Algorithm is very fast (10x faster than 3D threading approaches)

• The 2D database is small (<500 kbytes) compared to 3D database (>1.5 Gbytes)

• Appears to be just as accurate as DBM or other 3D threading approaches

• Very amenable to web servers

Page 68: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 68

Servers - PredictProtein

Page 69: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 69

Servers - 123D

Page 70: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 70

Servers - GenThreader

Page 71: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 71

More Servers - www.bronco.ualberta.ca

Page 72: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 72

2D Threading Disadvantages

• Reliability is not 100% making most threading predictions suspect unless experimental evidence can be used to support the conclusion

• Does not produce a 3D model at the end of the process

• Doesn’t include all aspects of 2o and 3o structure features in prediction process

• PSI-BLAST may be just as good (faster too!)

Page 73: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 73

Making it Better

• Include 3D threading analysis as part of the 2D threading process -- offers another layer of information

• Include more information about the “coil” state (3-state prediction isn’t good enough)

• Include other biochemical (ligands, function, binding partners, motifs) or phylogenetic (origin, species) information

Page 74: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 74

3D Threading Servers

• Generate 3D models or coordinates of possible models based on input sequence

• Loopp (version 2) – http://ser-loopp.tc.cornell.edu/loopp.html

• 3D-PSSM– http://www.sbg.bio.ic.ac.uk/~3dpssm/

• All require email addresses since the process may take hours to complete

Page 75: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 75

Page 76: Superposition & Threading - Bioinformatics...Superposition • The concept of superposition is key to many aspects of protein structure generation and comparison • Superposition

Lecture 3.3 76