Using Motion Planning to Study Protein Folding Pathways

26
USING MOTION PLANNING TO STUDY PROTEIN FOLDING PATHWAYS by Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi

description

Using Motion Planning to Study Protein Folding Pathways. by Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi. outline. Introduction- Definition Motion planning Probabilistic Roadmap Method Denaturation C-space - PowerPoint PPT Presentation

Transcript of Using Motion Planning to Study Protein Folding Pathways

Page 1: Using Motion Planning to Study Protein Folding Pathways

USING MOTION PLANNING TO

STUDY PROTEIN FOLDING

PATHWAYSby Guang Song and Nancy M. Amato

Journal of Computational Biology,April 1, 2002

Presentation by Athina Ropodi

Page 2: Using Motion Planning to Study Protein Folding Pathways

OUTLINE Introduction- Definition

Motion planning Probabilistic Roadmap Method Denaturation C-space

Probabilistic Roadmap Method Basic steps Degrees of freedom/C-space Node Generation Sampling strategy Potential Energy Computations

Results

Page 3: Using Motion Planning to Study Protein Folding Pathways

INTRODUCTION- MOTION PLANNING Motion planning is a term used in

robotics for the process of detailing a task into atomic robotic motions.

This issue is also known as the “navigation problem”:

Given an environment, a start and a goal position of an object, the objective is to find a valid path (continuous sequence of validconfigurations) from start to goal.

Page 4: Using Motion Planning to Study Protein Folding Pathways

PRM- DENATURATION Probabilistic Roadmap (PRM) motion

planning techniques are applied to “small” proteins (up to 60 residues) in order to compute folding pathways from a denaturated state to its native fold.

Denaturate: To cause the structure to unfold, so that some of its original properties, especially its biological activity, are diminished or eliminated. Usually caused by extreme conditions, e.g. high temperature.

Page 5: Using Motion Planning to Study Protein Folding Pathways

HOW DENATURATION OCCURS AT LEVELS OF PROTEIN STRUCTURE In quaternary structure denaturation, protein sub-units

are dissociated and/or the spatial arrangement of protein subunits is disrupted.

Tertiary structure denaturation involves the disruption of: Covalent interactions between amino acid side chains (such as

disulfide bridges between cysteine groups) Noncovalent dipole-dipole interactions between polar amino acid

side chains (and the surrounding solvent) Van der Waals (induced dipole) interactions between nonpolar

amino acid side chains. In secondary structure denaturation, proteins lose all

regular repeating patterns such as alpha-helices and beta-pleated sheets, and adopt a random coil configuration.

Primary structure, such as the sequence of amino acids held together by covalent peptide bonds, is not disrupted by denaturation

Page 6: Using Motion Planning to Study Protein Folding Pathways

BASIC ASSUMPTION The article investigates the folding

mechanisms of a protein assuming we know its native fold.

Results are validated by comparing the formation order to pulse-labeling experimental results.

The configuration space (C-Space) of a movable object is the space consisting of all positions and orientations of that object.

Pulse labeling is a biochemistry technique of identifying the target molecule presence by inclusion of a pulse of a radioactive compound.

Page 7: Using Motion Planning to Study Protein Folding Pathways

OUTLINE Introduction- Definition

Motion planning Probabilistic Roadmap Method Denaturation C-space

Probabilistic Roadmap Method Basic steps Degrees of freedom/C-space Node Generation Sampling strategy Potential Energy Computations

Results

Page 8: Using Motion Planning to Study Protein Folding Pathways

PROBABILISTIC ROADMAP METHOD FOR PROTEIN FOLDING Any complete motion planner would require

time exponential in the number of degrees of freedom (dof).

Several methods such as energy minimization, molecular dynamics, Monte-Carlo and genetic algorithms have been used.

This method tries to simulate the true dynamics of the folding process using the classical Newton’s motion equations.

However, an exact simulation would depend on the start conformation and could result in local minima.

Page 9: Using Motion Planning to Study Protein Folding Pathways

PROBABILISTIC ROADMAP METHOD FOR PROTEIN FOLDINGThe basic steps are: First of all, PRM samples points randomly

from C-space and retains those that satisfy certain requirements.

Then, the points are connected and form a graph using a simple planning method.

Finally, paths connecting the start and goal configurations are extracted using standard graph search techniques.

[Kavraki, Svestka, Latombe,Overmars 1996]

In this case, low-energy conformatio

ns are preferable.

Page 10: Using Motion Planning to Study Protein Folding Pathways

PROBABILISTIC ROADMAP METHOD FOR PROTEIN FOLDING

Page 11: Using Motion Planning to Study Protein Folding Pathways

C-SPACES All atomic bond lengths and angles are

considered to be constants. We only consider 2 dofs, φ and ψ angles. Side-chains are modeled as spheres

with no dof. Fold positions (atomic bonds)

correspond to joints and atoms correspond to links.

Thus, for k residues our model has 2k links and 2k revolute joints…

Page 12: Using Motion Planning to Study Protein Folding Pathways

NODE GENERATION Different configurations are produced by

assigning possible angle values. But, the nodes are accepted or rejected

based on their potential energy:

where Emin=50000 and Emax=89000KJouls/mol

A configuration with higher potential is

more likely to be rejected.

Page 13: Using Motion Planning to Study Protein Folding Pathways

SAMPLING STRATEGY Due to the high dimensionality of the

problem, a very dense uniform sampling is required.

Since prior knowledge of the native fold is assumed, a sampling strategy biased to the native fold is applied:

Sampling is performed from a set of normal distributions around the native fold. The standard deviations used are {5, 10, 20, 40, 80,160 degrees}.Small STDs capture the

detail around the goal, and larger ensure

adequate roadmap coverage

Page 14: Using Motion Planning to Study Protein Folding Pathways

CONSTRUCTION OF ROADMAP

For each node, the k-nearest neighbors are found(k=20 and the metric is Euclidean).

For each connection a feasibility check is performed. (two nodes are connected by a straight line)

For 2 consecutive intermediate conformations, i and i+1, we first check their potential energies and then the probability of moving from i to i+1:

RMSD metric proved inferior

Page 15: Using Motion Planning to Study Protein Folding Pathways

CONSTRUCTION OF ROADMAP

The total weight of the edge is:

Dijkstra’s algorithm is then used to find the smallest weight path.

Path optimization: resampling is performed around the nodes of paths with high potential.

Page 16: Using Motion Planning to Study Protein Folding Pathways

POTENTIAL ENERGY COMPUTATIONS

To reduce the cost of calculations, an approximation function is used. We only consider contribution from side chains and those are modeled as spheres.

The cost is then reduced by 2 orders of magnitude.

The 1st term represents

constraints that favor secondary structure,

hydrogen and disulphide bonds

The 2nd the van der Waals interactions

among atoms.

Page 17: Using Motion Planning to Study Protein Folding Pathways

OUTLINE Introduction- Definition

Motion planning Probabilistic Roadmap Method Denaturation C-space

Probabilistic Roadmap Method Basic steps Degrees of freedom/C-space Node Generation Sampling strategy Potential Energy Computations

Results

Page 18: Using Motion Planning to Study Protein Folding Pathways

RESULTS

Page 19: Using Motion Planning to Study Protein Folding Pathways

RESULTS Folding process of GB1:

Page 20: Using Motion Planning to Study Protein Folding Pathways

RESULTS Protein GB1(56

residues): 1 α-helix and 4 β-strands

Protein A (60 residues): 3 α-helices

Page 21: Using Motion Planning to Study Protein Folding Pathways

PROTEIN GB1: PHI/PSI DISTRIBUTION OF ROADMAP NODES

Page 22: Using Motion Planning to Study Protein Folding Pathways

PROTEIN GB1: POTENTIAL VS RMSDDISTRIBUTION OF ROADMAP NODES

Page 23: Using Motion Planning to Study Protein Folding Pathways

PROTEIN GB1: EFFECT OF RESOLUTION POTENTIAL PROFILES FOR DIFFERENT SIZE ROADMAPS

Peaks show where atoms are close and

Van der Waals interactions dominate.

Bigger roadmaps have smoother

paths.

Page 24: Using Motion Planning to Study Protein Folding Pathways

EFFECT OF RE-SAMPLING AROUND THE PEAKS

More samples around the peaks improve the path.

Page 25: Using Motion Planning to Study Protein Folding Pathways

WHAT ABOUT DIFFERENT START CONFORMATIONS?

Different pathways tend to come together and appear to have some common channels, as they approach the native fold.

Page 26: Using Motion Planning to Study Protein Folding Pathways

THANK YOU!Bibliography:1. Probabilistic roadmaps for path

planning in high-dimensional configuration spacesL. Kavraki, P. Svestka, J-C. Latombe, M. H. Overmars, 1996

2. Protein Folding by Restrained Energy Minimization and Molecular Dynamics Michael Levitt,1983

3. http://faculty.cs.tamu.edu/amato/dsmft/research/folding/index.shtml.OLD2