Probabilistic Approximations of Bio-Pathways Dynamics
description
Transcript of Probabilistic Approximations of Bio-Pathways Dynamics
![Page 1: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/1.jpg)
Probabilistic Approximations of Bio-Pathways Dynamics
P.S. ThiagarajanSchool of Computing, National University of Singapore
Joint Work with: Liu Bing, David Hsu
![Page 2: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/2.jpg)
Bio-pathways
Intracellular: Gene regulatory networks Metabolic networks signalling pathways.
![Page 3: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/3.jpg)
A Common Modeling Approach
View a bio-pathway as a network of bio-chemical reactions Model the network as a system of ODEs
One for each molecular species Mass action, Michelis-Menten, Hill, etc.
Study the ODE system to understand the dynamics of the bio-chemical network
![Page 4: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/4.jpg)
Major Hurdles Many rate constants not known
must be estimated noisy data; limited precision; population-based
High dimensional system closed-form solutions are impossible Must resort to numerical simulations Point values of initial states/data will not be available. a large number of numerical simulations needed for
answering each question
![Page 5: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/5.jpg)
Our work:
Decompose Analyze Compose (ISMB’06, WABI’07)
Probabilistic models and methods to: approximate the ODE dynamics(CMSB’09,
TCS’11). update models (RECOMB’10)
![Page 6: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/6.jpg)
The “Opinion Poll” Idea
Start with an ODEs system. Discretize the time and value domains. Assume a (uniform) distribution of initial states Generate a “sufficiently” large number of trajectories
by sampling the initial states and numerical simulations.
Encode this collection of trajectories as a dynamic Bayesian network.
ODEs DBN
![Page 7: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/7.jpg)
The “opinion poll” Idea
Pay the one-time cost of constructing the Bayesian network.
Amortize this cost: perform multiple analysis tasks using
inferencing algorithms for DBNs.
![Page 8: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/8.jpg)
Time Discretization
Observe the system only at discrete time points; in fact, only at finitely many time points
x(t)
t0 t1 t2tmax... ... ... ...
x(t) = t3 + 4t + x(0)
![Page 9: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/9.jpg)
Value Discretization
Observe only with bounded precision
x(t)
t0 t1 t2 tmax... ... ... ...
A
B
C
D
E
![Page 10: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/10.jpg)
Discrete Trajectories
A trajectory now is sequence of discrete values
C D D E D C B
x(t)
t0 t1 t2 tmax... ... ... ...
A
B
C
D
E
![Page 11: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/11.jpg)
Discretized Behavior
Assume a prior distribution of the initial states. Behavior = (Uncountably) many sequences of discrete
values.
t
x(t)
t0 t1 t2 tmax... ... ... ...
A
B
C
D
E
C D D E D C B
D ... ...
![Page 12: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/12.jpg)
Representation of Behavior
In fact, a probabilistic transition system. Pr( (D, 2) (E, 3) ) is
the “fraction” of the
trajectories residing in D
at t = 2 that land in E at
t = 3.
C D D E D C B
D
0.8
0.2
tt0 t1 t2 tmax... ... ... ...
A
B
C
D
E
![Page 13: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/13.jpg)
Mathematical Justification The value space of the variables is assumed to be a
compact subset C of In Z’ = F(Z), F is assumed to be differentiable
everywhere in C. Mass-law, Michaelis-Menton,… for every rate constant k.
Then the solution (flow) t : C C (for each t) exists, is unique, a bijection, continuous and hence measurable.
But the transition probabilities can not be computed.
![Page 14: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/14.jpg)
C D D E D C B
D
0.8
0.2
(s, i) – States; (s, i) (s’, i+1) -- Transitions
Sample, say, 1000 times the initial states.
Through numerical simulation, generate 1000 trajectories.
Pr((s, i) (s’ i+1)) is the fraction of the trajectories that are in s at ti which land in s’ at t i+1. tt0 t1 t2 tmax... ... ... ...
A
B
C
D
E
Computational Approximation
![Page 15: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/15.jpg)
Infeasible Size! But the transition system will be huge.
O(T . kn) k 2 and n ( 50-100) can be large.
![Page 16: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/16.jpg)
Compact Representation Exploit the network structure (additional
independence assumptions) to construct a dynamic Bayesian network instead.
The DBN is a factored form of a probabilistic transition system (Markov chain).
![Page 17: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/17.jpg)
The DBN Representation
S0
E0
ES0
P0
S1
E1
ES1
P1
S2
E2
ES2
P2
S3
E3
ES3
P3
... ...
... ...
... ...
... ...
![Page 18: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/18.jpg)
The DBN Representation
S0
E0
ES0
P0
S1
E1
ES1
P1
S2
E2
ES2
P2
S3
E3
ES3
P3
... ...
... ...
... ...
... ...
ABC
![Page 19: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/19.jpg)
Each node has a CPT associated with it.
This specifies the local (probabilistic) dynamics.
S0
E0
ES0
P0 P1
E2
ES2
P2
S3
E3
ES3
P3
... ...
... ...
... ...
... ...
ABC
A
BCC
B
BCB
C
BBB
C
CBC
C
CAC
... ...
S1
E1
ES1
S2
.
P(S2=C|S1=B,E1=C,ES1=B)= 0.2
P(S2=C|S1=B,E1=C,ES1=C)= 0.1
P(S2=A|S1=A,E1=A,ES1=C)= 0.05
. . .
![Page 20: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/20.jpg)
Fill up the entries in the CPTs by sampling, simulations and counting
S0
E0
ES0
P0 P1
E2
ES2
P2
S3
E3
ES3
P3
... ...
... ...
... ...
... ...
ABC S1
E1
ES1
S2
.
P(S2=C|S1=B,E1=C,ES1=B)= 0.2
P(S2=C|S1=B,E1=C,ES1=C)= 0.1
P(S2=A|S1=A,E1=A,ES1=C)= 0.05
. . .
![Page 21: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/21.jpg)
Computational Approximation
Fill up the entries in the CPTs by sampling, simulations and counting
S0
E0
ES0
P0 P1
E2
ES2
P2
S3
E3
ES3
P3
... ...
... ...
... ...
... ...
ABC S1
E1
ES1
S2
500 100
1000
![Page 22: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/22.jpg)
The Technique
Fill up the entries in the CPTs by sampling, simulations and counting
S0
E0
ES0
P0 P1
E2
ES2
P2
S3
E3
ES3
P3
... ...
... ...
... ...
... ...
ABC S1
E1
ES1
S2
500 100
P(S2=C|S1=B,E1=C,ES1=B)= 100/500= 0.2
![Page 23: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/23.jpg)
The Technique
S0
E0
ES0
P0 P1
E2
ES2
P2
S3
E3
ES3
P3
... ...
... ...
... ...
... ...
ABC S1
E1
ES1
S2
500 100
P(S2=C|S1=B,E1=C,ES1=B)= 100/500= 0.2
The size of the DBN is:
O(T . n . kd)
d will be usually much smaller than n.
![Page 24: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/24.jpg)
Unknown rate constants
S0
E0
ES0
P0
S1
E1
ES1
P1
S2
E2
ES2
P2
S3
E3
ES3
P3
... ...
... ...
... ...
... ...
k2
![Page 25: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/25.jpg)
Unknown rate constants
During the numerical
generation of a
trajectory, the value
of k2 does not change
after sampling.
S0
E0
ES0
P0
S1
E1
ES1
P1
S2
E2
ES2
P2
S3
E3
ES3
P3
... ...
... ...
... ...
... ...
k2 k2 k2k2
![Page 26: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/26.jpg)
Unknown rate constants
Sample uniformly
across all the
Intervals.
S0
E0
ES0
P0
S1
E1
ES1
P1
S2
E2
ES2
P2
S3
E3
ES3
P3
... ...
... ...
... ...
... ...
k2 k2 k2k2
![Page 27: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/27.jpg)
Bayesian inference
S0
E0
ES0
P0
S1
E1
ES1
P1
S2
E2
ES2
P2
S3
E3
ES3
P3
... ...
... ...
... ...
... ...
k2 k2 k2k2
This is the Factored Frontier algorithm (Murphy & Weiss)
Approximate but fast:
![Page 28: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/28.jpg)
Parameter Estimation
S0
E0
ES0
P0
S1
E1
ES1
P1
S2
E2
ES2
P2
S3
E3
ES3
P3
... ...
... ...
... ...
... ...
k2 k2 k2k2
For each choice of (jnterval) values for unknown parameters, present experimental data as evidence and assign a score using FF.
Return parameter estimates as maximal likelihoods.
Similar application of FF to do sensitivity analysis.
![Page 29: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/29.jpg)
DBN based Analysis
Our experiments with signalling pathways models (taken from the biomodels data base) suggest:The one-time cost of constructing the DBN
can be easily recovered by using it to do parameter estimation and sensitivity analysis.
![Page 30: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/30.jpg)
Segmentation Clock Network Governs the periodic formation
process of somites during embryogenesis
The underlying signaling network couples three oscillating pathways: FGF, Wnt and Notch.
22 ODEs; 75 unknown parameters
5 equal intervals; t = 5 min; 100 time points; 2.5 million trajectories; 3.1 hours on a 10 PC cluster.
![Page 31: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/31.jpg)
Results
Global sensitivity analysis ODE based: 81 hours DBN based: 3 hours
LM: Levenberg–Marquardt
GA: Genetic Algorithm
SRES: Evolutionary Strategy
PSO: Particle Swarm Optimization
BDM: our method
40 unknown parameters;Synthetic data: 8 proteins, 10 time points (noise with 5% variance added)
![Page 32: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/32.jpg)
ODE model (Brown et al. 2004) 32 species 48 parameters Features:
Good size Feedback loops
DBN construction Settings
5 intervals 1 min time-step, 100 minutes 3 x 106 samples
Runtime 4 hours on a cluster of 10 PCs
EGF-NGF Pathway
![Page 33: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/33.jpg)
LM: Levenberg–Marquardt
GA: Genetic Algorithm
SRES: Evolutionary Strategy
PSO: Particle Swarm Optimization
BDM: our method
Parameter Estimation20 (out of 48) unknown parameters Synthetic data: 9 proteins, 10 time points
![Page 34: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/34.jpg)
Sensitivity Analysis
Running time ODE based: 22 hours DBN based: 34 minutes
![Page 35: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/35.jpg)
35
Complement System
Complement system is a critical part of the immune system
Ricklin et al. 2007
![Page 36: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/36.jpg)
Classical pathwayLectin pathwayAmplified responseInhibition
![Page 37: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/37.jpg)
37
Goals
Quantitatively understand the regulatory mechanisms of complement system How is the complement activity enhanced under inflammation
condition? How is the excessive response of the complement avoided?
The model: Classical pathway + the lectin pathway
enhancement
Inhibitory mechanisms C4BP
![Page 38: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/38.jpg)
Complement System
ODE Model 42 Species 45 Reactions
Mass law Michaelis-Menten kinetics
92 Parameters (71 unknown) DBN Construction
Settings 6 intervals 100s time-step, 12600s 1.2 x 106 samples
Runtime 12 hours on a cluster of 20 PCss
![Page 39: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/39.jpg)
39
Parameter estimation Training data
4 proteins, 7 time points, 4 conditions
* inflammation condition: pH 6.5 calcium 2 nM
![Page 40: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/40.jpg)
40
Model Calibration (parameter estimation)
![Page 41: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/41.jpg)
41
Model validation
Validated the model using previous published data (Zhang et al 2009)
![Page 42: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/42.jpg)
42
Enhancement mechanism
The antimicrobial response is sensitive to the pH and calcium level
![Page 43: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/43.jpg)
43
Model prediction: The regulatory effect of C4BP
C4BP maintains classical complement activation but delays the maximal response time.
But attenuates the lectin pathway activation.
Classical pathway Lectin pathway
![Page 44: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/44.jpg)
44
The regulatory mechanism of C4BP The major inhibitory role of C4BP is to facilitate the decay of
C3 convertase
A
BD
C
A B
C D
![Page 45: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/45.jpg)
Results
Both predictions concerning C4BP were experimentally verified.
“A Computational and Experimental Study of the Regulatory Mechanisms of the Complement System” Liu et.al: PLoS Comp.Bio7(2011)
BioModels database (303.Liu).
![Page 46: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/46.jpg)
Current Collaborators
Immune signalling duringMultiple infections
DNA damage/response pathway Chromosome co-localizations and co-regulations
Ding Jeak Ling Marie-Veronique Clement G V Shivashankar
![Page 47: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/47.jpg)
Current work
Parametrized version of FF Reduce errors by investing more
computational time. Probabilistic verification methods.
Monte Carlo, FF based, … Implementation on a GPU platform.
![Page 48: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/48.jpg)
Conclusion
The DBN approximation method is useful and efficient.
This is all very well in practice but will it work in theory?
![Page 49: Probabilistic Approximations of Bio-Pathways Dynamics](https://reader036.fdocuments.net/reader036/viewer/2022070418/568159fb550346895dc74777/html5/thumbnails/49.jpg)
Our group:
David Hsu SucheePalaniappan
Liu Bing Abhinav Dubey Blaise GenestS. Akshay Benjamin Gyori
GireedharVenkatachalam
Wang JunjieP.S. Thiagarajan