EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11,...

59
EnKF and filter divergence David Kelly Andrew Stuart Kody Law Courant Institute New York University New York, NY dtbkelly.com December 11, 2014 Applied and computational mathematics seminar, NIST. David Kelly (NYU) EnKF December 11, 2014 1 / 28

Transcript of EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11,...

Page 1: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

EnKF and filter divergence

David Kelly Andrew Stuart Kody Law

Courant InstituteNew York University

New York, NY

dtbkelly.com

December 11, 2014

Applied and computational mathematics seminar, NIST.

David Kelly (NYU) EnKF December 11, 2014 1 / 28

Page 2: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Talk outline

1. What is EnKF?

2. What is known about EnKF?

3. How can we use stochastic analysis to betterunderstand EnKF?

David Kelly (NYU) EnKF December 11, 2014 2 / 28

Page 3: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

The filtering problem

We have a deterministic model

dv

dt= F (v) with v0 ∼ N(m0,C0) .

We will denote v(t) = Ψt(v0). Think of this as very high dimensionaland nonlinear.

We want to estimate v j = v(jh) for some h > 0 and j = 0, 1, . . . , J giventhe observations

y j = Hv j + ξj for ξj iid N(0, Γ).

David Kelly (NYU) EnKF December 11, 2014 3 / 28

Page 4: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

The filtering problem

We have a deterministic model

dv

dt= F (v) with v0 ∼ N(m0,C0) .

We will denote v(t) = Ψt(v0). Think of this as very high dimensionaland nonlinear.

We want to estimate v j = v(jh) for some h > 0 and j = 0, 1, . . . , J giventhe observations

y j = Hv j + ξj for ξj iid N(0, Γ).

David Kelly (NYU) EnKF December 11, 2014 3 / 28

Page 5: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

We can write down the conditionaldensity using Bayes’ formula ...

But for high dimensional nonlinearsystems it’s horrible.

David Kelly (NYU) EnKF December 11, 2014 4 / 28

Page 6: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

We can write down the conditionaldensity using Bayes’ formula ...

But for high dimensional nonlinearsystems it’s horrible.

David Kelly (NYU) EnKF December 11, 2014 4 / 28

Page 7: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Bayes’ formula filtering update

Let Y j = {y0, y1, . . . , y j}. We want to compute the conditional densityP(v j+1|Y j+1), using P(v j |Y j) and y j+1.

By Bayes’ formula, we have

P(v j+1|Y j+1) = P(v j+1|Y j , y j+1) ∝ P(y j+1|v j+1)P(v j+1|Y j)

But we need to compute the integral

P(v j+1|Y j) =

∫P(v j+1|Y j , v j)P(v j |Y j)dv j .

For high dimensional nonlinear systems, this is computationally infeasible.

David Kelly (NYU) EnKF December 11, 2014 5 / 28

Page 8: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Bayes’ formula filtering update

Let Y j = {y0, y1, . . . , y j}. We want to compute the conditional densityP(v j+1|Y j+1), using P(v j |Y j) and y j+1.

By Bayes’ formula, we have

P(v j+1|Y j+1) = P(v j+1|Y j , y j+1) ∝ P(y j+1|v j+1)P(v j+1|Y j)

But we need to compute the integral

P(v j+1|Y j) =

∫P(v j+1|Y j , v j)P(v j |Y j)dv j .

For high dimensional nonlinear systems, this is computationally infeasible.

David Kelly (NYU) EnKF December 11, 2014 5 / 28

Page 9: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Bayes’ formula filtering update

Let Y j = {y0, y1, . . . , y j}. We want to compute the conditional densityP(v j+1|Y j+1), using P(v j |Y j) and y j+1.

By Bayes’ formula, we have

P(v j+1|Y j+1) = P(v j+1|Y j , y j+1) ∝ P(y j+1|v j+1)P(v j+1|Y j)

But we need to compute the integral

P(v j+1|Y j) =

∫P(v j+1|Y j , v j)P(v j |Y j)dv j .

For high dimensional nonlinear systems, this is computationally infeasible.

David Kelly (NYU) EnKF December 11, 2014 5 / 28

Page 10: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

The Ensemble Kalman Filter (EnKF)is a lower dimensional algorithm.

(Evensen ’94)

EnKF generates an ensemble ofapproximate samples from the

posterior.

David Kelly (NYU) EnKF December 11, 2014 6 / 28

Page 11: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

The Ensemble Kalman Filter (EnKF)is a lower dimensional algorithm.

(Evensen ’94)

EnKF generates an ensemble ofapproximate samples from the

posterior.

David Kelly (NYU) EnKF December 11, 2014 6 / 28

Page 12: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

For linear models, one can draw samples,using the Randomized Maximum

Likelihood method.

David Kelly (NYU) EnKF December 11, 2014 7 / 28

Page 13: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

RML method

Let u ∼ N(m, C ) and η ∼ N(0, Γ). We make an observation

y = Hu + η .

We want the conditional distribution of u|y . This is called an inverseproblem.

RML takes a sample

{u(1), . . . , u(K)} ∼ N(m, C )

and turns them into a sample

{u(1), . . . , u(K)} ∼ u|y

David Kelly (NYU) EnKF December 11, 2014 8 / 28

Page 14: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

RML method

Let u ∼ N(m, C ) and η ∼ N(0, Γ). We make an observation

y = Hu + η .

We want the conditional distribution of u|y . This is called an inverseproblem.

RML takes a sample

{u(1), . . . , u(K)} ∼ N(m, C )

and turns them into a sample

{u(1), . . . , u(K)} ∼ u|y

David Kelly (NYU) EnKF December 11, 2014 8 / 28

Page 15: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

RML method: How does it work?

Along with the prior sample {u(1), . . . , u(K)}, we create artificialobservations {y (1), . . . , y (K)} where

y (k) = y + η(k) where η(k) ∼ N(0, Γ) i.i.d

Then define u(k) using the Bayes formula update, with (u(k), y (k))

u(k) = u(k) + G (u)(y (k) − Hu(k)) .

Where the “Kalman Gain” G (u) is computing using the covariance of theprior u.

The set {u(1), . . . , u(K)} are exact samples from u|y .

David Kelly (NYU) EnKF December 11, 2014 9 / 28

Page 16: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

RML method: How does it work?

Along with the prior sample {u(1), . . . , u(K)}, we create artificialobservations {y (1), . . . , y (K)} where

y (k) = y + η(k) where η(k) ∼ N(0, Γ) i.i.d

Then define u(k) using the Bayes formula update, with (u(k), y (k))

u(k) = u(k) + G (u)(y (k) − Hu(k)) .

Where the “Kalman Gain” G (u) is computing using the covariance of theprior u.

The set {u(1), . . . , u(K)} are exact samples from u|y .

David Kelly (NYU) EnKF December 11, 2014 9 / 28

Page 17: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

EnKF uses the same method, but with anapproximation of the covariance in the

Kalman gain.

David Kelly (NYU) EnKF December 11, 2014 10 / 28

Page 18: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

The set-up for EnKF

Suppose we are given the ensemble {u(1)j , . . . , u(K)j } at time j . For each

ensemble member, we create an artificial observation

y(k)j+1 = y j+1 + ξ

(k)j+1 , ξ

(k)j+1 iid N(0, Γ).

We update each particle using the Kalman update

u(k)j+1 = Ψh(u

(k)j ) + G (uj)

(y(k)j+1 − HΨh(u

(k)j )),

where G (uj) is the Kalman gain computed using the forecastedensemble covariance

Cj+1 =1

K

K∑k=1

(Ψh(u(k)j )−Ψh(uj))T (Ψh(u

(k)j )−Ψh(uj)) .

David Kelly (NYU) EnKF December 11, 2014 11 / 28

Page 19: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

The set-up for EnKF

Suppose we are given the ensemble {u(1)j , . . . , u(K)j } at time j . For each

ensemble member, we create an artificial observation

y(k)j+1 = y j+1 + ξ

(k)j+1 , ξ

(k)j+1 iid N(0, Γ).

We update each particle using the Kalman update

u(k)j+1 = Ψh(u

(k)j ) + G (uj)

(y(k)j+1 − HΨh(u

(k)j )),

where G (uj) is the Kalman gain computed using the forecastedensemble covariance

Cj+1 =1

K

K∑k=1

(Ψh(u(k)j )−Ψh(uj))T (Ψh(u

(k)j )−Ψh(uj)) .

David Kelly (NYU) EnKF December 11, 2014 11 / 28

Page 20: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

The set-up for EnKF

Suppose we are given the ensemble {u(1)j , . . . , u(K)j } at time j . For each

ensemble member, we create an artificial observation

y(k)j+1 = y j+1 + ξ

(k)j+1 , ξ

(k)j+1 iid N(0, Γ).

We update each particle using the Kalman update

u(k)j+1 = Ψh(u

(k)j ) + G (uj)

(y(k)j+1 − HΨh(u

(k)j )),

where G (uj) is the Kalman gain computed using the forecastedensemble covariance

Cj+1 =1

K

K∑k=1

(Ψh(u(k)j )−Ψh(uj))T (Ψh(u

(k)j )−Ψh(uj)) .

David Kelly (NYU) EnKF December 11, 2014 11 / 28

Page 21: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

The set-up for EnKF

Suppose we are given the ensemble {u(1)j , . . . , u(K)j } at time j . For each

ensemble member, we create an artificial observation

y(k)j+1 = y j+1 + ξ

(k)j+1 , ξ

(k)j+1 iid N(0, Γ).

We update each particle using the Kalman update

u(k)j+1 = Ψh(u

(k)j ) + G (uj)

(y(k)j+1 − HΨh(u

(k)j )),

where G (uj) is the Kalman gain computed using the forecastedensemble covariance

Cj+1 =1

K

K∑k=1

(Ψh(u(k)j )−Ψh(uj))T (Ψh(u

(k)j )−Ψh(uj)) .

David Kelly (NYU) EnKF December 11, 2014 11 / 28

Page 22: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

The set-up for EnKF

Suppose we are given the ensemble {u(1)j , . . . , u(K)j } at time j . For each

ensemble member, we create an artificial observation

y(k)j+1 = y j+1 + ξ

(k)j+1 , ξ

(k)j+1 iid N(0, Γ).

We update each particle using the Kalman update

u(k)j+1 = Ψh(u

(k)j ) + G (uj)

(y(k)j+1 − HΨh(u

(k)j )),

where G (uj) is the Kalman gain computed using the forecastedensemble covariance

Cj+1 =1

K

K∑k=1

(Ψh(u(k)j )−Ψh(uj))T (Ψh(u

(k)j )−Ψh(uj)) .

David Kelly (NYU) EnKF December 11, 2014 11 / 28

Page 23: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

What do we know about EnKF?Not much.

Theorem : For linear forecast models,ENKF → KF as N →∞

(Le Gland et al / Mandel et al. 09’).

David Kelly (NYU) EnKF December 11, 2014 12 / 28

Page 24: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

What do we know about EnKF?Not much.

Theorem : For linear forecast models,ENKF → KF as N →∞

(Le Gland et al / Mandel et al. 09’).

David Kelly (NYU) EnKF December 11, 2014 12 / 28

Page 25: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

What do we know about EnKF?Not much.

Theorem : For linear forecast models,ENKF → KF as N →∞

(Le Gland et al / Mandel et al. 09’).

David Kelly (NYU) EnKF December 11, 2014 12 / 28

Page 26: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Ideally, we would like a theorem aboutlong time behaviour of the filter for a

finite ensemble size.

David Kelly (NYU) EnKF December 11, 2014 13 / 28

Page 27: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Filter divergence

In certain situations, it has been observed (?) that the ensemble canblow-up (ie. reach machine-infinity) in finite time, even when the modelhas nice bounded solutions.

This is known as catastrophic filter divergence.

We would like to investigate whether this has a dynamical justificationor if it is simply a numerical artefact.

? Harlim, Majda (2010), Gottwald (2011), Gottwald, Majda (2013).

David Kelly (NYU) EnKF December 11, 2014 14 / 28

Page 28: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Filter divergence

In certain situations, it has been observed (?) that the ensemble canblow-up (ie. reach machine-infinity) in finite time, even when the modelhas nice bounded solutions.

This is known as catastrophic filter divergence.

We would like to investigate whether this has a dynamical justificationor if it is simply a numerical artefact.

? Harlim, Majda (2010), Gottwald (2011), Gottwald, Majda (2013).

David Kelly (NYU) EnKF December 11, 2014 14 / 28

Page 29: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Filter divergence

In certain situations, it has been observed (?) that the ensemble canblow-up (ie. reach machine-infinity) in finite time, even when the modelhas nice bounded solutions.

This is known as catastrophic filter divergence.

We would like to investigate whether this has a dynamical justificationor if it is simply a numerical artefact.

? Harlim, Majda (2010), Gottwald (2011), Gottwald, Majda (2013).

David Kelly (NYU) EnKF December 11, 2014 14 / 28

Page 30: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Assumptions on the dynamics

We make a dissipativity assumption on the model. Namely that

dv

dt+ Av + B(v , v) = f

with A linear elliptic and B bilinear, satisfying certain estimates andsymmetries.

This guarantees uniformly bounded solutions.

Eg. 2d-Navier-Stokes, Lorenz-63, Lorenz-96.

David Kelly (NYU) EnKF December 11, 2014 15 / 28

Page 31: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Discrete time results

For a fixed observation frequency h > 0 we can prove

Theorem (AS,DK,KL)

If H = Γ = Id then there exists constant β > 0 such that

E|u(k)j |2 ≤ e2βjhE|u(k)0 |

2 + 2Kγ2(e2βjh − 1

e2βh − 1

)

Rmk. This becomes useless as h→ 0

David Kelly (NYU) EnKF December 11, 2014 16 / 28

Page 32: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Discrete time results

For a fixed observation frequency h > 0 we can prove

Theorem (AS,DK,KL)

If H = Γ = Id then there exists constant β > 0 such that

E|u(k)j |2 ≤ e2βjhE|u(k)0 |

2 + 2Kγ2(e2βjh − 1

e2βh − 1

)

Rmk. This becomes useless as h→ 0

David Kelly (NYU) EnKF December 11, 2014 16 / 28

Page 33: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Discrete time results

For a fixed observation frequency h > 0 we can prove

Theorem (AS,DK,KL)

If H = Γ = Id then there exists constant β > 0 such that

E|u(k)j |2 ≤ e2βjhE|u(k)0 |

2 + 2Kγ2(e2βjh − 1

e2βh − 1

)

Rmk. This becomes useless as h→ 0

David Kelly (NYU) EnKF December 11, 2014 16 / 28

Page 34: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Discrete time results with variance inflationSuppose we replace

Cj+1 7→ α2I + Cj+1

at each update step. This is known as additive variance inflation.

Theorem (AS,DK,KL)

If H = Id and Γ = γ2Id then there exists constant β > 0 such that

E|e(k)j |2 ≤ θjE|e(k)0 |

2 + 2Kγ2(

1− θj

1− θ

)where θ = γ2e2βh

α2+γ2. In particular, if we pick α large enough (so

that θ < 1) then

limj→∞

E|e(k)j |2 ≤ 2Kγ2

1− θ

David Kelly (NYU) EnKF December 11, 2014 17 / 28

Page 35: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Discrete time results with variance inflationSuppose we replace

Cj+1 7→ α2I + Cj+1

at each update step. This is known as additive variance inflation.

Theorem (AS,DK,KL)

If H = Id and Γ = γ2Id then there exists constant β > 0 such that

E|e(k)j |2 ≤ θjE|e(k)0 |

2 + 2Kγ2(

1− θj

1− θ

)where θ = γ2e2βh

α2+γ2. In particular, if we pick α large enough (so

that θ < 1) then

limj→∞

E|e(k)j |2 ≤ 2Kγ2

1− θ

David Kelly (NYU) EnKF December 11, 2014 17 / 28

Page 36: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Discrete time results with variance inflationSuppose we replace

Cj+1 7→ α2I + Cj+1

at each update step. This is known as additive variance inflation.

Theorem (AS,DK,KL)

If H = Id and Γ = γ2Id then there exists constant β > 0 such that

E|e(k)j |2 ≤ θjE|e(k)0 |

2 + 2Kγ2(

1− θj

1− θ

)where θ = γ2e2βh

α2+γ2. In particular, if we pick α large enough (so

that θ < 1) then

limj→∞

E|e(k)j |2 ≤ 2Kγ2

1− θ

David Kelly (NYU) EnKF December 11, 2014 17 / 28

Page 37: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

For observations with h� 1, we needanother approach.

David Kelly (NYU) EnKF December 11, 2014 18 / 28

Page 38: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

The EnKF equations look like a discretization

Recall the ensemble update equation

u(k)j+1 = Ψh(u

(k)j ) + G (uj)

(y(k)j+1 − HΨh(u

(k)j ))

= Ψh(u(k)j ) + Cj+1H

T (HT Cj+1H + Γ)−1(y(k)j+1 − HΨh(u

(k)j ))

Subtract u(k)j from both sides and divide by h

u(k)j+1 − u

(k)j

h=

Ψh(u(k)j )− u

(k)j

h

+ Cj+1HT (hHT Cj+1H + hΓ)−1

(y(k)j+1 − HΨh(u

(k)j ))

Clearly we need to rescale the noise (ie. Γ).

David Kelly (NYU) EnKF December 11, 2014 19 / 28

Page 39: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

The EnKF equations look like a discretization

Recall the ensemble update equation

u(k)j+1 = Ψh(u

(k)j ) + G (uj)

(y(k)j+1 − HΨh(u

(k)j ))

= Ψh(u(k)j ) + Cj+1H

T (HT Cj+1H + Γ)−1(y(k)j+1 − HΨh(u

(k)j ))

Subtract u(k)j from both sides and divide by h

u(k)j+1 − u

(k)j

h=

Ψh(u(k)j )− u

(k)j

h

+ Cj+1HT (hHT Cj+1H + hΓ)−1

(y(k)j+1 − HΨh(u

(k)j ))

Clearly we need to rescale the noise (ie. Γ).

David Kelly (NYU) EnKF December 11, 2014 19 / 28

Page 40: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

The EnKF equations look like a discretization

Recall the ensemble update equation

u(k)j+1 = Ψh(u

(k)j ) + G (uj)

(y(k)j+1 − HΨh(u

(k)j ))

= Ψh(u(k)j ) + Cj+1H

T (HT Cj+1H + Γ)−1(y(k)j+1 − HΨh(u

(k)j ))

Subtract u(k)j from both sides and divide by h

u(k)j+1 − u

(k)j

h=

Ψh(u(k)j )− u

(k)j

h

+ Cj+1HT (hHT Cj+1H + hΓ)−1

(y(k)j+1 − HΨh(u

(k)j ))

Clearly we need to rescale the noise (ie. Γ).

David Kelly (NYU) EnKF December 11, 2014 19 / 28

Page 41: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

The EnKF equations look like a discretization

Recall the ensemble update equation

u(k)j+1 = Ψh(u

(k)j ) + G (uj)

(y(k)j+1 − HΨh(u

(k)j ))

= Ψh(u(k)j ) + Cj+1H

T (HT Cj+1H + Γ)−1(y(k)j+1 − HΨh(u

(k)j ))

Subtract u(k)j from both sides and divide by h

u(k)j+1 − u

(k)j

h=

Ψh(u(k)j )− u

(k)j

h

+ Cj+1HT (hHT Cj+1H + hΓ)−1

(y(k)j+1 − HΨh(u

(k)j ))

Clearly we need to rescale the noise (ie. Γ).

David Kelly (NYU) EnKF December 11, 2014 19 / 28

Page 42: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Continuous-time limitIf we set Γ = h−1Γ0 and substitute y

(k)j+1, we obtain

u(k)j+1 − u

(k)j

h=

Ψh(u(k)j )− u

(k)j

h+ Cj+1H

T (hHT Cj+1H + Γ0)−1(Hv + h−1/2Γ

1/20 ξj+1 + h−1/2Γ

1/20 ξ

(k)j+1 − HΨh(u

(k)j ))

But we know thatΨh(u

(k)j ) = u

(k)j + O(h)

and

Cj+1 =1

K

K∑k=1

(Ψh(u(k)j )−Ψh(uj))T (Ψh(u

(k)j )−Ψh(uj))

=1

K

K∑k=1

(u(k)j − uj)

T (u(k)j − uj) + O(h) = C (uj) + O(h)

David Kelly (NYU) EnKF December 11, 2014 20 / 28

Page 43: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Continuous-time limitIf we set Γ = h−1Γ0 and substitute y

(k)j+1, we obtain

u(k)j+1 − u

(k)j

h=

Ψh(u(k)j )− u

(k)j

h+ Cj+1H

T (hHT Cj+1H + Γ0)−1(Hv + h−1/2Γ

1/20 ξj+1 + h−1/2Γ

1/20 ξ

(k)j+1 − HΨh(u

(k)j ))

But we know thatΨh(u

(k)j ) = u

(k)j + O(h)

and

Cj+1 =1

K

K∑k=1

(Ψh(u(k)j )−Ψh(uj))T (Ψh(u

(k)j )−Ψh(uj))

=1

K

K∑k=1

(u(k)j − uj)

T (u(k)j − uj) + O(h) = C (uj) + O(h)

David Kelly (NYU) EnKF December 11, 2014 20 / 28

Page 44: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Continuous-time limit

We end up with

u(k)j+1 − u

(k)j

h=

Ψh(u(k)j )− u

(k)j

h− C (uj)H

TΓ−10 H(u(k)j − vj)

+ C (uj)HTΓ−10

(h−1/2ξj+1 + h−1/2ξ

(k)j+1

)+ O(h)

This looks like a numerical scheme for Ito S(P)DE

du(k)

dt= F (u(k))− C (u)HTΓ−10 H(u(k) − v) (•)

+ C (u)HTΓ−1/20

(dB

dt+

dW (k)

dt

).

David Kelly (NYU) EnKF December 11, 2014 21 / 28

Page 45: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Continuous-time limit

We end up with

u(k)j+1 − u

(k)j

h=

Ψh(u(k)j )− u

(k)j

h− C (uj)H

TΓ−10 H(u(k)j − vj)

+ C (uj)HTΓ−10

(h−1/2ξj+1 + h−1/2ξ

(k)j+1

)+ O(h)

This looks like a numerical scheme for Ito S(P)DE

du(k)

dt= F (u(k))− C (u)HTΓ−10 H(u(k) − v) (•)

+ C (u)HTΓ−1/20

(dB

dt+

dW (k)

dt

).

David Kelly (NYU) EnKF December 11, 2014 21 / 28

Page 46: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Nudging

du(k)

dt= F (u(k))− C (u)HTΓ−10 H(u(k) − v) (•)

+ C (u)HTΓ−1/20

(dB

dt+

dW (k)

dt

).

1 - Extra dissipation term only sees differences in observed space

2 - Extra dissipation only occurs in the space spanned by ensemble

David Kelly (NYU) EnKF December 11, 2014 22 / 28

Page 47: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Nudging

du(k)

dt= F (u(k))− C (u)HTΓ−10 H(u(k) − v) (•)

+ C (u)HTΓ−1/20

(dB

dt+

dW (k)

dt

).

1 - Extra dissipation term only sees differences in observed space

2 - Extra dissipation only occurs in the space spanned by ensemble

David Kelly (NYU) EnKF December 11, 2014 22 / 28

Page 48: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Nudging

du(k)

dt= F (u(k))− C (u)HTΓ−10 H(u(k) − v) (•)

+ C (u)HTΓ−1/20

(dB

dt+

dW (k)

dt

).

1 - Extra dissipation term only sees differences in observed space

2 - Extra dissipation only occurs in the space spanned by ensemble

David Kelly (NYU) EnKF December 11, 2014 22 / 28

Page 49: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Kalman-Bucy limit

If F were linear and we write m(t) = 1K

∑Kk=1 u

(k)(t) then

dm

dt= F (m)− C (u)HTΓ−10 H(m − v)

+ C (u)HTΓ−1/20

dB

dt+ O(K−1/2) .

This is the equation for the Kalman-Bucy filter, with empirical covarianceC (u). The remainder O(K−1/2) can be thought of as a sampling error.

David Kelly (NYU) EnKF December 11, 2014 23 / 28

Page 50: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Kalman-Bucy limit

If F were linear and we write m(t) = 1K

∑Kk=1 u

(k)(t) then

dm

dt= F (m)− C (u)HTΓ−10 H(m − v)

+ C (u)HTΓ−1/20

dB

dt+ O(K−1/2) .

This is the equation for the Kalman-Bucy filter, with empirical covarianceC (u). The remainder O(K−1/2) can be thought of as a sampling error.

David Kelly (NYU) EnKF December 11, 2014 23 / 28

Page 51: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Continuous-time results

Theorem (AS,DK)

Suppose that{u(k)}Kk=1 satisfy (•) with H = Γ = Id. Let

e(k) = u(k) − v .

Then there exists constant β > 0 such that

1

K

K∑k=1

E|e(k)(t)|2 ≤(

1

K

K∑k=1

E|e(k)(0)|2)

exp (βt) .

David Kelly (NYU) EnKF December 11, 2014 24 / 28

Page 52: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Why do we need H = Γ = Id ?

In the equation

du(k)

dt= F (u(k))− C (u)HTΓ−10 H(u(k) − v)

+ C (u)HTΓ−1/20

(dW (k)

dt+

dB

dt

).

The energy pumped in by the noise must be balanced by contraction of(u(k) − v). So the operator

C (u)HTΓ−10 H

must be positive-definite.

Both C (u) and HTΓ−10 H are pos-def, but this doesn’t guarantee the samefor the product!

David Kelly (NYU) EnKF December 11, 2014 25 / 28

Page 53: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Why do we need H = Γ = Id ?

In the equation

du(k)

dt= F (u(k))− C (u)HTΓ−10 H(u(k) − v)

+ C (u)HTΓ−1/20

(dW (k)

dt+

dB

dt

).

The energy pumped in by the noise must be balanced by contraction of(u(k) − v). So the operator

C (u)HTΓ−10 H

must be positive-definite.

Both C (u) and HTΓ−10 H are pos-def, but this doesn’t guarantee the samefor the product!

David Kelly (NYU) EnKF December 11, 2014 25 / 28

Page 54: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Why do we need H = Γ = Id ?

In the equation

du(k)

dt= F (u(k))− C (u)HTΓ−10 H(u(k) − v)

+ C (u)HTΓ−1/20

(dW (k)

dt+

dB

dt

).

The energy pumped in by the noise must be balanced by contraction of(u(k) − v). So the operator

C (u)HTΓ−10 H

must be positive-definite.

Both C (u) and HTΓ−10 H are pos-def, but this doesn’t guarantee the samefor the product!

David Kelly (NYU) EnKF December 11, 2014 25 / 28

Page 55: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Testing stability on the fly

Suppose we can actually measure the spectrum of the operator

C (u)HTΓ−10 H

whilst the algorithm is running. If we know that it is pos-def, then thefilter must not be blowing up.

If we knew thatC (u)HTΓ−10 H ≥ λ(t) > 0 .

Then we can say even more (eg. stability).

David Kelly (NYU) EnKF December 11, 2014 26 / 28

Page 56: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Testing stability on the fly

Suppose we can actually measure the spectrum of the operator

C (u)HTΓ−10 H

whilst the algorithm is running. If we know that it is pos-def, then thefilter must not be blowing up.

If we knew thatC (u)HTΓ−10 H ≥ λ(t) > 0 .

Then we can say even more (eg. stability).

David Kelly (NYU) EnKF December 11, 2014 26 / 28

Page 57: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Summary + Future Work

(1) Writing down an SDE/SPDE allows us to see the importantquantities in the algorithm.

(2) Does not “prove” that catastrophic filter divergence is anumerical phenomenon, but is a decent starting point.

(1) Improve the condition on H.

(2) If we can measure the important quantities, then we can test theperformance during the algorithm.

(3) Make use of controllability and observability.

David Kelly (NYU) EnKF December 11, 2014 27 / 28

Page 58: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Summary + Future Work

(1) Writing down an SDE/SPDE allows us to see the importantquantities in the algorithm.

(2) Does not “prove” that catastrophic filter divergence is anumerical phenomenon, but is a decent starting point.

(1) Improve the condition on H.

(2) If we can measure the important quantities, then we can test theperformance during the algorithm.

(3) Make use of controllability and observability.

David Kelly (NYU) EnKF December 11, 2014 27 / 28

Page 59: EnKF and filter divergence - math.nist.gov · 12/11/2014  · David Kelly (NYU)EnKFDecember 11, 2014 14 / 28. Filter divergence In certain situations, it has been observed (?) that

Thank you!

Well-posedness and accuracy of the ensembleKalman filter in discrete and continuous time.

D. Kelly, K.Law, A. Stuart.Nonlinearity 2014.www.dtbkelly.com

David Kelly (NYU) EnKF December 11, 2014 28 / 28