DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr....

56
DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA

description

DoD CG/AR at Colorado State University AT737 November 19, The Purpose of Data Assimilation u Why do data assimilation? 1.I want better model initial conditions for better model forecasts 2.I want better calibration and validation (cal/val) 3.I want better acquisition guidance

Transcript of DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr....

Page 1: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 1

What is Data Assimilation?

Dr. Andrew S. JonesCSU/CIRA

Page 2: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 2

Data Assimilation

Outline Why Do Data Assimilation? Who and What Important Concepts Definitions Brief History Common System Issues / Challenges

Page 3: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 3

The Purpose of Data Assimilation

Why do data assimilation?1. I want better model initial conditions for better model forecasts2. I want better calibration and validation (cal/val)3. I want better acquisition guidance

Page 4: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 4

Why do data assimilation?1. I want better model initial conditions for better model forecasts2. I want better calibration and validation (cal/val)3. I want better acquisition guidance4. I want better scientific understanding of

Model errors (and their probability distributions) Data errors (and their probability distributions) Combined Model/Data correlations DA methodologies (minimization, computational optimizations,

representation methods, various method approximations) Physical process interactions (i.e., sensitivities and feedbacks)

Leads toward better future modelsVIRTUOUS CYCLE

The Purpose of Data Assimilation

Page 5: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 5

The Data Assimilation Community

What skills are needed by each involved group? NWP Data Assimilation Experts (DA system methodology) NWP Modelers (Model + Physics + DA system) Application and Observation Specialists (Instrument capabilities) Physical Scientists (Instrument + Physics + DA system) Radiative Transfer Specialists (Instrument config. specifications) Applied Mathematicians (Control theory methodology) Computer Scientists (DA system + OPS time requirements) Science Program Management (Everything + $$ + Good People) Forecasters (Everything + OPS time reqs. + Easy/fast access) Users and Customers (Could be a wide variety of responses)

e.g., NWS / Army / USAF / Navy / NASA / NSF / DOE / ECMWF

Page 6: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 6

The Data Assimilation Community

Are you part of this community? Yes, you just may not know it yet.

Page 7: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 7

Bayes Theorem

Maximum Conditional Probability is given by:

P (x | y) ~ P (y | x) P (x)

Assuming Gaussian distributions…

P (y | x) ~ exp {-1/2 [y – H (x)]T R-1 [y – H (x)]}P (x) ~ exp {-1/2 [x –xb]T B-1 [x – xb]}

e.g.,3DVAR

Lorenc (1986)

Page 8: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 8

Minimization Process

TRUTH

Jacobian of the Cost Function is used in the minimization procedure

Minima is at J/ x = 0

Issues:Is it a global minima?Are we converging rapid

or slow?

J

x

Page 9: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 9

The Building Blocks of Data Assimilation

NWP Model

Observations

NWPAdjoint

Minimization

Observation ModelAdjoint

Control Variablesare the initial modelstate variablesthat are optimizedusing the new datainformation as a guide

They can also includeboundary conditioninformation, modelparameters for“tuning”, etc.

Observation Model

Start

Page 10: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 10

What Are We Minimizing?

)x(xB)x(x21y))](x(h[MRy))](x(h[M

21 b0

1Tb00i0,

1T0i0,

(time) i

J Minimize discrepancy between model and observation data over time

The Cost Function, J, is the link between the observational data and the model variables

Observations are either assumed unbiased, or are “debiased” by some adjustment method

Page 11: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 11

How are Data used in Time?

Assimilation time window

observations

)(ε)(xx

ε)](x[y

x01-i0,i

y0i0,

GM

Mh

Observation model

Cloud resolving model

time

forecast

Page 12: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 12

Assimilation time window

observationstime

forecast

A “Smoother” Uses All Data Availablein the Assimilation Window

(a “Simultaneous” Solution)

)(ε)(xx

ε)](x[y

x01-i0,i

y0i0,

GM

Mh

Observation model

Cloud resolving model

Page 13: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 13

Assimilation time window

observationstime

forecast

A “Filter” Sequentially Assimilates Dataas it Becomes Available in each Cycle

)(ε)(xx

ε)](x[y

x01-i0,i

y0i0,

GM

Mh

Observation model

Cloud resolving model

Page 14: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 14

Assimilation time window

observationstime

forecastCycle PreviousInformation

)(ε)(xx

ε)](x[y

x01-i0,i

y0i0,

GM

Mh

Observation model

Cloud resolving model

A “Filter” Sequentially Assimilates Dataas it Becomes Available in each Cycle

Page 15: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 15

Assimilation time window

observationstime

forecastCycle PreviousInformation

)(ε)(xx

ε)](x[y

x01-i0,i

y0i0,

GM

Mh

Observation model

Cloud resolving model

A “Filter” Sequentially Assimilates Dataas it Becomes Available in each Cycle

Page 16: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 16

Cycle Physics “Barriers”What Can Overcome the Barrier?1. Linear Physics Processes and2. Propagated Forecast Error Covariances

time

forecast

)(ε)(xx

ε)](x[y

x01-i0,i

y0i0,

GM

Mh

Observation model

Cloud resolving model

A “Filter” Sequentially Assimilates Dataas it Becomes Available in each Cycle

Page 17: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 17

Who are the Candidates for “Truth”? Minimize discrepancy between model and observation data over time

Candidate 1: Background Term“x0” is the model state vector at the initial time

t0

this is also the “control variable”,the object of the minimization process

“xb” is the model background state vector“B” is the background error covariance

of the forecast and model errors

)x(xB)x(x21y))](x(h[MRy))](x(h[M

21 b0

1Tb00i0,

1T0i0,

(time) i

J

Page 18: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 18

What Do We Trust for “Truth”? Minimize discrepancy between model and observation data over time

Candidate 1: Background TermThe default condition for the assimilation when1. data are not available or2. the available data have no significant sensitivity

to the model state or3. the available data are inaccurate

)x(xB)x(x21y))](x(h[MRy))](x(h[M

21 b0

1Tb00i0,

1T0i0,

(time) i

J

Page 19: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 19

Model Error Impacts our “Trust” Minimize discrepancy between model and observation data over time

Candidate 1: Background TermModel error issues are importantModel error varies as a function of the model timeModel error “grows” with timeTherefore the background term should be trusted

more at the initial stages of the model run and trusted less at the end of the model run

)x(xB)x(x21y))](x(h[MRy))](x(h[M

21 b0

1Tb00i0,

1T0i0,

(time) i

J

Page 20: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 20

)x(xB)x(x21y))](x(h[MRy))](x(h[M

21 b0

1Tb00i0,

1T0i0,

(time) i

J

How to Adjust for Model Error? Minimize discrepancy between model and observation data over time

Candidate 1: Background Term1. Add a model error term to the cost function so that the

weight at that specific model step is appropriately weighted or

2. Use other possible adjustments in the methodology, i.e., “make an assumption” about the model error impacts

If model error adjustments or controls are used the DA system is said to be “weakly constrained”

Page 21: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 21

What About Model Error Errors? Minimize discrepancy between model and observation data over time

Candidate 1: Background TermModel error adjustments to the weighting can be “wrong”

In particular, most assume some type of linearity Non-linear physical processes may break these

assumptions and be more complexly interrelatedA data assimilation system with no model error control is said to

be “strongly constrained” (perfect model assumption)

)x(xB)x(x21y))](x(h[MRy))](x(h[M

21 b0

1Tb00i0,

1T0i0,

(time) i

J

Page 22: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 22

What About Model Error Errors?

A StronglyConstrainedSystem?

“I just can’t run like I used to.”

Model“Little Data People”

Page 23: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 23

What About Model Error Errors?

A StronglyConstrainedSystem?

Can Data Over Constrain?

“We’ll… no one’s perfect.”

DA expert

Page 24: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 24

)x(xB)x(x21y))](x(h[MRy))](x(h[M

21 b0

1Tb00i0,

1T0i0,

(time) i

J

Candidate 2: Observational Term“y” is the observational vector, e.g., the satellite input data

(typically radiances), salinity, sounding profiles“M0,i(x0)” is the model state at the observation time “i”“h” is the observational operator, for example the

“forward radiative transfer model”“R” is the observational error covariance matrix that specifies

the instrumental noise and data representation errors (currently assumed to be diagonal…)

Who are the Candidates for “Truth”? Minimize discrepancy between model and observation data over time

Page 25: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 25

What Do We Trust for “Truth”? Minimize discrepancy between model and observation data over time

Candidate 2: Observational TermThe non-default condition for the assimilation when

1. data are available and2. data are sensitive to the model state and3. data are precise (not necessarily “accurate”) and4. data are not thrown away by DA “quality control”

methods

)x(xB)x(x21y))](x(h[MRy))](x(h[M

21 b0

1Tb00i0,

1T0i0,

(time) i

J

Page 26: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 26

What About other DA Errors?

Overlooked Issues?1. Data debiasing relative to the DA system

“reference”. It is not the “Truth”,however it is self-consistent.

2. DA Methodology Errors?1. Assumptions: Linearization, Gaussianity, Model

errors2. Representation errors (space and time)3. Poorly known background error covariances4. Imperfect observational operators5. Overly aggressive data “quality control”6. Historical emphasis on dynamical impact vs. physical

Synoptic vs. Mesoscale?

Page 27: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 27

DA Theory is Still MaturingThe Future: Lognormal DA (Fletcher and Zupanski, 2006, 2007)

Gaussian systems typically force lognormal variables to become Gaussian introducing an avoidable data assimilation system bias

Many important variablesare lognormally distributed

Gaussian data assimilation systemvariables are “Gaussian”

Add DA

Bias Here!

LognormalVariablesCloudsPrecipitationWater vaporEmissivities

Many otherhydrologicfields

Mode Mean

Page 28: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 28

What “Truth” Do We Have? Minimize discrepancy between model and observation data over time

DATA MODELCENTRIC CENTRIC

TRUTH

)x(xB)x(x21y))](x(h[MRy))](x(h[M

21 b0

1Tb00i0,

1T0i0,

(time) i

J

Page 29: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 29

What Approach Should We Use?

DATA MODELCENTRIC CENTRIC

TRUTH

Page 30: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 30

What Approach Should We Use?

DATA MODEL

CENTRIC CENTRIC

TRUTH

Page 31: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 31

My My PrecioPrecious…us…

We Trust the We Trust the Model!Model!

Data hurts Data hurts us!,us!,

Yes…Yes…

What Approach Should We Use?

DATA MODEL

CENTRIC CENTRIC

TRUTH

Page 32: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 32

MODEL CENTRIC FOCUS

DATA MODELCENTRIC CENTRIC

FOCUS ON“B” Background Error

Improvements are Needed“xb” Associated background

states and “Cycling” are more heavily emphasized

DA method selection tends toward sequential estimators, “filters”, and improved representation of the forecast model error covariances

E.g., Ensemble Kalman Filters,other Ensemble Filter systems

Page 33: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 33

What Approach Should We Use?

DATA MODELCENTRIC CENTRIC

TRUTH

This is not to say thatall model-centric improvementsare bad…

Page 34: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 34

What Approach Should We Use?

DATA

MODEL

CENTRIC

CENTRIC

TRUTH

Page 35: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 35

My Precious…My Precious…We Trust the Data!We Trust the Data!Models unfair and Models unfair and

hurts us!,hurts us!,Yes…Yes…

What Approach Should We Use?

DATA

MODEL

CENTRIC

CENTRIC

TRUTH

Page 36: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 36

DATA CENTRIC FOCUS

DATA DATACENTRIC CENTRIC

FOCUS ON“h” Observational Operator

Improvements are Needed“M0,i(x0)” Model state capabilities and

independent experimental validation is more heavily emphasized

DA method selection tends toward “smoothers” (less focus on model cycling), more emphasis on data quantity and improvements in the data operator and understanding of data representation errors e.g., 4DVAR systems

Page 37: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 37

DUAL-CENTRIC FOCUS

Best of both worlds?Solution: Ensemble based forecast covariance

estimates combined with 4DVAR smoother for research and 4DVAR filter for operations?

Several frameworks to combine the two approaches are in various stages of development now…

Page 38: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 38

What Have We Learned?

DATA MODELCENTRIC CENTRIC

TRUTH

Your Research Objective is CRITICAL to making the right choices…

1. Operational choices may supercede good research objectives2. Computational speed is always critical for operational purposes3. Accuracy is critical for research purposes

Page 39: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 39

DA Theory is Still Maturing

A Brief History of DA1. Hand Interpolation2. Local polynomial interpolation schemes

(e.g., Cressman)3. Use of “first guess”, i.e., a background4. Use of an “analysis cycle” to regenerate

a new first guess5. Empirical schemes, e.g., nudging6. Least squares methods

1. Variational DA (VAR)2. Sequential DA (KF)3. Monte Carlo Approx. to Seq. DA (EnsKF)

Page 40: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 40

Optimal Interpolation (OI)

)]([a bb h xyWxx

OI merely means finding the “optimal” Weights, W

Eliassen (1954), Bengtsson et al. (1981), Gandin (1963)Became the operational scheme in early 1980s and early 1990s

1)( RHBHBHW TT

A better name would have been “statistical interpolation”

Page 41: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 41

Variational Techniques

Finds the maximum likelihood (if Gaussian, etc.)(actually it is a minimum variance method)

Comes from setting the gradient of the cost function equal to zero

Control variable is xa

Major Flavors: 1DVAR (Z), 3DVAR (X,Y,Z), 4DVAR (X,Y,Z,T)Lorenc (1986) and others…

Became the operational scheme in early 1990s to the present day

xH

xyRHHRHBxx

hhTT

ba )]([)( 111

Page 42: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 42

What is a Hessian?

A Rank-2 Square MatrixContaining the Partial Derivatives of the Jacobian

G(f)ij(x) = DiDj f(x)

The Hessian is used in some minimization methods,e.g., quasi-Newton…

)()( 11 nnnn ff xxGxx

Page 43: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 43

The Role of the Adjoint, etc.Adjoints are used in the cost function minimization procedure

But first…

Tangent Linear Models are used to approximate the non-linear model behaviors

L x’ = [M(x1) – M(x2)] / L is the linear operator of the perturbation modelM is the non-linear forward model

is the perturbation scaling-factorx2 = x1 + x’

Page 44: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 44

Useful Properties of the Adjoint

<Lx’, Lx’> <LTLx’, x’> LT is the adjoint operator of the perturbation model

Typically the adjoint and the tangent linear operator can be automatically created using automated compilers

y = (x1, …, xn, y)

*xi = *xi + *y /xi

*y = *y /y where *xi and *y are

the “adjoint” variables

Page 45: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 45

Useful Properties of the Adjoint

<Lx’, Lx’> <LTLx’, x’> LT is the adjoint operator of the perturbation model

Typically the adjoint and the tangent linear operator can be automatically created using automated compilers

Of course, automated methods fail for complex variable types

(See Jones et al., 2004)

E.g., how can the compiler know when the variable is complex, when codes are decomposed into real and imaginary parts as common practice? (It can’t.)

Page 46: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 46

Sequential Techniques

B is no longer static, B => Pf = forecast error covariancePa (ti) is estimated at future times using the model

K = “Kalman Gain” (in blue boxes)

Extended KF, Pa is found by linearizing the model about the nonlinear trajectory of the model between ti-1 and ti

Kalman (1960) and many others…These techniques can evolve the forecast error covariance fields

similar in concept to OI

)()}])([)({()(

)]}([}{])([)({)()(1

1

if

iTii

fii

Tii

fi

a

ifT

iif

iiTii

fi

fi

a

tttt

tHtttt

PHHPHRHPIP

xyHPHRHPxx

Page 47: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 47

Sequential Techniques

f is a particular forecast instancel is the reference state forecastPf is estimated at future times using the modelK number model runs are required(Q: How to populate the seed perturbations?)

Sampling allows for use of approximate solutions Eliminates the need to linearize the model (as in Extended KF) No tangent linear or adjoint models are needed

Ensembles can be used in KF-based sequential DA systemsEnsembles are used to estimate Pf through Gaussian “sampling” theory

K

lk

Tfl

fk

fl

fk

fl K

)()(2

1 xxxxP

Page 48: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 48

Sequential Techniques

Notes on EnsKF-based sequential DA systems1. EnsKFs are an approximation2. Underlying theory is the KF (circa 1960)3. Assumes Gaussian distributions4. Many ensemble samples are required5. Can significantly improve Pf

6. Where does H fit in? Is it fully “resolved”?7. What about the “Filter” aspects?Future Directions Research using Hybrid EnsKF-Var techniques

Page 49: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 49

Sequential Techniques

NE is the number of ensemblesS is the state-space dimensionEach ensemble is carefully selected to represent the

degrees of freedom of the systemSquare-root filter is built-in to the algorithm assumptions

Zupanski (2005): Maximum Likelihood Ensemble Filter (MLEF)Structure function version of Ensemble-based DA(Note: Does not use sampling theory, and ismore similar to a variational DA scheme usingprinciple component analysis (PCA)

][ 2121 f

Nff

f EpppP

fiS

fi

fi

fi

p

pp

,

,2

,1

p

Page 50: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 50

Where is “M” in all of this?

3DDA Techniques have no explicit model time tendency information, it is all done implicitly with cycling techniques, typically focusing only on the Pf term

4DDA uses M explicitly via the model sensitivities, L, and model adjoints, LT,as a function of time

Kalman Smoothers (e.g., also 4DEnsKS) would likewise also need to estimate L and LT

No Mused

M used

Page 51: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 51

4DVAR Revisited(for an example see Poster NPOESS P1.16 by Jones et al.)

LT is the adjoint which is integrated from ti to t0

Adjoints are NOT the “model running in reverse”,but merely the model sensitivities being integratedin reverse order, thus all adjoints appear to functionbackwards. Think of it as accumulating the“impacts” back toward the initial control variables.

Automatically propagates the Pf within the cycle, however can not save the result for the next analysis cycle (memory of “B” info becomes lost in the next cycle) (Thepaut et al., 1993)

)(

})]([{

00

0,01

0

0,0,01

0

t

M

h

MhJ

ii

ii

iiiTi

N

i

Tib

xxx

L

xH

yxRHLxxBx

Page 52: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 52

obs1

obs2

xtime

Geographically distant observations can bring more information than close-by observations, if in a dynamically significant region

grid-point

Ensembles: Flow-dependent forecast error covariance and spread of information from observations

t0

t1

t2

Isotropic

correlations

From M. Zupanski

Page 53: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 53

J=const.

min

x0

xmin

J=const.

Physical space

Preconditioning space

-g

-gx

Preconditioning the Space

“Preconditioners” transform thevariable space so that fewer iterationsare required while minimizing the cost function

x ->

Result: faster convergence

From M. Zupanski

Page 54: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 54

Incremental VARCourtier et al. (1994)

Most Common 4D framework in operational use

Incremental form performs Linear minimization within a lower dimensional space (the inner loop minimization)

Outer loop minimization is at the full model resolution(non-linear physics are added back in this stage)

Benefits:Smoothes the cost function and assures better

minimization behaviorsReduces the need for explicit preconditioning

Issues: Extra linearizations occur It is an approximate form of VAR DA

Page 55: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 55

Types of DA Solution Spaces1. Model Space (x)2. Physical Space (y)3. Ensemble Sub-space

e.g., Maximum Likelihood Ensemble Filter (MLEF)

Types of Ensemble Kalman Filters1. Perturbed observations (or stochastic)2. Square root filters (i.e., analysis perturbations are

obtained from the Square root of the Kalman Filter analysis covariance)

Page 56: DoD CG/AR at Colorado State University AT737 November 19, 2008 1 What is Data Assimilation? Dr. Andrew S. Jones CSU/CIRA.

DoD CG/AR at Colorado State University AT737 November 19, 2008 56

Data AssimilationConclusions Broad, Dynamic,

Evolving, Foundational Science Field!

Flexible unified frameworks, standards, and funding will improve training and education

Continued need for advanced DA systemsfor research purposes(non-OPS)

Can share OPS framework components,e.g., JCSDA CRTM

Data Assimilation

Thanks! ([email protected])

For more information…Great NWP DA Review Paper (By Mike Navon)ECMWF DA training materialsJCSDA DA workshophttp://people.scs.fsu.edu/~navon/pubs/JCP1229.pdfhttp://www.ecmwf.int/newsevents/training/rcourse_notes/http://www.weatherchaos.umd.edu/workshop/