Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick...

22
Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick Hofmann, Bill Moninger, Steve Weygandt, Curtis Alexander, Susan Sahm

Transcript of Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick...

Page 1: Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick Hofmann, Bill Moninger, Steve Weygandt, Curtis Alexander,

Verification Summit

AMB verification:rapid feedback to guide

model development decisions

Patrick Hofmann, Bill Moninger, Steve Weygandt, Curtis Alexander,

Susan Sahm

Page 2: Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick Hofmann, Bill Moninger, Steve Weygandt, Curtis Alexander,

MotivationThere is a critical need for both rapid and comprehensive statistical and graphical verification of model forecasts from various AMB experimental models:

RUC, RR, and HRRR

- Real-time parallel cycles as well as retrospective runs

- Two primary types:- Station verification : Upper-air, surface and clouds- Gridded verification: Precipitation, radar reflectivity,

convective probabilities

- Illuminate model biases and patterns to errors

- Essential for evaluating model/assimilation configuration changes

Rapid verification feedback enables timely improvement in forecast skill

Page 3: Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick Hofmann, Bill Moninger, Steve Weygandt, Curtis Alexander,

Design GoalsFast computation and display of verification results

(real-time for real-time cycles, day or two for retros)

Simple procedures, but with sufficient options to elucidate key aspects (quantify visual impressions)

Built-in capabilities to allow quick stratification by key parameters (metric, threshold, scale, valid time, initial time, forecast length, region)

Easily accessible web-based presentation of verification results ability to quickly examine aggregate statistics AND single-case plots in complementary manner

Verification design driven by needs of forecast system developers

Page 4: Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick Hofmann, Bill Moninger, Steve Weygandt, Curtis Alexander,

Design DetailsUse modified NCEP IPOLATES routines for

interpolation and upscaling of input fields to multiple common grids.

Calculate contingency table fields (YY, YN, NY, NN) for multiple scales, domains, and thresholds:

-- database storage for statistical aggregation -- graphics for each event for detailed evaluation

Web-based interface for aggregate statistics and event graphics

Apply to multiple gridded fields (reflectivity, precipitation, probabilities) and multiple model runs (several version each of RUC, RR, HRRR as well as RCPF, HCPF, etc.)

Page 5: Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick Hofmann, Bill Moninger, Steve Weygandt, Curtis Alexander,

Statistics WebpagesComposite Reflectivity

Time Series: http://ruc.noaa.gov/stats/radar/beta/timeseries Valid Times: http://ruc.noaa.gov/stats/radar/beta/validtimes Lead Times: http://ruc.noaa.gov/stats/radar/beta/leadtimes

24 Hour Precipitation Time Series: http://ruc.noaa.gov/stats/precip/beta/timeseries Thresholds: http://ruc.noaa.gov/stats/precip/beta/thresholds

Convective Probabilities Time Series: http://ruc.noaa.gov/stats/prob/beta/timeseries CSI vs Bias: http://ruc.noaa.gov/stats/prob/beta/csibias Reliability Diagrams:

http://ruc.noaa.gov/stats/radar/prob/reliabilitydiagrams ROC Curves: http://ruc.noaa.gov/stats/prob/beta/roc

Page 6: Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick Hofmann, Bill Moninger, Steve Weygandt, Curtis Alexander,

Sample “time-series” stats interface

Model RegionScale Averaging

period

Metric

ThresholdForecastLength

Validtime

DateRange

ManyR/T runs

andretros

Page 7: Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick Hofmann, Bill Moninger, Steve Weygandt, Curtis Alexander,

RR-dev w/ Pseudo-obs

HRRR-devHRRR

HRRR-dev better

HRRR better

Reflectivity (> 25 dBZ)CSI Eastern US on 40 km grid

(3-day avg)

Models

Thresh

“Time series” mode

Metric

Region

Scale

Sample application of “time-series” stats

Difference

Page 8: Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick Hofmann, Bill Moninger, Steve Weygandt, Curtis Alexander,

HRRR-devHRRR

RR-dev w/ Pseudo-obs

HRRR-dev better

HRRR better

Reflectivity (> 25 dBZ)CSI Eastern US on 40 km grid

(3-day avg)

Models

Thresh

“Time series” mode

Metric

Region

Scale

Sample application of “time-series” stats

Difference

Page 9: Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick Hofmann, Bill Moninger, Steve Weygandt, Curtis Alexander,

HRRR-devHRRR

RR-dev w/ Pseudo-obs

HRRR-dev better

HRRR better

Reflectivity (> 25 dBZ)CSI Eastern US on 40 km grid

(3-day avg)

Models

Thresh

“Time series” mode

Metric

Region

Scale

Sample application of “time-series” stats

DifferenceImplemented in RR-prim

HRRR-devLongertime-step

Page 10: Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick Hofmann, Bill Moninger, Steve Weygandt, Curtis Alexander,

HRRR-devHRRR

RR-dev w/ Pseudo-obs

HRRR-dev better

HRRR better

Reflectivity (> 25 dBZ)CSI Eastern US on 40 km grid

(3-day avg)

Models

Thresh

“Time series” mode

Metric

Region

Scale

Sample application of “time-series” stats

DifferenceImplemented in RR-prim

HRRR-devLongertime-step

RR-devAdded shorter vert.length-scales in RR-dev/GSI

Imple-mented In HRRR

Page 11: Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick Hofmann, Bill Moninger, Steve Weygandt, Curtis Alexander,

CSI 25 dBZ 40-km EUS +6h fcst 8-22 Aug

RUCHRRR Better

RRHRRR better

Sample “time-series” stats to examine scatter in forecast differences

August

Page 12: Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick Hofmann, Bill Moninger, Steve Weygandt, Curtis Alexander,

Sample application of “lead-time” stats illustrating CSI and bias

“die-off” for different strengths of radar heating

CSI (X100)

Bias (X100)

Forecast Length (hours)0 2 4 6 8 10 0 2 4 6 8 10

Page 13: Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick Hofmann, Bill Moninger, Steve Weygandt, Curtis Alexander,

Upscaled verification (especially to 40km and 80km) reveals “neighborhood” skill in HRRR forecasts, especially around the time of convective initiation

20-km

80-km

40-km

3-km

HRRR 25dBZ, 6-h fcst

Valid Time (GMT)

CSI (

x 10

0)

Sample application of “valid time” stats illustrating diurnal variation

in scale-dependent skill

Convective

Initiationtime

00z 04z 08z 12z 16z 20z 00z

Page 14: Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick Hofmann, Bill Moninger, Steve Weygandt, Curtis Alexander,

Reflectivity Graphics Webpage

http://ruc.noaa.gov/crefVerif/Welcome.cgi

Page 15: Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick Hofmann, Bill Moninger, Steve Weygandt, Curtis Alexander,

12z + 6 hr

3-km40-kmMiss FA Hit

Single case plots showing

“neighborhood” skill

Obs Refl. HRRR fcst

Page 16: Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick Hofmann, Bill Moninger, Steve Weygandt, Curtis Alexander,

13-km CONUSComparison

2 X 12 hr fcstvs. CPC 24-h analysis

1 – 31 Dec 2010Matched

RR vs. RUC PrecipitationVerification

RR

RUC

| | | | | | | |0.01 0.10 0.25 0.50 1.00 1.50 2.00 3.00 in.

| | | | | | | |0.01 0.10 0.25 0.50 1.00 1.50 2.00 3.00 in.

CSI(x 100)

RUC

RR

100(1.0)

bias(x 100)

Sample application of “threshold” stats to show skill for range of precip amounts

Page 17: Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick Hofmann, Bill Moninger, Steve Weygandt, Curtis Alexander,

Precipitation Graphics Webpageshttp://ruc.noaa.gov/precipVerif

Page 18: Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick Hofmann, Bill Moninger, Steve Weygandt, Curtis Alexander,

CPC24-h

precip

RUC

Thrs CSI Bias1.00 .45 1.222.00 .29 1.95

observed

Thrs CSI Bias1.00 .31 0.692.00 .21 0.58

2 x 12h fcst interpolatedto 20-km grid

RR vs. RUC 24-h precip. verif

Single case plots showing

forecast skill for precip.

RR

Page 19: Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick Hofmann, Bill Moninger, Steve Weygandt, Curtis Alexander,

RRRUC

Thrs CSI Bias1.00 .45 1.222.00 .29 1.95

Thrs CSI Bias1.00 .31 0.692.00 .21 0.58

1” threshold

Miss FA Hit

CPC24-h

precip

observed

2 x 12h fcst interpolatedto 20-km grid

RR vs. RUC 24-h precip. verif

Single case plots showing

forecast skill for precip.

Page 20: Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick Hofmann, Bill Moninger, Steve Weygandt, Curtis Alexander,

2-h fcst4-h fcst6-h fcst

ROC curve CSI vs. bias

Sample display of probability

verification statistics

Work in progress, have display for CCFP and CoSPA probabilities

Plan to add HCPF, RCPF,expand to probabilitiesof other hazards (fog,

high echo-tops, etc.)

Page 21: Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick Hofmann, Bill Moninger, Steve Weygandt, Curtis Alexander,

2-h fcst4-h fcst6-h fcst

Sample Reliability Diagram

All plots can zoom

Page 22: Verification Summit AMB verification: rapid feedback to guide model development decisions Patrick Hofmann, Bill Moninger, Steve Weygandt, Curtis Alexander,

Conclusion

• The verification system, including both the statistical and graphical webpages, greatly aids evaluation of model performance within AMB and facilitates rapid assessment of experimental configurations and improvements in real-time.

• We are also able to verify retrospective cases of scientific interest in very quick succession for use in presentations and publications for outreach endeavors.