COPS data management: cops@zmaw · COPS data management: [email protected] 25.-26.09.06 / 2 Data archive...

29
COPS data management: [email protected] 25.-26.09.06 / 1 Data management and archiving for COP / GOP / D_PHASE 4 th COPS Workshop 25./26.09. 2006 Stuttgart Claudia Wunram Hannes Thiemann

Transcript of COPS data management: cops@zmaw · COPS data management: [email protected] 25.-26.09.06 / 2 Data archive...

Page 1: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 1

Data management and archivingfor

COP / GOP / D_PHASE

4th COPS Workshop25./26.09. 2006 Stuttgart Claudia Wunram

Hannes Thiemann

Page 2: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 2

Data archive

Long term data archive for

COPS, GOP and D-PHASE

hosted at

World Data Centre for Climate (WDCC)

run by the group

“Model and Data” (M&D)

at

Max Planck Institute for Meteorology,

in

Hamburg, Germany.

Page 3: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 3

Content

• WDCC as data archive in COPS-campaign• Common data policy with interlinked projects• Tasks of data archive and expected storage amounts• Data transfer, responsabilities for quality control• Data formats• Meta data description• Data structure• Data access• Next steps: test runs• Outlook• Contact info

Page 4: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 4

WDCC Content

Data fromEarth SystemModelling andRelatedObservations

• Mission: collect, store and disseminate data for climate research• Approved in January 2003• March 2006: 220 TB / 566 Experiments / 77.000 Data Sets

ERA40

IPCC

CEOPBALTEX

HOAPS

CARIBIC

WOCE

ERA15/40NCEP

GEBCO

COSMOS

Simulations @ MPI, GKSS,…

EH5/MPI-OMIPCC-AR4

ENSEMBLES

IPCC-DDC

COPS

GOP

DPHASE

Page 5: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 5

WDCC as data archive

in COPS campaignand interlinked

projects

Page 6: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 6

Common data policy

• As announced in data implementation plan

• Agreed on by all PIs and M&D

• All investigators deliver promptly their data to the archive (final version 03/2008)

• M&D gives access rights according to announcements of COPS coordinator (groups and timeline)

Page 7: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 7

• archive instrument data, model data, quicklooks and alerts forobservation periods:

• GOP: JAN 07 – DEC 07• COPS: JUN 07 – AUG 07 • DPHASE: JUN 07 – NOV 07

• define meta data layout and handle implementation• offer service within the frame of data storage at WDCC and

help to access to data base• no real time data handling can be done by M&D• host data base link to external data:

• EUMETSAT, 3D radar (DWD)• LMK (high resolution forecast model)

Tasks as COPS-data archive

Page 8: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 8

• Data storage volume for COPS, GOP and D-PHASE:

• 20 TB

• Estimated data volume:

• GOP: 3+ TB

• COPS instruments: 2 TB

• COPS models: 10 TB

• D-PHASE: 5 TB

• Plus processing area on M&D work group server:

•~500 GB + CPU (visualization tasks, quick access)

Storage amounts:

Page 9: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 9

AMF data

• Observation period: APR 07 to DEC 07

• Data volume: ~ 150 GB

• Data transfer: at the end of observation period

(shipped on disk, …)

Page 10: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 10

Data transfer

WDCC data baseCERA

checksum

checksumupload areain file system

data

ftp

meta data

ftp

data provider

unix account

user instruction- data structure- data upload

Page 11: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 11

processing area

ssh

D-PHASE PI‘s/UHOH

500GB

Data flow: visualization

WDCC data baseCERA

meta dataftp

COPS OCssh

sftp

pics

ftp

upload areain file system

data

ftp

Page 12: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 12

Data control

M&D:• technical controls (time stamp, consistency of time series)

Data providers:• responsible for quality of data file content and meta data content• responsible for data transfer (checksum tests)

Page 13: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 13

Accepted data formats:

model data

instrument data

quicklooks

meta data xml

GRIB1, netCDF/CF

netCDF/CF

jpg, gif, png, eps, …

CF-convention for meta data description is strongly advised:Variable names are described by CF-standard names

-> search in data base and intercomparison

Page 14: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 14

Entry

Reference

Status

Distribution

Contact Coverage

Parameter

SpatialReference

Data Org

Meta data information

Page 15: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 15

Meta data formular (1)

output is xml-file

webbased or local fill in

Page 16: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 16

Meta data formular (2)

Page 17: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 17

Data structure 1

Upload data structuredefines the access optionsfor downloading

WDCC data baseCERA

download

Data sets

upload

Page 18: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 18

Data structure 2WDCC

data base

CERA

Examples for download structure/data set definition:

A: focus on case studies (COPS, D-PHASE ?)

• Specific day -> all instruments, models, pics

B: focus on statistics (GOP ?)

• Specific parameter -> timeseries of observation period

C: other

• vertical model profiles / subregions

According to user needs

Page 19: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 19

view meta datadownload data via

web interface

CERA data base

download data in

batch mode

data userCERA user account

set access rightsaccording to data policy

Data access

Page 20: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 20

• Define data structure model (-> investigators)• Provide meta data formular to investigators

• Test runs for data delivery and upload are needed• Prior to campaign start of each project • Each data group has to deliver representative test data

• and full meta data description

• Test run timeline• GOP: NOV 2006• DPHASE: FEB 2007• COPS: APR 2007

Next steps

Page 21: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 21

• Registration of data as DOI (digital object identifier) is strongly advised

• Advantages:• data in final version are peer reviewed by review agency• citation of published data is possible like a reviewed scientific article• completeness of data set descriptions (metadata) is needed• quality of data values (precision, sequence and ranges) is needed

Outlook

Page 22: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 22

contact information

Service email adress:[email protected]

User information on:cops.wdc-climate.de

Page 23: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 23

COPS data management web infocops.wdc-climate.de

Page 24: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 24

M&D webpagewww.mad.zmaw.de

Page 25: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 25

CERA interface (1)• browse / login

Page 26: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 26

COPS

CERA interface (2)• select experiment

Page 27: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 27

CERA interface (3)• select data set

Page 28: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 28

CERA interface (4)• view meta data

Page 29: COPS data management: cops@zmaw · COPS data management: cops@zmaw.de 25.-26.09.06 / 2 Data archive Long term data archive for COPS, GOP and D-PHASE hosted at World Data Centre for

COPS data management: [email protected] / 29

End