Data publication at GFZ - Helmholtz...•Metadata should follow international standards, (e.g....
Transcript of Data publication at GFZ - Helmholtz...•Metadata should follow international standards, (e.g....
Data Publication at GFZ
Kirsten Elger, Damian Ulbricht, Roland Bertelmann
Deutsches GeoForschungsZentrum GFZ
• Helmholtz Zentrum für die Erforschung der festen Erde („vom All bis zum Erdkern”)
• ~1200 Angestellte
• FB Erde und Umwelt, Energie• Methodische Kernkompetenzen:
Satellitentechnologien, geodätisch-geophysikalische Messnetze, Tomographie der festen Erde; Forschungsbohrungen, Labor- und Experimentiertechnik; Modellierung von Geoprozessen, usw.
• Die Entwicklung von Datensystemen zur Archivierung, Verbreitung und Publikation von Forschungsdaten ist ein wichtiges Standbein und Service für Wissenschaft und Gesellschaft.
…to long tail data
From global networks…DATA SERVICES
GFZ Data Services
• Beratung rund um Daten• Data Repository• Open Access Verlag (internal Review von STR Data)• DOI Service für andere Netzwerke und Datenzentren• Produkte
– Datenpublikation als supplementary material zu wiss. Artikeln– Datenpublikation im Rahmen von Data Papers– Datenpublikation mit begleitendem Report
•
Unser Repository Service
• Seit 2004 Registrierung von Daten DOI (>450 registrierte Datensätze und Data Collections
• ~30 Data Reports seit 2011 (15 in 2015)• DOI Registrierung für andere GFZ Netzwerke (z.B. GEOFON: >5500
DOI für seismische Events, neu 2015: DOI für seismische Netzwerke, 15)
• 2016ff: FID Geo (Fachinformationsdienst Geowissenschaften der festen Erde, DFG Projekt)– Bundesweites Angebot der Publikation von Datensupplementen für
die Geowissenschaften an Hochschulen und in Forschungsinstituten (soweit keine institutionelle Lösung existiert).
PanMetaDocs/ eSciDoc/ DOIDB –a modular approach
File System
PubMan other systems
DOIDB metadata store
DataCite metadata storeData Portals
eSciDoc
PanMetaDocs
Dataset files
Basis, intern
Anwendungs-ebene
Extern
PMD Metadata Editor
mandatory fieldoptional field
(recommended)
A new line will automatically appear when filling the first line Click here todelete an entry
ClearentriesLoad a previousversionSafe youractualversion
Information on field definition
drop-down menusappear when clicking at the arrow
Stopping the mousepointer over (or clickinginto) a field or a drop-down parameter showsexplanatory informationor definitions
Spatial Domain – visual control via map
Opens:
movedraw
bounding box
draw point
Enter coordinates manually (decimaldegree with at least 4 decimal digits, DD.dddd)
or Select from map
• Manual changes of coordina-tes will be immediately dis-played in the bounding box and vice versa
• You may define the acquisi-tion time of each spatialelement in the same line
Metadata Standards
• Metadata should follow international standards, (e.g. Dublin Core, ISO, Datacite for discovery) to allow databaseinteroperability and metadata exchange between portals.
• However, having standardised metadata does not necessarilyinvolve displaying each variable on landing pages (e.g. a global dataset does not require a map; for seismic networks a stationmap is essential).
• Due to the large variety of geoscientific disciplines, thecommon bracket would not be more than Dublin Core Metadata, which is often not sufficient to identify a suitabledataset.
Some examples
No map required global dataset
structural metadata
International Centre for Global Earth Models: c.150 global gravitationalmodels since the1960s
Seismic Networks - GEOFONGE Net (permanent global network)
station map
citation
citation: GEOFON Data Centre(1993) GEOFON Seismic Network, GFZ Data Services. DOI: 10.14470/…
DOIDatacite metadataStructural metadata
network station list
GIPP Example: Data Report + Data
Data Report (= data description) Datasets
Example STR – Data: GIPP (MINAS) – template
• Abstract (+ coordinates, keywords, related sources)• Introduction• Data Acquisition
– Experiment design and schedule– Geometry/Location and Instrumentation– Acquisition parameters
• Data Processing• Data Description
– File Format– Data content and structure
• Data Quality/Accuracy• Data Availability/Access (“data is restricted until May 2017”)• Acknowledgements, References• Figures on data completeness, logger GPS quality, probabilistic Power
Spectral Densities for subarrays
EnMAP (hyperspectral satellite mission)
Project-specificdesign
Data requestvia form (large
datasets)
Datenpublikation Tereno NO
automatisch-generierte Metadaten
Next step
•International Geo Sample Number IGSN – uniqueidentifier forphysical objects
Data publication with assigned DOI
citableDOI have emerged as the leading system for text and data publication (COPDESS 2015).
persistentlong-term data access guaranteed (by the publisher) despite servers being changed or switched off or people change affiliations and email addresses.
with metadata and data descriptionessential for data re-use and discovery, a comprehensive data description should be made a condition for assigning a DOI to a dataset.