Marco Pellegrino, [email protected]

22
METIS work session on statistical metadata Luxembourg, 9 to 11 April 2008 1 UNECE SDMX as a source of standardised terminology: MCV and cross-domain concepts Marco Pellegrino, [email protected]

description

Marco Pellegrino, [email protected]. SDMX as a source of standardised terminology: MCV and cross-domain concepts. Please pass on my regards to former colleagues in SDMX and METIS. Good luck with your meetings. Best regards Denis Ward. - PowerPoint PPT Presentation

Transcript of Marco Pellegrino, [email protected]

Page 1: Marco Pellegrino,  marco.pellegrino@ec.europa.eu

METIS work session on statistical metadataLuxembourg, 9 to 11 April 2008 1

UNECE

SDMX as a source of standardised terminology:MCV and cross-domain concepts

Marco Pellegrino, [email protected]

Page 2: Marco Pellegrino,  marco.pellegrino@ec.europa.eu

2Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg, 9 to 11 April 2008

Please pass on my regards

to former colleagues in

SDMX and METIS.

Good luck with your

meetings.

Best regards

Denis Ward

Page 3: Marco Pellegrino,  marco.pellegrino@ec.europa.eu

3Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg, 9 to 11 April 2008

Starting point for the MCV: the Tower of Babel

Metadata concepts used for identifying/describing statistics Tower of Babel: same name for a different concept or different

name for the same concept. Code lists jungle. Different metadata and quality frameworks Metadata more and more demanded to assist data

interpretation, but… Metadata still hard to exchange in an automated way

From the Tower of Babel to “lingua franca”? • Syntax Technical standards, SDMX-ML• Semantics Cross-domain concepts, located in the MCV

Page 4: Marco Pellegrino,  marco.pellegrino@ec.europa.eu

4Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg, 9 to 11 April 2008

The SDMX Content-Oriented Guidelines

Set of recommended practices - applicable across several statistical subject-matter domains - for creating data and metadata sets using the SDMX standards

Version 1 of the COG is available at www.sdmx.org for public comments up to 31 May 2008

Send comments to: [email protected]

Cc: [email protected]

Page 6: Marco Pellegrino,  marco.pellegrino@ec.europa.eu

6Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg, 9 to 11 April 2008

The UNSC Commission…

1. Welcomed the SDMX initiative and recognized with appreciation the sponsors’ leadership in heading an important initiative for more efficient data communication at national and international levels

2. Recognized and supported SDMX as the preferred standard for the exchange and sharing of data and metadata

3. Requested that the sponsors continue their work on this initiative and encouraged further SDMX implementations

4. Emphasized the need to further involve national and international agencies by enabling opportunities for collaboration with the sponsoring organisations in order to influence decision-making and its governance to address their needs, especially in the area of developing cross-domain concepts.

Page 7: Marco Pellegrino,  marco.pellegrino@ec.europa.eu

7Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg, 9 to 11 April 2008

Organising cross domain concepts

Collect CDCs that are used across SDMX organisations and their constituencies (an evolving list)

Provide definition and context explanations (linked to Metadata Common vocabulary)

Document usage for data and/or metadata structures

Link to code lists for coded concepts

Map to existing frameworks (e.g. IMF DQAF, Eurostat Metadata Structure, OECD Metastore)

Page 8: Marco Pellegrino,  marco.pellegrino@ec.europa.eu

8Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg, 9 to 11 April 2008

Cross-domain concepts (CDC database)

For each concept:– Name and ID– Description and explanation of context– Representation (free text, code list)– Possible role (as a dimension, or attribute, in a DSD or

MSD)– Link to IMF-Eurostat-OECD metadata frameworks

CDCs are not:– a requisite for SDMX technical conformance– an imposition to statistical organisations

CDC are:– a framework to promote reusability of exchanged data and

metadata

Page 9: Marco Pellegrino,  marco.pellegrino@ec.europa.eu

9Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg, 9 to 11 April 2008

Page 10: Marco Pellegrino,  marco.pellegrino@ec.europa.eu
Page 11: Marco Pellegrino,  marco.pellegrino@ec.europa.eu

11Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg, 9 to 11 April 2008

Page 12: Marco Pellegrino,  marco.pellegrino@ec.europa.eu

12Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg, 9 to 11 April 2008

Use of cross-domain concepts

Page 13: Marco Pellegrino,  marco.pellegrino@ec.europa.eu

13Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg, 9 to 11 April 2008 13Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg, 9 to 11 April 2008

MCV: Expected benefits and use

Improved visibility for existing definitions (building on existing sources where feasible to avoid a proliferation of “standard” terminologies)

Improved accessibility to a set of standard definitions of metadata terms through a single web address

Facilitate mapping of different metadata systems, including those at national level, independently from any specific metadata model

Support to standardisation and consistency of metadata compiled

Support to XML structures and web services for searching and comparing statistical data and metadata with minimum need to determine “semantic equivalence”

Page 14: Marco Pellegrino,  marco.pellegrino@ec.europa.eu

14Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg, 9 to 11 April 2008

Page 15: Marco Pellegrino,  marco.pellegrino@ec.europa.eu

15Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg, 9 to 11 April 2008

MCV and general glossaries

MCV(411)

General glossaries(7 000)

SDMX

concepts

(130)

SDMX

concepts

(130)

International

(e.g. Eurostat / OECD)

Terminology

International

(e.g. Eurostat / OECD)

TerminologyNational

terminologyNational

terminology

Page 16: Marco Pellegrino,  marco.pellegrino@ec.europa.eu

16Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg, 9 to 11 April 2008

MCV STRUCTURE (February 2008)

Glossary fields

• Title (mandatory)

• Definition (mandatory)

• Context for the definition (optional, but widely used)

• Definition source (mandatory)

• Links to related terms within the glossary (optional)

• URL to more detailed information (optional)

Page 17: Marco Pellegrino,  marco.pellegrino@ec.europa.eu

17Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg, 9 to 11 April 2008

RAMON http://ec.europa.eu/eurostat/ramon

CODED

Page 18: Marco Pellegrino,  marco.pellegrino@ec.europa.eu
Page 19: Marco Pellegrino,  marco.pellegrino@ec.europa.eu
Page 20: Marco Pellegrino,  marco.pellegrino@ec.europa.eu

20Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg, 9 to 11 April 2008 20Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg, 9 to 11 April 2008

MCV: Issues for discussion

Link between MCV and cross-domain concepts

Scope of the MCV glossary: interaction with other general and domain-specific glossaries, including those at national level

Extent of usage and relevance of terms currently in the MCV. Suggestions for definitions and additional terms

Use of MCV concepts in connection with national metadata systems and national glossaries (translation, mapping)

MCV “flat” structure (term, definition, context, source, related terms, hyperlinks)

Page 21: Marco Pellegrino,  marco.pellegrino@ec.europa.eu

21Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg, 9 to 11 April 2008 21Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg, 9 to 11 April 2008

MCV: Issues for discussion (2)

Maintenance and periodic revisions (frequency?)

Use of registry facilities for notifying interest and launching a public review. Notification about amendments to the glossary

Involvement of NSIs and other stakeholders in the MCV revisions

Need for versioning of definitions in MCV – some definitions will evolve / change

Focus on concepts first, and then on translations

Page 22: Marco Pellegrino,  marco.pellegrino@ec.europa.eu

22Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg, 9 to 11 April 2008

Nothing is more practical than a good theory

We are continually faced with a series of great opportunities brilliantly disguised as insoluble problems

Reasonable people adapt themselves to the world Unreasonable people attempt to adapt the world to themselves

All progress, therefore, depends on unreasonable people(George Bernard Shaw)