COAR Resource Types
-
Upload
jochen-schirrwagen -
Category
Science
-
view
123 -
download
0
Transcript of COAR Resource Types
COAR Resource Types – a SKOSified Vocabulary for Open Repositories
Jochen Schirrwagen, Bielefeld University Library, Germany Imma Subirats, Food and Agriculture Organization (FAO) of the United Nations, Italy Kathleen Shearer, Confederation of Open Access Repositories (COAR)
OR 2016 Conference, Dublin, 15 Jun 2016
COAR Interest Group “Controlled Vocabulary for Repository Assets“
COAR At A GLANCE
“COAR aims to facilitate the vision by bringing together research repositories as part of a global infrastructure; to link across continents and around the world, enabling new forms of research and supporting new models of scholarly communication.”
• > 100 member organizations worldwide
• Major activities – International voice – Alignment and
interoperability – Cultivating relationships – Building capacity – Adopting value-added
services
COAR Interest Group “Controlled Vocabulary for Repository Assets“
About the COAR Interest Group “Controlled Vocabularies” and Editorial Board
Set up in 2014 by COAR members and external experts Two-fold strategy (from a neutral perspective)
Establish a forum to discuss and recommend vocabulary issues for repository managers and information specialists
Define a set of controlled vocabularies (based on info:eu-repo application profile)
Editorial Board formed by volunteering IG members For definition and maintenance of concepts For label translations For provision of the vocabularies For outreach and collaboration with repository developer
community
COAR Interest Group “Controlled Vocabulary for Repository Assets“
Vocabularies in Scope and Under Review
What can be maintained by COAR?
• Resource (publication) types
• Access rights
• Document version types
• Date types (incl. dates to express embargo periods)
What alternatives can be recommended by COAR?
• Authority Files for Funder (or Organizations) and GrantIds
• LOC Identifier Vocabulary to express resource identifier schemes
• Authority Files for Author and Contributor IDs
• LOC Classification scheme vocabulary
• Rights and License statements (like creative commons, rightsstatements.org)
COAR Interest Group “Controlled Vocabulary for Repository Assets“
Context and Scope – Capturing the Diversity of Vocabularies about Resource (Publication) Types
• And the 1000ths arbitrary strings in multiple languages
CASRAI CERIF
DCMI-Terms
PubMed
DataCite Schema
e-LIS PURE
info:eu-repo/semantics
COAR Interest Group “Controlled Vocabulary for Repository Assets“
Methodological Approach
Revision of vocabularies and terms from “info:eu-repo” Comparison (and matching) with other established
vocabularies and dictionaries Statistical analysis about terms used in repository
metadata Workflow controlled and web-based editorial process by
help of VocBench (originally used for Agrovoc)
COAR Interest Group “Controlled Vocabulary for Repository Assets“
Top Frequently Used Terms Used in dc:type
dc:type analysis over 81M records from 3870 data providers, BASE ( http://basesearch.net ), Nov.2015
COAR Interest Group “Controlled Vocabulary for Repository Assets“
SKOS – Super Briefly Explained
Florian Thiery, http://i3mainz.hs-mainz.de/sites/default/files/public/data/predicatecanon.png
Common data model for knowledge organization systems
“to provide a bridge between these communities and the Semantic Web by transferring existing models of knowledge organization to the Semantic Web technology context, and by providing a low-cost migration path for porting existing knowledge organization systems to RDF.”
“to provide a bridge between different communities of practice within the library and information sciences involved in the design and application of knowledge organization systems.”
COAR Interest Group “Controlled Vocabulary for Repository Assets“
VocBench: Vocabulary Editing and Workflow Tool
Concept Multilingual Labels
Mappings
COAR Interest Group “Controlled Vocabulary for Repository Assets“
Implementation
COAR Interest Group “Controlled Vocabulary for Repository Assets“
Linked Data Frontend Serving Humans …
Concept URI
Concept Definition
Multilingual Labels
Hierarchy and
Matches (Mappings)
COAR Interest Group “Controlled Vocabulary for Repository Assets“
…and Machines
COAR Interest Group “Controlled Vocabulary for Repository Assets“
COAR Resource Type Controlled Vocabulary
• > 50 concepts supported
• Labels available in (currently) 12 languages:
– English, german, frensh, spanish, catalan, italian, chinese, japanese, russian, portuguese, dutch, turkish
• Concepts are assigned permanent identifiers (URIs)
• Hierarchical structure
• Mappings (‘matches’) to terms of other controlled vocabularies that mean the same or similar thing
• Published under CC-BY 4.0
COAR Interest Group “Controlled Vocabulary for Repository Assets“
Concepts in the Resource Type Vocabulary v1
COAR Interest Group “Controlled Vocabulary for Repository Assets“
Usage Scenarios And Added Value for Open Access Repositories
Supporting consistent and multilingual browsing in repository or aggregator user interfaces
Consistent use in repository metadata and metadata transfer across repository networks globally
Proper resource type prereq. for calculating reliable altmetrics (see e.g. activity on non-traditional output types: http://www.niso.org/topics/tl/altmetrics_initiative/)
COAR Interest Group “Controlled Vocabulary for Repository Assets“
Adoption By Repositories and in Metadata Guidelines
EPrints plugin: mapping von EPrints types to COAR Resource Types: http://bazaar.eprints.org/422/ and tested eg. In E-LIS repository
Implementation approach for Phaidra International digital repositories
Dspace Prototype implementation provided by University of Minho
Supported in upcoming release of next OpenAIRE Repository Manager Guidelines
COAR Interest Group “Controlled Vocabulary for Repository Assets“
COAR Vocabs. -> DSpace Workflow Approach
DSpace supports controlled vocabularies – search and submission process.
• Supported controlled vocabularies are expressed in a simple XML format (“DSpace node schema”).
• All information about a term is enclosed in a <node> element.
• Only the expression of a hierarchical relationship is allowed through the use of the <isComposedBy> subelement.
• By using <hasNote> a simple annotation mechanism becomes possible.
COAR Interest Group “Controlled Vocabulary for Repository Assets“
Dspace OAI interface
Context (set) OpenAIRE
Change info:eu-repo name space
Expose dc:type = COAR purl
http://purl.org/coar/resource_type/c_5ce6
COAR Interest Group “Controlled Vocabulary for Repository Assets“
Challenges – Community Help Needed
• In particular what are important concepts used in the domain of research data and other non-textual research output ?
• Community feedback for gradual improvements and extensions of Resource Type and other vocabularies used by Open Access Repositories
• Collaboration with / technical support by repository platform developers
• Capacity building / organizing webinars on – LOD and SKOS
– Best practices on vocabulary design
COAR Interest Group “Controlled Vocabulary for Repository Assets“
Do Not Miss: “Next Generation Repositories”
Join the plenary tomorrow on:
“Next generation repositories: building the repository of the future”
Panel 6: Repositories of the Future
Time: 16/Jun/2016: 11:00am-12:30pm
Location: Joly Theatre
Presented by: Eloy Rodrigues, Paul Walk, Kathleen Shearer, Pandelis Perakakis
COAR Interest Group “Controlled Vocabulary for Repository Assets“
Thank You For Your Attention!
About COAR Controlled Vocabularies http://purl.org/coar/igcv [email protected]
About COAR https://www.coar-repositories.org [email protected] @coar_ev