Dewey in Sweden, Sweden in Dewey: Classification … · in Dewey: Classification in a ... LCSH -...
Transcript of Dewey in Sweden, Sweden in Dewey: Classification … · in Dewey: Classification in a ... LCSH -...
Dewey in Sweden, Sweden in Dewey: Classification in a Local/Global Context
Seminarium om Dewey och klassifikationens roll nationellt och internationelltStockholm5 February 2009
Joan S. MitchellEditor in Chief, DDCOCLC
Outline
• Dewey’s benefits
• What we are doing to keep (and increase) Dewey’s usefulness
• Some interesting applications
• What can Swedish librarians do right now?
• Discussion
1. What are Dewey’s benefits?
• Language-independent representation
• Large amount of categorized content
• Interoperable translations
• Mappings and crosswalks
• Organizational support
• Worldwide user community
Language-independent representation
VietnameseLập trình005.1
Swedish & NorwegianProgrammering005.1
SpanishProgramación005.1
RussianПрограммирование005.1
ItalianProgrammazione005.1
GreekΠρογραμματισμός005.1
GermanProgrammierung005.1
FrenchProgrammation005.1
EnglishProgramming005.1
Arabic005.1َبْرَمَجة
Large amount of categorized content
• DDC used by 200,000+ libraries in 138 countries
• ~25% of WorldCat records include explicit Dewey numbers (more about derived numbers later)
• Number is growing, e.g., Deutsche Nationalbibliothek is now adding Dewey numbers to WorldCat nearly at the same rate as the Library of Congress
Translations
• Translations published in following languages since 1998: Arabic, French, German, Greek, Hebrew, Icelandic, Italian, Norwegian, Russian, Spanish, Turkish, and Vietnamese
• Updated top levels available in: Arabic, Chinese, Czech, French, German, Hebrew, Italian, Norwegian, Portuguese, Russian, Spanish, Swedish, and Vietnamese
• Discussions under way: Indonesian abridged edition, French and Greek full web versions, approaches to web versions in Norway and Sweden
Translations: Localization
DDK 5
781.621-781.729
O2 Stilistisk innflytelse fra andre musikktradisjoner
Til 02 legges sifrene som følger etter 781.6 in 781.63-781.69, f.eks. jazzens innflytelse på skandinavisk folkemusikk 781.62395025
DDC 22
781.621-781.729
02 Stylistic influence of other traditions of music
Add to 02 the numbers following 781.6 in 781.63-781.69, e.g., influence of jazz on Spanish folk music 781.6261025 . . .
Translations: Interoperable Expansions
—43551 Regierungsbezirk Köln—435511 Aachen—435512 Kreise Aachen, Heinsberg, Düren, Euskirchen
—4355122 Kreis Aachen
—4355124 Kreis Heinsberg
—4355126 Kreis Düren
—4355128 Kreis Euskirchen
—435513 Rhein-Erft-Kreis
—435514 Köln—435515 Leverkusen
—435516 Rheinisch-Bergischer-Kreis, Oberbergischer Kreis
—4355163 Rheinisch-Bergischer-Kreis
—4355167 Oberbergischer Kreis
—435518 Bonn—435519 Rhein-Sieg-Kreis
Mappings to subject headings (1)
Library of Congress Subject Headings (LCSH)
Medical Subject Headings (MeSH)
Canadian Subject Headings (CSH)
Sears List of Subject Headings (Sears)
Book Industry Standards and Communications (BISAC) Subject Headings
Mappings to subject headings (2)
RAMEAU [French]
Schlagwortnormdatei (SWD) [German]
Nuovo Soggettario [Italian]
Sears Lista de Encabezamientos de Materia [Spanish]
(more about derived mappings later)
Mappings in WebDewey
LCSH - DDC
mappings
Mappings in Abridged WebDewey
Mappings in MelvilClass
SWD - DDC mappings
Nuovo Soggettario - DDC
RAMEAU - DDC (broad level only)
620 $aAnesthesia$vLCSH (en ligne), 2005-02-21622 1 $aAnesthesia$vMeSH (en ligne), 2005-02-21624 $a610
Crosswalks between schemes
LCC – DDC (ClassWeb, Classify)
UDC – DDC (IZUM, Czech National Library)
SAB – DDC (Electronic updated version at National Library of Sweden)
Organizational Support
Permanent editorial staff at LC and OCLC
International advisory board (EPC)
International user community
Research (OCLC + partners around the world)
Dewey Community
ACOC National Libraries
ALA Translation Teams
CILIP EDUG
NKKI Research Partners
SABINET
. . .
Editorial Policy Committee
LC & OCLC
DDC Editors
Dewey Users around the World
2. What are we doing to Dewey?
• Content (updates and transformations)
• New forms of representation
Content
Translations
New topics (Semantic web), expansions (blogs / social networks), events (elections), boundaries (Italian provinces), views (abortion), etc.
(short-term)
Education
Religion Law
Foods/Meals Music
Groups of people
(long-term)
Continuous updating Transformations
Content: Annual Additions
Schedule and table numbers: 120/year
Built numbers: 450/year
Mapped headings: 1800/year*
*as of July 2008
Content: Full Edition Database(December 2008)
Schedule numbers: 26,715
New Schedule Number: 006.752 Blogs
Content: Full Edition Database(December 2008)
Schedule numbers: 26,715
Tables 1-6: 9,356
New Table Number: T2—45674 Fermo province
Content: Full Edition Database(December 2008)
Schedule numbers: 26,715
Tables 1-6: 9,356
Built schedule numbers: 13,310
New Built Schedule Number: 782.42162916
Content: Full Edition Database(December 2008)
Schedule numbers: 26,715
Tables 1-6: 9,356
Built schedule numbers: 13,310
Built table numbers: 609
New Built Table Number: T5—9276264
Content: Abridged Edition Database(December 2008)
Schedule numbers: 4,937
Tables 1-4: 522
Built schedule numbers: 401
Built table numbers: 9
Transformations
In many areas, the standard sequence assumes an underlying “universal”viewpoint that is not universal, e.g., food, religion, education, music
Food and meals
Rethink food and meals in a global context
What is a sandwich?
Smörgåsar
DDC 22:
641.84 Sandwiches
Including burritos, tacos, wraps; submarine sandwiches
Sandwiches: Proposed Update
641.84 Sandwiches and related dishes
Standard subdivisions are added for either or both topics in heading
Class here sandwiches and related dishes of any type, e.g., open-faced sandwiches, grilled sandwiches, wraps
Meals: Current outline
641.52 Breakfasts
641.53 Luncheons, lunches, brunches, teas,suppers, snacks
641.54 Dinners
Meals: Proposed outline
641.52 First meal of the day
641.53 Light meals and snacks
641.54 Main meal of the day
200 Religion
200 Religion
210 Philosophy and theory of religion
220 Bible
230-280 Christianity
290 Other religions
Class 2 in UDC: Chronological/Regional Development
21 Prehistoric religions
22 Religions of Far East origin
23 Religions originating in Indian subcontinent
24 Buddhism
25 Religions of antiquity. Minor cults and religions
26 Judaism
27 Christianity
28 Islam
29 Modern spiritual movements
New View of 200 Religion (excerpt)
Taoism (299.514)Confucianism (299.512)Hinduism (294.5)Jainism (294.4)Buddhism (294.3)Wicca (299.94)Zulu (African people)—religion (299.683986)Voodoo (299.675)Ras Tafari (299.676)Bible (220)Judaism (296)Christianity (230)Islam (297)Scientology (299.936)
370 Education
Can we provide a global framework that addresses local and global needs (e.g., levels of education, curricula, policies)?
Levels of primary education
DDK 5
372.241 Småskoletrinnet (1.-4. klasse)
372.242 Mellomtrinnet (5.-7. klasse)
372.243 Ungdomstrinnet (8.-10. klasse)
DDC 22
372.241 Lower level (grades 1-3)
372.242 Upper level (grades 4-6)
Class middle schools (grades 5-8), junior high schools in 373.236
780 Music
Evolution of musical styles brings:
compression of styles
expansion of styles
hybridization of styles
Approaches
•Shallow developments with deep indexing and mappings
•Expansions
•Synthesis (number building) for hybrid styles
Example: Shallow developments
Current entry
781.66 Rock (Rock ‘n’ roll)
Including acid, folk, hard, punk, soft rock
Proposed entry
781.66 Rock (Rock ‘n’ roll)
Class here specific rock styles
. . .
Example: Indexing
Relative Index entries at 781.66 to include:
New wave music
Soft rockKrautrock
Rockabilly musicHard rock
Punk rockAlternative rock music
Psychedelic rockAcid rock
Example: Current Mappings at 781.66
Example: ExpansionExample: Expansion
781.648 Electronica
Class here specific electronica styles
Class comprehensive works on electronic music in 786.7
Example: Hybrid music styles
In add table under 781.63—781.69:17 Hybrid styles
Fusion of two or more styles from different traditions of music to create a new style
Add to 17 the numbers following 781.6 in 781.62–781.69, e.g., fusion with folk music 172, folk rock 781.66172
See Manual at 781.6: Hybrid styles
. . .
Representation
• Use of and extensions to MARC 21 formats for representation of the DDC
• Development of a Uniform Resource Identifier (URI) structure for the DDC (Michael Panzer)
• Experimentation with DDC in SKOS (Michael Panzer)
• Investigation of formal specification of relationships in the DDC (Rebecca Green and Michael Panzer)
Representation: Dewey in MARC 21 Formats
• Decision to use MARC 21 formats for classification and authority data in new Editorial Support System (standard, detailed representation, flexibility to drive other representations)
• Development of proposed extensions to support representation and access in cooperation with DNB, LC, and OCLC
Dewey Class Record in MARC Classification Format
Dewey Class Record in XML (Partial)
Dewey Class Record (Formatted View)
Relative Index Record in MARC Authority Format
Some Extensions to MARC 21
• Identification of notation in internal add tables
• Representation of component parts of numbers
• Accommodation of full and partial “access” numbers
Notation in Internal Add Tables: New $y Subfield
Example from MARC Classification format:
153 ## $a 930 $c 990 $y 1 $a 004 $j Ethnic and national groups
[Notation 004 in the internal add table located at 930-990]
Component Parts of Numbers
Inclusion of component parts of numbers in bibliographic records using a new 085 field, based on the 765 field in the classification format
Component Parts Example:Feminist Criticism of Television
082 01 $8 1 $a 791.45082 $2 22
085 ## $8 1.1 $b 791.45 $z 1 $s 082
Television Feminist
Access Numbers
Provision for assignment of access numbers (additional DDC numbers, notation from Tables 1-6, internal table notation) in bibliographic records
Access Numbers: Examples (1)
Tunnels in the Swiss Alps
082 00 $a 388.13 $2 22
083 0# $z 2 $a 4947 $2 22/ger $q DE-101b
T2—4947 (Swiss Alps)
German DDC 22
Assigned by Deutsche Nationalbibliothek
Access Numbers: Examples (2)
History of Norway, Sweden, and Denmark
082 00 $a 948 $2 22 (Scandinavia)
083 0# $a 948.1 $2 22 (Norway)
083 0# $a 948.5 $2 22 (Sweden)
083 0# $a 948.9 $2 22 (Denmark)
Access Numbers: Examples (3)
Lyng, Selma Therese, 1972-Være eller lære? : om elevroller, identitet og læring i
ungdomsskolen/ Selma Therese Lyng. - Oslo : Universitetsforl., cop. 2004. -
215 s.- (370.153)(372.243)ISBN 82-15-00597-7 (h.) : Nkr 249.00
082 14 $a 372.243 $2 DDK5
082 04 $a 373.236 $2 22
083 1# $a 370.153 $2 DDK5
Representation: Dewey URIs
Design goals for Dewey URI structure:
• Common locator for Dewey concepts and associated resources for use in web services and web applications
• Retraceable path to concept rather than abstract identification
• Classes as center of identification for DDC concepts
Dewey URI Examples
Generic URI
http://dewey.info/class/338.4
Specific time
http://dewey.info/class/338.4/2007/05/25
http://dewey.info/class/338.4/e22
Specific time & language
http://dewey.info/class/338.4/2007/05/25/about.en
Specific time, language & format
http://dewey.info/class/338.4/2007/05/25/about.en.skos
Dewey in SKOS/RDF
SKOS (Simple Knowledge Organization System) provides a standard way to represent knowledge organization systems (KOS) using the Resource Description Framework (RDF)
Problem: Dewey is more complex than many KOS (e.g., thesauri)
• Schedules, auxiliary tables, internal tables, Relative Index
• Standard numbers, optional numbers; number spans, centered entries
• Elaborate note structure in tables and schedules + lengthy notes in the Manual
Initial Design Driven by Linked Data Needs
Linked Data:
Use URIs as names for things
Use HTTP URIs so that people can look up those names
When someone looks up a URI, provide useful information
Include links to other URIs so that they can discover more things
Tim Berners-Lee http://www.w3.org/DesignIssues/LinkedData.html
Analyzing DDC for modeling in RDF/SKOS (1)
Singled out as skos:Concepts right now:
• Listed schedule numbers (including synthesized numbers)
• Number spans
• Centered entries
• Relative Index terms (in different namespace)
Analyzing DDC for modeling in RDF/SKOS (2)
370.11 Education for specific objectives
370.113 Vocational education
370.113085 Parents--vocational education
370.1130941 Vocational education--Great Britain
370.1130973 Vocational education--United States
Career development
Career education
Education of employees
Employee development
Human resource development …
Career education
Career education--United States
Career education--United States--Curricula
Core competencies
…
Vocational education
Vocational training centers
Relative Index
Mapped LCSH
ddc:topic
skos:closeMatch
Analyzing DDC for modeling in RDF/SKOS (3)
370.113 Vocational education
Class here career education, occupational training, vocational schools
Class on-the-job training, vocational training provided by industry in 331.2592
For vocational education at secondary level, see 373.246; for adult vocational education, see 374.013
See also 331.702 for choice of vocation; also 371.425 for vocational guidance in schools
skos:notation
skos:prefLabel
skos:related
RDF model: Class
<class/370.113/2007/12> a skos:Concept ;
skos:inScheme <scheme/2007/12> ;
dct:created "1996-06-01T00:00:00.0-05:00"^^<http://purl.org/dc/terms/W3CDTF> ;
dct:modified "2003-03-26T00:00:00.0-05:00"^^<http://purl.org/dc/terms/W3CDTF> ;
skos:notation "370.113"^^<schema-terms/Notation> ;
skos:prefLabel "Vocational education"@en ;
skos:broader <class/370.11/2007/12> ;
skos:narrower <class/370.113085/2007/12> ,
<class/370.1130941/2007/12> ,
<class/370.1130973/2007/12> ;
skos:narrowerStructural <class/373.246/2007/12> ,
<class/374.013/2007/12> ;
skos:related <class/331.2592/2007/12> .
RDF model: Relative Index terms
<class/370.113/2007/12> ddc:topic <index/Career%20development> ,
<index/Career%20education> ,
<index/Education%20of%20employees> ,
<index/Employee%20development> ,
<index/Human%20resource%20development> ,
<index/Job%20training> ,
<index/Occupational%20training> ,
<index/Retraining%E2%80%94vocational%20education> ,
<index/Staff%20development> ,
<index/Training%E2%80%94employee%20education> ,
<index/Vocational%20education> ,
<index/Vocational%20schools> ,
<index/Vocational%20training> ,
<index/Work%20training> .
RDF model: Mapped LCSH
<class/370.113/2007/12> skos:closeMatch
<http://tspilot.oclc.org/lcsh/sh%2085020255%20> ,
<http://tspilot.oclc.org/lcsh/sh%2000002431%20> ,
<http://tspilot.oclc.org/lcsh/sh%2085144178%20> ,
<http://tspilot.oclc.org/lcsh/sh%2096002453%20> .
The Big Question
How can we make Dewey in its various representations plus mapped terminologies and associated content work harder?
3. Some interesting applications
• Dewey.info and history-of-concepts Dewey web services (Michael Panzer, OCLC)
• MelvilSearch and Multilingual MelvilClass(Lars Svensson, DNB)
• DeweyBrowser, Classify, Shelfview(Diane Vizine-Goetz, OCLC)
Dewey.info
Putting the RDF/SKOS representation to work for humans and machines
370.113: Class + Upward/Downward Hierarchies + Mapped LCSH
Dewey.info: http://dewey.info/615.4/about
Generic view in HTML of class across all editions/versions in all languages
Generic view in HTML of class across all editions/versions in all languageshttp://dewey.info/615.4/about
View of all English-language versions of that classhttp://dewey.info/615.4/about.en
HTML of a specific version of a class in a specific language http://dewey.info/615.58/2007/02/about.fr(.html)
HTML format is obtained via content negotiation: The server determines that HTML is the appropriate format for this user agent (i.e., a web browser)
HTML is annotated with RDFa! Clicking the RDF logo produces an RDF version of the HTML view
<span class="notation" property="skos:notation" datatype="ddc:Notation">615.58</span>
<a id="class“ resource="http://dewey.info/class/615.58/2007/02/about.fr" property="skos:prefLabel" xml:lang="fr" href="http://dewey.info/class/615.58/2007/02/about.fr">Pharmacothérapie</a>…
History-of-concepts web service
History of changes in the DDC:
DDC changes are exposed to users record-by-record in notes from one edition to the next
DDC changes from one edition to the next are also summarized in Lists of Changes in the print edition and as a downloadable table
Hidden from users (human and machine) is a rich set of information on changes in the underlying data file
685 MARC History Note
006.7 Multimedia systems
685 01 @t Multimedia systems, interactive video, comprehensive works on computer graphics and computer sound synthesis @i all formerly located in @b 006.6 @d1996 @221
(this information is no longer exposed in the print DDC 22 or WebDewey record)
Tracking changes
• of the scheme as a whole (snapshots/editions)
• of individual classes (contents of a class)
• of individual topics associated with a class
for
linking/updating class numbers, updating translations, maintenance of mappings, query expansion . . .
Change in knowledge organization systems
How to expose history information for machine access (1)
Standard identifier (URL):<http://dewey.info/class/004.165/>
Type of history note:<http://dewey.info/class/004.165/> ddc:relocationNote []
Normalized date:dcterms:issued “2008-08-01”^^xs:date
Relationships of note to scheme:dcterms:isPartOf <http://dewey.info/scheme/e22/>
Result of changedcterms:description "Partially changed number“@en
How to expose history information for machine access (2)
DDC numbers of affected classesddc:oldNumber "004.165"^^<schema-terms/Notation>
ddc:newNumber "004.1675"^^<schema-terms/Notation>
Affected topicddc:hasTopic "Specific handheld devices"@en
Complete note in human-readable formrdf:value "Specific handheld devices relocated to 004.1675"@en
Use: Update Mappings
Mapping relationship: “BlackBerry” to “004.165”
Timestamp: 2007-02-04
Using history information685 20 $Specific handheld devices $irelocated to
$b004.1675 $d200808 $222
Mapping Update: 004.165 [<2008-08]004.1675 [>= 2008-08]
Use: Query Expansion
Search term: “Information theory”
Resulting DDC number: 003.54
Using history information“Relocations and Discontinuations” (Ed. 20):
Ed. 19: [001.539] Ed. 20: 003.54
685 01 $tInformation theory $iformerly located in $b001.539 $d19890306 $220
Expanded query:
{001.539 19; 003.54 20; 003.54 21; 003.54 22}
Titles under “Alpiner Skilauf (Abfahrtslauf)” (796.935)
Titles under “Alpiner Skilauf (Abfahrtslauf)” + (796.935*)
Record from 796.935* Search
Multilingual MelvilClass: English
Multilingual MelvilClass: German
DeweyBrowser
DeweyBrowser (Svenska)
Dewey in WorldCat.org?
Classifyhttp://deweyresearch.oclc.org/classify2/
• DDC/LCC/NLM classifier
• Developed by OCLC Office of Research
• Based on FRBR cluster data
• Human interface + web service
Virtual reshelving
006.74
4. What can Swedish librarians do right now?
• Experiment with contributing Dewey numbers to WorldCat
• Create mappings for access vocabulary
• Load Dewey numbers into SAO authority records
• Begin planning for translation
Subject Heading/DDC Mappings
• Based on likelihood of use of heading with number
• No explicit definition of relationship beyond concurrent use
Mappings for Access
Study derivation of mappings from SAB - DDC, LCSH - DDC, SAO - LCSH
Tools:
SAB - DDC
SAO – LCSH - DDC
DDC - DDC terminology files (RI terms, LCSH, MeSH) - DDC
Dewey Numbers in Subject Heading Authority Records
• Subject entity represented by heading equals or approximates the whole of the DDC class or is in standing room
• Definition of relationship between heading and number is found in Dewey number record
Draft Guidelines for Adding Dewey Numbers to Authority Files
• The subject entity represented by the LCSH equals or approximates the whole of the Dewey class
• The subject entity represented by the LCSH is explicitly in standing room at the number
• The geographic entity represented by the LCSH has an implicit relationship to the Dewey class
• The genus/species represented by the LCSH has an implicit relationship to the Dewey class
• If the subject entity represented by the LCSH matches more than one Dewey number according to the aforementioned rules, multiple Dewey numbers may be added to the authority record
Nuovo Soggettario - DDC
BISAC - DDC
003 OCoLC¶
005 20090120222758.0¶
008 090120n| anznnbabn |n ana d¶
040 .. ‡aOCoLC-O‡beng‡cOCoLC-O¶
039 .. ‡a(OCoLC-O)MED-006000¶
039 .. ‡a(OCoLC-O)MED-6000¶
072 .7 ‡aMED‡x006000‡2bisacsh‡92198¶
083 04 ‡a617.96‡222‡5OCoLC-D¶
150 .. ‡aMEDICAL‡xAnesthesiology‡9medical anesthesiology¶
667 .. ‡aUsually do not map *ology to 362.1‡9Conversion note¶
667 .. ‡aBISAC Subject Code: MED006000, Sequence Number: 002198¶
Some Translation Planning Activities
• Undertake pilot study to test mixed Swedish-English approach
• Decide on Swedish terms for standard Dewey instructions (Including, Class here, etc.)
• Develop interoperable expansions in geography, history, etc.
• Continue to participate in EDUG working groups (education, law, archaeology)
• Plan technical environment for translation support and web version
• Study end-user tools (Swedish MelvilSearch?)
• Create an advisory board