FAST (Faceted Application of Subject Terminology): A Vocabulary to Facilitate Faceted Browsing Eric...

Post on 27-Mar-2015

215 views 0 download

Tags:

Transcript of FAST (Faceted Application of Subject Terminology): A Vocabulary to Facilitate Faceted Browsing Eric...

FAST (Faceted Application of Subject Terminology): A Vocabulary to Facilitate Faceted Browsing

FAST (Faceted Application of Subject Terminology): A Vocabulary to Facilitate Faceted Browsing

Eric ChildressConsulting Project ManagerOCLC Programs & Research

Everything Need Not Be Miscellaneous: Controlled Vocabularies And Classification In A Web World

Sponsored by OCLC, ISKO-NA and Université de Montréal

Université de Montréal

5 August 2008

OCLC / ISKO-NA Preconference

OutlineOutline

• A Changing World

• About FAST

• Illustrative uses of FAST

A Changing WorldA Changing World

• Search & Retrieval evolves

• Interface expectations are set by others

• Presentation and navigation patterns flow…

• Data/Metadata production sources expand

• Machine services are increasingly important

Search & Retrieval evolvesSearch & Retrieval evolves

19th Century

Author, Title, Subject

Card Catalogs

Manual

Complex, Non-intuitive

Controlled Vocabulary

Library Library catalogscatalogs

21st Century

Key Word, Relevance Weighted

Computers/Web

Automated & Social

Simpler, User-Friendly

Uncontrolled Vocabulary

low unit costshigh unit costs

Interface expectations are set by othersInterface expectations are set by others

Top Web properties by visitor June 2008 (SEW)

Presentation and navigation patterns flow…Presentation and navigation patterns flow…

Losing favor:

•Text-heavy interfaces

•All options, all the time

•Jargon-laden labels

Gaining favor:

•Single-search box simplicity

•White-space and eye-catching graphics

•Side-bar navigation

•Tagclouds

Closer to, but not libraries…

Traditional library interfaces

Newer library system interfaces

Tagclouds

Data/Metadata production sources expandData/Metadata production sources expand

Institution-based

Sophisticated rules

Expert-built

Authority

Formal distribution channels

Institutional networks

Chiefly individuals

No/lightweight rules

UGC (User-Generated Content)

Wisdom of the crowds

“Metadata in the wild”

The Network

cost diffusioncost accounting

Machine services are increasingly importantMachine services are increasingly important

About FASTAbout FAST

• Arose from expert recommendations

• Project of OCLC Research in consultation with LC

• Complement of faceted vocabularies based on LCSH

• Suitable for non-expert application

• Machine-to-machine friendly

A New Frontier for Controlled VocabulariesA New Frontier for Controlled Vocabularies

Expert group studied issues related to controlled vocabularies and the Web environment

• American Library Association’s ALCTS/SAC/Subcommittee on Metadata and Subject Analysis (1997-2001)

Conclusion: For certain circumstances a controlled vocabulary was needed which was:

• Web-friendly

• Low-learning curve/non-expert user-friendly

Existing major vocabularies were not ready as-is

Adapting an existing vocabulary might be a pragmatic option

Requirements for a New VocabularyRequirements for a New Vocabulary

• Simple in structure and syntax

• Usable by non-catalogers and in non-library environments

• Compatible with MARC, Dublin Core, and other popular metadata schemas

• Easy maintainability

• Machine-compatible

Launching FASTLaunching FAST

OCLC Research project launched in 1998

Advisory group: ALA/ALCTS/SAC Subcommittee on FAST (Faceted Application of Subject Terminology)

Team:

OCLC: Eric Childress, Kay Clapton, Becky Dean, Anya Dyer, Kerre Kammerer, Ed O’Neill (Lead), Diane Vizine-Goetz

LC CSPO: Lynn El-Hoshy (now retired), Janice Young

Consultant: Lois Mai Chan (University of Kentucky)

Why Adapt LCSH?Why Adapt LCSH?

• Rich vocabulary covering all subject areas

• Synonym and homograph control

• Extensive hierarchical and associative references among terms

• De facto standard controlled vocabulary, extensively used by libraries, contained in millions of bibliographic records

• Long and well-documented history

• Strong institutional support of the Library of Congress

What is FAST?What is FAST?

• OCLC FAST (Faceted Application of Subject Terminology)

• A faceted vocabulary based on LCSH

• Modular – each facet may be used independently

• Supports post-coordinate search & retrieval

• Designed for use by non-expert assigners

• Machine-friendly controlled vocabulary

Authority Control: LCSH vs. FAST Authority Control: LCSH vs. FAST

LCSH FAST

Very large number (billions plus) of possible headings

Faceting limits the number of possible headings to a few million

Common headings are established; most assigned headings are synthesized by catalogers based on rules

All headings (except chronological) are established

Most headings are distinct (based on NACO normalization rules*); some conflicts occur particularly with $x & $v

All normalized headings are distinct; tagging and subfield coding provides no unique information (with the exception of forms)

*http:\\www.loc.gov/catdir/pcc/naco/normrule.html

FAST - Eight FacetsFAST - Eight Facets

Topical

Subject headings ―Evaluation

Form (Genre)

Guidebooks

Chronological

1939 - 1945

Geographic

New York (State) ―New York

Personal Names Kilgour, Frederick G.

Corporate Names

Oregon Library Association

Events

Olympic Games

Uniform Titles Dead Sea scrolls

How is FAST built?How is FAST built?

• OCLC Research-built software processes LCSH authority file and LCSH present in WorldCat bibliographic records to automatically build:

• FAST MARC authority file covering 8 facets

• FAST MARC authority reference records to assist with LCSH conversion

Sample Authority Record - GeographicSample Authority Record - Geographic

001    2130675003    OCoLC 005 20040512160245.0008    040512nneanz||babn n ana d 040    OCoLC   $b eng   $c OCoLC   $f fast043    n-us-ak151    Pacific Ocean $z Rowan Bay670    GNIS, Feb. 10, 2004   $b (Rowan Bay; bay;7 mi. N of Tebenkof Bay, on W coast of Kuiu I., Alex. Arch.; Wrangell-Petersburg Census Area, Alaska;56º40'02" N, 134º14'34" W; another Rowan Bay, pop. place in Wrangell-Petersburg Census Area)688    LC subject usage: 0 (2006)688    WC subject usage: 2 (2006)751  0 Rowan Bay (Alaska : Bay)$0 (DLC)sh2004005090

Sources of FAST HeadingsSources of FAST Headings

Library of Congress Subject Authority File

LC headings that combine different facets are deconstructed into discrete headings, each containing only one facet.

Headings assigned to bibliographic records in OCLC’s WorldCat

Many complex headings, i.e., those containing more than one element in the heading string, are based on literary warrant. They are derived from subject fields in the records in OCLC’s WorldCat.

Headings created for FAST

In some cases, faceting has required FAST headings to be created when no LCSH equivalents exists.

Example of a FAST-only heading – Events FacetExample of a FAST-only heading – Events Facet

In LCSH, it is common to established events as a combination of a geographic heading and a chronological ($y) subdivision:

Buffalo (N.Y.) $x History $y Civil War, 1861-1865

Grenada $x History $y American Invasion, 1983

For these subdivisions, a FAST topical heading is also created:

American Civil War, 1861-1865American Invasion of Grenada, 1983

LCSH to FAST ComparisonLCSH to FAST Comparison

600 Lincoln, Abraham, $d 1809-1865

648 1861 - 1865

650 Political leadership

650 Genius

650 Friendship

650 Presidents

650 Political science

651 United States

655 Case studies

655 Biography

FA

ST

600 Lincoln, Abraham, $d 1809-1865

650 Political leadership $z United States $v Case studies

650 Genius $v Case studies

600 Lincoln, Abraham, $d 1809-1865 $x Friends

and associates

650 Presidents $z United States $v Biography

651 United States $x Politics and government $y 1861-1865

LCS

H

Current FAST Authority FileCurrent FAST Authority File

Personal name headings 699,200

Corporate name headings 351,494

Topical headings 407,772

Geographic name headings 148,952

Chronological headings 676

Event headings 12,225

Title headings 48,245

Form headings 711

Total FAST authorities 1,669,375

July 2008

Future Development PlansFuture Development Plans

• Update and resynchronize all FAST headings with LCSH (In process)

• Improve the LCSH to FAST conversion (In process)

• Complete the FAST manual (In process)

• Expand the geographic names based on usage data and add information from the Geographic Names Information System (GNIS)

• Revise and expand the form (genre) facet

Sample OCLC applications of FASTSample OCLC applications of FAST

WorldCat Identities

FictionFinder

FA

ST h

ead

ing

s

FA

ST h

ead

ing

s

http://fictionfinder.oclc.org/

More informationMore information

OCLC FAST

• Project page: http://www.oclc.org/research/projects/fast/

• Search interface: http://fast.oclc.org/

OCLC WorldCat Identities

• http://orlabs.oclc.org/Identities/

OCLC FictionFinder

• http://fictionfinder.oclc.org/