A Browse Able Audiovisual Database Derived From Catalog Data

Click here to load reader

download A Browse Able Audiovisual Database Derived From Catalog Data

of 28

  • date post

    22-Jun-2015
  • Category

    Technology

  • view

    932
  • download

    0

Embed Size (px)

Transcript of A Browse Able Audiovisual Database Derived From Catalog Data

  • 1. A browse-able audiovisual database derived from catalog data Mark Dahl and Laura Ayling Lewis & Clark College http://www.lclark.edu/~dahl/presentations

2. A two-part presentation

  • The searchable/browse-able A/V database (Mark)
  • A genre headings enrichment project undertaken by our catalogers (Laura)

3. Audiovisual lists

  • Faculty, students like to browse videos (by language), CDs (by artist)
  • Video/CD collection behind circulation desk
  • III OPAC doesnt facilitate an easy way to browse CDs and videos/DVDs by language and genre

4. Old system

  • Use create lists in III to generate lists
  • Output lists in comma-delimited format
  • Manipulate them into web pages using search/replace
  • We provided:
    • Lists of videos in certain foreign languages
    • Lists of CDs by artist
  • Manually updated

5. New system

  • Allows browsing our collection of:
    • CDS
    • DVDs & Videos
    • Videos
    • DVDs
  • Browse by:
    • Title
    • Person (author, artist, film director, etc.)

6. New system

  • The following pre-limits may be made on these browses
    • Language (Videos & DVDs only)
    • Film genre (Videos & DVDs only)
    • Musical genre (CDs only)
  • Have a look:http://library.lclark.edu/dynamic/videoscds
  • Detailed genre browse for CDs (new):
    • http://library. lclark . edu /dynamic/ videoscds / detailbrowse . htm
  • Detailed genre browse for films on DVD/Video (new):
    • http://library. lclark . edu /dynamic/ videoscds / detailfilmbrowse . htm

7. The technology

  • Extracting data via create lists/Expect script (weekly)
  • Loading data into PostgreSQL database via Perl (weekly)
  • Searching/browsing SQL database via PHP script

8. Expect script (running as a cron job) logs into ILS and instructs system to produce text file list and ftp it to external server ILS Text list ftped to server in delimitedtext format Text file opened up by Perl script and loaded into external SQL database SQL database on external server Web form sends query toPHP page PHP page queries SQL database PHP page produces html with results ofquery 9. Extracting the data

  • Expect scripting language allows you to automate the process of creating lists, outputting results, ftping results to server
  • Expect acts like a robot that goes into Innopac and does what you tell it
  • Expect scripts can run on Linux/Unix machines with the Expect language loaded on them

10. Searching for records

  • We have itypes for videos, DVDs and CDs
  • We search for bibliographic records that have an item record with this itype

11. Searching for audiovisual stuff by itype 12. Data extracted in create lists (from bib record)

  • Title (t-field group, repeatable)
  • Author (a-field group, repeatable)
  • Subjectincludes local genre headings 655 (d field group, repeatable)
  • Format (from itype)
  • Language
  • .b number

13. Output user selected format Output FTPed in comma-delimited format to account on Linux server. 14. Database loader

  • Written in Perl
  • Depends on Perl-PostgreSQL module
  • Runs as cron job (weekly, daily)
  • Takes comma-delimited file from III and loads data into database
  • Recreates database from scratch when run

15. SQL relational database structure

  • .b number
  • format
  • language

Bib table Titles table Authors table Subjects table

  • title
  • .b number
  • author
  • .b number
  • subject
  • .b number

16. PHP Search/Browse

  • PHP built for querying databases and turning results into web pages
  • Similar to Cold Fusion or ASP
  • Functions within PHP particularly good at talking to databases, including PostgreSQL

17. PHP search/browse

  • Query class (does all the talking to database)
  • Browse class extends query class
    • Allows you to browse collection
    • Jump to letter of alphabet
    • Accepts limits (format, genre, language)
  • Search class extends query class
    • Allows you to enter search term
    • You may choose format

18. Browse system

  • Format pre-limiting achieved by SQL querying for certain itype codes
  • Language pre-limiting achieved by SQL querying for certain language codes
  • Genre pre-limiting by SQL querying for the presence of certain words within the SQL table subject entries

19. Languages

  • English" => "eng
  • "French" => "fre
  • "German" => "ger
  • "Spanish" => "spa"
  • "Japanese" => "jpn
  • "Chinese" => "chi
  • "Russian" => "rus"

20. Genre limiting

  • Genre limit based on an SQL query that looks at the first part of the subject/genre heading
  • If we limit on jazz we include records with any of these headings
    • Jazz 1971-1980
    • Jazz Instruction and Study
    • Jazz vocals

21. Genre synonyms

  • The system also allows for defining synonyms to make more inclusive genre browses
    • "songs" => "songs,vocal music,song cycles
    • "rap/hip-hop" => "rap
    • "techno" => "underground,techno"
    • "history" => "%history"

22. Web interface

  • Browse parameters (format, field (person, title), language, genre) chosen by:
    • Web form
    • Link with parameters

23. By web form: 24. http://library. lclark . edu / dyanmic / videoscds / detailbrowse . php Big-Band 25. Alternative systems

  • III scope
  • Featured lists (can use Expect to update)
  • Custom search interface to III OPAC using web forms
  • III XML server
  • Direct queries to Oracle database

26. Reflections

  • Low cost technology available to build online databases/browsing systems
  • These technologies are very flexible
    • Index what YOU want
    • Display it how YOU want it

27. Reflections

  • The idea of a library catalog is changing
    • A single search interface isnt enough
    • Nice to search/browse within smaller, more manageable segments of your collection
    • Browsing electronically (not just in person) is desirable
    • People are developing new ways of browsing library collections
      • http://belmont.antarcti.ca/
      • High level browse concept (map LC classes to locally defined disciplines to facilitate browsing new books)

28. http://www.lclark.edu/~dahl/presentations