Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog
description
Transcript of Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog
![Page 1: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/1.jpg)
Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog
Kathryn Lybarger @zemkatOVGTSL 2013 #ovgtsl2013
May 17, 2013
![Page 2: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/2.jpg)
Cataloging ebooks
MARC Catalog
![Page 3: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/3.jpg)
Success!
![Page 4: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/4.jpg)
![Page 5: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/5.jpg)
Except sometimes…
![Page 6: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/6.jpg)
Or even worse…
![Page 7: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/7.jpg)
Zombies?
![Page 8: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/8.jpg)
These ebooks look normal
![Page 9: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/9.jpg)
Until someone looks too closely
requires a subscription
Please login
Currently unavailable
Purchase for $30
errorPage not found
![Page 10: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/10.jpg)
Then the screaming starts
![Page 11: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/11.jpg)
Nobody wants that!
![Page 12: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/12.jpg)
Not just dead?• Dead links not so bad … if they are not
in the catalog
• Our patrons hate LOST books in the catalog
• Zombies are more disappointing
![Page 13: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/13.jpg)
Strategy:• Make sure zombies don’t get into the
catalog in the first place
• Watch for news of recently turned
• Hunt down the ones that are already in there
![Page 14: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/14.jpg)
URLs may be bad initially• May be a typo
• Book not actually on the vendor site yet
• Record may have NO URL
![Page 15: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/15.jpg)
Bad DOI• Not registered yet
• Registered incorrectly
• Maybe points TWO places!
![Page 16: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/16.jpg)
URLs may be modified• May contain proxy
prefix
• May be institution specific
• May have session information
![Page 17: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/17.jpg)
Provider neutral records• Old standard:
– One record per provider
• To catalog:– Use that record
• New standard:– All e-versions on one
record
• To catalog:– Use that record– Delete all URLs that
don’t apply
![Page 18: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/18.jpg)
Ebook links in print books• Some print book
records have URLs
• 856 42 “Related Resource”
• May sneak in through fast copy or batch cataloging
![Page 19: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/19.jpg)
Spot some bad URLs• Query the catalog for
distinct hosts
• In Voyager:
SELECT DISTINCT ELINK_INDEX.URL_HOST
FROM ELINK_INDEXWHERE ELINK_INDEX.RECORD_TYPE="B";
![Page 20: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/20.jpg)
Catch them before they come in• Verify one by one
• Do they have notes indicating they’re bad?
• Run list through a link checker
![Page 21: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/21.jpg)
Just keep new ones out?
• Not sufficient
• Good links may die
• Nobody may tell you
![Page 22: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/22.jpg)
Vendor announcements• E-mail, RSS feeds
• Often interspersed with ads or news
• Do not always mention deletions
![Page 23: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/23.jpg)
Vendor data for deletions• Some vendors
release “deleted” lists
• You may have to check the web site
• Even dig for them
![Page 24: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/24.jpg)
Current status data only• Some vendors will
provide a list of what they currently have
• Changes not highlighted
• Download periodically
![Page 25: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/25.jpg)
Useful tool: vimdiff• Free and open
source (charityware)
• Available on unix, mac
• Available on Windows (Cygwin)
![Page 26: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/26.jpg)
Vimdiff in action
![Page 27: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/27.jpg)
Some vendor data is less accessible• Examples:
– MARC blob– “Whatever’s on the web site”
• Watch for announcements?
• Download / overlay periodically?
![Page 28: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/28.jpg)
Convert data to text• MARC -> .mrk text
(MarcEdit)
• Web site– Find A-Z title list page– Download / extract list
• Compare text (vimdiff)
![Page 29: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/29.jpg)
How to extract?• Different per web site
• Script (gather)– Download A-Z page– Find lines with book titles– Delete everything but the title– Compare to last month’s copy
![Page 30: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/30.jpg)
Unix tools• vim / vimdiff – editor • curl – download web
pages• grep – search file
contents• sed – reformat files
• Available in Windows through Cygwin
![Page 31: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/31.jpg)
Hunting in the catalog• Necessary maintenance
• Links can go bad
• (Sometimes whole platforms!)
![Page 32: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/32.jpg)
Link checking
• Many link checkers available
• They check for codes:– Good?– Forbidden?– Not Found?
![Page 33: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/33.jpg)
Codes aren’t everything• A table of contents
is a good page
• A bad DOI can be fixed
• Effective method differs by vendor
![Page 34: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/34.jpg)
Humans are better at this• Instructions might
be complicated:– Go to the web page– Open up one of the
chapters– Make sure it is a
PDF, not an order form
![Page 35: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/35.jpg)
Normac• MARC Normalizer
and Access Checker
• Free, open source software
• Available from GitHub
![Page 36: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/36.jpg)
Normalize MARC• Only include URLs
for the vendor you want
• Delete URLs with a proxy prefix
![Page 37: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/37.jpg)
Access Check• Zombies look
different on each site – specify
• Load in MARC or list of URLs
• Check access according to rules
![Page 38: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/38.jpg)
Is it really a zombie?
• Or does it just look that way to you?
• Maybe your subscription changed?
![Page 39: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/39.jpg)
If you’re sure…• (Remove them from
your catalog)
• Contact the vendor
• Modify WorldCat master record
![Page 40: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/40.jpg)
Dead links in WorldCat• Leave them in!
• Make 856 second indicator blank
• $z This electronic address not available when searched on [Date]
![Page 41: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/41.jpg)
Then what?OCLC WorldShare
Metadata Collection Manager?
Separate database of dead links?
![Page 42: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/42.jpg)
Any questions?
![Page 43: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/43.jpg)
Contact MeKathryn Lybarger
Problem Catalogerhttp://pc.blog.zemows.org/
GitHub http://github.com/zemkat
![Page 44: Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog](https://reader035.fdocuments.net/reader035/viewer/2022062411/56816767550346895ddc4bee/html5/thumbnails/44.jpg)