Archiving and Preserving Born Digital Government Documents

18
Gone Today, Here Tomorrow: Archiving and Preserving Born Digital Government Documents Molly Bragg, Partner Specialist Internet Archive [email protected] g Federal Depository Library Conference Arlington, Virginia October 20, 2008

Transcript of Archiving and Preserving Born Digital Government Documents

Page 1: Archiving and Preserving Born Digital Government Documents

Gone Today, Here Tomorrow: Archiving and Preserving Born

Digital Government Documents

Molly Bragg,

Partner Specialist

Internet Archive

[email protected]

Federal Depository Library Conference

Arlington, Virginia

October 20, 2008

Page 2: Archiving and Preserving Born Digital Government Documents

Internet Archive

• Founded in 1996 by Brewster Kahle• Largest public web archive in existence• Designated as a library by the state of

California in 2007• Digitized collections of books, audio,

moving images• www.archive.org

Page 3: Archiving and Preserving Born Digital Government Documents

Partner Needs for Web Capture

• Libraries and Archives need web capture beyond general web archive

• Partners need to create focused collections

• Harvest at specific frequencies

• Reporting Features

• Hosting, Access and full text search

Page 4: Archiving and Preserving Born Digital Government Documents

Archiving Big and Small

• Domain crawls for the most comprehensive collections, ex .fr, .au

• Curated crawls for large collections, Iraq war, Election Collections

• Archive-It service, for smaller sized collections (automated harvesting)

Page 5: Archiving and Preserving Born Digital Government Documents

Archiving the U.S. Federal Government

Library of Congress• Congressional Harvests (107th – 110th)

NARA• End of Presidental term (2004)• Congressional Election Harvest (2006, 2008)

End of Term 2008 harvest • Collaborative project (LoC, CDL, UNT, GPO)

Page 6: Archiving and Preserving Born Digital Government Documents

www.loc.gov/minerva/

Page 7: Archiving and Preserving Born Digital Government Documents

www.webharvest.gov

Page 8: Archiving and Preserving Born Digital Government Documents

Archive-It

• Subscription service for smaller collection needs• Includes collection management, harvesting, full

text search, hosting and access• Collections publicly available at www.archive-it.org• Over 65 partners (State Archive/Libraries,

Universities, Federal institutions, Museums, Public Libraries)

Page 9: Archiving and Preserving Born Digital Government Documents
Page 10: Archiving and Preserving Born Digital Government Documents

Archiving with Archive-It

• Publications in born digital formats only• Web archiving allows archivist to capture

more than just the publications• At risk content needs to be preserved

before it is lost• Supplement paper collections• Builds relationships between

archives/libraries and government agencies

Page 11: Archiving and Preserving Born Digital Government Documents

Federal Institutions and Archive-It

• National Institutes of Health: capture select NIH websites and records

• Department of Energy, Office of Scientific and Technical Information: archiving the E-Print Network, a web-based library of published papers, research groups, and electronic documents.

• Department of Labor: create an archive of their web presence.

Page 12: Archiving and Preserving Born Digital Government Documents

US State Government:North Carolina

• State Library / State Archive partnership• 1 main collection for all state agencies• Websites for the collection are selected using

specific appraisal guidelines• Provide special access portal for the web

archives from their own site to brand and market the collection

Page 13: Archiving and Preserving Born Digital Government Documents

http://www.archives.ncdcr.gov/webarchives/index.html

Page 14: Archiving and Preserving Born Digital Government Documents

Local Web Archiving

• San Francisco Public Library, Government Information Center

• Archiving San Francisco city agencies with Archive-It

• Digitizing San Francisco municipal reports: http://www.archive.org/details/sfpl

Page 15: Archiving and Preserving Born Digital Government Documents

Global web archiving: Latin America

• Latin American Network Information Center, at the University of Texas, Austin

• Archive ministry, elected official websites for countries in Latin America and the Caribbean

• Comprehensive coverage of Latin American government information

Page 16: Archiving and Preserving Born Digital Government Documents

http://lanic.utexas.edu/project/archives/lagda/

Page 17: Archiving and Preserving Born Digital Government Documents

Global Web Archiving: Asia, Pacific Region

• National Library of Australia• Thailand, Laos, Papua New Guinea, East Timor, Burma

/ Myanmar and Cambodia• Election coverage, spontaneous events and

government websites• Example collections:

-Lao PDR Government and NGO Websites

-Post Thaksin politics in Thailand

-Cambodian National Election 2008

-Burmese Uprising 2007

Page 18: Archiving and Preserving Born Digital Government Documents

Contact Information

Molly Bragg

Partner Specialist

[email protected]

415.561.6799 ext 6

http://www.slideshare.net/event/dlcfall08