Lessons learned from twelve years’ operation of the Web Archiving Project (WARP)

dc.audienceAudience::Audience::Preservation and Conservation Section
dc.audienceAudience::Audience::Information Technology Section
dc.conference.sessionTypePreservation and Conservation with Information Technology
dc.conference.venueCape Town International Convention Centre
dc.contributor.authorMurakami, Kosuke
dc.date.accessioned2025-09-24T08:22:20Z
dc.date.available2025-09-24T08:22:20Z
dc.date.issued2015
dc.description.abstractThe National Diet Library (NDL) has been operating the Web ARchiving Project (WARP) since 2002, to collect and keep available for future access websites published in Japan. This paper describes the purpose of, history behind, and system used for this project, and introduces actual case studies to demonstrate the challenges faced in fulfilling the potential of this project. WARP has been attempting to create a comprehensive archive of websites published by public agencies in Japan, as prescribed in the 2010 revision of the NDL Law. It also archives, with permission of the publishers, the websites of private universities, websites promoting cultural or international events held in Japan, and websites related to the Great East Japan Earthquake. As of March 2015, the archived content reached 85,764 items, comprising 533 TB of data and 3.1 billion files. WARP was created using Open Source Software (OSS), such as Heritrix, Wayback and Solr, with some original software and user interfaces. Publications significant for public use, which are included in the collected websites, are cataloged individually, and made accessible together with other digitized materials. WARP metadata can also be searchable via other integrated search services. Some public agencies even guide their users to WARP in order to ensure access to older information that is no longer available on their own websites. Since it does not seem practicable for individual public libraries in Japan to conduct web archiving on their own, the NDL will take a step further in promoting WARP within the framework of digital resource sharing programs. We consider this an important part of the NDL’s mission as a national library responsible for disseminating cultural heritage through configuration of platforms and networks for digital resource sharing.en
dc.identifier.citationAkiyama, T. (2014). Struggles of the National Diet Library in Collecting Online Publications in Japan. Paper presented at: IFLA WLIC 2014 - Lyon - Libraries, Citizens, Societies: Confluence for Knowledge in Session 87 - Information Technology with Preservation and Conservation and National Libraries. In: IFLA WLIC 2014, 16-22 August 2014, Lyon, France. http://library.ifla.org/id/eprint/886, (accessed 2015-05-01). Maeda, N. (2013). 10 Years of Web Archiving Project (WARP). Paper presented at: Monthly meeting of the Information Organization Research Group, Nippon Association for Librarianship. In: Monthly meeting of the Information Organization Research Group, Nippon Association for Librarianship, 18 May, 2013, Osaka, Japan. http://warp.ndl.go.jp/warp10years.pdf, (accessed 2015-05-01). Sato, T. (2009). Archiving of web information at the National Diet Library. Paper presented at: The 28th mutual visit program between the National Diet Library and National Library of China. In: The 28th mutual visit program between the National Diet Library and National Library of China, 24 November - 1 December 2009, Tokyo, Japan. http://www.ndl.go.jp/jp/aboutus/cooperation/pdf/theme1_sato.pdf, (accessed 2015-05-01). Shimura, T. (2013). Current status of Web archiving of the National Diet Library, Japan. Paper presented at: 2013 General Assembly of the International Internet Preservation Consortium. In: 2013 General Assembly of the International Internet Preservation Consortium, 22-26 April, 2013, Ljubljana, Slovenia. http://netpreserve.org/resources/current-status-web-archiving-national-diet-library-japan, (accessed 2015-05-01).
dc.identifier.relatedurlhttp://conference.ifla.org/ifla81
dc.identifier.urihttps://repository.ifla.org/handle/20.500.14598/5449
dc.language.isoen
dc.rightsAttribution 3.0 Unported
dc.rights.accessRightsopen access
dc.rights.urihttps://creativecommons.org/licenses/by/3.0/
dc.subject.keywordWeb archiving
dc.subject.keywordOpen Source Software
dc.subject.keyworddigital resource sharing
dc.subject.keywordnational library
dc.subject.keywordJapan
dc.titleLessons learned from twelve years’ operation of the Web Archiving Project (WARP)en
dc.typeArticle
ifla.UnitPreservation and Conservation Section
ifla.UnitInformation Technology Section
ifla.oPubIdhttps://library.ifla.org/id/eprint/1089/

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
090-murakami-en.pdf
Size:
434.56 KB
Format:
Adobe Portable Document Format

Collections