Logo image
InterPlanetary Wayback: Peer-To-Peer Permanence of Web Archives
Conference proceeding   Peer reviewed

InterPlanetary Wayback: Peer-To-Peer Permanence of Web Archives

Mat Kelly, Sawood Alam, Michael L. Nelson and Michele C. Weigle
RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, TPDL 2016, v 9819, pp 411-416
01 Jan 2016

Abstract

Computer Science Computer Science, Information Systems Computer Science, Theory & Methods Information Science & Library Science Science & Technology Technology
We have integrated Web ARChive (WARC) files with the peer-to-peer content addressable InterPlanetary File System (IPFS) to allow the payload content of web archives to be easily propagated. We also provide an archival replay system extended from pywb to fetch the WARC content from IPFS and re-assemble the originally archived HTTP responses for replay. From a 1.0GB sample Archive-It collection of WARCs containing 21,994 mementos, we show that extracting and indexing the HTTP response content of WARCs containing IPFS lookup hashes takes 66.6min inclusive of dissemination into IPFS.

Metrics

14 Record Views
18 citations in Scopus

Details

InCites Highlights

Data related to this publication, from InCites Benchmarking & Analytics tool:

Web of Science research areas
Computer Science, Information Systems
Computer Science, Theory & Methods
Information Science & Library Science
Logo image