Analyzing web archives through topic and event ...

URL: http://dl.acm.org/citation.cfm?doid=2908131.2908175

Web archives capture the history of the Web and are therefore an important source to study how societal developments have been reflected on the Web. However, the large size of Web archives and their temporal nature pose many challenges to researchers interested in working with these collections. In this work, we describe the challenges of working with Web archives and propose the research methodology of extracting and studying sub-collections of the archive focused on specific topics and events. We discuss the opportunities and challenges of this approach and suggest a framework for creating sub-collections.

Additional Information

Field Value
Last updated March 15, 2017
Created March 15, 2017
Format PDF
License Apache Software License 1.1
Createdover 7 years ago
formatPDF
id3751d8ef-58dc-4a93-b986-357ccc77c3d9
package idd0fbd773-2af4-4d56-b212-4de1181ca4eb
revision id0e3eade1-1b91-4cf5-8689-599187bc8659
stateactive