Loading Events

Working with Web Archive Data – Sep. 29, 2020

Session Description

September 29, 2020 @ 2:00 pm - 3:00 pm EDT

Web archives, like the Internet Archive’s Wayback Machine, have existed for almost as long as the World Wide Web. This presentation will introduce tapping into this rich but complex data source. Participants will get an overview of major web archive sources, the WARC file format, methods of accessing web archived data, as well as a demonstration of tools for analytical tasks like extracting network graphs of links, extracting images, and filtering web page text for further analysis.

A link to join the event online will be sent to registrants.

This talk is part of Snacking on Bits and Bytes: Learning on all things Data, an online webinar series hosted by the Map & Data Library that features presentations and demonstrations on data-related topics and tools, such as web archives, visualization, GIS and statistics. See the full schedule and descriptions of sessions to learn more.

Go to Top