CNI: Coalition for Networked Information

  • About CNI
    • Membership
    • Staff
    • Steering Committee
    • CNI Awards
    • History
    • CNI News
  • Membership Meetings
    • Next Meeting
    • Past Meetings
    • Future Meetings
  • Topics
  • Events & Projects
    • Membership Meetings
    • Workshops & Projects
    • Other Events
    • Event Calendar
  • Resources
    • CNI Publications
    • Program Plan
    • Pre-Recorded Project Briefing Series
    • Videos & Podcasts
    • Executive Roundtables
    • Follow CNI
    • Historical Resources
  • Contact Us

Web Archives Analysis at Scale with the Archives Unleashed Cloud

Home / Project Briefing Pages / CNI Spring 2019 Project Briefing / Web Archives Analysis at Scale with the Archives Unleashed Cloud

April 5, 2019

Nick Ruest
Associate Librarian
York University

Ian Milligan
Associate Professor
University of Waterloo

Web archives, repositories of born-digital information dating back to the Internet Archive and national libraries in the mid-1990s, are fantastic resources of information covering topics of interest to humanities and social sciences scholars. Imagine a political historian studying elections, a historian studying youth culture in the late 1990s, or a scholar of the military or policy exploring how wars were reflected online. Yet while we have been collecting this information for over two decades, access has lagged: most scholars are limited to working with web archives one page at a time through portals such as the Wayback Machine. With the rise of the digital humanities, the computational social sciences, and web science more generally, scholars increasingly have the ability and desire to work with data at scale. In this presentation, we introduce the Archives Unleashed Cloud, currently supported through a grant from The Andrew W. Mellon Foundation. This service facilitates the (a) transfer of web archival data to the Cloud; (b) its analysis and transformation into standard scholarly derivatives; and (c) the building of a community around it via in-person events and learning guides. Our presentation begins by introducing the Cloud and discussing its motivation, discussing its technical underpinnings, and then exploring our current sustainability plan to keep the Archives Unleashed Cloud running after our foundation funding ends in 2020.

https://archivesunleashed.org/
https://cloud.archivesunleashed.org

Presentation

Share this:

  • Click to share on Facebook (Opens in new window) Facebook
  • Click to share on LinkedIn (Opens in new window) LinkedIn
  • Click to share on Mastodon (Opens in new window) Mastodon
  • Click to share on Bluesky (Opens in new window) Bluesky
  • Click to share on X (Opens in new window) X

Filed Under: CNI Spring 2019 Project Briefing, Digital Preservation, Emerging Technologies, Information Access & Retrieval, Project Briefing Pages, Repositories
Tagged With: cni2019spring, Project Briefings & Plenary Sessions

Last updated:  Sunday, November 30th, 2025

 

Contact Us

1025 Connecticut Ave, NW #1200
Washington, DC 20036
202.296.5098

Contact us
Copyright © 2025 CNI

  • Copyright Policy
  • Privacy Policy
  • Site map

Keeping up with CNI

CNI-ANNOUNCE is a low-volume electronic forum used for information about the activities and programs of CNI, and events and documents of interest to the CNI community.
Sign up

Follow CNI

LinkedInBlueSkyFacebookTwitterYouTubeVimeoMastodon

A joint project