CNI: Coalition for Networked Information

  • About CNI
    • Membership
    • CNI Collaborations
    • Staff
    • Steering Committee
    • CNI Awards
    • History
    • CNI News
  • Program Plan
    • Current Program Plan
    • Program Plan Archive
  • Topics
  • Events & Projects
    • Membership Meetings
    • Workshops & Projects
    • Other Events
    • Event Calendar
  • Resources
    • Publications by CNI Staff
    • Program Plan
    • Pre-Recorded Project Briefing Series
    • Videos & Podcasts
    • Follow CNI
    • Historical Resources
  • Contact Us

Reactive and Proactive Archiving of Crisis

Home / Project Briefing Pages / CNI Fall 2022 Project Briefings / Reactive and Proactive Archiving of Crisis

November 16, 2022

Sawood Alam
Web and Data Scientist
Internet Archive

Quinn Dombrowski
Academic Technology Specialist
Stanford University

The Internet Archive (IA) strives to archive more of the web and to archive it better. While it values the quality of web archival collections produced by a well-crafted set of scopes and a curated seed list (e.g., the End of Term Crawls), there are times when it cannot afford to operate crawling activities in a sequential order after a rigorous planning and seed collection because the target web resources become extremely volatile and vulnerable. In such cases, the IA puts extraordinary effort into capturing the section of the web in question as quickly as possible by allocating a significant portion of our finite compute resources as well as human resources. As a recent example, when the Russia-Ukraine war hit the world with a surprise, the IA acted immediately to run various crawls on relevant domains in a reactive mode. Its video archiving pipeline started to collect more videos on the topic.
The Saving Ukrainian Cultural Heritage Online (SUCHO) project was born independently from the IA. Volunteers of the project crowd-sourced efforts to identify relevant resources, archive them using various tools such as the Wayback Machine’s Save Page Now service, Webrecorder, and quality assurance. Many members of the Wayback Machine team were actively involved in supporting the project and addressing infrastructure issues.
This session will include a discussion of some of IA’s ongoing web archiving efforts related to crisis events, as well as an update on the SUCHO project. A presentation on SUCHO was included in CNI’s July 2022 Pre-Recorded Project Briefing Series: https://www.cni.org/topics/digital-humanities/saving-ukrainian-cultural-heritage-online-rapid-response-digital-humanities.
Presentation

Share this:

  • Click to share on Facebook (Opens in new window)
  • Click to share on Twitter (Opens in new window)
  • Click to share on LinkedIn (Opens in new window)

Filed Under: CNI Fall 2022 Project Briefings, Digital Curation, Digital Humanities, Digital Libraries, Digital Preservation, Information Access & Retrieval, Metadata, Personal Archives, Project Briefing Pages, Repositories, Social Media, Special Collections
Tagged With: cni2022fall, Project Briefings & Plenary Sessions, Videos

Last updated:  Wednesday, January 4th, 2023

 

Contact Us

21 Dupont Circle
Suite 800
Washington, DC, 20036
202.296.5098

Contact us
Copyright © 2023 CNI

  • Copyright Policy
  • Privacy Policy
  • Site map

Keeping up with CNI

CNI-ANNOUNCE is a low-volume electronic forum used for information about the activities and programs of CNI, and events and documents of interest to the CNI community.
Sign up

Follow CNI

  • View cni.org’s profile on Facebook
  • View cni_org’s profile on Twitter
  • LinkedIn
  • YouTube
  • Vimeo

A joint project