CNI: Coalition for Networked Information

  • About CNI
    • Membership
    • CNI Collaborations
    • Staff
    • Steering Committee
    • CNI Awards
    • History
    • CNI News
  • Program Plan
    • Current Program Plan
    • Program Plan Archive
  • Topics
  • Events & Projects
    • Membership Meetings
    • Workshops & Projects
    • Other Events
    • Event Calendar
  • Resources
    • Publications by CNI Staff
    • Program Plan
    • Pre-Recorded Project Briefing Series
    • Videos & Podcasts
    • Follow CNI
    • Historical Resources
  • Contact Us

Collaborative Software Archiving for Institutions (CoSAI)

Home / Project Briefing Pages / CNI Fall 2022 Project Briefings / Collaborative Software Archiving for Institutions (CoSAI)

November 16, 2022

Martin Klein
Scientist
Los Alamos National Laboratory

Lightning Round

Increasingly, scholarly artifacts created throughout the research lifecycle such as source code, datasets, audio/visual recordings, technical reports, presentation slides, and blog posts are considered necessary components of the scholarly record. Our previous research confirmed this claim and revealed the widespread use of version control systems among academic researchers writing source code. However, we also noticed a lack of comprehensive institutional programs to preserve their scholars’ source code. In addition, there are very few technical systems and tools available for the high-quality capture of source code artifacts at scale.

In response to this observation, and with generous support from the Alfred P. Sloan Foundation, we started the “Collaborative Software Archiving for Institutions (CoSAI)” project as a collaboration between New York University, Los Alamos National Laboratory, the University of Pittsburgh, and Old Dominion University. The CoSAI project focuses on developing and evaluating institutional approaches to archive research software hosted on public code sharing platforms on the web. This includes the creation (and sharing) of institutional collection policy plans as well as the technical development of systems that do not just crawl the code hosted on sites like GitHub, but also reliably capture the ephemeral record of supplementary material related to the code, for example, discussion threads and issues. We consider these materials just as important as the code itself, specifically for the purpose of reproducibility and replicability. By leveraging existing open source tools like Memento Tracer and building on workflow engines such as OCCAM, CoSAI will contribute to capturing web resources from code repositories at high quality, at scale, and in a reproducible manner. The briefing will include a project plan outline and report on progress to date.

See also “Who Versions Scholarly Code” (https://youtu.be/hmhMaEGQA6o) from CNI’s Pre-Recorded Project Briefing Series, Oct. ’22.

 

Share this:

  • Click to share on Facebook (Opens in new window)
  • Click to share on Twitter (Opens in new window)
  • Click to share on LinkedIn (Opens in new window)

Filed Under: CNI Fall 2022 Project Briefings, Digital Curation, Digital Libraries, Digital Preservation, Emerging Technologies, Information Access & Retrieval, Metadata, Project Briefing Pages, Repositories, Scholarly Communication
Tagged With: cni2022fall, Project Briefings & Plenary Sessions, Videos

Last updated:  Wednesday, January 4th, 2023

 

Contact Us

21 Dupont Circle
Suite 800
Washington, DC, 20036
202.296.5098

Contact us
Copyright © 2023 CNI

  • Copyright Policy
  • Privacy Policy
  • Site map

Keeping up with CNI

CNI-ANNOUNCE is a low-volume electronic forum used for information about the activities and programs of CNI, and events and documents of interest to the CNI community.
Sign up

Follow CNI

  • View cni.org’s profile on Facebook
  • View cni_org’s profile on Twitter
  • LinkedIn
  • YouTube
  • Vimeo

A joint project