CNI: Coalition for Networked Information

  • About CNI
    • Membership
    • Staff
    • Steering Committee
    • CNI Awards
    • History
    • CNI News
  • Membership Meetings
    • Next Meeting
    • Past Meetings
    • Future Meetings
  • Topics
  • Events & Projects
    • Membership Meetings
    • Workshops & Projects
    • Other Events
    • Event Calendar
  • Resources
    • CNI Publications
    • Program Plan
    • Pre-Recorded Project Briefing Series
    • Videos & Podcasts
    • Executive Roundtables
    • Follow CNI
    • Historical Resources
  • Contact Us

A Radical New Approach to Data Citation: Cook the Carrots, Burn the Sticks

Home / Project Briefing Pages / CNI Fall 2023 Project Briefings / A Radical New Approach to Data Citation: Cook the Carrots, Burn the Sticks

November 15, 2023

Jamie Wittenberg
Assistant Dean for Research & Innovation Strategy
University of Colorado Boulder

John Chodacki
University of California Curation Center (UC3) Director
California Digital Library

Kristi Holmes
Associate Dean for Knowledge Management and Strategy and Director, Feinberg School of Medicine
Northwestern University

Librarians and other stakeholders, through the Make Data Count initiative, have worked to advance data citation adoption among researchers and publishers by leveraging a variety of incentives (carrots) and regulations (sticks). Though much progress has been made, DataCite and Crossref systems show that across millions of scholarly outputs, structured ‘data citations’ are only present in tens of thousands of records. Often, this is because researchers mention underlying data without creating a structured citation or because publishers do not support structured citations for datasets in a paper’s references. The Make Data Count initiative devised a new strategy that does not rely on researchers or on publishers to assert the relationship between a paper and its underlying datasets. With funding from Wellcome Trust, DataCite has worked with the Chan Zuckerberg Initiative to develop a machine-learning algorithm that extracts references to underlying data from full journal articles and preprints without the inclusion of structured data citations. This model has been applied to the full text of hundreds of millions of articles, resulting in the Open Global Data Citation Corpus—a trusted central aggregate of all references to research data across articles, preprints, government documents, and other outputs. This corpus will fundamentally change the way libraries, bibliometricians, research administrators, software systems, and funders measure the impact of scholarly research.

https://makedatacount.org/data-citation/

Presentation Slides

Share this:

  • Click to share on Facebook (Opens in new window) Facebook
  • Click to share on LinkedIn (Opens in new window) LinkedIn
  • Click to share on Mastodon (Opens in new window) Mastodon
  • Click to share on Bluesky (Opens in new window) Bluesky
  • Click to share on X (Opens in new window) X

Filed Under: CNI Fall 2023 Project Briefings, Project Briefing Pages, Publishing, Repositories, Research Data Management, Scholarly Communication, Spaces
Tagged With: cni2023fall, Project Briefings & Plenary Sessions

Last updated:  Friday, January 5th, 2024

 

Contact Us

1025 Connecticut Ave, NW #1200
Washington, DC 20036
202.296.5098

Contact us
Copyright © 2026 CNI

  • Copyright Policy
  • Privacy Policy
  • Site map

Keeping up with CNI

CNI-ANNOUNCE is a low-volume electronic forum used for information about the activities and programs of CNI, and events and documents of interest to the CNI community.
Sign up

Follow CNI

LinkedInBlueSkyFacebookTwitterYouTubeVimeoMastodon

A joint project