• Contact CNI
  • Site map
  • View cni.org’s profile on Facebook
  • View cni_org’s profile on Twitter
  • LinkedIn
  • YouTube
  • Vimeo

CNI: Coalition for Networked Information

  • About CNI
    • Membership
    • CNI Collaborations
    • Staff
    • Steering Committee
    • CNI Awards
    • History
    • CNI News
  • Program Plan
    • Current Program Plan
    • Program Plan Archive
  • Topics
  • Meetings & Events
    • CNI Membership Meetings
    • CNI Workshops
    • Other Events
    • Event Calendar
  • Resources
    • Publications by CNI Staff
    • Program Plan
    • Pre-Recorded Project Briefing Series
    • Videos & Podcasts
    • Follow CNI
    • Historical Resources

Text and Data-Mining on Licensed Collections

Home / Project Briefing Pages / CNI Spring 2016 Project Briefings / Text and Data-Mining on Licensed Collections

March 17, 2016

Peter Leonard
Director, Digital Humanities Lab
Yale University

Many academic research libraries now spend more on electronic licensed content than they do on print materials. These commercial databases offer the promise of easy access to vast quantities of digitized material, but are under restrictions of both copyright and licensing agreements. Robots Reading Vogue, a project of the Yale University Library DHLab, is an effort to build DH tools on top of such an archive, allowing exploration and experimentation on 400,000 pages of Vogue magazine while still respecting copyright. Yale Library worked with ProQuest, the company which digitized Vogue for Condé Nast, to secure full access to the raw data underlying the commercial product. Any user of the site can explore patterns in the data, using affordances such as an n-gram search tool, but the hyperlinks to individual articles only resolve if the user has access privileges to the ProQuest content. We hope this projects serves as one possible model for ensuring researchers’ rights to fully explore the contents of vendor-digitized, licensed content.

http://dh.library.yale.edu/projects/vogue/

Presentation (PDF)

Share this:

  • Click to share on Facebook (Opens in new window)
  • Click to share on Twitter (Opens in new window)
  • Click to share on LinkedIn (Opens in new window)

Filed Under: CNI Spring 2016 Project Briefings, Digital Humanities, Information Access & Retrieval, Project Briefing Pages
Tagged With: cni2016spring, Project Briefings & Plenary Sessions, Videos

Last updated:  Thursday, May 12th, 2016

 

Contact Us

21 Dupont Circle
Suite 800
Washington, DC, 20036
202.296.5098

Contact us
Copyright © 2022 CNI

  • Copyright Policy
  • Privacy Policy

Keeping up with CNI

CNI-ANNOUNCE is a low-volume electronic forum used for information about the activities and programs of CNI, and events and documents of interest to the CNI community.
Sign up

Follow CNI

  • View cni.org’s profile on Facebook
  • View cni_org’s profile on Twitter
  • LinkedIn
  • YouTube
  • Vimeo

A joint project