CNI: Coalition for Networked Information

  • About CNI
    • Membership
    • CNI Collaborations
    • Staff
    • Steering Committee
    • CNI Awards
    • History
    • CNI News
  • Program Plan
    • Current Program Plan
    • Program Plan Archive
  • Topics
  • Events & Projects
    • Membership Meetings
    • Workshops & Projects
    • Other Events
    • Event Calendar
  • Resources
    • Publications by CNI Staff
    • Program Plan
    • Pre-Recorded Project Briefing Series
    • Videos & Podcasts
    • Follow CNI
    • Historical Resources
  • Contact Us

What To Do with All of those Hard Drives: Data Mining at Duke

Home / Project Briefing Pages / CNI Fall 2012 Project Briefings / What To Do with All of those Hard Drives: Data Mining at Duke

December 4, 2012

Joel Herndon
Head, Data and GIS Services
Duke University

Molly Tamarkin
Associate University Librarian for Information Technology
Duke University

Though research libraries face an increasing demand for collections and services that facilitate text mining, most digital text and e-journal collections are licensed for use and hosted by vendors in such a way as to prevent data mining. However, a few publishers have provided hard drives to represent “backup” copies of these licensed databases. Unsure what to do with the increasing collection of hard drives, and realizing that copies of this data could be easily obtained should the “backup” fail, Duke University Library decided to create a text mining collection within its Center for Data & GIS Services. Researchers at Duke can now access large volume text collections, either by using a lab designed for big data research, or on their own machines, via a system that provides working copies of large-scale text collections. Furthermore, the library has launched a series of workshops focused on research strategies surrounding text mining featuring a wide range of topics from managing text data structures to latent Dirichlet allocation. This presentation will describe the new services and data analytic methodologies while exploring continuing issues in text mining from licensing to access to research support.

Presentation

Share this:

  • Click to share on Facebook (Opens in new window)
  • Click to share on Twitter (Opens in new window)
  • Click to share on LinkedIn (Opens in new window)

Filed Under: CNI Fall 2012 Project Briefings, Digital Libraries, Information Access & Retrieval, Project Briefing Pages, Repositories
Tagged With: cni2012fall, Project Briefings & Plenary Sessions

Last updated:  Thursday, August 4th, 2022

 

Contact Us

21 Dupont Circle
Suite 800
Washington, DC, 20036
202.296.5098

Contact us
Copyright © 2023 CNI

  • Copyright Policy
  • Privacy Policy
  • Site map

Keeping up with CNI

CNI-ANNOUNCE is a low-volume electronic forum used for information about the activities and programs of CNI, and events and documents of interest to the CNI community.
Sign up

Follow CNI

  • View cni.org’s profile on Facebook
  • View cni_org’s profile on Twitter
  • LinkedIn
  • YouTube
  • Vimeo

A joint project