CNI: Coalition for Networked Information

  • About CNI
    • Membership
    • Staff
    • Steering Committee
    • CNI Awards
    • History
    • CNI News
  • Membership Meetings
    • Next Meeting
    • Past Meetings
    • Future Meetings
  • Topics
  • Events & Projects
    • Membership Meetings
    • Workshops & Projects
    • Other Events
    • Event Calendar
  • Resources
    • CNI Publications
    • Program Plan
    • Pre-Recorded Project Briefing Series
    • Videos & Podcasts
    • Executive Roundtables
    • Follow CNI
    • Historical Resources
  • Contact Us

Why Google Scholar Has Trouble Indexing Institutional Repositories

Home / Project Briefing Pages / CNI Spring 2012 Project Briefings / Why Google Scholar Has Trouble Indexing Institutional Repositories

March 30, 2012

Kenning Arlitsch
Associate Dean for IT Services
University of Utah
Patrick O’Brien
Search Engine Optimization Manager
University of Utah

Google Scholar (GS) has difficulty indexing the contents of institutional repositories (IRs) because most IRs use Dublin Core metadata, which cannot express bibliographic citation information adequately for academic papers. GS’s Webmaster Inclusion Guidelines site cautions to “use Dublin Core only as a last resort,” and recommends other metadata schemas instead. It also recommends specific guidelines to facilitate crawlers, including writing metadata from the repository database to HTML headers. Surveys of institutional and disciplinary repositories across the United States were conducted and the inquiries revealed indexing ratios to support the hypothesis that IRs that do not follow these metadata and crawl guidelines suffer from a low indexing ratio. Survey results also demonstrate that the low indexing ratio problem cuts across institutions and repository software. Three pilot projects were conducted that transformed the metadata of a subset of papers from USpace, the University of Utah’s institutional repository, and examined the results of Google Scholar’s harvest. The pilot projects were successful, achieving a 90% indexing ratio.

This presentation will cover the highlights of a paper that is being published in March in Library Hi Tech. The broader research initiative emphasizes search engine optimization for all digital repositories, including general digital library collections, and has recently been funded by a 3-year National Leadership Grant from the Institute of Museum and Library Services.

 

Share this:

  • Click to share on Facebook (Opens in new window) Facebook
  • Click to share on LinkedIn (Opens in new window) LinkedIn
  • Click to share on Mastodon (Opens in new window) Mastodon
  • Click to share on Bluesky (Opens in new window) Bluesky
  • Click to share on X (Opens in new window) X

Filed Under: CNI Spring 2012 Project Briefings, Information Access & Retrieval, Repositories
Tagged With: CNI2012spring, Project Briefings & Plenary Sessions

Last updated:  Friday, March 30th, 2012

 

Contact Us

1025 Connecticut Ave, NW #1200
Washington, DC 20036
202.296.5098

Contact us
Copyright © 2025 CNI

  • Copyright Policy
  • Privacy Policy
  • Site map

Keeping up with CNI

CNI-ANNOUNCE is a low-volume electronic forum used for information about the activities and programs of CNI, and events and documents of interest to the CNI community.
Sign up

Follow CNI

  • View cni.org’s profile on Facebook
  • View cni_org’s profile on Twitter
  • LinkedIn
  • YouTube
  • Vimeo
  • Flickr
  • Tumblr

A joint project