CNI Fall 2020 Membership Meeting: ReCiter: an Open Source Author Disambiguation System for Academic Medical Institutions

Paul Albert
Identity & Access Management Architect / Associate Professor
Cornell University

Sarbajit Dutta
Senior Web Application Developer
Cornell University

Staff at medical institutions are regularly called upon to produce and maintain lists of scholarly publications authored by individuals ranging from NIH-funded principal investigators to people affiliated with other institutions such as alumni and residents. This work tends to be done on an ad hoc basis and is time consuming, especially when profiled individuals have common names. Often, feedback from the authors themselves is not adequately captured in some central location and repurposed for future requests.

ReCiter is a highly accurate, rule-based system for inferring which publications in PubMed a given person has authored. ReCiter includes a Java application, a DynamoDB-hosted database, and a set of RESTful microservices which collectively allow institutions to maintain accurate and up-to-date author publication lists for thousands of people. This software is optimized for disambiguating authorship in PubMed and, optionally, Scopus.

ReCiter rapidly and accurately identifies articles, including those at previous affiliations, by a given person. It does this by leveraging institutionally maintained identity data (e.g., departments, relationships, email addresses, year of degree, etc.) With the more complete and efficient searches that result from combining these types of data, individuals at institutions can save time and be more productive. Running ReCiter daily, one can ensure that the desired users are the first to learn when a new publication has appeared in PubMed.

ReCiter is freely available and open source under the Apache 2.0 license. https://github.com/wcmc-its/ReCiter

For our presentation, we will demonstrate: – How to run ReCiter – ReCiter’s set of APIs for retrieving data – How ReCiter integrates with a third-party interface for capture feedback, feedback which is fed back into ReCiter to further improve accuracy – Using data collected by ReCiter to produce reports including Altmetric activity

Code – https://github.com/wcmc-its/ReCiter/
Demo – https://www.youtube.com/watch?v=MwaIXHc4PzI&feature=youtu.be

Presentation

ReCiter: an Open Source Author Disambiguation System for Academic Medical Institutions

Contact Us

Keeping up with CNI

Follow CNI

A joint project

Share this:

Contact Us

Keeping up with CNI

Follow CNI

A joint project