Dynamic OpenURL lookup with document delivery

Even though I’ve switched to a different area of work, I have still been handling many projects and tasks related to my old job these past several weeks. One project involved the integration of our OpenURL resolver (SFX) into our document delivery service.  A few days ago this project work completed and was successfully implemented. I am pretty thrilled with the result!

Basically a team of people met earlier this year to figure out how we could address the following issues:

  • Make better use of our SFX linking
  • Help our users know, earlier in the process, when we have something available in full text
  • Help our users save time and money by avoiding unnecessary document delivery charges

My library has a heavily-used web form that our customers use for inputting document delivery requests, 99% of which are for journal articles.  We charge for fulfilling their requests (average charge across all orders including regular charges mixed with higher priced rush orders is between $20-30 per article request).  We also know that a fairly significant number of article orders received via this web form on our site are for articles that we already have available in full text, at no additional cost to our users. Our library averages about 100 such requests per day.  The typical workflow is that a user inputs citation information into the form, clicks on a Continue button, is presented with a confirmation screen (with the ability for the user to modify or change information in the form), and then a Submit button for completing the order process.

Our idea was to add new functionality between the initial order form input screen and the confirmation screen, such that the article citation information would be used to dynamically look up our holdings in SFX and, if a valid match was found, a new SFX full text link would be presented in the confirmation screen telling the user that full text was available online.

This new functionality sounds simple but involves a lot of complex stuff behind-the-scenes.  In particular we were concerned that the SFX link presented to the user needs to work as close to 100% of the time as possible.  Anyone who uses any kind of OpenURL service knows that full text links are not as stable and successful as users wish.  The last thing we wanted was to present this new option and then give the user a bad experience and turn them off if the link, when clicked on, doesn’t work.

As already stated, I am thrilled with the results. Extensive testing has shown that this new functionality works well. The end result will be significant cost savings for our users. A very conservative estimate puts savings at more than $20,000 per year.  One of the things we built into the project is a method for specifically tracking use of the new functionality so we’ll be able to have exact figures rather than estimates over time.  Below is a screenshot of the new functionality.

image-0017.jpg

And the really cool thing is that with the coding we’ve done behind-the-scenes, this project is only the start of what we are able to do.

Roadblock to full OpenURLness [Updated]

This week I encountered a significant roadblock when trying to use OpenURL in a situation where it is a natural fit. Let me explain the scenario. A scientific researcher at the company where I work built an extensive bibliography of journal articles on a particular subject, and wants to publish that bibliography on the company intranet, complete with hyp[er]text links to the full text. This person initially thought it’d be ok to simply mount the full text articles that he had downloaded in the same webspace as the bibliography, and simply link to the files. Of course, that ideas was quickly shot down. Instead, we thought, why can’t we take this bibliography, check it against our SFX KnowledgeBase to see what articles we have available in full text, and then output the complete OpenURL for each of those articles for this researcher to use when marking up and publishing his bibliography?

The use case sounds straightforward, right? Turns out that it is anything but. I was provided with a text file of citations and was asked to come up with appropriate SFX links for each. Of course I could have manually rekeyed the citations one by one into a search form querying our SFX KB, but that would take quite a long time and quite a bit of effort. I tried to think of how this whole process could be automated.

On the advice of Dan Chudnov I downloaded an open source application written in Perl called Biblio-Citation-Parser, which on the face of it seemed to be exactly what I needed. I need a way to automatically parse the whole list of citations into the necessary chunks of metadata, and then automatically generate an OpenURL for each citation. After trying unsuccessfully to get Biblio-Citation-Parser to work (this isn’t a limitation of the software but of my Perl expertise), I sent queries out to other SFX users as well as to the Code4Lib discussion list. There were several responses from members of the Code4Lib discussion list, some of whom mentioned the application that I already knew about. But it turns out that pretty much nobody in that community [at least among those who responded] had ever used it, and also, that nobody in that community had come up with a good solution to this parsing problem themselves.

Since the original citations were stored in Reference Manager, one of the more common citation management software applications, I wrote back to the colleague who first asked me to help with this situation, asking him if he could provide me with the Reference Manager files. He did, and I downloaded a free trial version of the software, imported the references, then exported them in RIS format. Next, I imported the RIS output file into Zotero, and then exported the whole bibliography from Zotero into a readymade HTML bibliography. Because of Zotero’s built-in COinS functionality, the readymade HTML bibliography is automatically populated with OpenURLs. But I wasn’t done yet. I had to go through each citation by hand and test whether we did indeed have the article in full text, and also, to edit the HTML coding to substitute our company’s specific SFX base URL in each link.

In the end, I achieved what the user wanted — a list of bibliographic references with SFX links as the hypertext links. But it was a huge amount of work, and I kept asking myself, surely there is a better, easier way to do this?! Surely, someone, somewhere has already solved this problem of how to readily parse bibliographic citations in a text file and run them through a process to check for which articles are available in full text?

Maybe there is a much simpler solution and if you know of it, please comment on this post to let me know. I’m left thinking that this whole OpenURL stuff still has a ways to go in terms of ease of implementation for situations like I described.

SFX and convenience for users

One of the things I am responsible for is my library’s SFX service. I am a huge SFX fan and one of the reasons for my enthusiasm is that there are so many interesting things one can do with SFX. Today I stumbled across an example SFX services window customized for the Max-Planck-Gesellschaft. Two aspects to their customizations that caught my attention are highlighted in yellow and green in the following screenshot: a message that directs non-authenticated users to a local link resolver via WorldCat Registry, and a link to generate a TinyURL for the services window. Pretty cool! These additions go that little bit further to create convenience for users.

Merger fallout

Last week about 45 employees from Endeavor lost their jobs as a result of the merger with Ex Libris. Among them are friends and former co-workers. I am not surprised that some were laid off; what surprises me is why some were laid off and others kept their jobs. Some who kept their jobs don’t deserve it, in my opinion.

What else is clear from the merger is that not much aside from Voyager will be retained in terms of products from the Endeavor side of the equation. It appears that for example, Meridian customers will be migrated to Verde; Discovery: Resolver customers will be migrated to SFX; and Discovery: Search customers will be migrated to MetaLib. At least that is my understanding at this stage. Oh, I forgot: There is one other Endeavor product that appears to have staying power: Journals Onsite. I suppose the installed user community for that product as well as for Voyager was just too strong to ignore or alienate.

It is a weird situation. I’m still getting used to saying ‘Ex Libris’ instead of ‘Endeavor.’ I wonder how many customer libraries will actually migrate to new products at a time when many are already overstretched in terms of time and resources devoted to complex information management systems. I wonder about those employees from Endeavor who remain and how they will manage the transition and uncertainty, and the same thing from the Ex Libris side. Time will tell.

Selfishly I am thankful I was able to leave Endeavor when I did, and I am thankful to have a job, period.

Ross Atkinson dies

Yesterday I was saddened by the news that Ross Atkinson, Associate University Librarian at Cornell, had died. I never met him but I’ve read many of his writings, so my view of him is definitely second hand at best. My opinion is that Ross was one of the most important thinkers regarding libraries, collections, and acquisitions of the past several decades. I thoroughly enjoyed his writing style, his deep and sometimes provocative thoughts, and his perspectives on important issues. One of his articles (Toward a Redefinition of Library Services” (In Virtually Yours. Chicago, ALA, 1999; p. 3-21)Find in My Library) is a required introductory reading to the course I teach and it invariably sparks a lot of discussion. Other articles or writings of his are also part of the course. If you don’t know much about him or are interested in his writings, I highly recommend that you find a copy of Community, Collaboration, and Collections: the Writings of Ross Atkinson published in 2005 by the American Library Association. Here is the complete citation in Chicago Manual of Style format:

Atkinson, Ross, Robert Alan, and Bonnie MacEwan. Community, Collaboration, and Collections : The Writings of Ross Atkinson. Chicago: Association for Library Collections & Technical Services, 2005.Find in My Library

RSS and Aleph online catalogs

A lot of people in the library blogosphere get excited when an ILS vendor announces some kind of RSS capability for their online catalogs. I wanted to mention here some excitement of my own when I recently discovered some interesting RSS functionality for the ILS I maintain (Ex Libris Aleph 500), developed by Peter Corrigan of the National University of Ireland, Galway, James Hardiman Library. Peter has implemented this in relation to A9.com’s OpenSearch technology.

See his entry at A9.com and also click here to see a sample search. (Note the orange icons for RSS and Permalink in the upper lefthand side.) Yes, this is cool!

I also read on the North American Aleph Users Group discussion list that the new product manager for Aleph, Katriel Reichman, is actively tracking and investigating the use of RSS in Ex Libris products, including SFX and MetaLib.

Lilly Fund loses 20% of value

An article in the Indianapolis Star reports that the Lilly Endowment lost 20% of its value last year. This is big news because this is the largest source of charitable funds in the whole state, and the Lilly Endowment is particularly generous to educational institutions like the one where I work. In fact, Lilly gave us millions for faculty development recently. This is only the latest grant they have given us. They also gave the money that allowed our library consortium to purchase a new integrated library system (Aleph 500), OpenURL resolver service (SFX), and metasearching tool (MetaLib). Part of the grant is also paying for software (ContentDM) and hardware for member libraries to build digital library collections.

Lecture @ UIUC GSLIS

Yesterday I was able to go to UIUC GSLIS at the invitation of Kathie and Bill Henderson to speak to their Technical Services Functions class. (This is the same class that I teach in online form in UIUC GSLIS’s LEEP curriculum.) They’ve invited me every year for the past 14 years. Each time I go, I thoroughly enjoy the time with their students.

The topic of my talk was “The Times, They Are a Changin…or, How to Thrive in an Age of Chaos and Opportunity in Technical Services Librarianship.” What I try to do is give a quick overview of my career path thus far, and then focus on electronic resources as the “hot” area. We usually spend a lot of time talking about user expectations in this era of the Google Mindset, and how libraries are coping with the demand for online full-text while simultaneously struggling to manage the “traditional” collection. I demo’d the ERM system that we developed at Taylor, known as the Taylor Periodical Administration System (TPAS), and also talked quite a bit about OpenURL and metasearching technologies.

There are about 14 students in this class. Some years the students really get into a good discussion, while other times, they seem content just to listen to a lecture. I prefer interaction and was pleased that this year there were a lot of questions. Especially in recent years, students have expressed some apprehension about the future of technical services librarianship, or uncertainty about job prospects in this specialty. This topic came up tangentially in yesterday’s session. One student asked, given what we had been talking about in terms of the predominance of e-resources and vendor tools that libraries are stuck with, what is stopping vendors from selling or marketing directly to researchers? (We had also been talking quite a bit about Google Scholar.) I thought this was an excellent question. It really asks the question, are libraries relevant anymore? Why should libraries assume they are even part of the equation for information consumers? Have libraries given up way too much ground, ground that they can never expect to recover? Very interesting questions. We spoke about the decision made by libraries about 100 years ago to stop trying to do article-level analytics in their cataloging practice, and how that resulted in the whole development of vendor solutions in the form of abstracting and indexing tools. Those tools have then developed into online form in the last 20 years or so, and then have been developed further to provide aggregated full-text content. We have lost control (if we ever had it) of the means of access to our resources.

Overall it was a very stimulating time and I ran out of time trying to cover everything. I wish we had had all day!

The Henderson’s had made lunch and invited Linda Smith, Associate Dean and one of my former professors, to eat with us. It was nice to chat with them about various things. One of the things I noticed this time in my campus visit was that development of various areas of the campus continues at a rapid pace. Some areas are hardly recognizable anymore, they have changed so much since I was a student there.

Catching my breath and other ruminations

I have a lot of guilt and I’m not even Catholic…One of the things I feel guilty about is not posting more often and more regularly on this blog. It’s not for lack of things to write about, that’s for sure. I am just trying to catch my breath most days. There is so much to do, so little time (how unoriginal of me to write that, but it’s true). I have been preoccupied with many interesting yet stressful things. One of them is MetaLib training. For those who aren’t familiar with this product, let me just say that it is an example of a metasearching tool, sometimes also known as federated search. My library is part of a consortium that purchased this software in a package along with an integrated library system called Aleph 500, and a context-sensitive linking software called SFX. MetaLib is really cool and this kind of tool can help libraries like mine to offer a much more user-friendly way to find information for our users.While thinking of MetaLib and the issues of metasearching (or federated searching), I was reminded of the fact that I created a pretty barebones webliography of articles, reports, etc. on this topic here. I originally created this webliography as a companion tool for a presentation at last year’s NASIG conference, and have had several requests from people who want to share it with others, link to it, whatever. This a.m. as I was watching the kids while Michele went to her MOPS meeting, I was thinking that I desperately need to update that webliography. I have saved over 100 additional article citations in RefWorks (another neat new e-resource that I have purchased for our library users) and I need to figure out a time to look through them all and select those that I want to add into the webliography. Then, lo and behold, one thought led to another, and I had a “Doh!” kind of moment: Why don’t I reformat the webliography so that I can offer it as an RSS feed? That way, anyone who wants to do so can subscribe to the feed and automatically keep informed about any additions or changes I make to it. Why on earth hadn’t I thought of this before? How stupid of me…So, here goes with another thing to do…Anyone who thinks life as a librarian is boring doesn’t know their head from a hole in the ground.

Periodical Management Systems

A while ago I mentioned that I had written an article for Serials Review on periodical management systems, and that I had been working for quite some time on such a system at my library. This topic continues to be of interest to a lot of folks. Recently there have been two or three threads on SERIALST related to this issue in which the predominant opinion seems to be that Serials Solutions walks on water. I haven’t used their services — the whole point of developing something locally is/was to do what they or competing services do for cheaper and do it better — but they seem to get consistently good press!

I have a meeting on Friday with a subset of PALNI libraries to demonstrate what we’ve been doing locally with this homegrown system. My hope is that at least a few other libraries within our consortium will join in using it. One good reason is that we all share SFX and the Taylor Periodical Administration System integrates SFX data, making it somewhat distinctive from other setups I’ve read about. My ultimate hope is to make the system freely available as open source because I believe that there are many libraries out there who simply can’t afford services from Serials Solutions, TDNet, Ebsco, and others. I think this is particularly true of many small college and university libraries, as well as some public libraries. A huge amount of credit for the whole system goes to one of my student workers, Matt Wissman, who has worked on it for over two years now. He is really gifted in database design, advanced web development, and other areas, and I am really going to miss him when he graduates next year.