A Glimir of the Future
I spoke at VALA 2008, Australia’s biennial library conference, this past week. My participation was spurred initially by a request to deliver a paper on behalf of OCLC's Robin Murray, who could not attend. Soon after I agreed, the organizers invited me to stand in for a keynote speaker who had to cancel due to a family emergency. My first ever double-substitution conference.
The topic of the conference (Libraries: Changing Spaces, Virtual Places) gave me a perfect opportunity to combine two areas of interest – social networking and canonical identifiers – to present a case for how library systems might bring their assets into sharper Web focus.
OCLC has been exploring an important facet of this problem (canonical manifestation identifiers), and the VALA conference afforded a timely opportunity to announce this exploration. The tentative name for these identifiers: Global Library Manifestation Identifiers, or Glimirs for short.
The community at large is increasingly aware of the importance of canonical identifiers for FRBR entities, especially Group I entities (Works, Expressions, Manifestations, and Items). Existing OCLC numbers approximate manifestation identifiers, but ironically, as the database grows in scope, this rough correspondence is reduced through the loading of records in various languages. These are not duplicates, but rather alternative institutional, regional, or language representations that point to a given resource.
The need for explicit manifestation identifiers thus becomes more evident. We need identifiers that are globally scoped, business neutral, usable by all, and managed in either a centralized or federated manner. To the extent that such identifiers are canonical – that is, become the dominant identifier for a given asset, they increase the “URI equity” for library assets and will strengthen the library presence on the Web.
Interesting and challenging issues arise in the design of such identifiers and their supporting infrastructure. Broad adoption will require a careful balance of use-cases, business issues, and community participation in meeting the need. All of this in an environment already crowded with myriad special purpose identifiers. OCLC is launching a pilot to explore these issues. An early proposal has been shared with a number of technical and policy leaders and their valuable feedback will be used to strengthen the effort and move it to the next stage.
Stay tuned.
-----
A surfer at Torquay beach on the coast south of Melbourne.
Jonathan, you seem to have answered with your comment a question you brought up in this blog some months ago about the value of 'pure identifiers' (those without canonical resolvability).
I think we see some very good reasons for them right here. Every time when an identifier is resolvable, someone has to see to it, be it OCLC or some other entity (especially if such an identifier is global in scope).
The architecture enabling the resolution might or might not support a business case for that specific entity, but, to make it persistent, this entity should have made a commitment to continue offering this architecture by some means even when the current use or business case goes away.
My point is: Even if something akin to DNS resolution would be desirable for identifiers in our field, how likely is such a thing to emerge? (Even for something as important as ISBNs that hasn't happened yet.)
URNs vs. URLs are another good example: CDNL came up with the NBN namespace for National Bibliography Numbers, but who is using them, and who is providing the resolvers?
Resolvable identifier are always 'unpure' for a number of reasons. The mud of their initial purpose, business case, or of location (vs. identification) always clings to their boots. But we put up with those drawback because of their immediate usefulness.
Posted by:Michael Panzer | February 14, 2008 at 12:56 PM
You are probably aware that library customers and ESPECIALLY other vendors are suspicious that OCLC identifiers will be "business neutral". If you mean by "business neutral" what I think you mean, then I'm gratified you included it. You may not have.
But a hurdle OCLC will have to surmount if it wants a new form of OCLC-supplied identifiers (let alone 'canonical' identifiers) is convincing everyone that use of these identifiers will NOT be tied to OCLC products and services. We will need to be able to load these identifiers in whatever dbs we want without paying OCLC anything. We will need a way to discover, confirm, and assign these identifiers without paying OCLC anything.
Otherwise, library customers will be suspicious, and competing vendors will be outright refusniks.
Posted by:Jonathan Rochkind | February 13, 2008 at 02:07 PM