-
UMR – Unified Metadata Resources
Posted on April 12th, 2009 13 commentsOne single web page as the single identifier of every book, author or subject

I like the concept of “the web as common publication platform for libraries“, and “every book its own url“, as described by Owen Stephens in two blog posts:
“Its time to change library systems ”I’d suggest what we really need to think about is a common ‘publication’ platform – a way of all of our systems outputting records in a way that can then be easily accessed by a variety of search products – whether our own local ones, remote union ones, or even ones run by individual users. I’d go further and argue that platform already exists – it is the web!
and “The Future is Analogue ”
If every book in your catalogue had it’s own URL – essentially it’s own address on your web, you would have, in a single step, enabled anyone in the world to add metadata to the book – without making any changes to the record in your catalogue.
This concept of identifying objects by URL:Unified Resource Locator (or maybe better URI: Unified Resource Identifier) is central to the Semantic Web, that uses RDF (resource Description Framework) as a metadata model.
As a matter of fact at ELAG 2008 I saw Jeroen Hoppenbrouwers (“Rethinking Subject Access “) explaining his idea of doing the same for Subject Headings using the Semantic Web concept of triplets. Every subject its own URL or web page. He said: “It is very easy. You can start doing this right away“.

© Jeroen Hoppenbrouwers
To make the picture complete we only need the third essential component: every author his or her or its own URL!
This ideal situation would have to conform to the Open Access guidelines of course. One single web page serving as the single identifier of every book, author or subject, available for everyone to link their own holdings, subscriptions, local keywords and circulation data to.
In real life we see a number of current initiatives on the web by commercial organisations and non commercial groups, mainly in the area of “books” (or rather “publications”) and “authors”. “Subjects” apparently is a less appealing area to start something like this, because obviously stand-alone “subjects” without anything to link them to are nothing at all, whereas you always have “publications” and “authors”, even without “subjects”. The only project I know of is MACS (Multilingual Acces to Subjects), which is hosted on Jeroen Hoppenbrouwers’ domain.
For publications we have OCLC’s WorldCat, Librarything, Open Library, to name just a few. And of course these global initiatives have had their regional and local counterparts for many years already (Union Catalogues, Consortia models). But this is again a typical example of multiple parallel data stores of the same type of entities. The idea apparently is that you want to store everything in one single database aiming to be complete, instead of the ideal situation of single individual URI’s floating around anywhere on the web.
Ex Libris’ new Unified Resource Management development (URM, and yes: the title of this blog post is an ironic allusion to that acronym), although it promotes sharing of metadata, it does this within another separate system into which metadata from other systems can be copied.The same goes for authors. We have WorldCat Identities, VIAF, local authority schemes like DAI, etc. Again, we see parallel silos instead of free floating entities.
Of course, the ideal picture sketched above is much too simple. We have to be sure which version of a publication, which author and which translation of a subject for instance we are dealing with. For publications this means that we need to implement FRBR (in short: an original publication/work and all of its manifestations), for authors we need author names thesauri, for subjects multilingual access.
I have tried to illustrate this in this simplified and incomplete diagram:
In this model libraries can use their local URI-objects representing holdings and copies for their acquisitions and circulation management, while the bibliographic metadata stay out there in the global, open area. Libraries (and individuals of course) can also attach local keywords to the global metadata, which in turn can become available globally (“social tagging”).
It is obvious that the current initiatives have dealt with these issues with various levels of success. Some examples to illustrate this:
- Work: Desiderius Erasmus – Encomium Moriae (Greek), Laus Stultitiae (Latin), Lof der Zotheid (Dutch), Praise of Folly (English)
- Author: David Mitchell
Authors
Good:- Erasmus in WorldCat Identities (one ID, many forms)
- David Mitchell in WorldCat Identities (one id per author)
- David Mitchell in VIAF (one id per author)
Medium:
- Erasmus in OpenLibrary (one id, one incomplete form)
- Erasmus in VIAF (one id, although from The Netherlands, preferred forms are Swedish, French and German)
Bad:
- Erasmus in Librarything (no identifier, numerous forms and occurrences)
- David Mitchell in Librarything (one form, “David Mitchell is composed of at least 12 distinct authors“, no way to distinguish)
- David Mitchell in OpenLibrary (one id for multiple authors)
Publications
Good:Bad:
- Erasmus “Praise of folly” in Librarything (numerous entries for all different title variations)
- Erasmus “Praise of folly” in OpenLibrary (numerous entries for all different title variations)
These findings seem to indicate that some level of coordination (which the commercial initiatives apparently have implemented better than the non-commercial ones) is necessary in order to achieve the goal of “one URI for each object”.
Who wants to start?
-
Unique authors
Posted on February 4th, 2009 1 commentJonathan Rochkind, in his post “How do name authorities work anyway?“, wonders if catalogers will confuse him with another writer of the same name that has an LC authority record, whereas he does not have one.
I guess the relevance of this problem depends entirely on the question: do you think it’s important to know that an author of a specific work is the same as the author of another work? A former colleague of mine whom I respect very much, used to say that it does not matter, as long as the correct name appears with the work in question. This was only six years ago, before the emergence of web 2.0 and library 2.0 type services. It is just like looking at a printed book: you read the author’s name, and if there is no further information on the back cover, or a list of publications by the author inside, then that’s all there is to it. In normal life, if you read a book or an article for pleasure, or even for business, study or research, that is no problem. No need for author authority records at all.
However, the picture is completely different from the point of view of the authors, especially in the case of professional scientific and research staff, where the exact number of publications and citations is crucial. For these authors it is vital that the correct authority record is used for their publications. Here we definitely need authority records with unique identifiers. But of course there are so many different systems in use: LC authority records , WorldCat Identities , national systems etc., they all use their own identifiers.
There is the proposal to develop the UAI, Universal Author Identifier . This system depends on authors registering and maintaining their own personal information in a freely accessible web based database. There was a pilot system for a while, but it is not clear if any results were reached.
In The Netherlands a similar project on a national scale has led to a live implementation: the DAI, Digital Author Identifier . The DAI is based on the identifier used for authors in the OCLC-PICA Dutch National Union Catalog /Common Catalog system “PPN”, and is assigned to every author who has been appointed to a position at a Dutch university or research institute or has some other relevant connection with one of these organisations. The DAI is used in the Dutch university repositories, the Dutch national Research Database and in the national integrated portal NARCIS .
The difference with UAI is that DAI is assigned by catalogers in one of the participating organisations, whereas UAI depends on voluntary cooperation of the authors themselves.Of course a “universal author identifier” still does not solve Jonathan’s initial question: confusion is still possible if the authors do not have a clear interest in maintaining their information themselves.
Another issue here, about which something more can be said in a future post, is that for a real universal system we should use URI’s, as for unique works (see Owen Stephens’ post “The Future is Analogue “) and subject headings.








Recent Comments