<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>CommonPlace.Net</title>
	<atom:link href="http://commonplace.net/feed/" rel="self" type="application/rss+xml" />
	<link>http://commonplace.net</link>
	<description>Library2.0 and beyond</description>
	<lastBuildDate>Fri, 22 Jan 2010 16:45:48 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.1</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Mobile reading</title>
		<link>http://commonplace.net/2010/01/mobile-reading/</link>
		<comments>http://commonplace.net/2010/01/mobile-reading/#comments</comments>
		<pubDate>Fri, 22 Jan 2010 16:42:54 +0000</pubDate>
		<dc:creator>Lukas Koster</dc:creator>
				<category><![CDATA[Library2.0]]></category>
		<category><![CDATA[e-books]]></category>
		<category><![CDATA[e-papers]]></category>
		<category><![CDATA[e-readers]]></category>
		<category><![CDATA[epub]]></category>
		<category><![CDATA[frbr]]></category>
		<category><![CDATA[hardware]]></category>
		<category><![CDATA[marc]]></category>
		<category><![CDATA[mobi]]></category>
		<category><![CDATA[mobile]]></category>
		<category><![CDATA[newspapers]]></category>
		<category><![CDATA[pdf]]></category>

		<guid isPermaLink="false">http://commonplace.net/?p=941</guid>
		<description><![CDATA[New models, new formats
Recently I have been experimenting a bit with reading newspapers on my mobile phone (a G1 android device), or maybe I should say &#8220;reading news on my mobile&#8221;. I looked at two Dutch newspapers that adopt two completely different approaches.
&#8220;NRC Handelsblad&#8221; publishes it&#8217;s daily print newspaper as a daily &#8220;e-paper&#8221; in PDF, [...]]]></description>
			<content:encoded><![CDATA[<p><em><strong>New models, new formats</strong></em></p>
<div id="attachment_965" class="wp-caption alignright" style="width: 298px"><img class="size-large wp-image-965   " title="paroolmobile2" src="http://commonplace.net/wp-content/uploads/2010/01/paroolmobile2-768x1024.jpg" alt="" width="288" height="384" /><p class="wp-caption-text">© Lukas Koster</p></div>
<p>Recently I have been experimenting a bit with reading newspapers on my mobile phone (a <a id="so-5" title="G1 android device" href="http://www.htc.com/www/product/g1/overview.html" target="_blank">G1 android device</a>), or maybe I should say &#8220;reading <em>news</em> on my mobile&#8221;. I looked at two Dutch newspapers that adopt two completely different approaches.</p>
<p>&#8220;<em><a id="bcqi" title="NRC Handelsblad" href="http://nrc.nl/" target="_blank">NRC Handelsblad</a></em>&#8221; publishes it&#8217;s daily print newspaper as a daily &#8220;<a id="tid0" title="e-paper" href="http://epaper.nrc.nl/" target="_blank">e-paper</a>&#8221; in <a id="nip4" title="PDF" href="http://en.wikipedia.org/wiki/Pdf" target="_blank">PDF</a>, <a id="nhch" title="Mobi" href="http://en.wikipedia.org/wiki/Mobipocket" target="_blank">Mobi</a> and <a id="hwpm" title="ePub" href="http://en.wikipedia.org/wiki/EPUB" target="_blank">ePub</a> format, to be downloaded every day to the platform of your choice. In order to read the e-paper you need a physical device plus software (mobile phone, PC, e-reader, etc.) that can handle one of the available formats. On my G1 I use the <a id="krpd" title="Aldiko e-reader app for android" href="http://www.aldiko.com/" target="_blank">Aldiko e-reader app for android</a> with the ePub format. The e-paper is treated as an e-book file, with touch screen operation for browsing tables of content, paging through chapters or articles, zooming, etc. Access to the e-paper files is on a subscription basis.</p>
<p>&#8220;<em><a id="twe." title="Het Parool" href="http://www.parool.nl/" target="_blank">Het Parool</a></em>&#8221; on the other hand offers a free <em>app </em>to be downloaded from the Android Market that serves as a front end to all recent articles available from their news server on the web. There is no need for a daily download of a file in a specific format that has to be supported by the physical platform of your choice. There is also an iPhone app. The app and access to the news articles are free of charge.</p>
<div id="attachment_958" class="wp-caption alignleft" style="width: 304px"><img class="size-large wp-image-958  " title="paroolmobile" src="http://commonplace.net/wp-content/uploads/2010/01/paroolmobile-768x1024.jpg" alt="" width="294" height="389" /><p class="wp-caption-text">© Lukas Koster</p></div>
<p>Besides the difference in access (free vs paid), the most important contrast between these two mobile newspapers is the form in which the printed news is transformed to the digital and mobile environment. &#8220;<em>NRC Handelsblad</em>&#8221; takes the physical form the newspaper has had since it&#8217;s origin in the 17the century, dictated by physical, logistical and economical conditions, and transforms this 1 to 1 to the digital world: the e-paper still is one big monolithic bundle of articles that can&#8217;t be retrieved individually, completely ignoring the fact that the centuries old limitations don&#8217;t apply anymore. It is basically exactly the same as most manifestations of e-books.</p>
<p>&#8220;<em>Het Parool</em>&#8221; does completely the opposite. It treats individual news articles as units of content in their own right, &#8220;stories&#8221; as I call them in my post &#8220;<a id="p_89" title="Is an e-book a book?" href="../2009/11/is-an-e-book-a-book/">Is an e-book a book?</a>&#8220;. And this is how it should be in the digital mobile world. This is similar to the way that e-journals offer direct access to individual articles already.<br />
Readers should be able to apply their own selection of &#8220;stories&#8221; to read in a specific, virtual, on the fly bundle, using the front end of their choice.<br />
However, the &#8220;<em>Parool</em>&#8221; app functions as a predefined filter: it presents the reader with the most recent (24 hour max) articles from it&#8217;s own source of news. Of course this is fine as long as the readers choose to use the &#8220;<em>Parool</em>&#8221; app, but they may also choose to read news stories from different sources. This could be achieved with a different mobile, PC or web application that gathers content from a variety of sources.</p>
<p>Another drawback of the &#8216;<em>Parool</em>&#8221; implementation is that it does not offer a &#8220;save&#8221; option. There is no way to read old articles, other than to go to the official newspaper website, either through mobile browsing or by using a PC web browser. The &#8220;<em>NRC Handelsblad</em>&#8221; implementation on the other hand does offer this option, because it is based on a download model to begin with.</p>
<p>This brings me to the matter of mobile web browsing. Reading and navigating a web page designed for the PC screen on a mobile device is annoying at least, not to mention the time it takes to load complete web pages into the mobile browser. Common practice is to create a simplified version of full fledged web pages for mobile use only. Of course this means doubling the website maintenance effort.<br />
An alternative could be the adoption of <a id="fq6v" title="HTML5" href="http://html5.org/" target="_blank">HTML 5</a> and <a id="cpt5" title="CSS3" href="http://www.css3.info/about/" target="_blank">CSS 3</a>, as was stated at a <a id="v6t." title="Top Tech Trends Panel session at ALA Midwinter 2010" href="http://www.libraryjournal.com/article/CA6715484.html" target="_blank">Top Tech Trends Panel session at ALA Midwinter 2010</a>, where a university library official said: &#8220;<em>2010 is the year that the app dies</em>&#8220;, because &#8220;<em>developers can leverage a single well-designed service to serve both browser-based and mobile users</em>&#8220;. But this view completely misses the point: &#8220;<em>Apps are not about technology, they are about a business model</em>&#8221; as <a id="xndz" title="Owen Stephens" href="http://twitter.com/ostephens" target="_blank">Owen Stephens</a> <a id="ak26" title="pointed out" href="http://twitter.com/ostephens/status/7982515485" target="_blank">pointed out</a>. This business model implies the separation of content and presentation in a much broader sense then that of database back end &#8211; website front end only. This was an innovative concept until a couple of years ago.</p>
<p>As I briefly described above, we need units of content being accessible by all kinds of platforms and applications through universal APIs. This model not only applies to <em>reading </em>texts, but also to <em>finding </em>these texts. Especially libraries should be aware of that.</p>
<p>Although the ALA Top Trends Panel stated that libraries&#8217; focus should be on content rather than hardware, they did not touch upon the changing concept of what books are in the e-book era, as again <a id="dug5" title="Owen Stephens pointed out" href="http://twitter.com/ostephens/status/7982599207" target="_blank">Owen Stephens pointed out</a>. New models and formats will have all kinds of consequences for the way we handle information. For instance: pages. A PDF file, which is a 1 to 1 translation of the print unit to a digital unit, as I explained, still has fixed pages and page numbers. An ePub file however has a flexible format that allows &#8220;pages&#8221; to be automatically adapted to the size of the device&#8217;s screen (thanks to <a id="c6oh" title="@rsnijders" href="http://twitter.com/rsnijders" target="_blank">@rsnijders</a> and <a id="sk:s" title="@Wowter" href="http://twitter.com/wowter" target="_blank">@Wowter</a> for discussing this). There are no fixed pages or page numbers anymore. HTML pages containing full articles don&#8217;t have page numbers either, by the way. This will change the way we refer to texts online, without page numbers, which is one of the subject of the <a id="f3s1" title="Telstar project" href="http://www.open.ac.uk/blogs/telstar/" target="_blank">Telstar project</a>, again with Owen Stephens involved (watch that guy).</p>
<p>The flexible page is another reason to have a critical look at <a id="d83:" title="MARC" href="http://www.loc.gov/marc/" target="_blank">MARC</a>. There is no use anymore for tags like <a id="nc7." title="300" href="http://www.loc.gov/marc/bibliographic/bd300.html" target="_blank">300</a>,a &#8220;<em>Extent (Number of physical pages, etc.</em>)&#8221;, <a id="hmmh" title="773" href="http://www.loc.gov/marc/bibliographic/bd773.html" target="_blank">773</a>,g (&#8220;<em>Vol. 2, no. 2 (Feb.         1976), p. 195-230</em>&#8220;).</p>
<p>The inevitable conclusion of all this is that all innovative developments on the end user interface presentation front end need to be supported by corresponding developments on the content back end, and vice versa.</p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fcommonplace.net%2F2010%2F01%2Fmobile-reading%2F&amp;linkname=Mobile%20reading"><img src="http://commonplace.net/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://commonplace.net/2010/01/mobile-reading/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Old library, new library</title>
		<link>http://commonplace.net/2009/12/old-library-new-library/</link>
		<comments>http://commonplace.net/2009/12/old-library-new-library/#comments</comments>
		<pubDate>Tue, 29 Dec 2009 14:12:39 +0000</pubDate>
		<dc:creator>Lukas Koster</dc:creator>
				<category><![CDATA[Library2.0]]></category>
		<category><![CDATA[catalog]]></category>
		<category><![CDATA[collection]]></category>
		<category><![CDATA[haarlem]]></category>
		<category><![CDATA[social networking]]></category>
		<category><![CDATA[teylersmuseum]]></category>

		<guid isPermaLink="false">http://commonplace.net/?p=907</guid>
		<description><![CDATA[On Sunday December 27, 2009 I was in the opportunity to visit the, otherwise closed, library of The Netherlands&#8217; oldest museum Teylers museum in my home town Haarlem, together with a small group of Dutch library twitter people. We were very kindly shown around by librarian Marijn van Hoorn, who explained to us the library&#8217;s [...]]]></description>
			<content:encoded><![CDATA[<div id="attachment_916" class="wp-caption alignright" style="width: 470px"><a href="http://www.flickr.com/photos/88526197@N00/4218544261/" target="_blank"><img class="size-full wp-image-916 " title="Teylers museum library" src="http://commonplace.net/wp-content/uploads/2009/12/teylerslib.jpg" alt="" width="460" height="225" /></a><p class="wp-caption-text">Teylers museum library © Dymphie</p></div>
<p>On Sunday December 27, 2009 I was in the opportunity to visit the, otherwise closed, library of The Netherlands&#8217; oldest museum <a id="zb75" title="Teylers museum" href="http://www.teylersmuseum.eu/" target="_blank">Teylers museum</a> in my home town Haarlem, together with a small group of Dutch library twitter people. We were very kindly shown around by librarian <a id="cv9y" title="Marijn van Hoorn" href="http://teylersmuseum.ning.com/profile/marijnvanhoorn" target="_blank">Marijn van Hoorn</a>, who explained to us the library&#8217;s history and collection.</p>
<p>Now I&#8217;m not going to say something about the pleasant real life consequences of getting to know people in the virtual world (that has been done by <a id="vtat" title="@PeterMEvers" href="http://bibliotheek20.ning.com/profiles/blogs/koningin-en-de-bibliotheek-van" target="_blank">@PeterMEvers</a> already, in Dutch), or about the guided tour (already described very well by <a id="amyp" title="@underdutchskies" href="http://underdutchskies.com/?p=945" target="_blank">@underdutchskies</a> in English and by <a id="r1dn" title="@festinaatje" href="http://theeheeftzoveelcharme.blogspot.com/2009/12/bezoek-aan-het-teylersmuseum.html" target="_blank">@festinaatje</a> and <a id="kswe" title="@ecobibl" href="http://ecobibl.blogspot.com/2009/12/bezoek-teylers-museum-bibliotheek.html" target="_blank">@ecobibl</a> in Dutch). Also, none of the photos I made with my G1 phone are presentable; but you can have a look at the photos made by <a id="d8:-" title="@Dymphie" href="http://www.flickr.com/photos/88526197@N00/tags/teyler/" target="_blank">@Dymphie</a>, <a id="gwwo" title="@underdutchskies" href="http://www.flickr.com/photos/underdutchskies/sets/72157623080145880/" target="_blank">@underdutchskies</a> and <a id="wxk9" title="@wbk500" href="http://picasaweb.google.nl/wham.vd.brink/TeylermuseumTweetmeet2009#" target="_blank">@wbk500</a>).</p>
<p>Instead, I will try to make a comparison between the old library&#8217;s course of life and the developments that modern libraries are going through, because I see some parallels there.</p>
<p>The museum was built in 1784 with money from the legacy of the wealthy banker and merchant Pieter Teyler van der Hulst, to preserve his collections and advance the arts and sciences. The museum&#8217;s library was established in 1826 to house a separate collection of books and journals in the field of natural history (botany, zoology, paleontology and geology).</p>
<p>One of the objectives for the library was to have a complete collection of all journals in the area of natural history. In the beginning the library was only accessible by invitation, and the honoured guests were welcomed and assisted by the &#8220;caretaker&#8221; or &#8220;landlord&#8221; of the museum.<br />
By the middle of the 19th century the library opened up to a more general public, that is to say teaching and research staff members of the emerging universities.<br />
But from 1870 the importance of the Teylers library for university staff declined drastically, because the universities in The Netherlands started to organise academic libraries of their own. So the library closed its doors for regular visitors. The collection continued to be maintained and expanded until 1987, when it was no longer realistic to pursue completeness.<br />
During the 1970&#8217;s the privately funded museum and library faced the threat of closing down because of the cost of preserving the historical buildings and collections. In the written library catalog (created over time by a large number of volunteers and employees) all items were annotated with an estimated value in case of forced sale of the collection.</p>
<div id="attachment_933" class="wp-caption alignleft" style="width: 230px"><a href="http://www.flickr.com/photos/88526197@N00/4218548045/" target="_blank"><img class="size-full wp-image-933 " title="Teylers library catalog" src="http://commonplace.net/wp-content/uploads/2009/12/teylerscat.jpg" alt="" width="220" height="344" /></a><p class="wp-caption-text">Teylers library catalog © Dymphie</p></div>
<p>Fortunately the Dutch state decided to subsidise the historically and culturally valuable museum, and now Teylers is a very popular place, with a new wing with a large hall for temporary exhibitions, an educational section and a cafe.<br />
The museum library is only open for visitors on request and on special occasions. The collection is not expanded anymore, but it is a very complete and valuable historical natural history collection, which is, among other things, used to organise temporary thematic exhibitions in the museum. Besides the natural history items there are also old maps and atlases and travel journals, like the <a id="dj78" title="James Cook journals by Sydney Parkinson" href="http://www.teylersmuseum.eu/index.php?item=69&amp;i_id=4398&amp;kw=cook" target="_blank">James Cook journals by Sydney Parkinson</a> that <a id="rw8v" title="@jaapvandegeer" href="http://twitter.com/jaapvandegeer" target="_blank">@jaapvandegeer</a> drew my attention to.<br />
The museum and the library are also looking to the future. Both museum objects and library items are being <a href="http://instrumentenzaal.teylersmuseum.nl/index.php?lang=english" target="_blank">digitised</a>, there is a European project for creating a website on ornithology that uses the library&#8217;s birds images, there is a <a href="http://teylersuniversum.nl/" target="_blank">new thematic website</a> that combines documents, images, metadata from the museum, the library and external sources, the library catalog has been migrated to an <a id="kasd" title="Adlib" href="http://www.adlibsoft.com/" target="_blank">Adlib</a> system, and there is a <a id="xqf-" title="Ning social network" href="http://teylersmuseum.ning.com/" target="_blank">Ning social network</a>.</p>
<p>So, what are the parallels with modern libraries? First of all, it is clear that the influence of external developments on libraries is not something that is limited to the modern digital web age of Google. Just like Teylers library, modern public, academic and special libraries were at first targeted at a limited, well defined audience, and only accessible on location on specific times, after which their target audience and accessibility widened substantially. Catalogs and varying parts of the collection are available online to a global audience.<br />
The external influence from competing university libraries is currently mirrored by the world wide web itself, with Google as one of the main external threats. I have written about this in my post &#8220;<a id="qfzo" title="No future for libraries?" href="../2009/05/no-future-for-libraries/">No future for libraries?</a>&#8220;.<br />
The two important issues here are: the effects on modern library collections and audience. Teylers library decided to stop building its own collection, but they keep using it in a number of ways: temporary physical thematic exhibitions, but also in new digital &#8220;mashed up&#8221; ways. This might be a good example for modern libraries to follow: make use of modern technologies to reuse existing collections to create virtual online thematic aggregations of data, texts, images, etc. See also my post &#8220;<a id="pdv1" title="Collection 2.0" href="../2009/02/collection-20/">Collection 2.0</a>&#8220;.<br />
As for modern libraries&#8217; response to changing audiences: proceeding with new ways of using their collections will draw new customers anyway. But it is equally important to find other ways to &#8220;go where your users are&#8221;, like being on social networks like the Teylers Ning site. One of the most important moves in the near future will be mobile presence.</p>
<p>Teylers library shows us that there may be a new life for old libraries.</p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fcommonplace.net%2F2009%2F12%2Fold-library-new-library%2F&amp;linkname=Old%20library%2C%20new%20library"><img src="http://commonplace.net/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://commonplace.net/2009/12/old-library-new-library/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>Is an e-book a book?</title>
		<link>http://commonplace.net/2009/11/is-an-e-book-a-book/</link>
		<comments>http://commonplace.net/2009/11/is-an-e-book-a-book/#comments</comments>
		<pubDate>Thu, 19 Nov 2009 14:24:13 +0000</pubDate>
		<dc:creator>Lukas Koster</dc:creator>
				<category><![CDATA[Library2.0]]></category>
		<category><![CDATA[books]]></category>
		<category><![CDATA[doi]]></category>
		<category><![CDATA[e-books]]></category>
		<category><![CDATA[frbr]]></category>
		<category><![CDATA[marc]]></category>

		<guid isPermaLink="false">http://commonplace.net/?p=863</guid>
		<description><![CDATA[About cataloging physical items or units of content
2009 is the year of the e-book, or perhaps better: of the e-book reader. This is an important distinction that I will explain below. E-books are becoming more popular because of the increasing availability of various cheap e-book readers.
But what is an e-book? Is it the same as [...]]]></description>
			<content:encoded><![CDATA[<div><em><strong>About cataloging physical items or units of content</strong></em></p>
<div id="attachment_870" class="wp-caption alignleft" style="width: 310px"><a href="http://www.flickr.com/photos/hjvanderklis/2141212272/" target="_blank"><img class="size-medium wp-image-870  " title="Book scroll" src="http://commonplace.net/wp-content/uploads/2009/11/bookscroll-300x199.jpg" alt="Book scroll" width="300" height="199" /></a><p class="wp-caption-text">Book scroll © Henk-Jan van der Klis</p></div>
<p>2009 is the year of the e-book, or perhaps better: of the e-book reader. This is an important distinction that I will explain below. E-books are becoming more popular because of the increasing availability of various cheap e-book readers.<br />
But what is an e-book? Is it the same as a book? Some people say yes, some people say no. This question shouldn&#8217;t be so hard to answer, should it? We just have to define what a book is first. So, what is a book?</p>
<p>When people think of a book, they picture something like the archetypal book: printed, <img class="alignright size-full wp-image-876" title="book-icon" src="http://commonplace.net/wp-content/uploads/2009/11/book-icon.gif" alt="book-icon" width="150" height="150" />medium sized, hardcover, no illustrations on the front. The thing that you can actually hold in your hands and read.<br />
But if they say: &#8220;This book was written by that author&#8221;, they don&#8217;t think that the author actually wrote that particular item they are holding in their hands. Now we already have two different meanings of the concept &#8220;book&#8221;: one is a tangible object, the other is the content that is made available in this tangible object by means of printed text.</p>
<p>Besides these <em><strong>conceptual levels</strong></em>, there are more ways by which books can be described, as shown by this incomplete list of examples:</p>
<p><strong><em>Physical form</em></strong>: Historically there have been clay tablets, inscribed stones, handwritten scrolls, handwritten bound pages, printed pages. We also know different formats targeted at specific uses or audiences: audio books, braille books, pop up books.</p>
<div id="attachment_886" class="wp-caption alignright" style="width: 310px"><a href="http://www.flickr.com/photos/doegox/988934975/" target="_blank"><img class="size-medium wp-image-886    " title="popupbook" src="http://commonplace.net/wp-content/uploads/2009/11/popupbook-300x200.jpg" alt="Popup book © doegox" width="300" height="200" /></a><p class="wp-caption-text">Popup book © doegox</p></div>
<p><em><strong>Content</strong></em>: A book can contain text only, or images only (for instance a children&#8217;s picture book, or a book of photographs), or a combination of both.</p>
<p><em><strong>Units</strong></em>: A book can consist of one &#8220;story&#8221; ( for instance a novel), optionally subdivided in chapters, or be made up of several stories, or articles (like a text book about a certain subject). Chapters and stories can be written by the same or by several authors. A book can also contain two or more other books by the same author (&#8220;collected works&#8221;), etc.</p>
<p><em><strong>Content type</strong></em>: A book can contain fiction, aimed at entertaining readers. Books can be purely administrative, like accounting books. There are religious books to be used in religious ceremonies (sometimes these are referred to as &#8220;<em>THE book</em>&#8220;). Some books are for studying and learning (&#8220;text books&#8221;, which may also contain images by the way). There are scientific books and instructional books (travel guides, cook books, manuals).</p>
<p>First, we need see how all this fits together before we can answer the question &#8220;<em>Is an e-book a book?</em>&#8221; or more precise: &#8220;<em>In which sense is an e-book a book?</em>&#8220;. Fortunately there is already a conceptual model for bibliographic entities and the relationships between them that describes this: <a id="kh5o" title="FRBR" href="http://en.wikipedia.org/wiki/FRBR" target="_blank">FRBR</a> (<em>Functional Requirements for Bibliographic Records</em>), published by <a id="g74l" title="IFLA" href="http://www.ifla.org/" target="_blank">IFLA</a>. The <a id="nthc" title="IFLA Final Report" href="http://www.ifla.org/files/cataloguing/frbr/frbr_2008.pdf" target="_blank">IFLA Final Report</a> (2009 version) says it all, but there are also a couple of short summaries: <a id="xhzy" title="Barbara Tillet's (LoC) &quot;What is FRBR?&quot;" href="http://www.loc.gov/cds/downloads/FRBR.PDF" target="_blank">Barbara Tillet&#8217;s (LoC) &#8220;What is FRBR?&#8221;</a>, <a id="s_g8" title="Jenn Riley's &quot;FRBR&quot; blog post" href="http://techessence.info/frbr" target="_blank">Jenn Riley&#8217;s &#8220;FRBR&#8221; blog post</a>, and there is <a id="f:-s" title="William Denton" href="http://twitter.com/wdenton" target="_blank">William Denton</a>&#8217;s <a id="fv88" title="FRBR Blog" href="http://www.frbr.org/" target="_blank">FRBR Blog</a> for more information.<br />
The FRBR model is targeted at libraries, maybe even at publishers and booksellers too.</p>
<p>I will not go into the FRBR &#8220;<em>Group 2</em>&#8221; (persons and corporate bodies) and &#8220;<em>Group 3</em>&#8221; (subjects) entities here, but focus on the &#8220;<em>Group 1</em>&#8221; entities.</p>
<p>The FRBR &#8220;<em>Group 1 entities</em>&#8221; consist of <em>Work</em>, <em>Expression</em>, <em>Manifestation </em>and <em>Item </em>(also referred to as <em><strong>WEMI</strong></em>). FRBR entities not only apply to books or textual works, but also to movies, theater plays, music, etc.<em><strong> </strong></em></div>
<div>
<ul>
<li><em><strong>Work </strong>- a distinct intellectual or artistic creation</em></li>
<li><em><strong>Expression </strong>- the intellectual or artistic realization of a work</em></li>
<li><em><strong>Manifestation </strong>- the physical embodiment of an expression of a work</em></li>
<li><em><strong>Item </strong>- a single exemplar (or copy) of a manifestation</em></li>
</ul>
<div id="attachment_879" class="wp-caption alignleft" style="width: 624px"><a href="http://www.loc.gov/cds/downloads/FRBR.PDF" target="_blank"><img class="size-large wp-image-879 " title="frbrerd" src="http://commonplace.net/wp-content/uploads/2009/11/frbrerd-1024x770.jpg" alt="frbrerd" width="614" height="462" /></a><p class="wp-caption-text">FRBR Model © Library of Congress/Barbara Tillett</p></div>
</div>
<div>There are hierarchical relationships between the entities:</div>
<div>
<ul>
<li>A work (for instance a book) can have (&#8220;<em>is realized through</em>&#8220;) one or more expressions (for instance the original English text and the Dutch translation).</li>
<li>Each expression can have (&#8220;<em>is embodied in</em>&#8220;) one or more manifestations (for instance a specific edition with an ISBN, or one of more works/expressions in a &#8220;collected works&#8221; edition).</li>
<li>Each manifestation has (&#8220;<em>is exemplified by</em>&#8220;) one or more items, the things you can actually hold in your hands.</li>
<li>A manifestation can also consist of several expressions, as in the &#8220;collected works&#8221; example.</li>
</ul>
<p>Besides these hierarchical relationships between different entity types there are also recursive relationships between entities of the same type: hierarchical and other. Some examples:</p>
<ul>
<li>A work is part of another work (hierarchical), as in a series like Harry Potter</li>
<li>A work is an adaptation of another work</li>
<li>An expression is a sequel to another expression</li>
<li>A manifestation is a facsimile of another manifestation</li>
</ul>
<p>So far so good. The FRBR conceptual model describes (or aims to describe) real world things and relationships on an abstract level. The model can be implemented in actual systems (both computerised and manual!). In these systems you are free to refer to the conceptual model entities (&#8220;work&#8221;, &#8220;expression&#8221;, &#8220;manifestation&#8221;, &#8220;item&#8221;) by names that are actually used in daily life. This is what <a id="ivkl" title="Rob Styles" href="http://twitter.com/mmmmmrob" target="_blank">Rob Styles</a> is trying to do when he talks about &#8220;<em>stories</em>&#8221; and &#8220;<em>editions</em>&#8221; in his recent blog post &#8220;<a id="e24w" title="Bringing FRBR Down to Earth…" href="http://dynamicorange.com/2009/11/11/bringing-frbr-down-to-earth/" target="_blank">Bringing FRBR Down to Earth…</a>&#8221; I think. I will define the &#8220;story&#8221; concept in a  different way below.</p>
<p>Until now, catalogers and library systems have been targeted at describing the thing they have in their hands (or better the items that make up the library&#8217;s collection).  In FRBR terms this means that catalogs describe manifestations and items, not works and expressions (or implicitly at best). In short, a bottom up approach. This is understandable, because in the past there was nothing else to go by than the explicit manifestation information available on the physical item (author, title, ISBN, edition, publisher, etc.) .<br />
Of course, <a id="pc70" title="MARC21" href="http://www.loc.gov/marc/" target="_blank">MARC21</a> provides some options to describe relationships with expressions and works and other manifestations, like the <a id="cq8g" title="250 - Edition Statement" href="http://www.loc.gov/marc/bibliographic/bd250.html" target="_blank">250 &#8211; Edition Statement</a>, the <a id="mpsr" title="490 - Series Statement" href="http://www.loc.gov/marc/bibliographic/bd490.html" target="_blank">490 &#8211; Series Statement</a> and the <a id="odki" title="76X-78X - Linking Entries-General Information" href="http://www.loc.gov/marc/bibliographic/bd76x78x.html" target="_blank">76X-78X &#8211; Linking Entries-General Information</a>. But these fields can only be used if the information is known to the cataloger.<br />
Also, in traditional catalogs, works that are distinct expressions in one manifestation (like articles, chapters, stories, poems) are not described separately, because of the same reason: you only catalog the item you have before you. In the ideal world, or better in the new digital world, the unit to be cataloged or described should always be the work, which we may call &#8220;story&#8221;. In other words: we should catalog units of content (&#8220;stories&#8221;) instead of, or supplementary to, physical items.<br />
Current library practice is that we catalog books and journals in the catalog and offer article descriptions through subscribed article metadata databases separately.</p>
<p>So, back to the e-book. Where does that fit in? An e-book could be considered nothing more than a manifestation and/or an item belonging to a certain work/expression, because an e-book can be everything a printed book is. As such it is equivalent to a braille or audio book. Some libraries treat e-books as something different, as works/expressions as such. They catalog e-books separately, just like all other items/manifestations are treated as separate works. There are even separate e-book overviews.</p>
<div id="attachment_901" class="wp-caption alignleft" style="width: 310px"><a href="http://www.flickr.com/photos/alfie/3665788260/" target="_blank"><img class="size-medium wp-image-901" title="ebookreader" src="http://commonplace.net/wp-content/uploads/2009/11/ebookreader-300x225.jpg" alt="E-book reader © alfi" width="300" height="225" /></a><p class="wp-caption-text">E-book reader © alfi</p></div>
<p>But there is more to it than that. The big difference with books until now is that an e-book is not inseparably linked to the physical carrier. A printed book can only be read if the reader has a physical copy (a FRBR <em>item</em>) consisting of bound paper pages containing the text printed on them with ink. The same applies to handwritten texts, scrolls, clay tablets, etc.<br />
Even more so, the physical form, together with economical conditions and possibilities for distribution, often determines the actual manifestation of a book and a journal. A book (or volume) can only contain a certain number of pages in order to be manageable. There is also a cost consideration in the size and distribution of the items.</p>
<p>What we call an e-book is actually only a digital, abstract manifestation of a work/expression. In order to be able to read it you have to download it in a specific format (PDF, epub, etc.) onto a physical carrier (USB-stick, computer disk, etc.), and then you need a physical reading device with dedicated software (dedicated e-book readers like Kindle, a computer, a mobile phone, etc.).<br />
Libraries do not have e-books as items, only as manifestations. These e-book manifestations can be available on an online server somewhere in whatever form, and can be made into an item on-the-fly, using a specific format on-the-fly, choosing a physical carrier on-the-fly. What&#8217;s more, the content of e-books can also be selected out of several works/expressions on-the-fly, this way creating manifestations or even expressions on demand.</p>
<p>Now, is the FRBR conceptual model suited for describing e-books? If we treat e-books as manifestations without items (like we handle e-journals in our catalogs), how do we proceed? The FRBR Manifestation item among others has these attributes:</p></div>
<ul>
<li>form of carrier</li>
<li>extent of the carrier</li>
<li>physical medium</li>
<li>system requirements (electronic resource)</li>
<li>file characteristics (electronic resource)</li>
<li>mode of access (remote access electronic resource)</li>
<li>access address (remote access electronic resource)</li>
</ul>
<p>But we have just seen that in the case of e-books these are features of the items generated on-the-fly, which are not known before. Does this mean that we have to describe as manifestations all possible physical forms that one e-book can take? This would also mean that an e-book as such should be described on the level of a FRBR Expression. This may be correct in some cases (the creation of aggregated content on-the-fly), but not in all: where an e-book is similar to manifestations like braille, audio book, etc.</p>
<p>Does FRBR need an extra level? I am not sure. Let&#8217;s look briefly at how e-journals are handled. As far as I can see, journal and e-journal issues are described as separate manifestations of journals and e-journals (with a &#8220;part-of&#8221; relationship to the higher level). These issue manifestations are treated as aggregates that contain articles, that are also described as manifestations with a &#8220;part-of&#8221; relationship to the issue. In <em>MARC21 </em>this handled by the <a title="773 Host Item Entry tag" href="http://www.loc.gov/marc/bibliographic/bd773.html" target="_blank">773 Host Item Entry tag</a>.<br />
I am not sure if and how different physical formats (PDF, HTML) for articles in e-journals are handled. The obvious difference with e-books is that the described unit is the article (or &#8220;story&#8221; as definition of unit of content), which can be downloaded as separate items. The e-journal articles are ideally also identified by unique identifiers (<a id="ihde" title="DOI" href="http://www.doi.org/" target="_blank">DOI</a>&#8217;s).</p>
<p>What does this mean for e-books? I think we can treat an e-book as either an expression or a manifestation, depending on the nature of the specific e-book in question. For the e-book manifestation we would only need to register the <em>mode of acces</em>, <em>access address</em> and <em>manifestation identifier</em> attributes, preferably in the form of a <a id="g565" title="URI" href="http://en.wikipedia.org/wiki/Uniform_Resource_Identifier" target="_blank">URI</a>.<br />
I also think we should use the possibilities of the FRBR model to start describing, cataloging and identifying the &#8220;stories&#8221; (chapters, articles, etc.) that make up books and e-books separately, as units of content in their own right. People are interested in the content, the &#8220;stories&#8221;, not the physical items or artificial digital aggregate units like e-books or e-journals.<br />
In this sense, the &#8220;e-journal&#8221; is an archaic concept, where the limitations of the physical journal are translated as such to the digital world. There is no real need to bundle articles in electronic form into one electronic issue of an e-journal that is published at regular intervals in time. Electronic articles can be published individually immediately after peer review and approval. Published articles can be aggregated in one nor more virtual online serials.</p>
<p>Like ISBN&#8217;s and ISSN&#8217;s we need an identifier for the units of content other than journal articles. As a matter of fact, there already is one, the DOI:<br />
&#8220;<em>A DOI name can be used to identify any resource involved in an intellectual property transaction. Intellectual property includes both physical and digital manifestations, performances and abstract works. An entity can be identified at any arbitrary level of granularity</em>.&#8221; (see <a id="edau" title="http://www.doi.org/faq.html#2" href="http://www.doi.org/faq.html#2" target="_blank">http://www.doi.org/faq.html#2</a>). Thanks to <a id="p0p0" title="Owen Stephens" href="http://twitter.com/ostephens" target="_blank">Owen Stephens</a> for pointing this out to me in a twitter discussion with <a id="ncls" title="Inga Overkamp" href="http://twitter.com/grumpf" target="_blank">Inga Overkamp</a>.</p>
<p>I may be wrong about all this. I am open for comments and suggestions.</p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fcommonplace.net%2F2009%2F11%2Fis-an-e-book-a-book%2F&amp;linkname=Is%20an%20e-book%20a%20book%3F"><img src="http://commonplace.net/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://commonplace.net/2009/11/is-an-e-book-a-book/feed/</wfw:commentRss>
		<slash:comments>28</slash:comments>
		</item>
		<item>
		<title>Just in time or just in case?</title>
		<link>http://commonplace.net/2009/10/just-in-time-or-just-in-case/</link>
		<comments>http://commonplace.net/2009/10/just-in-time-or-just-in-case/#comments</comments>
		<pubDate>Fri, 16 Oct 2009 10:56:32 +0000</pubDate>
		<dc:creator>Lukas Koster</dc:creator>
				<category><![CDATA[Library2.0]]></category>
		<category><![CDATA[catalog]]></category>
		<category><![CDATA[discovery tools]]></category>
		<category><![CDATA[exlibris]]></category>
		<category><![CDATA[federated search]]></category>
		<category><![CDATA[harvesting & indexing]]></category>
		<category><![CDATA[innovation]]></category>
		<category><![CDATA[integration]]></category>
		<category><![CDATA[library systems]]></category>
		<category><![CDATA[metasearch]]></category>
		<category><![CDATA[next generation]]></category>

		<guid isPermaLink="false">http://commonplace.net/?p=825</guid>
		<description><![CDATA[Metasearch vs. harvesting &#38; indexing
 The other day I gave a presentation for the Assembly of members of the local Amsterdam Libraries Association &#8220;Adamnet&#8220;, about the Amsterdam Digital Library search portal that we host at the Library of the University of Amsterdam. This portal is built with our MetaLib metasearch tool and offers simultaneous access [...]]]></description>
			<content:encoded><![CDATA[<p><em><strong>Metasearch vs. harvesting &amp; indexing</strong></em></p>
<p><em> </em>The other day I gave a presentation for the Assembly of members of the local Amsterdam Libraries Association &#8220;<a id="zg9l" title="Adamnet" href="http://www.adamnet.nl/" target="_blank">Adamnet</a>&#8220;, about the <a id="rw:1" title="Amsterdam Digital Library search portal" href="http://digitaal.uba.uva.nl/V/?func=change-portal-1&amp;portal-name=DBA" target="_blank">Amsterdam Digital Library search portal</a> that we host at the <a id="k:70" title="Library of the University of Amsterdam" href="http://www.uba.uva.nl/" target="_blank">Library of the University of Amsterdam</a>. This portal is built with our <a id="z913" title="MetaLib" href="http://www.exlibrisgroup.com/category/MetaLibOverview" target="_blank">MetaLib</a> metasearch tool and offers simultaneous access to, at the moment, 20 local library catalogues.</p>
<p><object style="margin:0px" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="425" height="355" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowScriptAccess" value="always" /><param name="src" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=metasearchindexing-091016052945-phpapp01&amp;rel=0&amp;stripped_title=metasearch-vs-harvesting-andindexing" /><param name="allowfullscreen" value="true" /><embed style="margin:0px" type="application/x-shockwave-flash" width="425" height="355" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=metasearchindexing-091016052945-phpapp01&amp;rel=0&amp;stripped_title=metasearch-vs-harvesting-andindexing" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<p>A large part of this presentation was dedicated to all possible (and very real) technical bottlenecks of this set-up, with the objective of improving coordination and communication between the remote system administrators at the participating libraries and the central portal administration. All MetaLib database connectors/configurations are &#8220;home-made&#8221;, and the portal highly depends on the availability of the remote cataloging systems.</p>
<p>I took the opportunity to explain to my audience also the &#8220;issues&#8221; inherent in the concept of <em>metasearch </em>(or &#8220;<em>federated search</em>&#8220;, &#8220;<em>distributed search</em>&#8220;, etc.), and compare that to the <em>harvesting &amp; indexing</em> scenario.</p>
<p>Because it was not the first (nor last) time that I had to explain the peculiarities of metasearch, I decided to take the <em>Metasearch vs. Harvesting &amp; Indexing</em> part of the presentation and extend it to a dedicated slideshow. You can see it here, and you are free to use it. Examples/screenshots are taken from our MetaLib Amsterdam Digital Library portal. But everything said applies to other metasearch tools as well, like <a id="z:7h" title="Webfeat" href="http://www.webfeat.org/" target="_blank">Webfeat</a>, <a id="tr9x" title="Muse Global" href="http://www.museglobal.com/solutions/technology.html#integrationEngine" target="_blank">Muse Global</a>, <a id="k9_m" title="360-Search" href="http://www.serialssolutions.com/360-search/" target="_blank">360-Search</a>, etc.</p>
<p>The slideshow is meant to be an objective comparison of the two search concepts. I am not saying that Metasearch is bad, and H&amp;I is good, that would be too easy. Some five years ago Metasearch was the best we had, it was a tremendous progress beyond searching numerous individual databases separately. Since then we have seen the emergence of harvesting &amp; indexing tools, combined with &#8220;uniform discovery interfaces&#8221;, such as <a id="wojo" title="Aquabrowser" href="http://www.aquabrowser.com/" target="_blank">Aquabrowser</a>, <a id="zavy" title="Primo" href="http://www.exlibrisgroup.com/category/PrimoOverview" target="_blank">Primo</a>, <a id="lgbd" title="Encore" href="http://encoreforlibraries.com/" target="_blank">Encore</a>, and the OpenSource tools <a id="h0go" title="VuFind" href="http://www.vufind.org/" target="_blank">VuFind</a>, <a id="dye0" title="SUMMA" href="http://www.statsbiblioteket.dk/summa/" target="_blank">SUMMA</a>, <a id="clvi" title="Meresco" href="http://meresco.org/" target="_blank">Meresco</a>, to name a few.</p>
<p>Anyway,  we can compare the main difference between Metasearch and H&amp;I to the concepts &#8220;<em>Just in time</em>&#8221; and &#8220;<em>Just in case</em>&#8220;, used in logistics and inventory management.</p>
<p>With Metasearch, records are fetched on request (<em>Just in time</em>), with the risk of running into logistics and delivery problems. With H&amp;I, all available records are already there (<em>Just in case</em>), but maybe not the most recent ones.</p>
<p>Objectively of course, H&amp;I can solve the problems inherent in Metasearch, and therefore is a superior solution. However, a number of institutions, mainly general academic libraries, will for some time depend on databases that can&#8217;t be harvested because of technical, legal or commercial reasons.</p>
<p>In other cases, H&amp;I is the best option, for instance in the case of cooperating local or regional libraries, such as Adamnet, or dedicated academic or research libraries that only depend on a limited number of important databases and catalogs.</p>
<p>But I also believe that the real power of H&amp;I can only be taken advantage of, if institutions cooperate and maintain shared central indexes, instead of building each their own redundant metadata stores. This already happens, for instance in Denmark, where the Royal Library uses <a href="http://primo-7.kb.dk/primo_library/libweb/action/search.do?mode=Basic&amp;vid=KGL&amp;tab=artikler" target="_blank">Primo</a> to access the national <a href="http://dads.dtv.dk/" target="_blank">DADS</a> database.</p>
<p>We also see commercial hosted H&amp;I initiatives implemented as <em>SaaS </em>(Software as a Service) by both tool vendors and database suppliers, like <a id="t4es" title="Ex Libris' PrimoCentral" href="http://www.exlibrisgroup.com/category/PrimoCentral" target="_blank">Ex Libris&#8217; PrimoCentral</a>, <a id="zl-." title="Summon" href="http://www.serialssolutions.com/summon/" target="_blank">SerialSolutions&#8217; Summon</a> and <a id="z36d" title="EBSCOhost Integrated Search" href="http://www.ebscohost.com/thisTopic.php?topicID=1177&amp;marketID=1" target="_blank">EBSCOhost Integrated Search</a>.</p>
<p>The funny thing is, that if you want to take advantage of all these hosted harvested indexes, you are likely to end up with a hybrid kind of metasearch situation where you distribute searches to a number of remote H&amp;I databases.</p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fcommonplace.net%2F2009%2F10%2Fjust-in-time-or-just-in-case%2F&amp;linkname=Just%20in%20time%20or%20just%20in%20case%3F"><img src="http://commonplace.net/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://commonplace.net/2009/10/just-in-time-or-just-in-case/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Roadmaps to uncertainty</title>
		<link>http://commonplace.net/2009/10/roadmaps-to-uncertainty/</link>
		<comments>http://commonplace.net/2009/10/roadmaps-to-uncertainty/#comments</comments>
		<pubDate>Tue, 06 Oct 2009 20:36:11 +0000</pubDate>
		<dc:creator>Lukas Koster</dc:creator>
				<category><![CDATA[Library2.0]]></category>
		<category><![CDATA[exlibris]]></category>
		<category><![CDATA[igelu]]></category>
		<category><![CDATA[igelu09]]></category>
		<category><![CDATA[library systems]]></category>
		<category><![CDATA[meetings]]></category>
		<category><![CDATA[next generation]]></category>
		<category><![CDATA[roadmap]]></category>
		<category><![CDATA[saas]]></category>
		<category><![CDATA[system librarians]]></category>
		<category><![CDATA[urm]]></category>
		<category><![CDATA[workflows]]></category>

		<guid isPermaLink="false">http://commonplace.net/?p=795</guid>
		<description><![CDATA[What will library staff do 5 years from now?
I attended the IGeLU 2009 annual conference in Helsinki September 6-9. IGeLU is the International Group of Ex Libris Users, an independent organisation that represents Ex Libris customers. Just to state my position clearly I would like to add that I am a member of the IGeLU [...]]]></description>
			<content:encoded><![CDATA[<p><em><strong>What will library staff do 5 years from now?</strong></em></p>
<div id="attachment_813" class="wp-caption alignleft" style="width: 388px"><img class="size-medium wp-image-813" title="Road closed" src="http://commonplace.net/wp-content/uploads/2009/10/DSCF5544-300x225.jpg" alt="Road closed" width="378" height="282" /><p class="wp-caption-text">© Lukas Koster</p></div>
<p>I attended the <a id="cr:l" title="IGeLU 2009 annual conference" href="http://igelu2009.org/" target="_blank">IGeLU 2009 annual conference</a> in Helsinki September 6-9. <strong>IGeLU </strong>is the <a id="fx.q" title="International Group of Ex Libris Users" href="http://igelu.org/" target="_blank">International Group of Ex Libris Users</a>, an independent organisation that represents <a id="qon5" title="Ex Libris" href="http://www.exlibrisgroup.com/" target="_blank">Ex Libris</a> customers. Just to state my position clearly I would like to add that I am a member of the <a id="ib9m" title="IGeLU Steering Committee" href="http://igelu.org/about-us/steering-committee" target="_blank">IGeLU Steering Committee</a>.<br />
These annual user group meetings typically have three types of sessions: internal organisational sessions (product working groups and steering committee business meetings, elections), Ex Libris sessions (product updates, Q&amp;A, strategic visions), and customer sessions (presentations of local solutions, addons, developments).</p>
<p>Not surprisingly, the main overall theme of this conference was the future of library systems and libraries. The word that characterises the conference best in my mind (besides &#8220;<em>next generation</em>&#8220;and &#8220;<em>metaphor</em>&#8220;) is &#8220;<strong><em>roadmap</em></strong>&#8220;. All Ex Libris products but also all attending libraries are on their way to something new, which strangely enough is still largely uncertain.</p>
<div id="attachment_820" class="wp-caption alignright" style="width: 310px"><img class="size-medium wp-image-820" title="Sunrise" src="http://commonplace.net/wp-content/uploads/2009/10/DSCF5514-300x225.jpg" alt="Sunrise" width="300" height="225" /><p class="wp-caption-text">© Lukas Koster</p></div>
<p><strong>Library paradise</strong>?<br />
Ex Libris presented the latest state of design and development of their <a id="wwwt" title="URM" href="http://www.exlibrisgroup.com/?catid=%7B86D863D1-734A-4CBC-9E1F-4AD08536B07D%7D" target="_blank">URM</a> (Unified Resource Management) project, &#8216;A New Model for Next-generation Library Services&#8217;. In the final URM environment all back end functionality of all current Ex Libris products will be integrated into one big modular system, implemented in a <a id="zoze" title="SaaS" href="http://en.wikipedia.org/wiki/Software_as_a_service" target="_blank">SaaS</a> (&#8220;<em>Software as a Service</em>&#8220;) architecture. In the Ex Libris vision the front end to this model will be their <a id="xcud" title="Primo" href="http://www.exlibrisgroup.com/category/PrimoOverview" target="_blank">Primo</a> Indexing and Discovery interface, but all <em>URM </em>modules will have open API&#8217;s to enable using them with other tools.<br />
The goal of this roadmap apparently is efficiency in the areas of technical and functional system administration for libraries.</p>
<p><strong>Intermediate generation</strong><br />
In the mean time development of existing products is geared towards final inclusion in <em>URM</em>. All future upgrades will result in what I would like to call &#8220;<em>intermediate</em>&#8221; instead of &#8220;<em>next generation</em>&#8221; products . <em>MetaLib</em>, the metasearch or federated search tool, will be replaced by <em>MetaLib Next Generation</em>, with a re-designed metasearch engine and a <em>Primo </em>front end. The digital collection management tool <em>DigiTool</em> will be merged into its new and bigger nephew <em>Rosetta</em>, the digital preservation system. The database of the OpenUrl resolver <em>SFX</em> will be restructured to accommodate the <em>URM </em>datamodel.  The next version of <em>Verde</em> (electronic resource management) will effectively be <em>URM </em>version 1, which will also be usable as an alternative for both ILS&#8217;es <em>Voyager</em> and <em>Aleph</em>.<br />
Here we see a kind of &#8220;intermediate&#8221; roadmap to different &#8220;base camps&#8221; from where the travelers can try to reach their final destination.</p>
<div id="attachment_816" class="wp-caption alignright" style="width: 310px"><img class="size-medium wp-image-816" title="Accessibility" src="http://commonplace.net/wp-content/uploads/2009/10/DSCF5547-300x225.jpg" alt="Accessibility" width="300" height="225" /><p class="wp-caption-text">© Lukas Koster</p></div>
<p><strong>Holy cows!</strong><br />
From the perspective of library staff we see another panorama appearing.<br />
In one of the customer presentations Janet Lute of <a id="bwqb" title="Princeton University Library" href="http://www.princeton.edu/main/library/" target="_blank">Princeton University Library</a>, one of the three (<a id="ax9q" title="now four" href="http://initiatives.exlibrisgroup.com/2009/09/continuing-collaboration-our-first-urm.html" target="_blank">now four</a>) URM development partners, mentioned a couple of &#8220;holy cows&#8221; or library tasks they might consider stopping doing while on their way to the new horizon:</p>
<ul>
<li>managing prediction patterns for journal issues</li>
<li>checking in print serials</li>
<li>maintaining lots of circulation matrices and policies</li>
<li>collecting fines</li>
<li>cataloging over 80% of bibliographic records</li>
</ul>
<p>I would like to add my own holy cow <em><strong>MARC </strong></em>to this list, about which I have written a previous post <a id="oxb3" title="Who needs MARC?" href="../2009/05/who-needs-marc/">Who needs MARC?</a>. (Some other developments in this area are self service, approval plans, shared cataloging, digitisation, etc.)<br />
This roadmap is supposed to lead to more efficient work and less pressure for acquisitions, cataloging and circulation staff.</p>
<p><strong>Eldorado or Brave New World?</strong><br />
To summarise: we see a sketchy roadmap leading us via all kinds of optional intermediate stations to an as yet still vague and unclear Eldorado of scholarly information disclosure and discovery.<br />
The majority of public and professional attention is focused on discovery: modern web 2.0 front ends to library collections, and the benefits for the libraries&#8217; end users. But it is probably even more important to look at the other side, disclosure: the library back end, and the consequences of all these developments for library staff, both technically oriented system administrators and professionally oriented librarians.</p>
<p>Future efficient integrated and modular library systems will no doubt eliminate a lot of tasks performed by library staff, but does this mean there will be no more library jobs?<br />
Will the university library of the future be &#8220;<em>sparsely staffed, highly decentralized, and have a physical plant consisting of little more than special collections and study areas</em>&#8220;, as was stated recently in an <a id="x:1k" title="article in &quot;Inside Higher Education&quot;" href="http://www.insidehighered.com/news/2009/09/24/libraries" target="_blank">article in &#8220;Inside Higher Education&#8221;</a>? I mentioned similar options in &#8220;<a id="ssim" title="No future for libraries?" href="../2009/05/no-future-for-libraries/">No future for libraries?</a>&#8220;.</p>
<p>Personally I expect that the two far ends of the library jobs spectrum will merge into a single generic job type which we can truly call &#8220;<em>system librarian</em>&#8220;, as I stated in my post &#8220;<a id="gz:c" title="System librarians 2.0" href="../2008/10/system-librarians-20/">System librarians 2.0</a>&#8220;. But what will these professionals do? Will they catalog? Will they  configure systems? Will they serve the public? Will they develop system add-ons?</p>
<p>This largely depends on how the new integrated systems will be designed and implemented, how systems and databases from different vendors and providers will be able to interact, how much libraries/information management organisations will outsource and crowdsource, how much library staff is prepared to rethink existing workflows, how much libraries want to distinguish themselves from other organisations, how much end users are interested in differences between information management organisations; in brief: how much these new platforms will allow us to do ourselves.</p>
<p>We have come up with a realistic image of ourselves for the next couple of decades soon, otherwise our publishers and system vendors will be doing it for us.</p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fcommonplace.net%2F2009%2F10%2Froadmaps-to-uncertainty%2F&amp;linkname=Roadmaps%20to%20uncertainty"><img src="http://commonplace.net/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://commonplace.net/2009/10/roadmaps-to-uncertainty/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Explicit and implicit metadata</title>
		<link>http://commonplace.net/2009/08/explicit-and-implicit-metadata/</link>
		<comments>http://commonplace.net/2009/08/explicit-and-implicit-metadata/#comments</comments>
		<pubDate>Thu, 20 Aug 2009 13:06:00 +0000</pubDate>
		<dc:creator>Lukas Koster</dc:creator>
				<category><![CDATA[Library2.0]]></category>
		<category><![CDATA[catalog]]></category>
		<category><![CDATA[controlled vocabulary]]></category>
		<category><![CDATA[fulltext searching]]></category>
		<category><![CDATA[implicit metadata]]></category>
		<category><![CDATA[library systems]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[metadata generation]]></category>
		<category><![CDATA[social tagging]]></category>

		<guid isPermaLink="false">http://commonplace.net/?p=764</guid>
		<description><![CDATA[On August 17, after I tested a search in our new Aleph OPAC and mentioned my surprise on Twitter, the following discussion unfolded between me (lukask), Ed Summers of the Library of Congress and Till Kinstler of GBV (German Union Library Network):

lukask: Just found out we only have one item about RDF in our catalogue: [...]]]></description>
			<content:encoded><![CDATA[<div id="attachment_788" class="wp-caption alignright" style="width: 321px"><a href="http://www.flickr.com/photos/phunk/204366856/" target="_blank"><img class="size-full wp-image-788" title="tagged" src="http://commonplace.net/wp-content/uploads/2009/08/tagged.jpg" alt="tagged" width="311" height="353" /></a><p class="wp-caption-text">Tagged © funkandjazz</p></div>
<p>On August 17, after I tested a search in our new Aleph OPAC and mentioned my surprise on Twitter, the following discussion unfolded between <a id="eua7" title="me" href="http://twitter.com/lukask" target="_blank">me (lukask)</a>, <a id="rbn7" title="Ed Summers" href="http://twitter.com/edsu" target="_blank">Ed Summers</a> of the Library of Congress and <a id="d4rf" title="Till Kinstler" href="http://twitter.com/tillk" target="_blank">Till Kinstler</a> of GBV (German Union Library Network):</p>
<ul>
<li><span style="font-size: small;"><em><span style="font-family: Georgia;">lukask: Just found out we only have one item about RDF in our catalogue: <a href="http://tinyurl.com/lz75c4" target="_blank">http://tinyurl.com/lz75c4</a></span></em></span></li>
<li><span style="font-size: small;"><em> <span style="font-family: Georgia;">edsu: @lukask broaden that search <img src='http://commonplace.net/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' />  <a href="http://is.gd/2l6vB" target="_blank">http://is.gd/2l6vB</a></span></em></span></li>
<li><span style="font-size: small;"><em> <span style="font-family: Georgia;">lukask: @edsu Ha! Thanks! But I&#8217;m sure that RDF will be mentioned in these 29 titles! A case for social tagging!</span></em></span></li>
<li><span style="font-size: small;"><em> <span style="font-family: Georgia;">edsu: @lukask or better cataloging <img src='http://commonplace.net/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </span></em></span></li>
<li><span style="font-size: small;"><em> <span style="font-family: Georgia;">edsu: @lukask i guess they both amount to the same thing eh?</span></em></span></li>
<li><span style="font-size: small;"><em> <span style="font-family: Georgia;">lukask: @edsu That&#8217;s an interesting position&#8230;&#8221;social tagging=better cataloging&#8221;. I will ask my cataloguing co-workers about this specific example</span></em></span></li>
<li><span style="font-size: small;"><em> <span style="font-family: Georgia;">edsu: @lukask make sure to wear body-armor</span></em></span></li>
<li><span style="font-size: small;"><em> <span style="font-family: Georgia;">lukask: @edsu Yes I know! I will bring it up at tomorrow&#8217;s party for the celebration of our ALEPH STP (after some drinks&#8230;)</span></em></span></li>
<li><span style="font-size: small;"><em> <span style="font-family: Georgia;">tillk: </span><span style="font-family: Georgia;"><span>@edsu @lukask or fulltext search&#8230; <img src='http://commonplace.net/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' />  SCNR&#8230;</span></span></em></span></li>
<li><span style="font-size: small;"><em> <span style="font-family: Georgia;"><span>edsu: @tillk yeah, totally &#8212; with projects like @<a href="http://twitter.com/googlebooks" target="_blank">googlebooks</a> and @<a href="http://twitter.com/hathitrust" target="_blank">hathitrust</a> we may look back on the age of cataloging with different eyes &#8230;</span></span></em></span></li>
<li><span style="font-size: small;"><em> <span style="font-family: Georgia;">lukask: @tillk @edsu Fulltext search yes, or &#8220;implicit automatic metadata generation&#8221;?</span></em></span></li>
</ul>
<p>What happened here was:</p>
<ul>
<li><strong><em>A problem with findability of specific bibliographic items was observed</em></strong>: although it is highly unlikely that books about the <em>Semantic Web</em> will not cover <em>RDF-Resource Description Framework</em>, none of the 29 titles found with &#8220;<em>Semantic Web</em>&#8221; could be found with the search term &#8220;<em>Resource Description Framework</em>&#8220;; on the other side, the only item found with &#8220;<em>Resource Description Framework</em>&#8221; was NOT found with &#8220;<em>Semantic Web</em>&#8220;. I must add that the &#8220;<em>Semantic we</em>b&#8221; search was an &#8220;All words&#8221; search. Only 20 of the results were indexed with the Dutch subject heading &#8220;<em>Semantisch web</em>&#8221; (which term is never used in real life as far as I know; the English term is an international concept). Some results were off topic, they just happened to have &#8220;<em>semantic</em>&#8221; and &#8220;<em>web</em>&#8221; somewhere in their metadata. A better search would have been a phrase search (adjacent) with &#8220;<em>semantic web</em>&#8221; in actual quotes, which gives 26 items. But of these, a small number were not indexed with subject heading &#8220;<em>Semantisch web</em>&#8220;. Another note: searching with &#8220;<em>RDF</em>&#8221; gets you all kinds of results. Read more on the issue of searching and relevance in my post <a id="k5tu" title="Relevance in context" href="../2009/08/relevance-in-context/">Relevance in context</a>.</li>
</ul>
<ul>
<li><strong><em>Four possible solutions were suggested</em></strong>:</li>
</ul>
<ol style="margin-left: 40px;">
<li><em>social tagging</em></li>
<li><em>better cataloging</em></li>
<li><em>fulltext searching</em></li>
<li><em>automatic metadata generation</em></li>
</ol>
<p><strong>Social tagging</strong><br />
Clearly, the 26 items found with the search &#8220;<em>Semantic web</em>&#8221; are not indexed by the &#8220;<em>Resource description framework</em>&#8221; or &#8220;<em>RDF</em>&#8221; subject heading. There is not even a subject heading for &#8220;<em>Resource description framework</em>&#8221; or &#8220;<em>RDF</em>&#8220;. In my personal view, from my personal context, this is an omission. Mind you, this is not only an issue in the catalogue of the Library of the University of Amsterdam, it is quite common. I tried it in the <a id="t9nm" title="British Library Integrated Catalogue" href="http://catalogue.bl.uk/" target="_blank">British Library Integrated Catalogue</a> with similar results. Try it in your own OPAC!<br />
I presume that our professional cataloging colleagues can&#8217;t know everything about all subjects. That is completely understandable. I would not know how to catalog a book about a medical subject myself either! But this is exactly the point. If you allow end users to add their own tags to your bibliographic records, you enhance the findability of these records for specific groups of end users.<br />
I am not saying that cataloguing and indexing by library specialists using controlled vocabularies should be replaced by social tagging! No, not at all. I am just saying that both types of tagging/indexing are complementary. Sure, some of the tags added by end users may not follow cataloging standards, but who cares? Very often the end users adding tags of their own will be professional experts in their field. In any case, items with social tags will be found more often because specific end user groups can find them searching with their own terms.</p>
<p><strong>Better cataloging<br />
</strong>I suppose Ed Summers was trying to say the same thing as I just did above, when he commented &#8220;<span style="font-size: small;"><em><span style="font-size: small;"><span style="font-family: Georgia;">or better cataloging</span></span></em></span>, <span style="font-size: small;"><em><span style="font-size: small;"><span style="font-family: Georgia;">I guess they both amount to the same thing eh?</span></span></em></span>&#8220;, which I summarised as &#8220;<span style="font-size: small;"><em><span style="font-size: small;"><span style="font-family: Georgia;">social tagging=better cataloging</span></span></em></span>&#8220;, but he can correct me if I&#8217;m wrong.<br />
Anyway, I hope I made it clear that I would not say &#8220;<span style="font-size: small;"><em><span style="font-size: small;"><span style="font-family: Georgia;">social tagging=better cataloging</span></span></em></span>&#8220;, but rather &#8220;<span style="font-size: small;"><em><span style="font-size: small;"><span style="font-family: Georgia;">controlled vocabularies+social tagging=better cataloging</span></span></em></span>&#8220;.<br />
Or alternatively, could we improve cataloging by professional library catalogers? I must admit I do not know enough about library training and practice to say something about that. I am not a trained librarian. Don&#8217;t hesitate to comment!</p>
<p><strong>Fulltext searching</strong><br />
Is fulltext searching the miracle cure for findability problems, as Till Kinstler seems to suggest? Maybe.<br />
Suppose all our print material was completely digitised and available for fulltext search, I have no doubt that all 26 items mentioned above (the results of the &#8220;<em>semantic web</em>&#8221; all words search) would be found with the &#8220;<em>resource description framework</em>&#8221; or &#8220;<em>rdf</em>&#8221; search as well. But because fulltext search is by its very nature an &#8220;all words&#8221; search, the &#8220;<em>rdf</em>&#8221; fulltext search would also give a lot of &#8220;noise&#8221;, or items not having any relation to &#8220;semantic web&#8221; at all (author&#8217;s initials &#8220;R.D.F&#8221;, other acronyms &#8220;R.D.F.&#8221;, just see <a id="x304" title="RDF in the BL catalogue" href="http://catalogue.bl.uk/F/?func=find-b&amp;request=rdf&amp;find_code=WRD" target="_blank">RDF in the BL catalogue</a>). Again, see my post <a id="vz8e" title="Relevance in context" href="../2009/08/relevance-in-context/">Relevance in context</a> for an explanation of searching without context.<br />
Also, there will be books or articles about a subject that will not contain the actual subject term at all. With fulltext search these items will not be found.<br />
Moreover, fulltext searching actually limits the findable items to text, excluding other types, like images, maps, video, audio etc.<br />
This brings me to the &#8220;final solution&#8221;:</p>
<p><strong>Automatic metadata generation</strong><br />
Of course this is mostly still wishful thinking. But there are a number of interesting implementations already.<br />
What I mean when I say &#8220;(implicit) automatic metadata generation&#8221; is: metadata that is NOT created deliberately by humans, but either generated and assigned as static metadata, or generated on the fly, by software, applying intelligent analysis to objects, of all types (text, images, audio, video, etc.).<br />
In the case of our &#8220;rdf&#8221; example, such a tool would analyse a text and assign &#8220;<em>rdf</em>&#8221; as a subject heading based on the content and context of this text, even if the term &#8220;<em>rdf</em>&#8221; does not appear in the text at all. It would also discard texts containing the string &#8220;<em>rdf</em>&#8221; that refer to something completely different. Of course for this to succeed there should be some kind of contextual environment with links to other records or even other systems to be able to determine if certain terminology is linked to frequently used terms not mentioned in the text itself (here the Linked Data developments could play a major role).<br />
The same principle should also apply to non-textual objects, so that images, audio, video etc. about the same subject can be found in one go. Google has some interesting implementations in this field already: image search by colour and content type: see for example the search for <a id="lig-" title="&quot;rdf&quot; in Google Images with colour &quot;red&quot;and content type &quot;clip art&quot;" href="http://images.google.com/images?as_q=rdf&amp;imgcolor=red&amp;imgtbs=lt&amp;hl=en&amp;btnG=Google+Search&amp;as_epq=&amp;as_oq=&amp;as_eq=&amp;imgtype=clipart&amp;imgsz=&amp;imgw=&amp;imgh=&amp;imgar=&amp;as_filetype=&amp;imgc=&amp;as_sitesearch=&amp;as_rights=&amp;safe=images&amp;as_st=y" target="_blank">&#8220;rdf&#8221; in Google Images with colour &#8220;red&#8221;and content type &#8220;clip art&#8221;</a>.<br />
But of course there still needs a lot to be done.</p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fcommonplace.net%2F2009%2F08%2Fexplicit-and-implicit-metadata%2F&amp;linkname=Explicit%20and%20implicit%20metadata"><img src="http://commonplace.net/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://commonplace.net/2009/08/explicit-and-implicit-metadata/feed/</wfw:commentRss>
		<slash:comments>15</slash:comments>
		</item>
		<item>
		<title>Relevance in context</title>
		<link>http://commonplace.net/2009/08/relevance-in-context/</link>
		<comments>http://commonplace.net/2009/08/relevance-in-context/#comments</comments>
		<pubDate>Mon, 10 Aug 2009 23:58:53 +0000</pubDate>
		<dc:creator>Lukas Koster</dc:creator>
				<category><![CDATA[Library2.0]]></category>
		<category><![CDATA[contextual searching]]></category>
		<category><![CDATA[federated searching]]></category>
		<category><![CDATA[library systems]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[metasearching]]></category>
		<category><![CDATA[relevance ranking]]></category>
		<category><![CDATA[search systems]]></category>

		<guid isPermaLink="false">http://commonplace.net/?p=709</guid>
		<description><![CDATA[If you do a search in a bibliographic database, you should find what you need, not just what you are looking for, or what the database &#8220;thinks&#8221; you are looking for. If you find what you are looking for, then you will not be surprised and you will not discover anything new. And that&#8217;s not [...]]]></description>
			<content:encoded><![CDATA[<div id="attachment_715" class="wp-caption alignleft" style="width: 310px"><a href="http://www.flickr.com/photos/denverjeffrey/2561885967/" target="_blank"><img class="size-medium wp-image-715" title="searching" src="http://commonplace.net/wp-content/uploads/2009/08/searching-300x234.jpg" alt="searching" width="300" height="234" /></a><p class="wp-caption-text">Search! © Jeffrey Beall</p></div>
<p>If you do a search in a bibliographic database, you should find what you need, not just what you are looking for, or what the database &#8220;thinks&#8221; you are looking for. If you find what you are looking for, then you will not be surprised and you will not discover anything new. And that&#8217;s not what you want, is it? But if you find things you did not look for but also do not need, you&#8217;re not just surprised, you are confused! And that&#8217;s not what you want either.</p>
<p><em>You want the results that are the most relevant for your search, with your specific objectives, at that specific point in time time, for your specific circumstances, and you want them immediately.</em></p>
<p>So, how should search systems behave to make you find what you need? There are two conditions that need to be met:</p>
<ul>
<li><em>The search terms must be interpreted correctly</em></li>
<li><em>The most relevant search results must be presented</em></li>
</ul>
<p><strong>The Problem</strong><br />
First of all, let&#8217;s take a look at current practice.</p>
<p>Search systems cannot cope with ambiguous search terms. My favorite example and test search term is &#8220;<em><strong>voc</strong></em>&#8220;. This can stand for a number of things in various disciplines: <em><strong>V.O.C.</strong></em> (Dutch: &#8220;Verenigde Oostindische Compagnie&#8221;  or &#8220;Dutch United East Indies Company&#8221;) in historical databases; &#8220;<em><strong>vocals</strong></em>&#8221; in musical databases; &#8220;<em><strong>volatile organic compounds</strong></em>&#8221; in physics databases. So if you do a search for &#8220;<em>voc</em>&#8221; in a standard library catalogue, you get all kinds of results. Even more so if you use a metasearch or federated search tool for searching several databases simultaneously.</p>
<div id="attachment_722" class="wp-caption alignnone" style="width: 579px"><a href="http://catalogue.bl.uk/F/?func=find-b&amp;request=voc&amp;find_code=WRD" target="_blank"><img class="size-full wp-image-722" title="blvoc" src="http://commonplace.net/wp-content/uploads/2009/08/blvoc.jpg" alt="Search for &quot;voc&quot; in British Library catalogue" width="569" height="624" /></a><p class="wp-caption-text">Search for &quot;voc&quot; in British Library Integrated Catalogue</p></div>
<p>You are confused. You would like the system to &#8220;understand&#8221; which one of these concepts you are referring to instead of just using the literal string. You would like the system to take into account your <em><strong>context</strong></em>.</p>
<p>In most databases search results can be sorted or filtered by a number of fields, most commonly by year, title, author, and also by more specific fields in dedicated databases.  But unless you are interested in a specific year, author or title, this will not do. Recently many systems have implemented &#8220;faceted&#8221; and &#8220;clustered&#8221; browsing of results, enabling &#8220;drilling down&#8221; on specific terms or subjects. This basically comes down to setting the context after the fact.</p>
<p>But after the system has interpreted your search terms, the  results should also be ordered in a specific way, the ones you need most should be on top. This is where &#8220;<em><strong>relevance ranking</strong></em>&#8221; of search results comes in. Most catalogues and databases use a system specific default relevance ranking algorithm. Search results are assigned a rank, based on a number of criteria, that can differ between databases, depending on the nature of the database.<br />
Some databases just present the most recent results on top. For medical and physical sciences this may be right, but for history and literature databases this may just be wrong.<br />
Sometimes the search terms are taken into account: the number of times the given search terms are present in the result fields is important, but also the specific fields in which search terms appear. The appearance of search terms in &#8220;Title&#8221; and &#8220;Subject&#8221; may rank higher than in &#8220;Abstract&#8221; or &#8220;Publisher&#8221;. Moreover, the search indexes used can have a major influence on rank: if you search for <em>&#8220;Subject&#8221; = &#8220;flu&#8221;</em>, then results with <em>&#8220;flu&#8221;</em> as subject will be ranked higher than results with <em>&#8220;flu&#8221;</em> in the title only.<br />
To come back to my example, with ambiguous search terms like &#8220;<em>voc</em>&#8221; this type of relevance ranking will definitely not be enough, because results from the three different conceptual areas will be completely mixed up.</p>
<div id="attachment_746" class="wp-caption alignnone" style="width: 668px"><a href="http://digitaal.uba.uva.nl/" target="_blank"><img class="size-full wp-image-746" title="uvafacets" src="http://commonplace.net/wp-content/uploads/2009/08/uvafacets.jpg" alt="Faceted/clustered search results in Amsterdam Univerity Digital Library " width="658" height="400" /></a><p class="wp-caption-text">Clustered search results in Amsterdam University Library MetaLib portal</p></div>
<p>When searching with a metasearch or federated search tool things get even more complicated. Each of the remote databases that you search in has its own default way of ranking. Usually the metasearch tool fetches the first 30 or so results from each remote database (one set sorted by date, the other by internal rank, the next by title), merges these into one list and then applies its own local ranking mechanism to this subset only. Confusion! And I did not even mention searching databases with metadata in multiple languages. Moreover, databases containing only metadata will produce different results and relevance than databases with full text articles. There is absolutely no way of telling if you actually have the most relevant results for your situation.</p>
<p>Again, with relevance ranking search systems do not take into account the context either. You could say it is an introverted, internally focused way of ranking, the confusing results of which are multiplied in case of metasearching.</p>
<p>Most metasearch tools give users the option of searching in sets of pre-selected databases, based on subject or type. This way you can limit your search to those databases that are known to have data about that specific subject. You more or less set the context in advance. But this mechanism only eliminates results from databases that probably do not have data on your subject at all, so they would not have shown up in the results anyway. Moreover, the same issues that were discussed above apply to this limited set of databases.</p>
<p>The metasearch tool that I know best (MetaLib) offers the option of setting a relative rank per database, so results from databases with a higher rank will have a higher relevance in merged result sets. But this is a system wide option, set by system administration, so it is not taking into account any context at all. It would be better if you could make the relative database rank dependent on the set or subject the search is done from (for instance: <em>if a history database is searched in the context of a &#8220;History&#8221; set, the results get a higher rank than in a search from a &#8220;Music&#8221; set</em>).</p>
<p>The best solution for this &#8220;internal&#8221; relevance problem regarding distributed databases is a central database of harvested indexes. In this case all harvested metadata is normalised and ranked in a uniform way, and users do not have to select databases in advance. But these systems still do not take into account &#8220;external&#8221; relevance: there is no context!</p>
<p>A very interesting and  intelligent solution for the problem of pre-selecting databases is provided in <a id="lphx" title="PurpleSearch" href="http://purplesearch.ub.rug.nl/" target="_blank">PurpleSearch</a>, the integrated front end to MetaLib (among other things), developed by the <a id="xuc4" title="Library of the University of Groningen" href="http://www.rug.nl/bibliotheek" target="_blank">Library of the University of Groningen</a>. The system records which databases actually produce results for specific search terms. As soon as the user enters search terms in the single search box, the system knows which databases will have results, and the search is automatically carried out in these databases, without asking the user to select the databases or subject area he or she wants to search in. Simultaneously a background search in all other databases is performed in order to check additional new results, and the information about results in databases is updated.<br />
Of course, all other usual options are available as well, like pre-selecting databases (setting context in advance) and faceted results drilling down (setting context after the fact). But again, no external contextual settings.</p>
<div id="attachment_730" class="wp-caption alignnone" style="width: 655px"><img class="size-full wp-image-730" title="purplesearchvoc" src="http://commonplace.net/wp-content/uploads/2009/08/purplesearchvoc.jpg" alt="Search &quot;voc&quot; in PurpleSearch" width="645" height="376" /><p class="wp-caption-text">Search &quot;voc&quot; in PurpleSearch</p></div>
<ul>
<li><em>Conclusion: the only way to find what you need, is to make search systems take into account the context in which the search is done, both for searching and for relevance ranking.</em></li>
</ul>
<p><strong>Solutions</strong><br />
Now, let&#8217;s have a look at a couple of conditions that would make contextual searches possible.</p>
<p><em><strong>Personal context</strong></em>: a system should &#8220;know&#8221; about your personal interests, field of study, job situation, age, etc. so it can &#8220;decide&#8221; which databases to search in and which results are the most relevant for you. Some systems, like university systems, have access to information about their users. Once you log in, the system potentially knows which subjects you are studying or teaching and could use this information for setting the context for searching and ranking.<br />
But what if you are a student in Law <em>AND </em>Social Siences, which subject area should the system choose? Or: if you are a History teacher, and you have a personal interest in Ecology, which the system does not know about, what then? Somehow you still need to set context yourself.</p>
<p>Some systems also offer the opportunity of setting personal preferences, like: area of interest, specific databases, type of material (only digital or print), only recent material, etc. Again: you must be able to deviate from these preferences, depending on your situation, which means setting context manually.</p>
<p>Different search systems will have different user profiles (user data and preferences). It would be nice if search systems could take advantage of universal external personal profiles (like Google Profiles for instance) using some kind of universal API.</p>
<p><em><strong>Situational context</strong></em>: a system should also &#8220;know&#8221; about the situation you are in, both in a functional sense and in a physical sense.</p>
<p><em><strong>Functional context </strong></em>means: wich role are you playing? Are you in your law student role or in your social sciences student role? Are you in your professional role or in your private role? But also: to which resources do you have access?<br />
An interesting idea: if you work Monday to Friday during office hours, study in the evenings and spend time on your personal interests on the weekends, it would be nice if you could link times of day and days of the week to your different roles, so search systems could use the correct context for your searches depending on time and date: &#8220;<em>if it&#8217;s Tuesday evening then use study profile and search in &#8216;History&#8217;; if it&#8217;s Sunday, use private profile and search in &#8216;Ecology&#8217;</em>&#8220;.</p>
<p><img class="size-full wp-image-755 alignright" title="It's the Great Pumpkin Charlie Brown" src="http://commonplace.net/wp-content/uploads/2009/08/Its_the_Great_Pumpkin_Charlie_Brown.jpg" alt="It's the Great Pumpkin Charlie Brown" width="162" height="220" /></p>
<p>This <strong><em>temporal context</em></strong> was also referred to by <a id="p4nc" title="Till Kinstler" href="http://twitter.com/tillk" target="_blank">Till Kinstler</a> in a (German) <a id="oo1j" title="blog post" href="http://blog.nationallizenz.de/2009/02/07/wo-ist-denn-die-information/" target="_blank">blog post</a> about the new &#8220;<a id="npd1" title="Suchkisten" href="https://finden.nationallizenz.de/prototyp/Search/Home?s=charlie+brown&amp;what=suchkiste" target="_blank">Suchkiste</a>&#8221; search system prototype of the <a id="hzii" title="German     Union Library Network (GBV)" href="http://www.gbv.de/vgm/info/info_0198" target="_blank">German     Union Library Network (GBV)</a>: &#8216;<em>the search for “Charlie Brown” in October should result in “It’s the Great Pumpkin, Charlie Brown” at number 1, and in December in “A Charlie Brown Christmas“</em>&#8216;.</p>
<p><em><strong>Physical context </strong></em>means: where are you? It would be nice if a library catalogue search system would take into account your actual location, so it could show you the records of the copies of the FRBR-ized results available in the library locations nearest to you (<em>this idea came up in a recent Twitter discussion between <a id="y.oh" title="@librarianbe" href="http://twitter.com/librarianbe" target="_blank">@librarianbe</a> and <a id="n-ok" title="@gbierens" href="http://twitter.com/gbierens" target="_blank">@gbierens</a></em>). This is what Worldcat does when you supply it with your location manually. In Worldcat this is a static preference. But it would be nice if it would respond to your actual location, for instance by using the GPS coordinates of your mobile phone. Alternatively, search systems could derive your location from the IP address you are sending your search from.<br />
This information could also be used to determine if records for digital or physical copies should be ranked the most relevant in this case. If you are inside the library building and you have a preference for physical books and journals, then records for available print copies should be on top of the results list. If you are at home, then records for digital copies that you have access to should come first.</p>
<p>Contextual searching and ranking should always be a combination of all possible conditions, personal, situational and internal system ones.</p>
<p>Of course it goes without saying that it would be great if metasearch tools were able to convey the search context to the remote databases and get contextual results back, using some kind of universal serach context API!</p>
<p>Last but not least, each search system should show the context of the search, and explain how it got to the results in the presented order. Something like: <em>based on your personal preferences, the time of day and day of the week, and your location, the search was done in these databases, with this subject area, and the physical copies of the nearest location are shown on top</em>.<br />
This context area on the results screen could then be used as a kind of inverted faceted search, drilling &#8220;up&#8221; to a broader level or &#8220;sideways&#8221; to another context.</p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fcommonplace.net%2F2009%2F08%2Frelevance-in-context%2F&amp;linkname=Relevance%20in%20context"><img src="http://commonplace.net/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://commonplace.net/2009/08/relevance-in-context/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>Linked Data for Libraries</title>
		<link>http://commonplace.net/2009/06/linked-data-for-libraries/</link>
		<comments>http://commonplace.net/2009/06/linked-data-for-libraries/#comments</comments>
		<pubDate>Fri, 19 Jun 2009 12:29:48 +0000</pubDate>
		<dc:creator>Lukas Koster</dc:creator>
				<category><![CDATA[Library2.0]]></category>
		<category><![CDATA[aacr2]]></category>
		<category><![CDATA[api's]]></category>
		<category><![CDATA[connectivity]]></category>
		<category><![CDATA[dublin core]]></category>
		<category><![CDATA[foaf]]></category>
		<category><![CDATA[frbr]]></category>
		<category><![CDATA[identifiers]]></category>
		<category><![CDATA[integration]]></category>
		<category><![CDATA[library systems]]></category>
		<category><![CDATA[linked data]]></category>
		<category><![CDATA[marc]]></category>
		<category><![CDATA[mash-ups]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[next generation]]></category>
		<category><![CDATA[ontologies]]></category>
		<category><![CDATA[owl]]></category>
		<category><![CDATA[rda]]></category>
		<category><![CDATA[rdf]]></category>
		<category><![CDATA[RDFa]]></category>
		<category><![CDATA[semantic web]]></category>
		<category><![CDATA[skos]]></category>
		<category><![CDATA[sparql]]></category>
		<category><![CDATA[vocabularies]]></category>
		<category><![CDATA[web2.0]]></category>
		<category><![CDATA[web3.0]]></category>

		<guid isPermaLink="false">http://commonplace.net/?p=607</guid>
		<description><![CDATA[Linked Data and bibliographic metadata models
Some time after I wrote &#8220;UMR &#8211; Unified Metadata Resources&#8220;, I came across Chris Keene&#8217;s post &#8220;Linked data &#38; RDF : draft notes for comment&#8220;, &#8220;just a list of links and notes&#8221; about Linked Data, RDF and the Semantic Web, put together to start collecting information about &#8220;a topic that [...]]]></description>
			<content:encoded><![CDATA[<div><em><strong>Linked Data and bibliographic metadata models</strong></em></p>
<div id="attachment_597" class="wp-caption alignnone" style="width: 510px"><a href="http://www.flickr.com/photos/photonquantique/3272712288/" target="_blank"><img class="size-full wp-image-597" title="ted" src="http://commonplace.net/wp-content/uploads/2009/06/ted1.jpg" alt="ted" width="500" height="375" /></a><p class="wp-caption-text">© PhOtOnQuAnTiQuE</p></div>
<p>Some time after I wrote &#8220;<a id="sc83" title="UMR - Unified Metadata Resources -	One single web page as the single identifier of every book, author or subject" href="../2009/04/umr-unified-metadata-resources/">UMR &#8211; Unified Metadata Resources</a>&#8220;, I came across <a id="zv_v" title="Chris Keene" href="http://twitter.com/chriskeene" target="_blank">Chris Keene&#8217;s</a> post &#8220;<a title="Linked data &amp; RDF : draft notes for comment" rel="bookmark" href="http://www.nostuff.org/words/2009/linked-data-rdf-draft-notes-for-comment/" target="_blank">Linked data &amp; RDF : draft notes for comment</a>&#8220;, &#8220;<em>just a list of links and notes</em>&#8221; about Linked Data, RDF and the Semantic Web, put together to start collecting information about &#8220;<em>a topic that will greatly impact on the Library / Information management world</em>&#8220;.</p>
<p>While reading this post and working my way through the links on that page, I started realising that <a id="icvx" title="Linked Data" href="http://linkeddata.org/" target="_blank">Linked Data</a> is exactly what I tried to describe as <em>One single web page as the single identifier of every book, author or subject.</em> I did mention <a id="qdq1" title="Semantic Web" href="http://en.wikipedia.org/wiki/Semantic_web" target="_blank">Semantic Web</a>, <a id="vgqt" title="URI's" href="http://en.wikipedia.org/wiki/URI" target="_blank">URI&#8217;s</a> and <a id="n81m" title="RDF" href="http://en.wikipedia.org/wiki/Resource_Description_Framework" target="_blank">RDF</a>, but the term &#8220;<em>Linked Data</em>&#8221; as a separate protocol had escaped me.</p>
<p>The <a id="a01y" title="concept of Linked Data" href="http://www.w3.org/DesignIssues/LinkedData.html" target="_blank">concept of Linked Data</a> was described by <a id="acb7" title="Tim Berners Lee" href="http://www.w3.org/People/Berners-Lee/" target="_blank">Tim Berners Lee</a>, the inventor of the World Wide Web. Whereas the World Wide Web links documents (pages, files, images), which are basically resources <em><strong>about </strong></em>things, (&#8220;<em>Information Resources</em>&#8221; in Semantic Web terms), Linked Data (or the Semantic Web) links raw data and real life things (&#8220;<em>Non-Information Resource</em>s&#8221;).</p>
<p>There are several definitions of Linked Data on the web, but here is my attempt to give a simple definition of it (loosely based on the definition in <a id="p4h6" title="Structured Dynamics' Linked Data FAQ" href="http://structureddynamics.com/linked_data.html" target="_blank">Structured Dynamics&#8217; Linked Data FAQ</a>):</p>
<div style="margin-left: 40px;"><em>Linked Data is a methodology for providing relationships </em><em>between things (data, concepts and documents) anywhere on the web, using URI&#8217;s for identifying</em><em>, RDF for describing</em> <em>and HTTP for publishing </em><em>these things and relationships, in a way that they can be interpreted and used by humans and software.</em></div>
<div style="margin-left: 40px;"><em><br />
</em></div>
<p>I will try to illustrate the different aspects using some examples from the library world. The article is rather long, because of the nature of the subject, then again the individual sections are a bit short. But I do supply a lot of links for further reading.</p>
<p><strong>Data is relationships</strong><br />
The important thing is that &#8220;<em>data is relationships</em>&#8220;, as Tim Berners Lee says in his <a id="xk9y" title="recent presentation for TED" href="http://www.ted.com/talks/tim_berners_lee_on_the_next_web.html" target="_blank">recent presentation for TED</a>.<br />
Before going into relationships between things, I have to point out the important distinction between abstract concepts and real life things, which are &#8220;<em>manifestations</em>&#8221; of the concepts. In Object modeling these are called &#8220;<em>classes</em>&#8221; (abstract concepts, types of things) and &#8220;<em>objects</em>&#8221; (real life things, or &#8220;<em>instances</em>&#8221; of &#8220;<em>classes</em>&#8220;).</p>
<p>Examples:</p>
<ul>
<li>the class <em>book </em>can have the instances/objects &#8220;<em>Cloud Atlas</em>&#8220;, &#8220;<em>Moby Dick</em>&#8220;, etc.</li>
<li>the class <em>person<strong> </strong></em>can have the instances/objects &#8220;<em>David Mitchell</em>&#8220;, &#8220;<em>Herman Melville</em>&#8220;, etc.</li>
</ul>
<p>In the Semantic Web/RDF model the concept of triples is used to describe a relationship between two things: <em>subject &#8211; predicate &#8211; object,</em> meaning: a thing has a relation to another thing, in the broadest sense:</p>
<ul>
<li>a book (<em>subject</em>) is written by (<em>predicate</em>) a person<strong> </strong>(<em>object</em>)</li>
</ul>
<p>You can also reverse this relationship:</p>
<ul>
<li>a person<strong> </strong>(<em>subject</em>) is the author of (<em>predicate</em>) a book (<em>object</em>)</li>
</ul>
<div id="attachment_614" class="wp-caption alignleft" style="width: 187px"><img class="size-full wp-image-614" title="bookstriple" src="http://commonplace.net/wp-content/uploads/2009/06/bookstriple.jpg" alt="Triple" width="177" height="185" /><p class="wp-caption-text">Triple</p></div>
<p>The person in question is only an author because of his or her relationship to the book. The same person can also be <em>a mother of three children</em>, <em>an employee of a library</em>, and <em>a speaker at a conference</em>.<br />
Moreover, and this is important: there can be more than one relationship between the same two classes or types of things. A <em>book </em>(<em>subject</em>) can also be <em>about </em> (<em>predicate</em>) <em>a person<strong> </strong></em>(<em>object</em>). In this case the person is a &#8220;subject&#8221; of the book, that can be described by a &#8220;keyword&#8221;, &#8220;subject heading&#8221;, or whatever term is used. A special case would be a book, written by someone about himself (an autobiography).</p>
<p>The problem with most legacy systems, and library catalogues as an example of these, is that a record for let&#8217;s say a book contains one or more fields for the author (or at best a link to an entry in an authority file or thesaurus), and separately one or more fields for subjects. This way it is not possible to see books written by an author and books about the same author in one view, without using all kinds of workarounds, link resolvers or mash-ups.<br />
Using two different relationships that link to the same thing would provide for an actual view or representation of the real world situation.</p>
<p>Another important option of Linked Data/RDF: a certain thing can have as a property a link to a concept (or &#8220;class&#8221;) , describing the nature of the thing: &#8220;<em>object Cloud Atlas</em>&#8221; has type &#8220;<em>book</em>&#8220;; &#8220;<em>object David Mitchell</em>&#8221; has type &#8220;<em>person</em>&#8220;; &#8220;<em>object Cloud Atlas</em>&#8221; is written by &#8220;<em>object David Mitchell</em>&#8220;.</p>
<p>And of course, the property/relationship/predicate can also link to a concept describing the nature of the link.</p>
<p><strong>Anywhere on the web</strong></p>
<div id="attachment_618" class="wp-caption alignleft" style="width: 210px"><img class="size-full wp-image-618" title="bookserd" src="http://commonplace.net/wp-content/uploads/2009/06/bookserd1.jpg" alt="ERD" width="200" height="239" /><p class="wp-caption-text">ERD</p></div>
<p>So far so good. But you may argue that this relationship theory is not very new. Absolutely right, but up until now this data-relationship concept has mainly been used with a view to the inside, focused on the area of the specific information system in question, because of the nature and the limitations of the available technology and infrastructure.</p>
<p>The &#8220;triple&#8221; model is of course exactly the same as the long standing methodology of <em>Entity Relationship Diagrams</em> (<em>ERD</em>), with which <em>relationships </em>between <em>entities </em>(=&#8221;<em>classes</em>&#8220;) are described. An ERD is typically used to generate a database that contains data in a specific information system. But ERD&#8217;s could just as well be used to describe Linked Data on the web.</p>
<p>Information systems, such as library catalogs, have been, and still are, for the greatest part closed containers of data, or &#8220;silos&#8221; without connections between them, as Tim Berners Lee also mentions in his TED presentation.<br />
<img alt="" /><br />
<img alt="" />Lots of these silo systems are accessible with web interfaces, but this does not mean that items in these closed systems with dedicated web front ends can be linked to items in other databases or web pages. Of course these systems can have <a id="mpip" title="API" href="http://en.wikipedia.org/wiki/Api" target="_blank">API</a>&#8217;s that allow system developers to create scripts to get related information from other systems and incorporate that external information in the search results of the calling system. This is what is being done in <em>web 2.0</em> with so-called <em>mash-ups</em>.<br />
But in this situation you need developers who know how to make scripts using specific scripting languages for all the different proprietary API&#8217;s that are being supported for all the individual systems.<br />
If Linked Data was a global standard and all open and closed systems and websites supported RDF, then all these links would be available automatically to RDF enabled browser and client software, using <a id="xb-0" title="SPARQL" href="http://en.wikipedia.org/wiki/SPARQL" target="_blank">SPARQL</a>, the RDF Query Language.</p>
<ul>
<li><em>Linked Data/RDF</em> can be regarded as a <em>universal API</em>.</li>
</ul>
<p>The good thing about Linked Data is, that it is possible to use Linked Data mechanisms to link to legacy data in silo databases. You just need to provide an RDF wrapper for the legacy system, like has been done with the <a id="sx8-" title="Library of Congress Subject Headings" href="http://id.loc.gov/authorities" target="_blank">Library of Congress Subject Headings</a>.</p>
<p>Some examples of available tools for exposing legacy data as RDF:</p>
<ul>
<li><a id="fg.2" title="Triplify" href="http://triplify.org/" target="_blank">Triplify</a> &#8211; a web applications plugin that converts relational database structures into RDF triples</li>
<li><a href="http://www4.wiwiss.fu-berlin.de/bizer/d2r-server/" target="_blank">D2R Server</a> &#8211; a tool for publishing relational databases on the Semantic Web</li>
<li><a id="e2._" title="wp-rdfa" href="http://dev.squarecows.com/projects/wp-rdfa/" target="_blank">wp-RDFa</a> &#8211; a wordpress plugin that adds some RDF information about Author and Title to Wordpress blog posts</li>
</ul>
<p>Of course, RDF that is generated like this will very probably only expose objects to link TO, not links to RDF objects external to the system.</p>
<p>Also, Linked Data can be used within legacy systems, for mixing legacy and RDF data, open and closed access data, etc. In this case we have RDF triples that have a subject URI from one data source and an object URI from another data source. In a situation with interlinked systems it would for instance be possible to see that the author of a specific book (data from a library catalog) is also speaking at a specific conference (data from a conference website). Objects linked together on the web using RDF triples are also known as an &#8220;RDF graph&#8221;. With RDF-aware client software it is possible to navigate through all the links to retrieve additional information about an object.</p>
<div id="attachment_677" class="wp-caption alignnone" style="width: 631px"><img class="size-large wp-image-677" title="linkeddata" src="http://commonplace.net/wp-content/uploads/2009/06/linkeddata-1024x459.jpg" alt="Linked Data" width="621" height="278" /><p class="wp-caption-text">Linked Data</p></div>
<p><strong>URI&#8217;s</strong><br />
URI&#8217;s (&#8220;Uniform Resource Identifiers&#8221;) are necessary for uniquely identifying and linking to resources on the web. A URI is basically a string that identifies a thing or resource on the web. All &#8220;Information Resources&#8221;, or WWW pages, documents, etc. have a URI, which is commonly known as a URL (Uniform Resource Locator).</p>
<p>With Linked Data we are looking at identifying &#8220;Non-information Resources&#8221; or &#8220;real world objects&#8221; (people, concepts, things, even imaginary things), not web pages that contain information <em>about </em>these real world objects. But it is a little more complicated than that. In order to honour the requirement that a thing and its relations <em>can be interpreted and used by humans and software</em><em>,</em> we need at least 3 different representations of one resource (see: <a id="wn7l" title="How to publish Linked Data on the web" href="http://www4.wiwiss.fu-berlin.de/bizer/pub/LinkedDataTutorial/" target="_blank">How to publish Linked Data on the web</a>):</p>
<ul>
<li>Resource identifier URI (identifies the real world object, the concept, as such)</li>
<li>RDF document URI (a document readable for semantic web applications, containing the real world object&#8217;s RDF data and relationships with other objects)</li>
<li>HTML document URI (a document readable for humans, with information about the real world object)</li>
</ul>
<div id="attachment_676" class="wp-caption alignnone" style="width: 557px"><img class="size-full wp-image-676" title="rdfredir2" src="http://commonplace.net/wp-content/uploads/2009/06/rdfredir.jpg" alt="rdfredir2" width="547" height="275" /><p class="wp-caption-text">Redirection</p></div>
<p>For instance, there could be a Resource Identifier URI for a book called &#8220;<em>Cloud Atlas</em>&#8220;. The web resource at that URI can redirect an RDF enabled browser to the RDF document URI, which contains RDF data describing the book and its properties and relationships. A normal HTML web browser would be redirected to the HTML document URI, for instance a web page about the book at the publisher&#8217;s website.</p>
<p>There are several methods of redirecting browsers and application to the required representation of the resource. See <a id="fl7j" title="Cool URIs for the Semantic Web" href="http://www.w3.org/TR/2008/WD-cooluris-20080321/" target="_blank">Cool URIs for the Semantic Web</a> for technical details.</p>
<p>There are also RDF enabled browsers that transform RDF into web pages readable by humans, like the FireFox addon &#8220;<a id="akol" title="Tabulator" href="https://addons.mozilla.org/en-US/firefox/addon/5596" target="_blank">Tabulator</a>&#8220;, or the web based <a id="h6k1" title="Disco" href="http://www4.wiwiss.fu-berlin.de/bizer/ng4j/disco/" target="_blank">Disco</a> and <a id="fxjf" title="Marbles" href="http://www5.wiwiss.fu-berlin.de/marbles/" target="_blank">Marbles</a> browsers, both hosted at the Free University Berlin.</p>
<p><strong>RDF, vocabularies, ontologies</strong><br />
RDF or Resource Description Framework, is, like the name suggests, just a framework. It uses XML (or a simpler non-XML method <a href="http://en.wikipedia.org/wiki/Notation_3" target="_blank">N3</a>) to describe resources by means of relationships. RDF can be implemented in vocabularies or ontologies, which are sets of RDF classes describing objects and relationships for a given field.<br />
Basically, anybody can create an RDF vocabulary by publishing an RDF document defining the classes and properties of the vocabulary, at a URI on the web. The vocabulary can then be used in a resource by referring to the namespace (the URI) and the classes in that RDF document.</p>
<p>A nice and useful feature of RDF is that more than one vocabularies can be mixed and used in one resource.<br />
Also, a vocabulary itself can reference other vocabularies and thereby inherit well established classes and properties from other RDF documents.<br />
Another very useful feature of RDF is that objects can be linked to similar object resources describing the same real world thing. This way confusion about which object we are talking about, can be avoided.</p>
<p>A couple of existing and well used RDF vocabularies/ontologies:</p>
<ul>
<li><a id="ex1e" title="RDF" href="http://www.w3.org/1999/02/22-rdf-syntax-ns#" target="_blank">RDF</a> &#8211; the base RDF vocabulary</li>
<li><a id="g531" title="RDFS" href="http://www.w3.org/2000/01/rdf-schema#" target="_blank">RDFS</a> (for <a href="http://en.wikipedia.org/wiki/RDFS" target="_blank">RDF Schema</a>)</li>
<li><a href="http://purl.org/dc/elements/1.1/" target="_blank">DC</a> (for <a href="http://dublincore.org/" target="_blank">Dublin Core</a>)</li>
<li><a id="eaah" title="FOAF" href="http://xmlns.com/foaf/0.1/" target="_blank">FOAF</a> (for <a href="http://www.foaf-project.org/" target="_blank">FOAF- Friend of a Friend</a>) &#8211; online identities and social networks</li>
<li><a id="htt6" title="SKOS" href="http://www.w3.org/2004/02/skos/core#" target="_blank">SKOS</a> (for <a href="http://www.w3.org/2004/02/skos/" target="_blank">SKOS &#8211; Simple Knowledge Organisation System</a>) &#8211; thesauri, classification schemes, subject heading systems and  	taxonomies</li>
<li><a id="xitg" title="OWL" href="http://www.w3.org/2002/07/owl#" target="_blank">OWL</a> (for<a href="http://en.wikipedia.org/wiki/Web_Ontology_Language" target="_blank"> OWL -Ontology Web Language</a>)</li>
</ul>
<p>(<em>By the way,  the links in the first column (to the RDF files themselves) may act as an illustration of the redirection mechanism described before. Some of them may link to either the RDF file with the vocabulary definition itself, or to a page <strong>about </strong>the vocabulary, depending on the type of browser you use: rdf-aware or not</em>.)</p>
<p>A special case is:</p>
<ul>
<li><a id="zitw" title="RDFa" href="http://www.w3.org/TR/xhtml-rdfa-primer/" target="_blank">RDFa</a> &#8211; a sort of microformat without a vocabulary of its own, which relies on other vocabularies for turning XHTML page attributes into RDF</li>
</ul>
<p><em><strong>Example</strong></em><br />
A shortened example for &#8220;<a href="http://www.worldcat.org/oclc/53919721" target="_blank">Cloud Atlas</a>&#8221; by David Mitchell from the <a id="im:j" title="RDF BookMashUp" href="http://www4.wiwiss.fu-berlin.de/bizer/bookmashup/index.html" target="_blank">RDF BookMashup</a> at the Free University Berlin, which uses a number of different vocabularies:</p>
<div style="margin-left: 40px;"><span style="color: #0000ff;">&lt;?xml version=&#8221;1.0&#8243; encoding=&#8221;UTF-8&#8243; ?&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;">&lt;rdf:RDF</span><br style="color: #0000ff;" /><span style="color: #0000ff;"> xmlns:rdf=&#8221;http://www.w3.org/1999/02/22-rdf-syntax-ns#&#8221;</span><br style="color: #0000ff;" /><span style="color: #0000ff;">&#8230;</span><br style="color: #0000ff;" /><span style="color: #0000ff;"> xmlns:skos=&#8221;http://www.w3.org/2004/02/skos/core#&#8221;&gt;</span><br style="color: #0000ff;" /><br style="color: #0000ff;" /><span style="color: #0000ff;">&lt;rdf:Description rdf:about=&#8221;http://www4.wiwiss.fu-berlin.de/bookmashup/books/0375507256&#8243;&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;"> &lt;rev:hasReview rdf:resource=&#8221;http://www4.wiwiss.fu-berlin.de/bookmashup/reviews/0375507256_EditorialReview1&#8243;/&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;"> &lt;dc:creator rdf:resource=&#8221;http://www4.wiwiss.fu-berlin.de/bookmashup/persons/David+Mitchell&#8221;/&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;"> &lt;dc:format&gt;Paperback&lt;/dc:format&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;"> &lt;dc:identifier rdf:resource=&#8221;urn:ISBN:0375507256&#8243;/&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;"> &lt;dc:publisher&gt;Random House Trade Paperbacks&lt;/dc:publisher&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;"> &lt;dc:title&gt;Cloud Atlas: A Novel&lt;/dc:title&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;">&lt;/rdf:Description&gt;</span><br style="color: #0000ff;" /><br style="color: #0000ff;" /><span style="color: #0000ff;">&lt;scom:Book rdf:about=&#8221;http://www4.wiwiss.fu-berlin.de/bookmashup/books/0375507256&#8243;&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;"> &lt;rdfs:label&gt;Cloud Atlas: A Novel&lt;/rdfs:label&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;"> &lt;skos:subject rdf:resource=&#8221;http://www4.wiwiss.fu-berlin.de/bookmashup/subject/Fantasy+fiction&#8221;/&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;"> &lt;skos:subject rdf:resource=&#8221;http://www4.wiwiss.fu-berlin.de/bookmashup/subject/Fate+and+fatalism&#8221;/&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;">&#8230;</span><br style="color: #0000ff;" /><span style="color: #0000ff;"> &lt;foaf:depiction rdf:resource=&#8221;http://ecx.images-amazon.com/images/I/51MIVHgJP%2BL.jpg&#8221;/&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;"> &lt;foaf:thumbnail rdf:resource=&#8221;http://ecx.images-amazon.com/images/I/51MIVHgJP%2BL._SL75_.jpg&#8221;/&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;">&lt;/scom:Book&gt;</span><br style="color: #0000ff;" /><br style="color: #0000ff;" /><span style="color: #0000ff;">&lt;rdf:Description rdf:about=&#8221;http://www4.wiwiss.fu-berlin.de/bookmashup/doc/books/0375507256&#8243;&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;"> &lt;dc:license rdf:resource=&#8221;http://www.amazon.com/AWS-License-home-page-Money/b/ref=sc_fe_c_0_12738641_12/102-8791790-9885755?ie=UTF8&amp;amp;node=3440661&amp;amp;no=12738641&amp;amp;me=A36L942TSJ2AJA&#8221;/&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;"> &lt;dc:license rdf:resource=&#8221;http://www.google.com/terms_of_service.html&#8221;/&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;">&lt;/rdf:Description&gt;</span><br style="color: #0000ff;" /><br style="color: #0000ff;" /><span style="color: #0000ff;">&lt;foaf:Document rdf:about=&#8221;http://www4.wiwiss.fu-berlin.de/bookmashup/doc/books/0375507256&#8243;&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;"> &lt;rdfs:label&gt;RDF document about the book: Cloud Atlas: A Novel&lt;/rdfs:label&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;"> &lt;foaf:maker rdf:resource=&#8221;http://www4.wiwiss.fu-berlin.de/is-group/resource/projects/Project10&#8243;/&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;"> &lt;foaf:primaryTopic rdf:resource=&#8221;http://www4.wiwiss.fu-berlin.de/bookmashup/books/0375507256&#8243;/&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;">&lt;/foaf:Document&gt;</span><br style="color: #0000ff;" /><br style="color: #0000ff;" /><span style="color: #0000ff;">&lt;rdf:Description rdf:about=&#8221;http://www4.wiwiss.fu-berlin.de/bookmashup/persons/David+Mitchell&#8221;&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;"> &lt;rdfs:label&gt;David Mitchell&lt;/rdfs:label&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;">&lt;/rdf:Description&gt;</span><br style="color: #0000ff;" /><br style="color: #0000ff;" /><span style="color: #0000ff;">&lt;rdf:Description rdf:about=&#8221;http://www4.wiwiss.fu-berlin.de/bookmashup/reviews/0375507256_EditorialReview1&#8243;&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;"> &lt;rdfs:label&gt;Review number 1 about: Cloud Atlas: A Novel&lt;/rdfs:label&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;">&lt;/rdf:Description&gt;</span><br style="color: #0000ff;" /><br style="color: #0000ff;" /><span style="color: #0000ff;">&lt;rdf:Description rdf:about=&#8221;http://www4.wiwiss.fu-berlin.de/is-group/resource/projects/Project10&#8243;&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;"> &lt;rdfs:label&gt;RDF Book Mashup&lt;/rdfs:label&gt;</span><br style="color: #0000ff;" /><span style="color: #0000ff;">&lt;/rdf:Description&gt;</span><br style="color: #0000ff;" /><br style="color: #0000ff;" /><span style="color: #0000ff;">&lt;/rdf:RDF&gt;</span></div>
<div style="margin-left: 40px;"><span style="color: #0000ff;"><br />
</span></div>
<p><em>A partial view on this RDF file with the Marbles browser:<br />
</em></p>
<div id="attachment_637" class="wp-caption alignnone" style="width: 646px"><a href="http://www5.wiwiss.fu-berlin.de/marbles/?uri=http%3A%2F%2Fwww4.wiwiss.fu-berlin.de%2Fbookmashup%2Fbooks%2F0375507256" target="_blank"><img class="size-full wp-image-637" title="cloudatlas" src="http://commonplace.net/wp-content/uploads/2009/06/cloudatlas.jpg" alt="RDF browser view" width="636" height="395" /></a><p class="wp-caption-text">RDF browser view</p></div>
<p>See also <a href="http://www4.wiwiss.fu-berlin.de/rdf_browser/?browse_uri=http%3A%2F%2Fwww4.wiwiss.fu-berlin.de%2Fbookmashup%2Fbooks%2F0375507256" target="_blank">the same example in the Disco RDF browser</a>.</p>
<p><strong> Library implementations</strong><br />
It seems obvious that Linked Data can be very useful in providing a generic infrastructure for linking data, metadata and objects, available in numerous types of data stores, in the online library world. With such a networked online data structure, it would be fairly easy to create all kinds of discovery interfaces for bibliographic data and objects. Moreover, it would also be possible to link to non-bibliographic data that might interest the users of these interfaces.</p>
<p>A brief and incomplete list of some library related Linked Data projects, some of which already mentioned above:</p>
<ul>
<li><a id="o2bg" title="RDF BookMashup" href="http://www4.wiwiss.fu-berlin.de/bizer/bookmashup" target="_blank">RDF BookMashup</a> &#8211; Integration of Web 2.0 data sources like Amazon, Google  or Yahoo into the Semantic Web.</li>
<li><a id="eea6" title="Library of Congress Authorities" href="http://id.loc.gov/authorities" target="_blank">Library of Congress Authorities</a> &#8211; Exposing LoC Autorities and Vocabularies to the web using URI&#8217;s</li>
<li><a id="dn58" title="DBPedia" href="http://dbpedia.org/" target="_blank">DBPedia</a> &#8211; Exposing structured data from WikiPedia to the web</li>
<li><a id="ld7z" title="LIBRIS" href="http://blog.libris.kb.se/semweb/?p=7" target="_blank">LIBRIS</a> &#8211; Linked Data interface to Swedish LIBRIS Union catalog</li>
<li><a id="z5o1" title="Scriblio+Wordpress+Triplify" href="http://mashedlibrary.ning.com/forum/topics/scriblio-wordpressmu" target="_blank">Scriblio+Wordpress+Triplify</a> &#8211; &#8220;A social, semantic OPAC Union Catalogue&#8221;</li>
</ul>
<p>And what about <em><a id="ump4" title="MARC" href="http://www.loc.gov/marc/" target="_blank">MARC</a></em>, <em><a id="gfvr" title="AACR2" href="http://www.aacr2.org/" target="_blank">AACR2</a> </em>and <em><a id="y6n1" title="RDA" href="http://www.rdaonline.org/" target="_blank">RDA</a></em>? Is there a role for them in the <em>Linked Data</em> environment? <em>RDA </em>is supposed to be the successor of <em>AACR2 </em>as a content standard that can be used with <em>MARC</em>, but also with other encoding standards like <em><a id="gfym" title="MODS" href="http://www.loc.gov/standards/mods/" target="_blank">MODS</a></em> or <em><a id="f-0q" title="Dublin Core" href="http://dublincore.org/" target="_blank">Dublin Core</a></em>.<br />
The <a id="mzgj" title="RDA Entity Relationship Diagram" href="http://www.rdaonline.org/ERDiagramRDA_24June2008.pdf" target="_blank">RDA Entity Relationship Diagram</a>, that incorporates <a id="toc5" title="FRBR" href="http://www.frbr.org/" target="_blank">FRBR</a> as well, can of course easily be implemented as an RDF vocabulary, that could be used to create a universal Linked Data library network. It really does not matter what kind of internal data format the connected systems use.</div>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fcommonplace.net%2F2009%2F06%2Flinked-data-for-libraries%2F&amp;linkname=Linked%20Data%20for%20Libraries"><img src="http://commonplace.net/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://commonplace.net/2009/06/linked-data-for-libraries/feed/</wfw:commentRss>
		<slash:comments>22</slash:comments>
		</item>
		<item>
		<title>No future for libraries?</title>
		<link>http://commonplace.net/2009/05/no-future-for-libraries/</link>
		<comments>http://commonplace.net/2009/05/no-future-for-libraries/#comments</comments>
		<pubDate>Sun, 24 May 2009 22:02:36 +0000</pubDate>
		<dc:creator>Lukas Koster</dc:creator>
				<category><![CDATA[Library2.0]]></category>
		<category><![CDATA[catalog]]></category>
		<category><![CDATA[collection]]></category>
		<category><![CDATA[communities]]></category>
		<category><![CDATA[future]]></category>
		<category><![CDATA[library]]></category>
		<category><![CDATA[next generation]]></category>
		<category><![CDATA[open stack]]></category>
		<category><![CDATA[people]]></category>

		<guid isPermaLink="false">http://commonplace.net/?p=536</guid>
		<description><![CDATA[Will library buildings and library catalogs survive the web?

Some weeks ago a couple of issues appeared in the twitter/blogosphere (or at least MY twitter/blogoshere) related to the future of the library in this digital era.

There was the Espresso book machine that prints books on demand on location, which led to questions like: &#8220;apart from influencing [...]]]></description>
			<content:encoded><![CDATA[<p><em><strong>Will library buildings and library catalogs survive the web?</strong></em></p>
<p><em><strong></strong></em></p>
<div id="attachment_547" class="wp-caption alignright" style="width: 375px"><a href="http://www.flickr.com/photos/moqub/3473256400/" target="_blank"><img class="size-medium wp-image-547" title="creatingfuturelibraries" src="http://commonplace.net/wp-content/uploads/2009/05/creatingfuturelibraries-300x199.jpg" alt="© Moqub" width="365" height="242" /></a><p class="wp-caption-text">© Moqub</p></div>
<p>Some weeks ago a couple of issues appeared in the twitter/blogosphere (or at least <em><strong>MY </strong></em>twitter/blogoshere) related to the future of the library in this digital era.</p>
<ul>
<li>There was the <a id="pv0." title="Espresso book machine" href="http://www.guardian.co.uk/books/2009/apr/24/espresso-book-machine-launches" target="_blank">Espresso book machine</a> that prints books on demand on location, which led to questions like: &#8220;<em>apart from influencing publishing and book shops, what does this mean for libraries?</em>&#8220;.</li>
<li>There was a Twitter discussion about &#8220;<em>will we still need library buildings?</em>&#8220;.</li>
<li>There was another <a id="cn2m" title="blog post about the future of library catalogs" href="http://www.zbdigitaal.nl/2009/04/lastige-materie-de-toekomst-van.html" target="_blank">blog post about the future of library catalogs</a> by Edwin Mijnsbergen (in Dutch) that asked the question of the value of library catalogs in relation to web2.0 and the new emerging semantic web.</li>
</ul>
<p>This made me start thinking about a question that concerns us all: <em>is there a future for the library as we know it?</em></p>
<p>To begin with, what is a library anyway?</p>
<p>For ages, since the beginning of history, up until some 15 years ago, a library was an institution characterised by:</p>
<div id="attachment_545" class="wp-caption alignright" style="width: 211px"><a href="http://www.flickr.com/photos/mihaibojin/3472311814/" target="_blank"><img class="size-medium wp-image-545" title="melklibrarystairs" src="http://commonplace.net/wp-content/uploads/2009/05/melklibrarystairs-201x300.jpg" alt="© Mihai Bojin" width="201" height="300" /></a><p class="wp-caption-text">© Mihai Bojin</p></div>
<ul>
<li>a physical <em><strong>collection </strong></em>of printed and handwritten material</li>
<li>a physical <em><strong>location</strong></em>, a building, to store the collection</li>
<li>a physical printed or handwritten on site <em><strong>catalog</strong></em></li>
<li>on location <em><strong>searching </strong></em>and finding of information sources using the catalog</li>
<li>on site requesting, <em><strong>delivery</strong></em>, reading, lending and returning of material</li>
<li>a staff of trained <em><strong>librarians </strong></em>to catalog the collection and assist patrons</li>
</ul>
<p>The central concept here is of course the <strong>collection</strong>. That is the &#8220;<em>raison d&#8217;être</em>&#8221; of a library. The purpose of library building, catalog and librarians is to give people access to the collection, and provide them with the information they need.</p>
<p>Clearly, because of the physical nature of the collection and the information transmission process the library needed to be a building with collection and catalog inside it. People had to go there to find and get the publications they needed.</p>
<p>If collections and the transmission of information were completely digital, then the reason for a physical location to go to for finding and getting publications would not exist anymore. Currently one of these conditions has been met fully and the other one partly. The transmission of information can take place in a completely digital way. Most new scientific publications are born digital (e-Journals, e-Books), and a large number of digitisation projects are taking care of making digital copies of existing print material.<br />
Searching for items in a library&#8217;s collection is already taking place remotely through OPACs and other online tools almost everywhere. A large part of these collections can be accessed digitally. Only in case a patron wants to read or borrow a printed book or journal, he or she has to go the library building to fetch it.</p>
<p>All this seems to lead to the conclusion that the library may be slowly moving away from a physical presence to a digital one.</p>
<p>But there is something else to be considered here, that reaches beyond the limits of one library. In my view the crucial notion here is again the <strong><em>collection</em></strong>.<br />
In my post <a id="tb:q" title="Collection 2.0" href="../2009/02/collection-20/">Collection 2.0</a> I argue that in this digital information age <strong><em>a library&#8217;s collection is everything a library has access to</em></strong> as opposed to the old concept of <em><strong>everything a library owns</strong></em>. This means in theory that every library could have access to the same digital objects of information available on the web, but also to each other&#8217;s print objects through ILL. There will be no physically limited collection only available in one library anymore, just one large global collection.</p>
<p>In this case, there is not only no need for people to go to a specific library for an item in its collection, but also there is no need to search for items using a specific library&#8217;s catalog.</p>
<p>Now you may say that people like going to a library building and browse through the stacks. That may still be true for some, but in general, as I argue in my post &#8220;<a id="nfix" title="Open Stack 2.0" href="../2008/11/open-stack-20/">Open Stack 2.0</a>&#8220;, the new Open Stack is the Web.</p>
<div id="attachment_558" class="wp-caption alignleft" style="width: 397px"><a href="http://www.flickr.com/photos/nengard/464992353/" target="_blank"><img class="size-large wp-image-558" title="catalognofuturelarge" src="http://commonplace.net/wp-content/uploads/2009/05/catalognofuturelarge-1024x768.jpg" alt="© Nicole C. Engard" width="387" height="290" /></a><p class="wp-caption-text">© Nicole C. Engard</p></div>
<p>In the future there will be collections, but not physical ones (except of course for the existing ones with items that are not allowed to leave the library location). We will see virtual subject collections, determined by classifications and keywords assigned both by professionals and non-professionals.</p>
<p>On a parallel level there will be virtual catalogs, which are views on virtual collections defined by subjects on different levels and in different locations: global, local, subject-oriented, etc. These virtual collections and catalogs will be determined and maintained by a great number of different groups of people and institutions (commercial and non-commercial). One of these groups can still be a library. As <a id="rysi" title="Patrick Vanhoucke observed on Twitter" href="http://twitter.com/librarianbe/status/1642391366" target="_blank">Patrick Vanhoucke observed on Twitter</a> (in Dutch): &#8220;<em>We have to let go of the idea of the library as a building; the &#8216;library&#8217; is the network of librarians</em>&#8220;. These virtual groups of people may be identical to what is getting known more and more as &#8220;<em><a id="sdfq" title="tribes" href="http://tametheweb.com/2009/05/17/ten-ways-to-encourage-the-tribe/" target="_blank">tribes</a></em>&#8220;.</p>
<p>Having said all this, of course there will still be occurrences of libraries as buildings and as physical locations for collections. Institutions like the Library of Congress will not just vanish into thin air. Even if all print items have been digitised, print items will still be wanted for a number of reasons: research, art, among others. Libraries can have different functions, like archives, museums, etc. and still be named &#8220;libraries&#8221; too.<br />
Library buildings can transform into other types of locations: in universities they can become meeting places and study facilities, including free wifi and Starbucks coffee. Public libraries can shift focus to becoming centres of discovery and (educational) gaming. Anything is possible.</p>
<p>It&#8217;s obvious that libraries obey the same laws of historical development as any other social institution or phenomenon. The way that information is found and processed is determined, or at least influenced, by the status of technological development. And I am not saying that all development is technology driven! This is not the place for a philosophy on history, economics and society.</p>
<p>Some historical parallels to illustrate the situation that libraries are facing:</p>
<ul>
<li><em><strong>writing</strong></em>:<em> inscribing clay tablets &gt; scratching ink on paper &gt; printing (multiplication, re-usability) &gt; typewriter &gt; computer/printer (digital multiplication and re-usability!) &gt; digital only (computer files, blogs, e-journal, e-books)</em></li>
<li><em><strong>consumption of music</strong></em>: <em>attending live performance on location &gt; listening to radio broadcast &gt; playing purchased recordings (vinyl, cassettes, cd, dvd) &gt;  make home recordings &gt;  play digital music with mp3/personal audio &gt;  listen to digital music online</em></li>
</ul>
<p>From these examples it&#8217;s perfectly clear that new developments do not automatically make the old ways disappear! Prevailing practices can coexist with &#8220;outdated&#8221; ways of doing things. Libraries may still have a future.</p>
<p>In the end it comes down to these questions:</p>
<ul>
<li>Will libraries cease to exist, simply because they no longer serve the purpose of providing access to information?</li>
<li>Are libraries engaged in a rear guard fight?</li>
<li>Will libraries become tourist attractions?</li>
<li>Will libraries adapt to the changing world and shift focus to serve other, related purposes?</li>
<li>Are professional librarian skills useful in a digital information world?</li>
</ul>
<p>I do not know what will happen with libraries. What do you think?</p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fcommonplace.net%2F2009%2F05%2Fno-future-for-libraries%2F&amp;linkname=No%20future%20for%20libraries%3F"><img src="http://commonplace.net/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://commonplace.net/2009/05/no-future-for-libraries/feed/</wfw:commentRss>
		<slash:comments>14</slash:comments>
		</item>
		<item>
		<title>Who needs MARC?</title>
		<link>http://commonplace.net/2009/05/who-needs-marc/</link>
		<comments>http://commonplace.net/2009/05/who-needs-marc/#comments</comments>
		<pubDate>Fri, 15 May 2009 11:09:15 +0000</pubDate>
		<dc:creator>Lukas Koster</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[aarc2]]></category>
		<category><![CDATA[authority files]]></category>
		<category><![CDATA[bibliographic formats]]></category>
		<category><![CDATA[dublin core]]></category>
		<category><![CDATA[exlibris]]></category>
		<category><![CDATA[formats]]></category>
		<category><![CDATA[library systems]]></category>
		<category><![CDATA[mab]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[metalib]]></category>
		<category><![CDATA[openurl]]></category>
		<category><![CDATA[pica+]]></category>
		<category><![CDATA[rda]]></category>
		<category><![CDATA[sfx]]></category>
		<category><![CDATA[sru]]></category>
		<category><![CDATA[srw]]></category>

		<guid isPermaLink="false">http://commonplace.net/?p=502</guid>
		<description><![CDATA[Why use a non-normalised metadata exchange format for suboptimal data storage?
 This week I had a nice chat with André Keyzer of Groningen University library and Peter van Boheemen of Wageningen University Library who attended OCLC&#8217;s Amsterdam Mashathon 2009. As can be expected from library technology geeks, we got talking about bibliographic metadata formats, very [...]]]></description>
			<content:encoded><![CDATA[<p><em><strong>Why use a non-normalised metadata exchange format for suboptimal data storage?</strong></em></p>
<div id="attachment_515" class="wp-caption alignright" style="width: 375px"><a href="http://www.flickr.com/photos/yolaleah/2243898496/" target="_blank"><img class="size-full wp-image-515" title="Catalog card" src="http://commonplace.net/wp-content/uploads/2009/05/catalogcard.jpg" alt="Catalog card" width="365" height="274" /></a><p class="wp-caption-text">© leah the librarian</p></div>
<p><em><strong> </strong></em>This week I had a nice chat with <a id="btrc" title="André Keyzer" href="http://twitter.com/AndreKeyzer" target="_blank">André Keyzer</a> of <a id="cdvh" title="Groningen University library" href="http://www.rug.nl/bibliotheek" target="_blank">Groningen University library</a> and <a id="sprl" title="Peter van Boheemen" href="http://twitter.com/PetervBoheemen" target="_blank">Peter van Boheemen</a> of <a id="v.06" title="Wageningen University Library" href="http://library.wur.nl/" target="_blank">Wageningen University Library</a> who attended <a id="zzks" title="OCLC's Amsterdam Mashathon 2009" href="http://worldcat.org/devnet/wiki/2009EUMashathon" target="_blank">OCLC&#8217;s Amsterdam Mashathon 2009</a>. As can be expected from library technology geeks, we got talking about bibliographic metadata formats, very exciting of course. The question came up: what on earth could be the reason for storing bibliographic metadata in exchange formats like <a id="w7l3" title="MARC" href="http://www.loc.gov/marc/" target="_blank">MARC</a>?</p>
<p>Being asked once at an <a id="zkk0" title="ELAG" href="http://elag.org/" target="_blank">ELAG</a> conference about the bibliographic format Wageningen University was using in their <a id="ag2e" title="home grown catalog system" href="http://library.wur.nl/desktop/catalog/" target="_blank">home grown catalog system</a>, Peter answered: &#8220;<em><strong>WDC</strong></em>&#8221; &#8230;.&#8221;<em>we don&#8217;t care</em>&#8220;.</p>
<p>Exactly my idea! As a matter of fact I think I may have used the same words a couple of times in recent years, probably even at <a id="kw4h" title="ELAG2008" href="http://library.wur.nl/elag2008/" target="_blank">ELAG2008</a>. The thing is: it really does not matter how you store bibliographic metadata in your database, as long as you can present and exchange the data in any format requested, be it <em>MARC </em>or <a id="ce0g" title="Dublin Core" href="http://dublincore.org/" target="_blank">Dublin Core</a> or anything else.</p>
<p>Of course the importance of using internationally accepted standards is beyond doubt, but there clearly exists widespread misunderstanding of the functions of certain standards, like for instance <em>MARC</em>. <em>MARC </em>is NOT a data storage format. In my opinion <em>MARC </em>is not even an exchange format, but merely a presentation format.</p>
<div id="attachment_523" class="wp-caption alignleft" style="width: 150px"><img class="size-medium wp-image-523" title="St. Marc Express" src="http://commonplace.net/wp-content/uploads/2009/05/afb101-225x300.jpg" alt="St. Marc Express" width="140" height="188" /><p class="wp-caption-text">St. Marc Express</p></div>
<p>With a background and experience in data modeling, database and systems design (among others), I was quite amazed about bibliographic metadata formats when I started working with library systems in libraries, not having a librarian training at all. Of course, <em>MARC </em>(&#8220;<em><strong>MA</strong>chine <strong>R</strong>eadable <strong>C</strong>ataloging record</em>&#8220;) was invented as a standard in order to facilitate exchange of library catalog records in a digital era.<br />
But I think MARC was invented by old school cataloguers who did not have a clue about data normalisation at all. A MARC record, especially if it corresponds to an official set of cataloging rules like <a id="nhba" title="AARC2" href="http://www.aacr2.org/" target="_blank">AARC2</a>, is nothing more than a digitised printed catalog card.</p>
<p>In pre-computer times it made perfect sense to have a standardised uniform way of registering bibliographic metadata on a printed card in this way. The catalog card was simultaneously used as a medium for presenting AND storing metadata. This is where the confusion originates from!</p>
<div id="attachment_526" class="wp-caption aligncenter" style="width: 616px"><img class="size-full wp-image-526" title="MARC record" src="http://commonplace.net/wp-content/uploads/2009/05/marc.jpg" alt="MARC record" width="606" height="204" /><p class="wp-caption-text">MARC record</p></div>
<p>But when the <a id="w-2l" title="Library of Congress says" href="http://www.loc.gov/marc/umb/um01to06.html#part2" target="_blank">Library of Congress says</a> &#8220;<em>If a library were to develop a &#8220;home-grown&#8221;   system that did not use MARC records, it would not be taking advantage   of an industry-wide standard whose primary purpose is to foster communication   of information</em>&#8221; it is saying just plain nonsense.<br />
Actually it is better NOT to use something like <em>MARC </em>for other purposes than exchanging, or better, presenting data. To illustrate this I will give two examples of MARC tags that have been annoying me since my first day as a library employee:</p>
<ul>
<li><em><a id="l4ri" title="100 - Main Entry-Personal Name (NR) - subfield $a - Personal name (NR)" href="http://www.loc.gov/marc/bibliographic/bd100.html" target="_blank">100 &#8211; Main Entry-Personal Name (NR) &#8211; subfield $a &#8211; Personal name (NR)</a> </em></li>
<li><em><a id="ax75" title="773 - Host Item Entry (R) - subfield $g - Relationship information (R)" href="http://www.loc.gov/marc/bibliographic/bd773.html" target="_blank">773 &#8211; Host Item Entry (R) &#8211; subfield $g &#8211; Relationship information (R)</a> </em></li>
</ul>
<p><strong>100 &#8211; Main Entry-Personal Name</strong><br />
Besides storing an author&#8217;s name as a string in each individual bibliographic record instead of using a code, linking to a central authority table (&#8220;foreign key&#8221; in relational database terms), it is also a mistake to use a person&#8217;s name as one complete string in one field. Examples on the Library of Congress MARC website use forms like &#8220;Adams, Henry&#8221;, &#8220;Fowler, T. M.&#8221; and &#8220;Blackbeard, Author of&#8221;. To take only the simple first example, this author could also be registered as &#8220;Henry Adams&#8221;, &#8220;Adams, H.&#8221;, &#8220;H. Adams&#8221;. And don&#8217;t say that these forms are not according to the rules! They are out there! There is no way to match these variations as being actually one and the same.<br />
In a normalised relational database, this <em>subfield $a</em> would be stored something like this (simplified!):</p>
<ul>
<li><em><strong>P</strong><strong>erson </strong></em>
<ul>
<li><em><strong>Surname=Adams</strong></em></li>
<li><em><strong>First name=Henry</strong></em></li>
<li><em><strong>Prefix=</strong></em></li>
<li><em><strong>&#8230;</strong></em></li>
</ul>
</li>
</ul>
<p><strong>773 &#8211; Host Item Entry<br />
</strong>Subfield $g of this MARC tag is used for storing citation information for a journal article, volume, issue, year, start page, end page, all in one string, like: &#8220;<em>Vol. 2, no. 2 (Feb. 1976), p. 195-230</em>&#8220;. Again I have seen this used in many different ways. In a normalised format this would look something like this, using only the actual values:</p>
<ul>
<li><em><strong>Journal </strong></em>
<ul>
<li><em><strong>Volume=2</strong></em></li>
<li><em><strong>Issue=2</strong></em></li>
<li><em><strong>Year=1976</strong></em></li>
<li><em><strong>Month=2</strong></em></li>
<li><em><strong>Day=</strong></em></li>
<li><em><strong>Start page=195</strong></em></li>
<li><em><strong>End page=230</strong></em></li>
</ul>
</li>
</ul>
<p>In a presentation of this normalised data record extra text can be added like &#8220;<em>Vol.</em>&#8221; or &#8220;<em>Volume</em>&#8220;, &#8220;<em>Issue</em>&#8221; or &#8220;<em>No.</em>&#8220;, brackets, replacing codes by descriptions (Month 2 = Feb.)  etc., according to the format required. So the stored values could be used to generate the text &#8220;<em>Vol. 2, no. 2 (Feb. 1976), p. 195-230</em>&#8221; on the fly, but also for instance &#8220;<em>Volume 2, Issue 2, dated February 1976, pages 195-230</em>&#8220;.</p>
<p>The strange thing with this bibliographic format aimed at exchanging metadata is that it actually makes metadata exchange terribly complicated, especially with these two tags <em><strong>Author</strong> </em>and <em><strong>Host Item</strong></em>. I can illustrate this with describing the way this exchange is handled between two digital library tools we use at the Library of the University of Amsterdam, <a id="vk8x" title="MetaLib" href="http://www.exlibrisgroup.com/category/MetaLibOverview" target="_blank">MetaLib</a> and <a id="l_3p" title="SFX" href="http://www.exlibrisgroup.com/category/SFXOverview" target="_blank">SFX</a> , both from the same vendor, <a id="p_oj" title="Ex Libris" href="http://exlibrisgroup.com/" target="_blank">Ex Libris</a>.</p>
<p>The metasearch tool <em>MetaLib </em>is using the described and preferred mechanism of on the fly conversion of received external metadata from any format to MARC for the purpose of presentation.<br />
But if we want to use the retrieved record to link to for instance a full text article using the <em>SFX</em> link resolver, the generated MARC data is used as a source and the non-normalised data in the 100 and 773 MARC tags has to be converted to the <a id="ju_e" title="OpenURL" href="http://en.wikipedia.org/wiki/OpenURL" target="_blank">OpenURL</a> format, which is actually normalised (example in simple <em>OpenUrl </em>0.1):</p>
<pre id="line11"><strong><em><span class="comment">isbn=;issn=0927-3255;date=1976;
volume=2;issue=2;spage=195;epage=230;
aulast=Adams;aufirst=Henry;</span><span class="comment">auinit=;</span></em></strong></pre>
<p>In order to do this all kinds of regular expressions and scripting functions are needed to extract the correct values from the MARC author and citation strings. Wouldn&#8217;t it be convenient, if the record in <em>MetaLib </em>would already have been in <em>OpenURL </em>or any other normalised format?</p>
<p>The point I am trying to make is of course that<em><strong> it does not matter how metadata is stored</strong></em>, as long as it is possible to get the data out of the database in any format appropriate for the occasion. The <a id="nnzr" title="SRU/SRW" href="http://www.loc.gov/standards/sru/" target="_blank">SRU/SRW</a> protocol is particularly aimed at precisely this: getting data out of a database in the required format, like MARC, Dublin Core, or anything else. An SRU server is a piece of middleware that receives requests, gets the requested data, converts the data and then returns the data in the requested format.</p>
<p>Currently at the Library of the University of Amsterdam we are <a id="ngmc" title="migrating our ILS" href="../2009/04/replacing-our-ils-business-as-usual/">migrating our ILS</a> which also involves converting our data from one bibliographic metadata format (<a href="http://www.gbv.de/wikis/cls/PICAplus_in_XML" target="_blank"><em>PICA+</em></a>) to another (<em>MARC</em>). This is extremely complicated, especially because of the non-normalised structure of both formats. And I must say that in my opinion<em> PICA+</em> is even the better one.<br />
Also all German and Austrian libraries are meant to <a id="qz:8" title="migrate from the MAB format to MARC" href="http://www.d-nb.de/eng/standardisierung/formate/marc21.htm" target="_blank">migrate from the MAB format to MARC</a>, which also seems to be a move away from a superior format.<br />
All because of the need to adhere to international standards, but with the wrong solution.</p>
<p>Maybe the projected <span class="style14">new standard for resource description and access <a id="b9b7" title="RDA" href="http://www.rdaonline.org/ERDiagramRDA_24June2008.pdf" target="_blank">RDA</a> will be the solution, but that may take a while yet.</span></p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fcommonplace.net%2F2009%2F05%2Fwho-needs-marc%2F&amp;linkname=Who%20needs%20MARC%3F"><img src="http://commonplace.net/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://commonplace.net/2009/05/who-needs-marc/feed/</wfw:commentRss>
		<slash:comments>26</slash:comments>
		</item>
	</channel>
</rss>
