'Knowledge organization' seems a slightly quaint term now, but we don't have a better in general use. Take the catalogue. This has been a knowledge organization tool. When an item is added, the goal is that it is related to the network of knowledge that is represented in the catalogue. In theory, this is achieved through 'adjacency' and cross reference, notably with reference to authors, subjects and works. In practice this has worked variably well.

In parallel with bibliographic data, the library community, notably national libraries, have developed 'authorities' for authors and subjects to facilitate this structure. From our current vantage point, I think we can see three stages in the development of these tools.

1. Label. In the first, subject and name authorities provide lists from which values for relevant fields were chosen. Effectively, they constrain the range and format of subject or name data, providing an agreed text label for a concept or name. Examples are LCSH, Dewey, and the Library of Congress Name Authority File. These provide some structuring devices for local catalogues, but those systems do not exploit the full structure of the authority systems from which the values are taken. Think of what is done, or not done, with classification for example. The classification system may not be used to provide interesting navigation options in the local system, and more than likely is not connected back to the fuller structure of the source scheme. That said, having a consistent label is an advantage, and facilitates matching within and between systems.

2. Data. The second stage is that these authority systems are being considered as resources in themselves, and not just as sources of controlled values for bibliographic description. So, we are seeing the Library of Congress, for example, making LCSH and the Name Authority File available as linked data. OCLC is working with a group of national libraries to synthesize name authority files and make them available as an integrated resource in the VIAF service. FAST has recently been made available in this way. The Digital Author Identifier, a national Dutch system for identifying researchers, is interesting in this context. In this arrangement, there is collaboration between the apparatus for uniquely identifying researchers and the national authority file.

3. Network. In a third stage, as these network-level resources become more richly linkable and as local environments exploit that linking ability it becomes possible to do more. This type of linking has only just begun though, and it will be interesting to see how it develops. In this context, a URI is added to the label, making it actionable and globally unique. As an example, think again of the catalogue. The structuring devices we employ are about structuring relationships *within* the catalogue. This would be turned inside out if we not only imported values, but also linked those labels to those external resources. In this way, the item represented could be re-placed in the broad network of knowledge established by the authority file from which it comes.

Of course, alongside this, they may also link to, or draw data from, other navigational, contextual, identifying or structuring resources such as DBpedia, MusicBrainz or Geonames. These and other reference points are likely to be important webscale identity and knowledge organization services. In a sense, more generally, this has already happened, as people orient themselves by links to Wikipedia, MusicBrainz, IMDB and other network level resources.

As in other areas of our activity, we need to think about how activities whose natural level was once local are now moving up to the network level. And once they are at the network level, they have to live alongside other approaches.

If this were to become more common, there are some implications ...

From records to entities ... we ship data around in 'records', bundles about individual items, and our systems are structured around managing these records. We do not tend to manage data about other things of interest to us to the same extent: authors, places, people, concepts, works, and so on, the types of things we have in authority files. What would happen if we more clearly described an item by linking it to these files? More generally, we can see stronger interest emerging in some of these other entities, personal names especially. Think for example of how Amazon has created people pages or the growing interest in researcher identification. Or of places, as geolocation services take hold. Freebase is creating an 'entity graph' giving IDs to millions of entities (people, places and things).

Much of the library linked data discussion has been about making that local record-based data available in different ways. As interesting is the discussion about what key resources libraries will want to link to, and how they might be sustained. An important question for national libraries and others who manage some of the schemes mentioned above is how to move into this third phase. What would this mean for library systems or for library data of this type? What resources are important? How should they be sustained? To make this concrete, are the name authority files maintained by national libraries fit for purpose in a network world? Does it make sense to limit their scope to authors identified in a particular library workflow, cataloging, and exclude other authors (of articles, for example)? Does it make sense to limit their creation to a restricted group of specialist librarians? And so on ...

Finally, as knowledge organization moves to the network level how do library resource relate to others. Can other services leverage the accumulated investment of the library community, or does it fade. The organized relationship between the Deutsche Nationalbibliothek and Wikipedia in Germany is an interesting example here, where the German Wikipedia explicitly takes advantage of the structuring work done by the DNB. Wikipedia itself is very interesting in this regard, as it has effectively become an 'addressible knowledgebase'. If I want to tell you about a new concept or movement, or refer you to a place, or mention a person, I can send you a Wikipedia link. What would be required for Wikipedia to take advantage of 'knowledge organization' approaches developed in the library community?


Related entry:

  • End of the digest .... October 13, 2011 – For almost as long as this blog has been going we have had an associated digest. This has gone out to over 800 people. The frequency of the digest has changed as the frequency of posting has gone down. We have decided that it is now time to turn off the digest. While the blog continues, it has become more a venue for occasional comment than a steady stream. Thank you to all those who have subscribed to the digest, and I hope you continue to read entries in the future. And, as a reminder, I am on Twitter at... more
  • Collections are library assets August 31, 2011 – I quite like using the word 'assets' with reference to library collections. We tend to think of assets in positive terms, as things that are valuable. More of that later. I was interested to see Rick Anderson remark on the vocabulary used by my colleague Constance Malpas a while ago. This was in the context of a generous note about Constance's "Cloud-sourcing Research Collections: Managing Print in the Mass-digitized Library Environment." [pdf]. I confess that I giggle and shudder simultaneously at the thought of referring publicly to books in our collection as "inventory that is increasingly devalued as an institutional... more
  • The ILS, the digital library and the research library August 17, 2011 – Job adverts are interesting for a variety of reasons. They give a sense of skills and attributes in demand. They say something about how the hiring institution wants to present itself. And they can indicate trends. I have been interested to see three research libraries look for senior digital library posts in recent months. Associate Director for Digital Library Programmes and Information Technologies, Bodleian Libraries, University of Oxford. Associate Vice President for Digital Programs and Technology Services, Columbia University Libraries/Information Services. Head of Digital Library, Information Services, The University of Edinburgh. Note: given the nature of these resources, the links... more
  • Preserving musical heritage ... August 14, 2011 – One of the casualties of the London riots last week was a Sony distribution warehouse. The building, owned by Sony DADC, was also the main HQ for the UK's biggest distributor of independent music, Pias. [More than 1.5m CDs destroyed in Sony warehouse fire] Interestingly, Sony looked after the stock of more than 150 record labels at the warehouse. According to the BBC story quoted above "As well as CDs, the 20,000 sq m (215,000 sq ft) centre was used to store DVDs, Blu-ray discs and discs used for PlayStation Portable games." It was depressing reading about the impact on... more
  • Nostalgia, the Dublin Irish Festival, and variant forms of names August 05, 2011 – The Dublin Irish Festival is on this weekend - Dublin, Ohio, that is. I notice that Moya Brennan is performing. As some folks will know, Moya Brennan is an Irish singer who was a member of the well-known family group, Clannad. They emerged in the 70s, playing very much in a traditional irish music idiom. As they evolved, they developed a style that was influential in the emergence of the sort of new age, 'celtic' music that became popular and has some well known practitioners. One of the most vivid memories I have of secondary school is when they put... more
  • Worldcat Identities Network: a 'mashup' July 30, 2011 – There has been some nice reaction to the Worldcat Identities Network. The initial motivation for this was to put a graphic display of related Identities into an Identities page. This did not work out and we decided to make it available as a standalone app. The aim is to show how something could be built on top of the Worldcat API and the Worldcat Identities Web Services. It is a 'mashup'. Which prompted me to think that the peak of the mashup has passed? Or at least we do not hear as much about mashups as before? It is interesting... more
  • Gamification: services and libraries July 24, 2011 – I have been interested to see more notes in my tweetstream about peoples' exercise or diet regimes. They are typically generated by network services as a by-product of some activity, running or cycling, for example, and are part of a motivating framework. The Withings bathroom scale is connected to the network, allows you to set goals and to record and access weight and other data, as well as optionally communicating progress by tweeting your weight. In a similar vein, I recently came across stickK.com - "the smartest way to set and achieve your goals". If you're ready to turn that... more
  • Spotify and Klout: fungible influence July 19, 2011 – Popular music streaming service Spotify has just launched in the US. For some background see the Ars Technica story. One of the interesting aspects of the launch was the tie-in with Klout. Klout is one of several services which provide analytics around social media activity. It aims to be the 'standard for influence', tracking social media impact. I have spoken about Klout before (Analysing influence .. the personal reputational hamsterwheel). The business model includes the matching of 'influencers' in a particular area with providers of products and services relevant to that area. The providers may provide the influencers with 'perks'... more
  • Hamster wheeling .... June 14, 2011 – I have found the expression 'hamster wheeling' useful over the last few years. I tend to use it in the context of any frantic effort where the participants have to keep several things going at the same time, and where it seems that slowing down might cause something to fall off. More specifically, it is an appropriate description for those operations, common in a digital library environment, where a set of grants are used to keep a set of people working on a set of projects. Many of us are familiar with such environments, either as participants or observers. In... more
  • Books, artifactually and anecdotally speaking ... June 08, 2011 – Anecdotally speaking ....... it seems to me that the artifactual and design elements of print books are coming more to the fore, albeit in different ways. As the use of ebooks grows this is not really surprising, as this change creates niche opportunities for experiences focused specifically on the possibilities of the print medium. Some related trends are interesting. First, we are seeing physical books repurposed, recycled, or 'upcycled' to perform other functions. Second, we are seeing a focus on the aesthetics of print books, and on book arts in general. And third, we are seeing - I think? -... more