Seeing the whole archive

Yesterday I went to Dr Mitchell Whitelaw’s impressive presentation at the National Archives, about his Visible Archive project.

First, he gave a great introduction to why visualisations are important and how they can help you get a handle on a collection. In brief, search excels when you know which small piece you’re looking for. But if you want to explore the whole, you need another way in. Visualisations are great because by looking, we can find patterns and therefore intrinsic structures, which help us to make sense of and thereby navigate within large data sets.

Look at this beautiful visualisation of all the series in the Archives:

Every series – big square means

65k archival series – a big border => physically large; a big interior square => a lot of registered items

In the interactive version, you can click on any series and see the agencies that created or controlled it, and the other series to which it relates – eg an index to the series, or a successive series:

Series A432, which agencies created and controlled it, and its related series

Series A432, which agencies created and controlled it, and its related series

When you highlight one of the agencies, in this case CA5, the orange squares also indicate all the other series that that agency created.

Ah, the beauty of the series system! As Ross Gibbs, Director-General of the National Archives, said at the end of the presentation, Peter Scott would be elated.

But wait, there was more. Mitchell then showed us a deceptively simple visualisation of a single series, A1. It started with a tag cloud of the 150 most common words in the titles if items. ‘Naturalisation’ and ‘certificate’ were huge, and there were a lot of names, of places like Norfolk and Papua but also of people.

On hover you could see the spread of each term in items over time, and on click you could see a list of items. Then, if the item was digitised you could also have a look at each folio. Nice!

But the zing was yet to come. You can also combine two terms, or exclude one (eg, what is there in that series apart from all those naturalisation files? And in this way you could start to make discoveries, just by playing around with the tag cloud – for example, that there was a major cyclone in Darwin in 1937.

In so many ways, visualisation works as a way in to the records. We can’t predict all the ways that it works until we see them working. But sure as eggs there will be ways, not least because the national archives data has an in-built structure.

Thanks Mitchell ! for doing this great work and for making it look effortless. (I know it’s not!)

Push for pull

In the spirit of yesterday’s post, I’m sharing some diagrams.

Last year I gave a conference paper about how user-generated description can improve archival findability. I began with the idea that every use of an archival record can generate a description, which (if it is captured) can make the record more findable in future.

f2-circuit

A visual elaboration of this concept is below. It begins in the middle. (And ‘RecordSearch’ is the collection database of the National Archives.)

Circuit of findability and enrichment

Ultimately, my point was also that archives should value users as much as users value archives. Their relationship is interdependent – archives engage users, users enrich archives.

 width=

And really, if they are not used and interpreted and shared, archives are irrelevant.

The paper was well received, and I was especially pleased because Eric Ketelaar asked me if he could use the main diagram with his students.

Innovative ideas

A few comments on Friday’s Innovative Ideas Forum at the National Library of Australia…

A bit of faith in humanity goes a long way

From my POV, the most interesting aspect of the forum was that one key theme was so not-new – trust in and respect for the public / consumers / audiences / users – whatever we want to call them. Four of the presenters suggested in some way that trust as an essential element of contemporary cultural work – Marcus Gillezeau, Mark Scott, Rose Holley and Darren Sharp. This is an absolutely critical point. But it has been said many a time before, especially in relation to the library sector. It was the issue that was most discussed in 2006 when I posted a paper about Web 2.0. And it’s all over the web now – just try googling ‘radical trust’.

As government-funded cultural workers – as people in positions of cultural authority – we need to lay aside our fears, withhold our judgement, and actively resist our will-to-control – and trust and respect our audiences, radically and fundamentally. If we assume the best of people, and build systems based on radical trust (which can include transparency features and safeguards such as version tracking and rollback functions), then it’s possible to get the brilliant results that the Library is getting through its Newspapers Digitisation Program – thousands of people correcting millions of words, because they want to help. As Rose Holley reported, people are motivated by the trust and respect the Library is showing them.

The opposite is also true, and I bet we all have this experience: lack of trust is a powerful demotivator.

But clearly, hearing the words and seeing the success stories is not sufficient to engender the cultural shift we need in order to build trust-based systems. Every single time I hear (or talk) about a project involving user-generated content, someone invariably asks the question about the vandals.

Well, yes it happens that some people do dodgy things, by accident and by design. But it’s better to build a system that enables public participation for public benefit than to preclude that participation and benefit on the assumption of ill-will. Only then can we allow and benefit from user-led innovation  – thanks to Darren Sharp for bringing this notion to the forum (and hear, hear to the recently-released Venturous Australia report, which pointed out that governments have been pretty good at fostering top-down innovation but fare badly when it comes to innovating from the bottom up).

Talking the talk but baulking at the walk

Despite lugging my huuuuge Mac laptop with the idea of swimming along in the tweetstream while I listened… I was one of the who-knows-how-many who couldn’t connect, even after more IP addresses were made available. Well – I did manage it at 4.45pm from the foyer, after everyone had gone home :-(

So as someone willing but unable to participate in that way, I was disappointed that the social media channels that had been set up for the event were not integrated in any way into the forum itself – rather, there was an unfortunate (and ironic, given the subject) disconnect between the presenters and the audience.

Perhaps the next forum could be a more radical experiment in the form of the forum – perhaps we could collaborate to create some innovative ideas.

A phenomenon that passed me by…

I admit I had never once heard a jot about Scorched, the ambitious and fascinating all-media creation of Marcus Gillezeau and co. I’m not a watcher of commercial tv, so no surprise there. But I’m a user of social media, and did not hear about it that way either. Would have been interesting to see a show of hands as to how many people at the forum had heard of the project, watched the telemovie, participated in the community. Lots about it is interesting and worthy of further discussion – in particular, the relationships between fact / fiction, and commercial / non-commercial culture. And what happens to the community now that there’s no more funding?

Reading the textbook is not enough

Here’s an idea I like:

Students should not read textbooks; they should write them.

Bruce Tognazzini said it in the 1990s, and David Weinberger considers it in a story in The Filter, published by the Berkman Center for Internet and Society at Harvard Law School.

Weinberger was initially ambivalent, but since the advent of wikis he has warmed to the notion. He describes how the collective, collaborative – and no doubt contentious – act of crafting a coherent, accurate wiki on the subject of study is itself educational:

Let them argue about how to organize it. Keep the discussion pages up. Keep the differences visible. Let them fill it with links. Let them connect with other students in other schools creating related wikis.

A class’s wiki is not going to be as complete, well-grounded or well-written as a good textbook. But students will learn more by writing one than by cribbing and cramming from a professional textbook.

In my (by now, predictable) view, the same principle applies to museum exhibitions and websites. If you’re only ever engaged in a passive way, as a consumer, it’s hard to remain interested. But if you have the chance to think through the issues of what to put on display, how to arrange and describe the items, and what they mean, it’s a faaaar more interesting experience. A journey, rather than a sushi train of neatly prepackaged ideas. At 5 to midnight, my metaphors are failing me, but I hope you know what I mean.

I’d like to see more programs that work on that principle. I’d love to hear about yours.

(Thanks to Mal for the pointer.)

How Web 2.0 will change history

As an editor of archival websites, I’m interested in the tools available for historical publishing, research and interpretation. And the advent of Web 2.0 means that such tools are proliferating and becoming easier and more fun to use. Social software is making search interfaces more intuitive and clever; it is making publishing dialogic – readers can also be writers; and it is enabling many new kinds of collaborations to occur in interpreting collections.

Last month I addressed a small group at the Australian Historical Association conference in Canberra on this topic of How Web 2.0 will change history – possible futures for websites of the National Archives of Australia (PDF 312kb). The paper was framed by this mindmap I made

Mindmap of Web 2.0 and the life cycle of historiography

(inspired by other mindmaps on Web 2.0, like the one on Wikipedia).

There are plenty of exciting things the National Archives of Australia could do with these technologies, and it is starting to happen, but the path is long, resources are limited, and in some ways a cultural shift is necessary – it does not come naturally for a cultural institution to radically trust its audience.* So the paper is a bit imagin-ary. But didn’t Einstein say that imagination is more important than knowledge?

* Deep bow here to the Powerhouse Museum and its new collection interface, which you can read more about on the fresh + new blog.

Folksonomic findability

It is a pet peeve of mine that museums so rarely draw on the knowledge and understanding of visitors to help interpret their collections. Ever since I read Museums and Communities: The Politics of Public Culture,1 I’ve sought out initiatives that facilitate community interaction. In global terms, there are plenty of examples, especially in terms of public programs. But in terms of the core business of museums – collection management, and exhibitions, it remains rare for a museum to involve audiences in the process of making collections meangingful. (Your examples are welcome!)

In the world of the web, though, the story is different. Social software enables visitors to the site to help make the site. And the cultural heritage sector is starting to explore the possibilities.

a tagged imageSteve is a project of a group of seven museums.2 It emerged out of the mismatch between the classification systems of museums and the way users tend to think about collection items. A museum might describe an artwork in terms of the artist’s proper family name. Whereas a visitor might search for an artwork according to how they remember it – its shape, or the fact that a painting had some nice clouds in it.

The Steve people are researching and developing a tool that will enable website visitors to add descriptive tags to any item they are viewing. The tags then join in with the official description of the item, so that the collection takes on a hybrid official and vernacular classification system. And henceforth each item becomes more findable for more people. And more collectively meaningful!

Have a look at, and join in, this wonderful experiment. (You need to register if you want to do some cataloguing.)

circular fabric designorange fabric designBut wait! Here’s another example, closer to home. Our very own Powerhouse Museum invites users to help describe swatches of fabric, dating from the 1890s to the 1920s. You can enter your thoughts on their colour, pattern, mood and/or ‘other facts’.

1. Ivan Karp, Christine Mullen Kreamer, and Steven D. Lavine (eds), Museums and Communities: The Politics of Public Culture, Washington: Smithsonian Institution Press, 1992.
2. Steve stands for Social Terminology Enhancement through Vernacular Engagement.