OHSWG 1997.04.23
OHSWG logo

Open Hypermedia Systems Working Group

A Southampton scenario for OHS


A Southampton Scenario for OHS

Wendy Hall and Hugh Davis


Extracted from the video script of "Making Links", inaugural lecture given
by Wendy Hall on 28th January 1997

The scenario begins with the film 2001: A Space Odessey. It is the vision
of building a real-life version of HAL: the ship-board computer in 2001. In
fact, according to the book of the film, HAL became operational on 12th
January 1997, which is pretty timely as far as this scenario is concerned.
In today's terminology HAL would be called an "intelligent agent". At
Southampton our work on open hypermedia systems is evolving into the
development of  communities of "intelligent" agents that can do some of the
things HAL could do, in particular to help us interact with digital
information. This scenario is illustrated by an imaginary dialogue with AL,
a cut-down version of HAL!

Imagine that AL has access to vast banks of digital information.

Our Motivation - the digital version of the Mountbatten archive

As we described in our ECHT'90 paper, one of the motivations that led us to
developing Microcosm was the acquisition of the archive of the Earl
Mountbatten of Burma by the University of Southampton in 1987.

The archive contains about 250,000 text documents, 50,000 photographs, many
of his speeches recorded on 78 rpm records, and a large collection of film
and video. It spans his entire lifetime (1900-1978) and essentially mirrors
British history in the twentieth century.

The multimedia nature of the archive makes publication impossible by
traditional methods. We became very interested in the prospect of building
a digital version of the archive. This was not something that would fit
neatly into a database. There was no linear sequence to the material other
than that of chronological date, and users would want to move seamlessly
from one document to other related documents. Every user would approach the
material from a different perspective, so there was a need to create
different 'views' for different users. To us this meant different sets of
hypertext links, and the integration of hypermedia technology with database
management and information retrieval systems to even begin to tackle the
issues.

Here is an example of a dialogue with AL to illustrate the type of
environment we were trying to build.

WH: Al, lets see some examples from the period 1947/48 when Mountbatten was
Viceroy of India

AL: Could you be a little more specific?

WH: How about something typical of the period - a picture of Mountbatten in
dress uniform

AL: (slight pause) Will this do?

SLIDE 9 - Mountbatten in dress uniform

Thats fine. The database and information retrieval queries that AL will
have had to perform to retrieve that photograph from the archive could be
extremely complex. He will be searching the picture catalogue for pictures
taken between 1947 and 48 and either using key-words that describe the
picture, or some sort of image processing techniques, to find the ones of
Mountbatten in uniform. Anyway, lets look at some more information in the
archive.

Through working with the historians, we know that when Mountbatten arrived
in India, one of the many problems he had to deal with was that of the
Indian National Army. AL,  can you find us some text documents about the
Indian Army?

AL: This one looks like it might be of interest

SLIDE 10 - Viceroy's interviews No.22, April 1947

WH: Ah yes, this is interesting. Here Mountbatten is describing his utter
despair at ever being able to achieve independence for India and mentions
his interview with Claude Auchinleck about the Army. AL will have found
this by doing a text search of the catagolue or the documents.

AL Do we have any other documents that refer to the Indian National Army
Problem and Auchinleck?

AL: Yes, here are the minutes of a meeting at which the problem was discussed

SLIDE 11 - Minutes of 2nd Miscellaneuos Meeting

WH: There AL could have followed a pre-defined hypertext link or again
undertaken a text search of the documents.  Auchinleck is clearly a
significant person in this debate. Can you tell us a bit about him, Al?

AL: Field Marshal Sir Claude Auchinleck was the Commander in Chief of the
Indian Army during 1947 after a long distinguished career serving in the
British Army. I have some pictures of him if you would like to see one?

WH: Yes please, one with Auchinleck and Mountbatten would be good

SLIDE 12 - Auchinleck and Mountbatten

So again, AL has made links, that is associations, between relevant pieces
of information using a mix of database, information retrieval and hypertext
techniques.

What We Learnt

We started with the period when Mountbatten was in India, and we quickly
learnt that the biggest issue was that of copyright, but we'll  leave that
to the lawyers to resolve. Working with the archive and trying to build a
system that would support these types of queries led us to build an open
hypermedia system, Microcosm, in which there was a minimum of three levels
of access to information.

 Structured access to the information through the use of document and
database management systems to answer questions like
	"List photographs taken by ...."

Information retrieval techniques to do similarity matching across indexed
sets of documents to answer questions like
	"Find photographs similar to ........"

Browsing/hypermedia techniques that could be used whatever the format of
the information to answer questions like
	"Tell me about ......"

Voyager 2001

Microcosm is in essence a set of communicating processes (filters). As we
move beyond today's version of the system, our work is naturally evolving
to incorporate the use of intelligent communicating agents to provide
access to distributed information in ways that are completely transparent
to the user.

The vision is epitomised in science fiction by HAL, but other examples
well-known to televison viewers would be ORAC and ZEN from Blake Seven and
HOLLY from Red Dwarf.  The following dialogue is intended to represent the
way we might interact with computers in the future to find the information
we are looking for.

We start with Cell Biology since that was one of the first topics we used
at Southampton for developing a hypermedia application.

Suppose we want to  learn a little bit about how cells move. AL, tell us
something about Cell Motility

AL: Here's a tutorial on the subject used by biology students at Southampton

SLIDE 12a - Cell motility image

WH: OK, Al lets see the Introduction

SLIDE  13-  Intro to Cell Biology

WH: Thats good. Some links have been highlighted by the biology tutor to
tell the student that there is more information in the system about this
subject that they should look at. Lets look at Tissue Cell Movement AL

SLIDE 14 - Tissue Cell Movement

WH: Now the word embryo is not highlighted but I'd still like to know more
about what an embryo is. AL can you tell me about embryo's

AL: How about a dictionary definition?

WH: That will do fine

SLIDE 15 -  Embryo definition

WH: OK. Now lets switch contexts to computer science. Here's a challenge
for you AL. Can you find me some relationship between embryo's and computer
science?

AL: (Pause) Hard but not impossible. I have a paper by Alan Turing, one of
the pioneers of computer science, which mentions embryo's.

WH: Yes lets see that

SLIDE  16- Turing Paper

WH: Interesting.  Now lets stay with Computer Science and think about the
Department here at Southampton, since thats where we are. AL what
information do you have about the Department?

AL: Here's the information that's available on the Internet

SLIDE 17 - ECS info + map

WH: And almost the first thing you see is that the Department of
Electronics and Computer Science is housed in the "Mountbatten" building.
There's a familiar name. But what's the association between Mountbatten and
Computer Science AL?

AL: Well, Lord Mountbatten was once the President of the British Computer
Society.

WH: So that could be it. Now lets change context again to history and World
War II. What information do you have about Mountbatten's role in World War
Two AL?

AL: Would you like to start with some pictures. Here's a typical group
photo taken in Casablanca in 1943.

LEFT SLIDE  18 - Mountbatten and Churchill group

WH: OK, thanks AL, we won't continue down that trail now. But it is
interesting that we've ended by making a link to Churchill, whose archive
is also a major feature of research work at Southampton.  Lets finish with
the film of Churchill's  "Iron Curtain" speech where he talks about the
need for making links, or special relationships as he calls them. Can you
find that clip AL?

VIDEO 8 - Churchill's speech

So maybe that's how we'll be finding information in 2001. What's going on
behind the scenes?  Well, there's a complicated mix of database queries,
pattern matching, hypermedia linking and knowledge-based techniques that is
all transparent to the user, as is the location of the information that is
being made available. Al is creating associations between different pieces
information to answer my queries - its all making links!

There's also an awful lot of natural language processing and speech
recognition, and in order to do all this Al is a supercomputer because the
processing power required for him to perform these tasks is quite
phenomenal.

This is all rather far-fetched considering the state-of-the-art today but
whatever proposals we might come up with for an open hypermedia protocol,
they should take us another step forward to achieving this vision.


Wendy Hall
U Southampton, England
wh@ecs.soton.ac.uk
http://diana.ecs.soton.ac.uk/~wh/