Augmented Books, Knowledge, and Culture

Kim H. VELTMAN <k.veltman@mmi.unimaas.nl>
Maastricht McLuhan Institute
Netherlands

Contents

1. Introduction

Some inventions are very important. Things that change the world are usually more than an isolated invention. They bring about systemic changes in society. The invention of printing around the year 805 in Korea was very important. [1] But it was not until Gutenberg in the 1440's had the idea of using that invention for spreading knowledge that something systemic changed. With Gutenberg's approach, printing became one aspect of a new way of organizing knowledge, a new way of communicating practice. But Gutenberg went broke. It took over a century and a half for the vision to manifest itself and yet the world changed.

The invention of the computer has been compared to the invention of printing. This is both true and misleading. It is true because computers entail a systemic change in society, which will change the world even more than the advent of printing. It is misleading because the computer revolution is not about one invention. This paper explores some dimensions of these claims.

Computers are an important invention. Connected computers through the Internet are a very important invention. The systemic changes they are bringing with them are changing the world. One of these changes is miniaturization. A second is the development of mobile equipment. Wireless connections mean (a) that we shall soon be able to access equipment literally anytime, anywhere and partly as a consequence thereof, (b) most of the neat distinctions between different kinds of devices are disappearing. A third and fourth innovation are Geographic Information Systems (GISs) and Global Positioning Systems (GPSs). A fifth invention is the concept of agents. This means among other things that routine tasks of librarians and scholars can increasingly be relegated to software. A sixth change is that optical character recognition (OCR) is reaching a mature state. This means that digital versions of the Xerox machine are emerging, that we scan anything we wish. It also means, as we shall see, that hypertext takes on a new meaning.

Connected computers in combination with these new technologies are bringing about a systemic change. Most persons expect that this should take the form of some unexpected killer application, which will invariably remove or replace all competition. In our view something more subtle is happening. The systemic change will inspire us to use what we already have in new ways. We shall begin with a brief survey of these new technologies, which at first sight have no connection with one another. This will lead to a review of different meanings of hypertext before exploring how augmented books and augmented knowledge will change our conceptions of knowledge.

2. New technologies

A number of new technologies, which began as independent inventions, are coming together through the phenomenon typically called convergence. These include electronic books, mobile devices, GISs, GPSs, agents, and advances in OCR.

Miniaturization and electronic books

Computers were traditionally mainframes, which took up entire rooms. The rapid development of miniaturization brought computers to the desktop, then to portables, to the laptop, the notebook, and notepad and more recently to handheld devices such as the Palm Pilot. Among these new handheld devices is the electronic book. [2] Books are heavy. Important books such as the Bible often weighed ten pounds or more. The complete Oxford English Dictionary in 26 volumes is so heavy that no ordinary individual can carry all of it around. Electronic versions thereof in electronic books are portable.

The process of miniaturization is continuing. Today's transistors measure about 180 nanometers. In November 1999, Lucent's Bell Labs announced "the world's smallest" transistor gates measuring "just 50 nanometers, which is about 2,000 times smaller than a human hair," the company said. Future refinements should shrink transistor gates to less than 30 nanometers. [3]

Within two decades, we are told, an object the size of a fly will be sufficient space for a computer that is 100 million times more powerful than a contemporary Pentium computer. [4] Technically speaking this means that the electronic equivalent of millions of books can potentially be carried around easily. Wearable computing, a trendy fantasy even five years ago, will probably be an everyday reality within a decade, thanks partly to developments in a new field called molecular electronics or molectronics. [5] A project at Notre Dame University, for instance, has as its goal the production of a "logic device less than 50 nm on a side, with associated input and output structures." As Dr. Marya Lieberman explains:

Quantum-dot cellular automata (QCA) are a completely new architecture for computation. Information is transmitted between QCA cells through Coulomb interactions; depending on the arrangement of the cells, bits can be transmitted, inverted, or processed with logic operations (AND, OR). Information processing does not require the flow of current, so QCA has the potential for extremely low power dissipation.

Switching the cell depends not on voltage gating of a current (as transistors do) but on this tunneling process; thus, the device performance is expected to improve the smaller the QCA cells can be made [sic], with the ultimate size limit being molecular QCA cells.

Classical inorganic coordination chemistry is our main tool for the synthetic work. Both ultra-high vacuum scanning tunneling microscopy and X-ray photoelectron spectroscopy will be employed to characterize model compounds and molecular QCA cells on the surface of silicon wafers. [6]

In 1967, the film Fantastic Voyage described an imaginary trip through the bloodstream of a human being as a brilliant example of science fiction. Given the trends in miniaturization this could become reality within two decades. What possibilities does the future hold for the science fiction of today in novels such as Michael Crichton's Timeline? [7]

Ubiquity and mobiles

Connected with such developments is the rise of ubiquity and mobiles. The late Mark Weiser at Xerox Park developed the idea of ubiquitous computing: [8] that computing was not just about a gadget on a desk but could potentially involve miniature gadgets spread everywhere in our work and home environment, a vision for which he decided wireless communications were necessary. Parallel with this, Leonard Kleinrock developed the idea of nomadic computing for the military. [9]

In 1998 a number of new consortia [10] moved discussions of wireless technologies from futuristic scenarios into the front line. This has had two basic consequences. First, there are new prospects of interoperability among a whole range of electronic devices and appliances beginning in the home and soon spreading elsewhere especially after the introduction of Universal Mobile Telecommunications Services (UMTS) in 2002 when it will effectively be possible to link computers via satellite from anywhere at any time.

Second, there is a so-called convergence of devices whereby the traditional notion of devices having a specific function is fast disappearing. Traditionally, for example, a telephone was for making telephone calls. A camera was for making photographs. A fax machine was for faxes and a computer was for e-mail, word processing, and Internet connections.

In the past two years, there are new mobile devices which perform all of these functions in a single instrument. Some of these instruments have grown out of hand-held Personal Digital Assistants (PDAs) such as the Palm Pilot VII. [11] A number of these devices resemble telephones with miniature screen displays. Perhaps the most striking examples are those of Nokia, [12] a company which is consciously working towards what they term "personal information bubbles." [13] Other important players in the field include Ericsson [14] , Kenwood [15] and Motorola. [16] (figure 1). Qualcomm has a telephone which incorporates Palm Pilot functionality directly. Some players believe that there may well be a new wave of diversification of devices with one crucial difference: such future telephones or PDAs will be completely interoperable.


Figure 1. Examples of new technologies which combine wireless cell-(tele-)phones with Personal Assistants (PDAs) with fax, e-mail, and Internet capabilities from Kenwood and Nokia.

Meanwhile, another trend being explored by companies such as Telecom Italia is the idea of virtual interfaces. In this approach an electronic notepad-like device would have a simple screen which could be reconfigured to resemble the interface for a telephone, a fax machine, a computer terminal, etc. Hence a change of interface permits a new functionality.

With respect to our story these developments are of great significance because they mean that access to the Internet, which today is still largely dependent on a fixed connection, will within five years be accessible in many ways at any time from anywhere on earth. The linking of computers to an orbiting satellite will no longer be limited to the equivalents of James Bond. It will become an everyday affair. The price of such devices also continues to decrease. A new variant of Palm Pilot called the Handspring Visor sells for only $179. [17] Once embedded into everyday devices such as telephones, it is quite feasible that simplified versions of computers could be made available for as little as $25. Persons such as John Gage [18] at the frontiers of the field assure us that the actual cost could well sink to less than $10, at which point the Internet Society's motto of "Internet for all" will become more than rhetoric. Universal accessibility, now only thinkable in the most advanced countries, could potentially be applied around the world to the entire six billion persons of our population.

Geographic information systems and global positioning systems

GISs have been developing steadily since the 1970's. They allow a precise linking between maps and enormous amounts of other information, which can be arranged in layers. Hence there can be one layer indicating where electrical lines exist. A second layer can trace the position of all the sewers. A third can show the position of all the railway lines on a map. A city such as Toronto has over sixty such layers in its GIS. In the past decade there has been an increasing trend to link this information with Facilities Management Systems (FMSs), such that one can also trace where all the telephones, computer terminals, printers, and fax machines are in a skyscraper of offices. Such innovations introduce new interface challenges in moving from two-dimensional to three-dimensional information spaces -- a field where the augmented reality work of pioneers such as Steve Feiner is particularly interesting. [19]

In parallel with the rise of GIS has been a development of GPSs. These were initially developed in the military and continue to be developed in this context, where they have achieved an accuracy of less than a centimeter. Civilian versions are much less accurate but can readily determine one's position anywhere on earth, via a satellite link, to within a hundred meters. In cases where there are clearly defined roads and landmarks this accuracy can be increased to within a few feet. Hence the recent rise of navigation aids in automobiles which show us how to reach a destination via a map. If all this sounds futuristic, a combination of cell phone, PDA, and GPS is already commercially available. [20] As we shall note below, a combination of GIS and GPS has considerable consequences for our interests.

Agents

The introduction of object-oriented programming set out from a notion that individual bits of code could be treated as objects, and thus made re-usable. The amount of code potentially contained within such an object soon grew to include simple characteristics, instructions, routines, and the like. [21] From this emerged the idea that individual agents, each linked with specific instructions, could perform simple routines autonomously. This evolved into a more complex vision of a multi-agent system, and subsequently agent societies, whereby hundreds or even thousands of agents cooperate among themselves to perform tasks, which are cumulatively much more complex.

These developments are potentially of the greatest significance for the world of museums, libraries, and archives. Such institutions typically have great numbers of catalogues. The Vatican Library, for instance, had over eighty catalogues for its secret archive alone. As a result, every time a person wished to search for one new title he or she had to consult eighty books. Assuming that all the catalogues are in electronic form, agents can potentially collate all these names and titles such that a future user will need to consult only a single catalogue in searching for a title. [22] More wide-ranging uses for agents will be considered below.

Optical character recognition

The quest to turn printed documents into digital form was outlined clearly in Buckminster Fuller's Education Automation (1962), where he proposed a universally accessible digital library that would enable anyone, anywhere to study, learn, and grow. He believed that this intellectual freedom of the masses would bring humanity's best ideas to reality. This vision led authors such as Tony McKinley to explore OCR devices over the past fifteen years. [23] Throughout the eighties and early nineties OCR devices were fairly accurate but invariably not fully reliable. In the past few years their accuracy has improved to such an extent that they are near perfect.

One of the unexpected consequences of such advances has been the advent of new technologies such as the Quicktionary. [24] Using this device which resembles a fat pen with a small viewing screen, a user can touch any word which is then scanned by the device and a definition for this term is immediately provided. The present limitation of the Quicktionary is that the number of terms is limited to the internal memory of the pen-like device, which was around 285,000 definitions last year and is presently 430,000 definitions.

A combination of the innovations outlined above introduces considerably more dramatic possibilities. Given wireless connections, a penlike device can scan in a word, which is conveyed to an electronic book or notepad. As the term enters the electronic book the user is offered options resembling items one to five in figure 3. Let us say the user chooses definitions. They are then offered a choice of standard dictionaries such as the Oxford English or Webster's Dictionary. Their choice of dictionary is then coupled with the word identified through the pen scanner and relayed via the Internet to a Virtual Reference Room and the corresponding definition is then sent back to the user's electronic book, notepad or Personal Intelligent Assistant (PIA; by Philips). Whereas the Quicktionary was limited (a) by the memory of the pen device and (b) by screen space to providing a single word or very short definition, our solution will provide as full a definition as desired. This same principle applies to all kinds of reference works including encyclopaedias, book catalogues, and specialized bibliographies.

This can be complemented by a series of elementary add-ons. With voice recognition software, users will have an alternative of making their choices voice-activated, which is an obvious convenience except in cases where a user is working in a foreign language in which they are unfamiliar with pronunciation patterns.

As miniaturization continues apace, an increasing number of standard reference works can simply be downloaded onto the electronic book or PIA. Such regularly used works can thus come under an annual subscription scheme or be part of basic services to a citizen in the way that use of public libraries has traditionally been a free right. Meanwhile, less frequently used works would continue to be used via an Internet connection to on-line virtual reference rooms. Before considering further possibilities a short excursus on hypertext will be helpful.

3. Hypertext

Hypertext is intimately connected with the rise of the Internet as we know it today. The idea was implicitly described by Vannevar Bush in his pioneering article, "As we may think" (1945), [25] and developed by Douglas Engelbart, [26] who coined the term "text link" and linked the concepts of hypertext and multimedia to create the term "hypermedia" and later "open hyperdocument systems." The term "hypertext" [27] was coined by Ted Nelson, who defined it in Literary Machines as "non-sequential writing." [28] Nelson also drew attention to what he termed the framing problem of ordinary hypertext and in his Xanadu system accordingly called for a universal hypermedia environment. As to the origins of the concept Nelson claimed: "The hypertext is a fundamental document of Jewish religion and culture and the Talmudic scholar is one who knows many of its pathways." [29]

In Nelson's vision everything could be linked with everything though he acknowledged that this could bring problems. On the positive side, it introduces enormous freedom because everything can potentially be linked. On the negative side, it creates a complete spaghetti effect qua connections (figure 2). Everything leads to everything else. One cannot see the big picture or the frameworks for understanding due to the unbelievable complexity of links, which Nelson terms the intertwingledness of things. Particularly attractive in Nelson's vision is the potential for all users effectively to become authors, something which the World Wide Web as subsequently developed by Tim Berners-Lee has effectively made into a reality.



Figure 2. Two examples of the spaghetti effect when everything is connected with everything else from E. Noik. [30].

Attractive as all this seems, it also introduces some unexpected problems. A good author must be a good organizer of knowledge. Everyone becoming an author does not yet mean that everyone is necessarily a good organizer of knowledge. In practice many individuals with no experience of organizing knowledge are now presenting their materials on the web. This is a major reason why websites and the Internet as a whole pose such problems qua navigation. [31]

Fortunately these problems of a disorganized web are so universally recognized that a number of possible improvements are being explored. Robert Horn has provided a useful brief history of the Internet with respect to hypertext approaches, which is summarized in figure 3.

Date Project Individual
1945 Invention of concept in "As we may think" Vannevar Bush
1962-75 First operational hypertext: Hypermedia Douglas Engelbart
1965 Coined term "hypertext" Ted Nelson
1968 First university instruction Van Dam and Brown
1972 Menu interfaces Zog Group, CMU [32]
1976 Spatial Dataland Negroponte and Bolt
1986 Hypertext for PC and Macintosh Brown and Guide
1987 Vision of the Knowledge Navigator Sculley
1987 First Commercial Hypertext "Hit" Atkinson

Figure 3. Brief history of hypertext according to Robert Horn (1989)

Robert Horn has also developed the idea of information mapping. [33] Horn outlined a method for navigating through structured hypertrails, which he divided into nine kinds: prerequisite, classification, chronological, geographical, project, structural, decision, definition, and example. This offered a first helpful glimpse of a more systematic approach to the potentially infinite connections, a theme to which we shall return below (sections 4-7).

In Horn's hypertrails the chronological approach is one of nine alternatives. In our view the chronological dimension applies potentially to every hypertrail: i.e., maps have a history, structures have a history, definitions have a history as etymology, etc. Similarly the geographical dimension can apply to all the hypertrails. Structures in China are frequently different than structures in the United States. In our view, it is also useful to identify different levels of hypertrails: those involving single terms as in classifications; a few terms as in definitions, or examples, which can be seen as a branch of definitions (cf. figure 4).

At WWW7 (Brisbane, 1998), Tim Berners-Lee outlined his vision of a global reasoning web, a theme which he developed at WWW8 (Toronto, 1999), where he spoke of the development of a semantic web. The web, he noted, had begun as a method for individuals to communicate with each other. It is rapidly becoming also an instrument whereby machines can "communicate" with each other without human intervention. For this to become a reality requires separating rhyme from reason: separating those aspects which are subjective expressions of individuals (poetry, theatre, literature) from objective logical claims (logical principles, mathematical formulae, scientific laws, etc.). How could this possibly be achieved? Everyone putting things on the web could effectively be invited to identify the truth standard of their materials. Those making false claims would then be subject to legal challenges. Those wishing to avoid stating their status could by their omission implicitly be suggesting that their claims are not as transparent as they would have us believe.

This vision of a global semantic web is a noble goal and has our full support. Essentially it addresses the challenge of how we deal with new knowledge as it is put on the web. Our concern lies in combining the enormous amounts of enduring knowledge in libraries, museums, and archives with this vision. Again a brief excursus on every-word hypertext is needed.

Every-word hypertext

Jeffrey Harrow, in a recent discussion of e-paper in his electronic bulletin on The Rapidly Changing Face of Computing (RCFoC), drew attention to a new approach to hypertext:

For example, imagine if you could do a special "tap" on a word displayed on your Epaper and immediately be whisked to relevant information about it. "No big deal," you're probably thinking, "that's just another hyperlink, which already drives the Web." But in this case, I'm talking about tapping on words that have NOT been previously hyperlinked. For example, I just ALT-clicked on the word "Cambridge" a few paragraphs above (OK, I had to click, but I'm using a PC, not a book of Epaper) and a window popped up telling me that it was settled in 1630, its first name was New Towne, and it has a population of about 96,000. Where did this come from?

Brought to our attention by RCFoC reader Jack Gormley, a new free service on the Web called GuruNet [34] makes a stab at turning EVERY word on your screen, not just those in a Web browser, into a "live" link to further information. With their software installed, ALT-clicking on any word quickly evokes a response from their server, which has licensed numerous reference works to provide more focused, deeper insight than you might get from a search engine.

I've only just begun to explore GuruNet, and it is in beta, but it's an interesting concept. And hey, while it didn't directly know what to do with "RCFoC," when it drew a blank it offered to feed the mystery word to a search engine, which in this case came back with 487 references. Not too bad.

Now try THAT with an Industrial Age paper book -- which gives us an idea of why the day might really come when we can't imagine a "non-active" book. [35]

4. Augmented books

The approach outlined above by Harrow takes any given term, sends it to an Internet search engine, and finds what corresponding sites are available. Our notion of augmented books uses this every-word hypertext in combination with the new technologies outlined above as a point of departure to arrive at new possibilities.

A great advantage of a traditional book is that it makes handily accessible a certain amount of knowledge within its two covers. A limitation of a book is that the rules for organizing this knowledge will frequently vary from one book to another: namely, conventions for names of authors, subjects, or places. For instance, one book refers to Eric Blair, another refers to George Orwell and yet both are referring to the same individual. The rise of electronic library catalogues and more recently digital library projects has led to increasing attention to authority lists qua authors, names, subjects, places, etc. in library catalogues. This tendency towards standardization needs to be extended throughout the entire reference section of libraries such that the names, subjects, and places in classification systems, dictionaries, encyclopaedias, national book catalogues, abstracts, reviews, etc. are all correlated.

The librarian, information scientist, or classification specialist may agree that this is much to be desired, but will almost invariably dismiss the task as impossible because of difficulties in correlating terms. How can one be sure, they will ask, that one can know precisely what the equivalent for a given word (caption, descriptor, term) in Dewey should or could be in the Library of Congress or some other classification system? Abstractly asked, the question is daunting and sometimes near impossible to answer.

Fortunately a more empirical and pragmatic solution is possible. Instead of musing about possible equivalents, one can study what equivalents practitioners have established. Major books have been classed under most of the major classification schemes at least once. So we could start with a book such as the Bible or Dürer's Instruction in Measurement (Underweysung der Messung), examine under what words it is classed in a Dewey library, under what descriptors it is classed in a Library of Congress library, and so on. [36] At the end of this process one has one title with a series of equivalent words in a number of classification systems. This rather dreary task is sufficiently mechanical that it can quite readily be assigned to a low-level agent. After checking many millions of books against several hundreds of classification systems one will have an empirical list of equivalent terms in the main classification systems.

This same approach can be extended to the subject lists in national book catalogues and earlier bibliographies. Historical bibliographies such as Draud and Murhard frequently listed books under very different subjects than those to which we are accustomed. Agents using our empirical approach will simply use a given title as a point of departure and collect all the subject names under which it has been listed. This is a very long process, which would be overwhelming if it had to be done manually. However, with the help of agents and a serious amount of computing, it is fully feasible. A first result will be a new kind of meta-data whereby any book is linked with a set of subject terms found in classification systems, subject catalogues, and bibliographies, all of which are again co-related to a given date. One can then trace the history of terms with which a book is associated in the course of centuries.

One can use a similar approach for authors. We all know Shakespeare. We check a standard bibliography to determine how many plays Shakespeare wrote and what their titles were. We then have agents check all the earlier bibliographies of Shakespeare in order to trace how the number of plays claimed to be by Shakespeare varies over the centuries as do the precise names of those plays. That which applies to authors applies equally to painters, architects, engineers, etc. Traditionally in art history, there was a quest to achieve a catalogue raisonnée of all the paintings of a given artist. With the help of agents we shall have a dynamic catalogue raisonnée whereby we can trace that the number of paintings attributed to Rembrandt in 1700 was different than in 1800, in 1900, in 1950 or even today (after the Rembrandt committee has finished its research).

Agents can also play a considerable role with respect to place names. They can help us recognize that Liège (in French), Luik (in Dutch), Lüttich (in German), and Liegi (in Italian) are one and the same city. They can also help us develop a new kind of dynamic cartography. Poland today is very different than seventy years ago, 1800, 1600, or 1400. This means that one Polish city in 1400 might well have been a Russian or a German city at other time periods. It also means that the number of towns in a country varies enormously over time. If I am searching for all the towns in Poland in 1400, the number will be much larger than two centuries later when the geographical expanse of Poland was much smaller.

Applied to the whole range of materials available in reference rooms of the major libraries, agents will thus lead to a new level of meta-data which not only associates persons, subjects, objects and places with a number of variant names -- which can serve as alternative search strategies -- but also with an awareness of cultural and historical shifts in those variant names. For we shall be able to see that a book which a French person classes in one way may be classed quite differently by an Italian, a Russian, an Indian, a Chinese, or an Australian. These differences extend to all human activity, especially in the field of interpretation. Hence an Italian archaeologist's view of the Roman Forum is different from a French or a German version. Philosophers tell us that people have different world views (cf. German Weltanschauung). This new approach will make such differences visible with respect to everyday objects.

What, the impatient reader may ask, has this to do with our theme of augmented books? Everything, if we consider the enormous potential that will be unleashed if access to all this cumulative reference material is made available through the web. Some simple scenarios will prove useful by way of illustration.

Let us assume I am reading a traditional printed book. I encounter the name Leonardo da Vinci. I am aware that he was a Renaissance figure but want to know more. My pen-like object scans the name, transmits it to my notepad which offers me a number of options with a default for a basic introductory bibliographical sketch, which I choose. The notepad then issues a wireless command to an Internet connection, goes to the nearest virtual reference room, acquires the relevant bibliographical sketch and transmits this back to the monitor of my notepad along with a new set of options: Do I want to know more about Leonardo's life, the titles of his paintings, a list of his (primary) publications, or a survey of (secondary) literature concerning him or am I content with this sketch and now want to go back to the book which I was reading?

There are also many new possibilities with respect to place names. I am reading a book and come across the name Timbuctoo. I have a vague notion it is probably in Africa -- or is it one of those unlikely place names in the United States such as Kalamazoo? -- and would like to know where it is. I scan the term with my pen-like object or I simply speak the word into my notepad, which then issues me with a number of options. Do I wish to see it on a regular map, a terrain map, a political map, a population map, etc.? The notepad then conveys my choice via a wireless connection to the nearest virtual reference room on the Internet and then relays back to my notepad an appropriate map with Timbuctoo highlighted.

Or I am travelling in Turkey and have with me a Baedecker or some other travel book. I know I am within 100 miles of Uzuncaburc, a former Greek colony with an impressive temple, but am lost. Using the same basic approach I am shown where Uzuncaburc is on the map, with the difference that a GPS now determines the position of my jeep and also shows where this is on the map. An automobile navigation system can then compute how I get from where I am to Uzuncaburc, my destination.

At first sight all this is very much like the hypertext idea of a Ted Nelson. There is, however, a basic difference. While acknowledging that all connections are possible, our approach suggests that if I am reading a book and begin searching for names or places there is a certain sequence that makes more sense than others. It usually makes sense, for example, to determine the name of an author before asking what are the titles of that author's publications and it is useful to know the titles before choosing which of those titles one wishes to consult in full text.

The complexity of these sequences will, of course, vary with different users. A child will need a simpler list than a university student. A general practitioner will have other medical requirements than a cardiologist who is the world expert on the left aorta of the heart. In time, middle-level agents will be able to deduce the level of the user and adjust choices and interfaces accordingly.

We refer to our method as an augmented book because we do not foresee a simple replacement of a physical book by an electronic version. The physical book can remain, while wireless technologies give us the privilege of having all the benefits of a major reference room without requiring that we be there, or even that we leave our seat. However, augmented versions of traditional printed books represent only a first step in the revolution which concerns us here.

A next stage of possibilities arises when the physical text we are reading is entirely available in electronic form elsewhere on the web. Let us say that I am reading a physical copy of the Bible, the Koran, or the Mahabharata. I encounter a name such as Solomon and want to know more about this person. As outlined above, a mere scan by my pen-like device can relay the name to my notepad and offer me a number of options which can then be searched for in a virtual reference room and relayed back to my notepad via the Internet. Since the whole of the Bible is on-line, the same network can provide me with additional information such as how many times and in what contexts Solomon is cited in the Old Testament. These incidents can in turn be checked against other historical contexts where Solomon is discussed, such that we can see on which episodes of Solomon's life the Bible focussed and which episodes it overlooked. Images of Solomon can also be found: Solomon's choice with the two children, Solomon and the Queen of Sheba -- including Brunelleschi's panel on the doors of the Baptistery in Florence. Hence a Biblical story can take me "directly" to Renaissance art.

A further scenario is the following. I am reading a book on the history of physics, which has no index (as is often the case particularly with French books). I come across the name Galileo and would like to know how often this name occurs in the book. The pen-like object scans the name, conveys it to my notepad, which combines it with my request, checks the on-line version of the book, and relays to my notepad all the occasions within the book where the name of Galileo is cited. This is effectively a first step towards retrospective indexing or indexing on demand.

This principle could readily be developed. From the title of the work, my handheld or desktop computer can determine that it is a book about physics. An agent then goes to the authority list of all names and culls a subset of all names of physicists. This list is then used as a master list against which to check every name mentioned in the book. At the end of this exercise the agent has a subset of this master list which represents an index of all the names of physicists within this given book. A similar approach can be applied to subjects, places, etc. In the case of more famous books which exist in various editions, of which some have indexes, agents can then compare their own lists with these existing indexes in order to arrive at more comprehensive indexing.

The professional indexer and library scientist will rightly insist that such techniques are never as profound as the indexing skills of human professionals in the field. On the other hand, they will have to admit that there are never likely to be enough professional indexers retrospectively to index all the tens of millions of existing old books. In such a non-ideal world, even limited methods which produce useful results are considerably better than nothing.

A fundamental claim of professional indexers is that simply making a list of all the words in a text may produce a concordance but not a proper index. The radical form of this claim states that even the author of a text is incapable of indexing properly his or her own work. [37] What is needed, argue these indexers, is a careful analysis of the text to determine the underlying concepts or processes, which are often not directly listed in the text. Say the book is about botany and there is considerable discussion of papaveraceae. At other times there is discussion about poppies. Only an expert indexer who is also familiar with botanical classification will know that poppies belong to the papaveraceae family.

Major reference rooms contain our knowledge about classification systems, thesauri, and other methods of ordering subjects and processes. If these are available digitally through a virtual reference room, other possibilities emerge. I identify the term poppy, which is relayed to the virtual reference room. The word is identified as a botanical term, an appropriate botanical classification is found, and I am shown on my notepad the relation of poppies to the papaveraceae family. Or conversely I am reading about the papaveraceae family and wonder what actual plants or flowers belong to this family. By a similar process the virtual reference room is able to show me that poppies are an example of this family.

Hence, I can use classification systems as an intermediary to show me more abstract concepts (through broader terms) or more concrete examples (through narrower terms) of any word, which are encountered in any book that I am reading. This ability to move seamlessly to broader and narrower terms greatly expands both the domains at my disposal and the potential accuracy of my searches. Implicit decisions about what to pre-structure and what to search for on-demand are discussed below.

With a traditional book the amount I am able to glean from its pages is limited to my own knowledge and experience. With an augmented book the millions of hours of analytical sorting and organizing of knowledge represented by (virtual) reference rooms are at my fingertips. [38] The augmented book thus augments my brain by linking it with the cumulative memory of mankind, and guiding it through the levels of knowledge (figure 4).

In future, numerous further developments are foreseeable. My notepad has context awareness: it is "aware" that my research is focussed on bio-chemistry. It also "knows" which languages I am able to read. Accordingly, through user-modelling, an agent culls the books in print and periodicals to keep me in touch with what is happening. Assuming a semantic web as envisaged by Tim Berners-Lee, [39] these books and articles will have metadata tags identifying which are by major authors in the field, which are by specialists, and so on. Subsets of these lists can therefore be offered on the basis of whether I want just the latest views of authorities in the field or wish to study new views also.

Pointers 1. Concepts and their Terms (Classification Systems)
2. Concepts and their Definitions (Dictionaries)
3. Concepts and their Explanations (Encyclopaedias)
4. Titles (Bibliographies)
5. Partial Contents (Abstracts, Indexes)
Objects 6. Full Contents (Books, Paintings, etc.)
Interpretations 7. Internal Analyses  
8. External Analyses  
9. Restorations  
10. Reconstructions  

Figure 4. Basic levels of knowledge in the System for Universal Multi-Media Access (SUMMA).


Figure 5. Connections between reference levels (a term, a definition, an explanation, a list of authors, titles, partial contents as in abstracts and reviews) and full text, links between primary and secondary literature and different kinds of content, namely, internal analyses, external analyses, restorations, and reconstructions.

5. New reference potentials

Implicit in the above is a new approach to reference materials as a whole. In the past, reference rooms contained a whole range of sources (1-5 as outlined in figure four). In a traditional library I would consult a classification scheme to study related terms and use what I find to check another part of the subject catalogue. Or possibly, I might check the definition of a subject term, and then go back to the subject catalogue. In a virtual reference room all these individual levels are linked and I can go from any of the levels to re-enter my search mode.

Hence, in a traditional library each time I wished to consult a new source, I had to open some other book, often with its own idiosyncratic methods of organization. Given a virtual reference room where all these works are linked through common authority lists, which reflect historical and cultural dimensions of knowledge, I can go from a term to a definition, an explanation, a list of authors, titles, to partial contents (abstracts and reviews) to full text without leaving my desktop or notepad. While some may contend that all this is merely futuristic dreaming it is useful to look at the Japanese Electronic Dictionary Research Institute's (EDR) Electronic Dictionary as an example of what is happening today (figure 6). [40]

These new reference potentials are by no means limited to the reference sections of libraries. In the past, partial contents of books were available through at least three kinds of materials: (1) tables of contents and indexes, usually at the beginning and end of a book; (2) abstracts and reviews, available in periodicals; and (3) learned monographs and commentaries in secondary literature with their own indexes usually established under different principles than those in the primary literature.

In future, these three kinds of materials will be linked. I am interested in Leonardo da Vinci. A master index provides me with a list of all subjects in his manuscripts. These subjects correspond to a standard classification system, such that I can switch to broader or narrower terms. Say I am interested in optics. A list of all references to optics is displayed on my screen. I can ask for subsets relating to a given date or period (e.g., 1505-1508), and/or a given place (e.g., Milan). I choose a folio in the Codice Atlantico (Milan, Ambrosiana). Low-level agents will (have) search(ed) all the secondary literature on Leonardo concerning that folio and the subject optics. This results in a list of all secondary literature relevant to that folio as a whole, to that folio with respect to optics, and to optics elsewhere in the notebooks.

Alternatively, I am reading secondary literature, say the late Dr. Kenneth Keele's basic work, Leonardo da Vinci. Elements of a Science of Man, and encounter a reference to optics in Leonardo's Manuscript C (Paris, Institut de France). The system now allows me to go back to that folio in Leonardo's notebooks and, then, for example, back to a list of all other references to optics in Leonardo's writings.


Figure 6. Structure of the Electronic Dictionary Research (EDR) Institute's Electronic Dictionary.

In future, these connections will not be limited to traditional library materials. Lectures given by Leonardo scholars and university professors will be online. Agents will search these and create appropriate links. The order in which these are searched will vary. Sometimes it will follow a vertical model (as in figure 5). At other times the sequence may be quite different as in the following example (figure 7). Say we have a folio from the Codex Atlanticus with a preparatory drawing for the Last Supper. Agents will prepare alphabetical and chronological lists of all preparatory drawings for the Last Supper, link these such that one can view the corresponding images accordingly, i.e., alphabetically or chronologically, then go to an image of the Last Supper in Milan, and from there (a) to all copies and versions thereof; (b) to a lecture which compares all these drawings and originals, and/or (c) to a lecture which considers other aspects of the Last Supper.

The full power of the approach lies in that the sequence is open; that is, while viewing the lecture in (b) above, I can choose to return to the original folio from which the preparatory drawing comes, follow the figure it represents through various versions and copies, or study the history of restoration of the original fresco with respect to that figure. Quite simply, the potentials of linking things as foreseen by Bush, Nelson, and others remain, with one fundamental addition: there are lists to guide me through the various choices which present themselves.


Figure 7. Connections between different levels of reference levels and interpretation: a folio from a primary text (level 6); alphabetical and chronological lists (level 4); list of preparatory drawings (level 7), internal analysis (level 7), and external analysis (level 8) of same.

In the interests of simplicity we have focussed, in the above examples, on new treatments of textual knowledge. Augmented books in electronic form allow much more, of course. If an original, printed book showed an engraving or black/white photograph of the Acropolis, the corresponding augmented book potentially allows us to call up color photographs, Quick-Time Virtual Reality (VR) sequences, movie clips, and even live video feeds. In short, it unleashes the whole range of multimedia on any book we may happen to be reading. From an administrator's viewpoint, this approach also implies a virtual reorganization of libraries and museums.

6. Virtual reorganization of libraries

In an earlier section we noted that the convergence of new media will bring about a digitization of all reference works as well as a series of new links between these such that one can go seamlessly between classification systems, dictionaries, encyclopaedias, bibliographies, abstracts, and reviews. It is important to stress that these are only the first steps in a more fundamental change, whereby the contents of these reference works will be transformed, as will be illustrated by some examples.

In the case of classification systems, for instance, (low-level) agents can produce a kind of concordance between all the major classifications indicating which are the corresponding broader and narrower terms in other classifications. Hence, to continue with our papaveraceae example, the system will tell me which classifications have narrower terms for this concept and then allow me to view these alternatives either singly or together.

The case of dictionaries is similar. Having scanned these in and created links between equivalent words therein, a next step will be to analyze different kinds of definitions found therein, following, for instance, Dahlberg's fundamental distinction between ostensive, nominal, and real definitions. [41] This can lead to a re-organization of dictionaries such that every word has accompanying it, not only a series of etymological definitions, with full references to the history of a term in various languages, but also a clear indication of the kind of definition entailed. This will provide us with an important new search parameter. Sometimes we may wish to consult only real definitions, in Dahlberg's sense above, and at other times we might accept a more liberal approach and include ostensive and nominal definitions. Meanwhile, authority lists of personal names can be correlated with biographical dictionaries, such that I can see exactly which biographical works (be it the Dictionary of National Biography or Jöcher's Gelehrten Lexikon) are available for any given individual.

With respect to encyclopaedias, agents can determine all the languages in which a given term is treated. This might in turn be linked with classification systems such that one can visualize how one language deals more broadly with one subject area than another, and/or with a greater level of detail than another.

With respect to titles, although library catalogues typically have information concerning the standard title in the original language of writing/or publication, these same catalogues typically display titles without any regard to this standard. Hence, if one looks for Alberti's Della pittura, one needs to also look under De pictura, On Painting, Traité de la peinture, Traktat der Malerei, etc. Agents can collate all these variants and remind us of the language involved in each case. [42] While this could seem redundant in the case of On Painting for someone who is a native English speaker, in the case of O Malarstwie, the Polish equivalent, the language tag can prove very useful. In the case of reviews, agents can do systematic searches beginning with standard reference works such as the Internationale Bibliographie der Rezensionen and present these findings both alphabetically and chronologically.

Physically, libraries have traditionally been divided into two fundamental parts: (1) a reference room, which contains catalogues and reference works and (2) stacks which contain the books: i.e., pointers to content and the contents themselves. Academically, scholars have in turn divided those contents into two further categories: primary literature (original sources) and secondary literature (monographs and articles about those original sources). Physically this distinction is somewhat reflected through separate sections for books and periodicals.

Another way of looking at the primary-secondary distinction is to suggest that primary literature is the equivalent to the original (objective) contents, whereas secondary literature constitutes the (subjective) interpretations of those contents. As such, secondary literature then lends itself to four further distinctions. Interpretations (of secondary literature) can entail (a) internal analyses of the object itself, as brought into focus by close reading; (b) external analyses, whereby the object is compared with similar objects of that class; (c) restorations, whereby the original object has built into it the interpretations of the restorer; and finally (d) reconstructions, whereby the built-in interpretations of the reconstructors are that much greater. These reconstructions pertain to given diagrams, claims, ruins, etc. The system will give systematic access to each of these. [43] Taken together with the pointers to knowledge (reference works) and the full contents of the texts themselves, these interpretations are an important part of the ten basic levels of knowledge (figure 4).

This approach to searching at different levels of knowledge is now typically referred to as granularity. Historically, this process was very long in developing. The idea of searching at the level of books goes back to the earliest known libraries in the third millenium before Christ. The idea of searching at the level of articles came only at the end of the nineteenth century when Otlet and Lafontaine founded the Mundaneum (Brussels now at Mons) and the International Institute of Bibliography (now the FID in the Hague). In a sense, full-text searching began with the introduction of indexes in the latter Middle Ages, but practically speaking full-text searching has only been introduced in the course of the past decades with the advent of more powerful computers.


Figure 8. Examples of three basic levels of granularity in searching for materials. An ability to study the full contents of books at level three goes hand in hand with a need to link this with more abstract searching at levels one and two.

Paradoxically, this ability to search potentially at the level of every word has introduced a concurrent need to search at higher levels of abstraction in order to see the wood for the trees. Hence as one has gone from studying an individual word, phrase, or sentence to all uses in a book, there has been an increasing need to study also partial contents and concepts at a higher level in order to understand larger patterns (figure 8). Here, one approach is to re-organize knowledge in terms of different subsumptive relations such that one can go from a subject down to its properties or up to its whole/part and type/kind relations (figure 9). This is a realm where recent developments in scientific visualization offer promising new possibilities. [44]

Material, Subject Relations
Logical relation Subsumptive Type/Kind Principle/Manifestation
Genus/Species
Species/Individuum
Whole/Part Organism/Organ
Composite/Constituent
Matrix/Particles
Subject/Property Substance/Accident
Possessor/Possession
Accompanance

Figure 9. Levels of abstraction based on subsumptive relations according to Perrault.

On this problem of linking different levels of abstraction, futurists such as Benking [45] have also intuitively sought to link subjective and objective elements in arriving at a more systematic understanding of the world. Benking foresees using a universal classification system such as the ICC as a means of switching between various classification schemes and moving from levels of abstraction to levels of detailed knowledge. Lacking in such outlines is a clear method of how one moves between various levels of abstraction. Even so, his intuitive understanding of the challenges is extremely stimulating.

Our approach to levels of knowledge is an attempt to create a more coherent framework for systematic study at different levels of granularity. In electronic versions of libraries these levels provide important new approaches to orientation within a field of knowledge, contextualization of knowledge, and ultimately the re-organization of knowledge.

7. Virtual reorganization of museums

That which applies to libraries will apply in similar form to museums, archives, and other memory institutions with collections. A simple example will illustrate how all this will lead to a new contextualization of knowledge. The Last Supper is an event in the life of Christ as reported by the four Evangelists (Matthew, Mark, Luke, and John) in their gospels in the New Testament.

Once agents have searched all references in the secondary literature, we can trace all commentaries on the Last Supper as well as the history of images of this event. Hence, we can trace how the Franciscan idea of making a fresco of the Last Supper at Assisi was taken to Santa Croce, the Franciscan church in Florence, adapted by the Dominicans through Ghirlandaio in San Marco (1479-1480), then became the fashion in Florence as Ghirlandaio produced another version in Ognissanti (1480), Castagno another in Sant'Apollonia (c.1480) before Leonardo da Vinci immortalized the theme in Santa Maria delle Grazie (Milan, c. 1495-1498). The copies of Leonardo's version now in Lugano, Tongerloo, and London (Royal Academy, Burlington House) will also be available, as will many other versions as paintings, models, etc.

Internal analyses (level 7 in figures 4 and 5) will allow me to trace all discussions about the contents of a given fresco, including identifications of the various persons in Leonardo's version. External analyses (level 8) will allow me to trace the history of different versions and also different treatments of a given Apostle within a fresco. Restorations (level 9) will make available all information about various interventions concerning Leonardo's painting at Milan. Reconstructions (level 10) will make accessible all attempts to analyze the space within the fresco, ranging from crude drawings to complex perspectival drawings, physical models, and computer simulations.

In the case of paintings, this process of contextualization will provide, to the extent possible, a history of where the object was situated: i.e., a three-dimensional version of the approach in the Getty Provenance Index. As a result I shall be able to see where the painting hangs today along with all the places it hung previously in other galleries including the church or palace for which it was initially commissioned. This same principle can be extended to the near future, such that I am able to see where paintings will travel for special exhibitions in the coming months.

Initially the idea of a virtual museum was seen primarily as a means of helping tourists: providing a guide so that tourists can find more quickly Botticelli's Birth of Venus in the Uffizi, Leonardo's Mona Lisa in the Louvre, Rembrandt's Nightwatch in the Rijksmuseum, etc. Initial examples of this approach such as the Micro-Gallery in London focussed on orienting users qua the public rooms of the National Gallery. A next step would be to make visitors aware that major museums such as the National Gallery have a number, sometimes a great number, of paintings in storage. Classing these by themes and showing how there are (sometimes not so subtle) shifts of what gets shown and what gets stored in the course of decades will provide new insights into the history of taste.

The French concept of an imaginary museum (le musée imaginaire) expands this approach considerably. For now an electronic museum can do things that do not necessarily correspond to the physical world and sometimes would not be possible in the physical world. A dramatic example would be to show electronically all the paintings of Leonardo in a single, fictive space, [46] and combine this with all the restorations and reconstructions of those works, particularly the Last Supper. Such virtual museums can be extended to periods and styles. For example one could have an imaginary museum of impressionism which would show all the famous extant examples ranging from the Jeu de Pommes in Paris, the Metropolitan in New York, the Barnes Collection in Philadelphia, and the Pushkin Museum in Moscow.

Slightly more prosaic but wonderfully useful nonetheless would be a virtual history of exhibitions. Ideally, this would serve as an excellent aide mémoire in helping us to recall historic, so-called blockbuster, exhibitions. At the same time it can help us to imagine what we missed. Many of us know the experience only too well. There is an exhibition by one of those famous artists such as Rembrandt, Monet, Van Gogh, or Picasso in a city such as Paris or Washington, D.C., and we miss it because we just cannot get away during the few weeks that the exhibition is in town. Or even worse, we get there only to find that the exhibition is sold out for the few days that we are there. Sometimes we manage to get a ticket and find that we have fifteen minutes to see a set of paintings because visitors are being herded through as if they were cattle. Alternatively, we get in and find that the herds are so great that we effectively cannot see the painting at all. This even occurs with paintings that are on permanent display such as the Mona Lisa.

The advent of the notepad computer and PIAs described earlier offer many new possibilities. Some of these are being explored in the context of the Intelligent Information Interfaces (I3, Icube) [47] program of the European Commission. For instance, the Hyper Interaction within Physical Space (HIPS) [48] project, which is being tested in the Museo Civico of Siena, allows visitors to listen to information using earphones and make notes on a PDA. A limitation of this project is that information used is limited to that contained within the PDA.

Our approach foresees linking the PDA through the Internet with a much larger array of materials in a virtual reference room as in the following scenario. I am standing in front of a copy of Brueghel's Winter Landcape in the Bonnefanten Museum (Maastricht). My notepad computer is equipped with a small camera, which recognizes the painting via pattern recognition, the title, or simply via the equivalent of a bar code. This information is taken up by a (low-level) agent, which notes that this is a copy of a more famous original now in the Kunsthistorisches Museum (Vienna). The software agent then offers me a possibility of seeing various other copies and versions of the same painting.

While I am still at home and preparing to visit a city such as Florence I could "have a chat" with my PIA. I could point out that I am going to Florence for four days and am interested in seeing museums and churches. The assistant determines which museums and churches are open during those days and displays these as a list. Say I choose the Uffizi and San Marco. The assistant tells me what exhibitions will be on at that time. If I am a specialist, the assistant can give me a list of painters exhibited in the Uffizi and ask me which of these are of particular interest -- assuming that I am a new user and my assistant does not yet know my specialties. The assistant can then list earlier exhibitions about those painters, both in Florence and elsewhere, and more detailed literature, if I choose. Hence, I can effectively do research for my visit before I arrive in Florence and once there I can call up different copies, versions, related preparatory drawings, etc. If I am interested in the history of the painting, I can review various restorations which have been made thereof or study the layers under the surface of the painting as revealed by new techniques such as infrared reflectography. [49] The new media will literally allow me to look into pictures and see things that I could never see as a regular visitor.

Such innovations may prove to be as significant for museum professionals as for tourists and regular visitors. There has traditionally been a clear division of labor whereby curators studied the surfaces of paintings as connoisseurs, whereas conservators studied the layers underneath. Often these two groups went about their work as if they were dealing with completely different objects. In fact, detailed knowledge of the layers of paint under the surface studied by the conservator can often provide vital clues for a connoisseur, who is uncertain whether to attribute a given work to the master himself, to a student, to a member of the workshop, or to a follower. Hence, the new media can bring new interactions between conservators and curators, which can ultimately further enrich the viewing experience of visitors.

As the capacities of high-level virtual reality become more generally available, further scenarios are possible. One could have a history of the Louvre in Paris which shows (a) how the complex of buildings grew in the course of the centuries, (b) how the paintings of the collection are configured differently over the centuries; (c) how these changes and changes in frames reflect shifts in the history of taste, and (d) in certain cases even show reconstructions of how paintings have changed in color over the ages. [50]

Seeing such intelligent guides and virtual exhibitions cannot truly replace the experience of the actual objects. Nonetheless, they serve at least four valuable purposes:

  1. In many cases, the simulations will have the effect of inciting persons to look more attentively at the originals.
  2. They can help us to see a given painting in the context of others by that artist and/or their contemporaries. As a result when we next see that painting on its own in its home gallery, it will evoke a much richer set of connotations.
  3. Even those of us who are very familiar with museums such as the Louvre, the Prado, and Vatican often do not have occasion to see the great collections and monuments of Russia, India, China, and Japan. In fact very few persons are able to see all the great museums. Hence virtual exhibitions can serve to bring understanding of those places which we cannot visit.
  4. Finally, in the case of endangered sites, such as the Caves of Lascaux or the Tomb of Nefertari, which are now closed to the public, such virtual reality reconstructions are a substitute for viewing the original, in the interests of long-term conservation.

8. Reorganization of knowledge

We noted earlier how object-oriented programming led increasingly to a bundling of instructions concerning a given operation. Applied to cultural objects this implies that all the knowledge concerning a given subject such as the Last Supper will be bundled therewith, such that one can trace its development chronologically and spatially throughout a range of media. Since the Renaissance we have used media to separate objects: books went into libraries, paintings into art galleries, drawings into drawing collections (French: cabinets de dessin), engravings into engraving collections (German: Kupferstichkabinett), maps into map rooms, etc. The reorganization of knowledge implies that a theme such as the Last Supper, which entails all of these individual media, will link all these objects which are now spread out through numerous institutions.

This trend is more interesting because companies such as Autodesk have already extended the notion of object-oriented programming to the building blocks of the manmade world through what they term Industry Foundation Classes. [51] Hereby a door is now treated as a dynamic object, which contains all the information pertaining to doors in different contexts. Hence, if one chooses a door for a 50-story skyscraper, the door object will automatically acquire certain characteristics, which are very different from a door for a cottage or for a factory warehouse. This is leading to a revolution in architectural practice because it means that those designing buildings will automatically have at their disposal the "appropriate" dimensions and characteristics of the door, window, or other architectural building block which concerns them. There is a danger that this could lead to stereotyped notions of a door, window, etc., a McWorld effect, whereby buildings in one country are effectively copies of those in other countries, and travel loses its attractions because everywhere appears the same.

Alternatively, one can extend the concept of foundation classes to include cultural and historical dimensions. As a result, an architect in Nepal wishing to build a door, in addition to the universal principles of construction applying to such objects, will be informed about the particular characteristics of Nepalese doors, perhaps even of the distinctions between doors in sections of Khatmandu or near Annapurna. Similarly an Italian restorer will be informed about the particular characteristics of doors in Lucca in the fifteenth century. All this may seem exaggerated. But after some of the key historical houses with elaborate ornamental carvings in Hildesheim were bombed during the Second World War, a small group of carpenters worked for several decades to reconstruct the original, beam by beam, carving by carving. They did so on the basis of detailed records (photographs, drawings etc.). If this knowledge is included in the cultural object-file of Hildesheim doors, windows, and houses, then rebuilding such historical treasures will be much simpler in future.

At stake is something much more than an electronic memory of cultural artifacts which will serve as a new type of insurance against disaster. The richest cultures are not static. They change with time, gradually transforming their local repertoire, often in combination with motifs from other cultures. The Romanesque churches of Northern Germany adopted lions from Italy for their entrances, which were, in turn, based on lions from the Middle East. The church of San Marco in Venice integrated Byzantine, Greek, and local motifs. The architecture of Palermo created a synthesis of Byzantine, Norman, Arabic, and Jewish motifs. The architects in Toledo and at Las Huelgas near Burgos created their own synthesis of Jewish, Arabic, and Christian motifs. A comprehensive corpus of variants in local heritage can thus lead to much more than a glorification of local eccentricities and provincialism. It can prove an inspiration to multi-culturalism in its deepest sense. It can include basic building methods, styles, ornament, and decoration.

These combinations were successful because they were guided by culture and taste. Combinations per se do not guarantee interesting results. If taste and sensibility are lacking, the results are merely hybrid versions of kitsch. So the technology must not be seen as an answer in itself. It offers a magnificent tool, which needs to be used in combination with awareness of the uniqueness and value of local traditions. These new possibilities of using technology to expand the vocabulary of cultural expression apply not only to the physical, built environment of the manmade world. Potentially, they apply to the whole of knowledge. Thus the re-organization of knowledge leads to a re-combination of elements to produce new expressions and new knowledge. Here we shall outline briefly some scholarly and business dimensions of what is entailed.

Scholarly dimensions

Individuals (who?)

An initial challenge is to achieve dynamic knowledge of individuals: (1) changing historical knowledge about an individual, (2) changing perceptions of an individual, and (3) assessing the authority of literature concerning an individual. First, there is a problem that what is known about the writings or paintings of an individual changes over time. Today there are static lists of complete works or of a catalogue raisonnée of paintings. Such lists need to be dynamic. As noted earlier, the list of paintings attributed to Rembrandt changes with time and this phenomenon must be integrated into our search strategies such that there will be a time-sensitive answer to the question: What did Rembrandt paint?

Second, there is a paradox that persons now famous such as Montaigne or Leonardo were judged very differently throughout the centuries, almost forgotten in some generations, particularly praised and often for different reasons in other generations. Most of our present methods of presenting individuals do not take adequate account of such aspects.

Third, with a genius such as Leonardo, thousands of persons feel prompted to write something about the man. The number of persons in any generation who have actually read his notebooks has never been more than a handful. The Internet potentially offers us access to everyone who cites Leonardo, but has almost no mechanisms in place to distinguish between standard works, generally respected works, and non-authoritative lists. In our view, the radical proposal of some to re-introduce censorship is not a reasonable solution. The problem is made the more elusive because the recognized world authority in one decade may well be replaced in another decade.

Needed, therefore, are new kinds of dynamic, weighted bibliographies, which allow us to have subsets on the basis of field-specific acceptance, new ways of expressing and recording in electronic form the well-established traditions of peer review (which is totally different from attempts to do simplistic electronic computations of quality), to arrive at peer review with an historical dimension in electronic form and yet still have access to a wider range of less authoritative or, more precisely, less established by the authorities, sources in a field. In tackling such alternatives between the authority of sources vs. (mere) citations, we would be using technologies in new ways to return to central questions of quality. Having such dynamic biographies will permit new approaches to fundamental questions of how we assess quality and importance. The extent to which agents can be developed to address these challenges needs to be studied.

Objects (what?)

Beyond the initial problem of access to a specific object in a museum, library, or archive, or to free-standing monuments is a larger challenge of how we can access all the related objects which have derived from or have been affected by the object: the copies, replicas, versions, imitations, and sometimes pastiches.

Present-day sources typically focus on objects as static entities. The limitations of print frequently lead us to focus on one example as if it were the whole category. Accordingly we all know about the Coliseum in Rome but most of us are unaware of the dozens of coliseums spread throughout the Roman Empire. Using dynamic maps and chronologies, new kinds of cultural maps can be developed, which allow us to trace the spatial-temporal spread of major cultural forms such as Greek theatres or temples, Roman coliseums, or Christian Romanesque churches. This will allow novel approaches to longstanding problems of central inspiration and regional effects, the interplay between center and periphery, in some cases between center and colonies. Such questions pertaining to original and variants (versions, copies, imitations) are again central to the challenges of a European Union which aims to maintain diversity within a larger unity. At another level, this is effectively a global challenge.

Concepts and their relations (what?)

Presently we have many different classification systems and thesauri. Earlier in this paper I outlined a practical approach for mapping between these systems, by using existing applications in libraries in order to do a retrospective backtracking to link concepts. Theoretical proposals for mapping among these systems also exist (Williamson, [52] McIlwaine [53] ). Systems such as the Universal Decimal Classification (UDC) and developments in terminology allow more systematic treatment of relations among subjects into classes such as subsumptive, determinative, ordinal, etc. (Perrault). [54] A dynamic system that allows us to switch between classifications in different cultures and historical periods can provide new kinds of filters for perceiving and hopefully appreciating subtleties of historical and cultural diversity. The enormous implications for learning range from philosophy (especially epistemology), where we can trace changing relations of concepts dynamically, to the humanities with courses on culture and civilization (a term which again has very different connotations in French, German, and English). Instead of just citing different monuments, works of art, and literature, we shall be able to explore different connections among ideas in different cultural traditions. For example, with respect to the fine arts, Ranganathan's classification (India) is much weaker than Dewey (United States), yet much more subtle than Dewey with respect to metaphysics and religion.

Historical and cultural concepts (what?)

An integration of the methods outlined will lead to new kinds of knowledge maps, which allow us to trace the evolution of concepts both spatially in different countries and temporally in different historical periods. This will lead to dynamic bibliographies. We shall be able to trace how a concept is frequently broached at a conference, then developed in articles and books, and gradually evolves into a recognized discipline, which may subsequently re-combine with another as, for example, the way in which biology and chemistry led to bio-chemistry. Such an approach to the growth of knowledge will allow us to return with new depth to problems broached above of standard/model versus variants/versions, of center versus periphery and the role of continuity in the spread of major forms and styles of expression. [55]

Space (where?)

Current printed maps in atlases are static. Historically the boundaries of maps change. At JRC, the Institute for Systems, Informatics and Safety (ISIS) has already developed very impressive three-dimensional maps. In conjunction with the European Space Agency much work is being done in the area of coordinating GISs and GPSs. The spatial meta-data project would produce dynamically changing atlases and link this with GIS and GPS. This is a prerequisite for visualizing changing political boundaries. Coupled with information concerning cities and events and other historical data, this will permit much more nuanced search strategies. As mentioned earlier, we shall then be able to trace how the boundaries of a country such as Poland shift dramatically over the centuries. The question Where is Poland? will thus shift with time and adjust accordingly. Applied globally this will furnish us with more than simple instructions of how to get there. It will make visible people's misconceptions of geography at various points of history. It will also reveal how politics affects the boundaries of countries such that, for instance, India's maps of India and Pakistan may well be different than Pakistan's maps of the same two countries. To achieve this, global cooperation is needed.

Time (when?)

Connected with this is a challenge of historical temporal meta-data, whereby one has a standard method for correlating the different time scales of various chronological systems and calendars (including the Jewish, Chinese, Indian, Gregorian, Islamic, and Julian calendars). Those of the Hebrew faith had their fourth, not second, millenium problem some time ago. Coupled with historical data this will be a significant step towards studying history from multicultural points of view. For if I am reading am Arabic or Jewish manuscript and come upon the date 380, the system will immediately provide an equivalent in the Christian Gregorian or Julian calendars.

Narratives (how? why?)

Beyond the initial problem of access to a given subject or episode such as the Crucifixion or Diana and Actaeon, there is the challenge of accessing the context of these subjects, the narratives from which they derive. An integration of the above methods will bring new access to the history of narrative, and thereby new approaches to literature, art, and culture as a whole. A culture such as Europe is defined by a relatively small number of major narratives deriving from two traditions: (1) Judeo-Christian (the Bible, Lives of the Saints) and (2) Greco-Roman (Homer, Virgil, Ovid). We belong to the same culture if we know the same narratives, if we have the same stories in common.

Paradoxically, those who have the same stories inevitably develop very different ways of telling those stories. The media differ. For instance, in Italy the lives of the saints most frequently become the great fresco cycles on the walls of churches. In France and the Netherlands, the lives of the saints are more frequently treated in illuminated manuscripts. In Germany, they frequently appear in complex altarpieces. Not only do the media vary but also the ways of telling stories. The Life of Christ in Spain is very different than in the Balkans or within the Orthodox tradition in Russia. Even so the commonality of themes means that Europeans can feel an affinity towards a Russian Orthodox church, which they cannot readily feel with an Indian temple with stories from the Mahabharata or the Ramayana (unless of course they know these stories as well).

In these transformations of the familiar lies at once the fascination of change through continuity, which inspired the studies of Aby Warburg and his school, but also implicitly, a series of important lessons about the keys to diversity. The most diverse narratives are precisely about the most familiar stories. Uniqueness cannot simply be generated by trying to be different, by rejecting others and removing them through ethnic cleansing. Uniqueness comes through sharing common stories, which inspire fundamental values (such as life and truth, e.g., through the ten commandments), and then expressing them differently. To visualize and make visible the complexities of these historical diversities of expression is our best hope for understanding the challenges of future diversity. Inherent in such questions lie the seeds for understanding changing definitions of our world: dynamic phenomena, processes rather than static definitions.

The attentive reader will be aware that the re-organization of knowledge implicitly leads to two related yet different challenges. One is simply: How do we re-arrange what we have so far in order to make new sense thereof? A second challenge is more elusive: How will the new technologies transform the very ways in which we organize, store, and access information? Will our principles of classification change dramatically? Will our definitions of knowledge be changed completely? In this paper we have offered a series of possible, tentative suggestions in the direction of a new vision which embraces both these challenges.

To achieve this, however, as should by now be abundantly evident, requires much more than a simple scanning in of documents. It requires long-term commitment to meta-data and eventually a complete re-organization of all our knowledge. We may continue to store things in libraries, museums, and archives but the sources of insight will come through virtual reference rooms and repositories beyond the walls of any of those institutions. And these electronic repositories may well become systematic links which prove to be larger than the original sources, meta-data banks of authority lists with all the key pointers concerning a given person, object, event, or process, properly reflecting the complexities of cultural and historical views thereof. Moreover, as will be discussed below, this heritage of the contemplative life (scholarship) needs to be integrated with the insights of the active life (business). Some basic references on knowledge organization are provided in Appendix 1.

Business dimensions

Individuals (who?)

Presently, individuals in a company typically have a curriculum vitae (c.v.) which summarizes their education, employment experience, publications, lectures, etc. In the future c.v.'s may well include dynamic aspects, with brief video clips of lectures or presentations, dynamic maps which show where a person traveled and perhaps even indicate the impact of their products, books, etc.

Companies (what?)

In the past decades there has been an increasing fascination with visualizing change in all domains of business. A quest to see dynamic changes in stocks led Hani Rashid to create a Virtual New York Stock Exchange (3DTFV) [56] with which one can see stocks in real time as they rise and fall. This allows one to see the state of a company at any given moment.

Imagine, however, a historical version of the same, whereby we can trace a company qua its financial growth and the number of its employees; whereby we can see the companies which a given corporation buys out, plans to acquire, and actually takes over; the firms through which it is linked in consortia; and the firms with which it has alliances and associations. Using such a tool we could watch how IBM grew from a family business to one of the great corporations of the world. One could also see at any time the relation of a given firm to others in the field. What, for example, is the size of IBM relative to other computer firms such as Fujitsu, Compaq, Dell, Bull, etc.?

Institutions (what?)

How will these converging new media transform institutions as we now know them? Some radical thinkers assume that within two decades banking will be so completely automated that bank employees will be largely redundant and consequently bank buildings will no longer have their present use. Will the same things happen to most large companies? Does this mean that the downtown sections of most large cities which are now dominated by office blocks could conceivably be refitted to provide a new kind of inner-city housing? Will the same thing happen in government? If so, why are countries with futuristic visions such as Malaysia building a whole new government city, Putrajaya? Will we look back in twenty years to the idea of automatic banks and government with the same amount of irony with which we now look back to the idea of the paperless office as it was advertised in the sixties and seventies? In any case, as with companies, there is a challenge of how we can arrive at dynamic visualizations of the growth and/or decline of institutions. Some basic references on institutional aspects and knowledge management are provided in Appendix 3. It is instructive to recall that some authors such as Philip C. Murray define knowledge as "information transformed into capabilities for effective action. In effect, knowledge is action," [57] which is far removed indeed from our notion of knowledge organization. In this interpretation knowledge management is nothing more than management with new tricks.

Products (what?)

The question of what will change applies equally in the domain of products. There are many new methods for networked design and even networked manufacturing and production. More objects than ever are being produced. But how can we be certain that they are better?

Processes (how?)

The quest for efficiency in work processes was led in the early twentieth century by Frederick W. Taylor, who became so famous that his approach is still known formally as the Taylorisation of work. [58] It is implicitly also remembered through Charlie Chaplin's classic movie Modern Times. Nearly 40 years ago Everett Rogers wrote his classic work on Diffusion of Innovations. [59] In the past decades this has led to new activities such as Gantt charts, [60] work flow software, business process engineering, and requirements engineering. Nearly two decades ago thinkers began writing of the Taylorisation of intellectual work. [61]

The realm of processes is particularly interesting for it is precisely in this field that indexers claim to have made the greatest progress in the past decades. Ironically there is practically no cooperation between scientific management experts in the tradition of Taylor and professional indexers. How can we combine the intellectual insights of indexers regarding processes with the advances in visualization of processes from other domains? One of the great problems with early solutions in this domain is that they were almost invariably in-house or proprietary solutions produced by external consultancy firms. A challenge lies in using standards whereby all these disparate systems can become accessible through a common front-end.

In the Mediaeval period simple processes such as mining, smelting, purification of metals such as copper and iron, and the preparation of medicines were largely verbal. The advent of printing in the West during the Renaissance saw an increasing trend to codify these instructions in book form. These early how-to books led gradually to the instruction and repair manuals of the twentieth century. In the past three decades these manuals have become available in electronic form and increasingly with multimedia aids such as virtual reality. How will these new media transform the nature of technical training, instruction, and repair? As will be mentioned below, some believe that many of these activities will also become automated. Radical visions in this context claim that any process which can be described will become automated within the next two generations. Meanwhile, futurists already speak of self-repairing and self-healing products. [62]

Control (how?)

One could usefully see the growth of control systems and management systems as a subset of processes or at least as a closely related domain for, ultimately, all efforts at control and management require a careful understanding of the processes which are being controlled and managed. Here again a new kind of meta-data is implicit because potentially we need to (a) understand the processes as they are intended to function; (b) recognize a series of things which can hinder or disrupt those functions, and (c) know which steps are necessary in order to repair these. All this may seem theoretical and futuristic but actually reflects a trend toward intelligent devices on the Internet:

AT&T Network Services is spearheading an effort to provide managed network services for intelligent devices that would allow remote monitoring and troubleshooting of any piece of equipment that can be linked to a network. AT&T is now testing the service, which it expects to release in early 2000. Managed network services have the potential to significantly alter business operations. For example, companies that send support staff into the field to repair a customer's device would be able to monitor the device's performance remotely and replace parts proactively before a failure occurred. Furthermore, managed network services would allow manufacturers to build only one variation of a product, allowing customers to obtain additional features by downloading firmware onto a chip embedded in the device. Current demands for customization require manufacturers to build many variations of each device. Intelligent device-management software is already available from RapidLogic, which offers RapidControl software to consolidate the different ways of accessing, managing, and controlling intelligent devices. [63]

Transactions (how?)

One can foresee that there will emerge new meta-levels which trace and make visible larger patterns in transactions of firms, not just that x dollars were spent this week, but more concerning the directions of money flows. A scenario follows: firm A is based in Colorado and has annual earnings of $10 million. Traditionally, 80 percent of those earnings and expenditures took place within the state, and the remaining 20 percent within the United States. When 5 percent of those earnings are suddenly transferred to a Swiss account, the system is able to produce the financial equivalent of a flow chart in order that managers can visualize and understand exactly what processes are entailed. This approach can be applied at a personal, company, and institutional level and potentially also at the national level. As e-commerce and Internet trade shift from hype to become central aspects of our lives, such methods may prove to be the only hope for governments to maintain an overview of taxable goods. Indeed they may become the only hope for governments to maintain the very idea of taxes. Some basic references on organizational aspects are provided in Appendix 2.

Learning

Learning is one of the ubiquitous buzzwords in all these developments. Lifelong learning is touted as an emerging phenomenon as if employees in the past were not faced with the phenomenon. Chris Argyris and Peter Senge [64] have convinced many that there are now Learning Organizations, [65] wherein change is a part of everyday experience. Needed are new methods which will help us to visualize this change. As in so many other realms the re-organization of knowledge lies in no small part in trying to measure what was previously not measurable, to visualize what was previously invisible.

Some see a challenge of understanding these hitherto diverse phenomena as part of the emerging knowledge society. For instance a new International Institute of Infonomics (Heerlen), which will work closely with the Maastricht McLuhan Institute, sees a need to link these various business dimensions, namely, individuals, companies, and institutions, and organizational aspects such as products, processes, control, transactions, and learning, and relate all these to shifts in the theory of knowledge and knowledge transfer. [66] Some basic references on learning are provided in Appendix 4.

Closely linked with these developments are new theories and practices as to how students learn best. We are told that a shift in the role of teachers is under way: "From the sage on the stage to the guide on the side." There is an ever greater emphasis on problem-based, project-based, and even product-based learning. There is a fascination with collaborative learning, with electronic notebooks for courses, and with an assumption that students will do their own annotations. Here developments in multivalent documents offer new possibilities. [67]

Levels of truth

For such an approach to work we also need new methods to determine the truth level of any claim, as Tim Berners-Lee has already envisaged in his concept of a semantic web. Such an approach will allow us to distinguish between personal claims; those accepted by groups, national, and international bodies; those which are logically true; and those which have been experimentally demonstrated.

It is important to recognize, moreover, that this truth level is itself subject to historical changes. During the Middle Ages, for instance, the Donation of Constantine -- whereby that Roman emperor theoretically handed over the rights of his empire to the Catholic Church -- was long believed to be true. During the fifteenth century, the Renaissance scholar, Lorenzo Valla, after careful palaeographical and codicological detective work, discovered that the document had been written after the fact and could therefore demonstrate that it was false.


Figure 10. Schematic view of a European trend to use the Internet for new access to enduring knowledge.

Hence, in future the question Was the Donation of Constantine true? will have a nuanced answer: it was false but for a given period it was believed to be true. If all this sounds close to sophistry and too abstract, it is useful to recall that these problems of veracity remain very real in contemporary medicine. Almost every day new drugs are developed. National bodies require that these drugs be carefully examined by experts and undergo rigorous testing. Medical experts are therefore consulted, whose cautious opinions are sometimes quoted out of context to give a better impression concerning the efficacy of the drug. Only through access to the full context of these comments can the level of certainty reasonably be determined.

In the case of scientific laws such as gravity, the level of certainty is, theoretically, beyond discussion -- else it would not be a true law. In many cases, however, there are hypotheses, where there is no single, incontrovertibly accepted answer. In such cases we need access to different views concerning the problem.

As noted earlier, the re-organization of knowledge which is under way is much more than a simple scanning process from analog to digital form. It entails new approaches to all branches of knowledge. It will introduce animated, dynamic aspects into knowledge, both in the sense of science (cf. German Wissenschaft) and know-how (cf. Dutch kennis). Individual facts and claims will be contextualized through links, which Horn would call hypertrails. These hypertrails will, moreover, have their own recognizable structures qua knowledge organization and will have cultural dimensions (e.g., proverbial French logic vs. English reason), historical dimensions (with different chronologies), and geographical dimensions (with different kinds of maps and different boundaries over time).

9. Augmented knowledge

The late Marshall McLuhan spoke of technologies as extensions of man. In the past those technologies were invariably extensions of physical aspects of the human condition. Through a mechanical "extension" of our arms, bulldozers allowed us to push more than we ever could with our bare hands. Similarly cranes allowed us to lift more. Telephones were extensions of our voice, televisions of our eyes. Through web cameras on the Internet we can see news halfway around the world without having to go there. An almost anecdotal example in a recent newsletter gives an idea of the unlikely extensions of technology, which are occurring through these Internet developments:

Connecting your Singer to Cyberspace

Was I shocked when I read this sentence in a sewing discussion board? "Husquavarna also has a new machine coming out in January which changes their software/hardware needs. The new machine will use a regular 3.5" floppy..." All my experiences with Husquavarnas were with chain saws, but beyond that -- sewing machines with USBs, floppies, and their own file format debates? Bernina, Pfaff, and Singer all have developed product lines that connect to the Internet (via a PC) so you can download patterns and embroidery. [68]

The combination of technologies which are the focus of this paper continue this trend but at the same time take it inward. Augmented books and the like are about extending our memory and the capacities of our minds. In the past the amount I knew depended on how much I had learned personally, and then remained largely a function of how good my memory was. If my work involved scholarship and I was fortunate enough to have at my disposal a great library with the resources of a major reference room, then such tools could compensate somewhat for limitations of memory, but I was still limited to what was within reach at any given moment. If I happened to live in Toronto, it was of little help to know that a wonderful reference work stood somewhere on a shelf at the British Library, the Bibliothèque Nationale, or the Vatican. Through augmented books, linked via notepads to virtual reference rooms through wireless connections, all this changes. I can have at my fingertips the cumulative memory concerning enduring knowledge as found in our libraries, museums, and archives.

What is particularly fascinating in these developments is the promise of a new kind of convergence that goes far beyond the technological dimensions, which were the starting point of this essay. Some authors have written about collective [69] intelligence or connected intelligence. [70] Pierre Levy, for instance, claims that we have created three kinds of spaces in the past and that the new technologies and the Internet are introducing a fourth space entailing knowledge and collective intelligence (figure 11). [71] Richard Barbrook notes that Levy's ideas are inspired by Islamic theology, whereby the "collective intelligence" is rather similar to God and that Levy's real interest is in real-time direct democracy. [72]

Kind of Space Products
Earth Language, Technology, Religion
Territory Agriculture, City, State, Writing
Commodities  
Knowledge and Collective Intelligence Virtual Agora, New Democracy

Figure 11. Four kinds of spaces according to Pierre Levy in his book Intelligence Collective (1996).

Others have gone further. Gregory Stock has written of Metaman: The Merging of Humans and Machines into a Global Superorganism. The Fishers have written of The Distributed Mind: Achieving High Performance Through the Collective Intelligence of Knowledge Work Teams. Peter Russell has gone further to write The Global Brain Awakens: Our Next Evolutionary Leap. [73] Meanwhile visionaries in the business world, such as Allee, speak of Fifth Generation Management: Co-Creating Through Virtual Enterprising, Dynamic Teaming, and Knowledge Networking, Halal et al. speak of The Infinite Resource: Creating and Leading the Knowledge Enterprise. [74] On the Internet there are also a series of Conceptions of the (sic!) Global Superbrain. [75] In other words, something much more than a re-organization of old and new knowledge has begun. Implicitly there is a new convergence between these.

On the one hand, as we have noted, the realm of enduring knowledge has focussed ever more on the potentials of systematic knowledge organization. Whereas the world of new knowledge championed by technology and business would have us believe in natural language searches, whereby every term is equal in a complete democracy of words, the champions of enduring knowledge remind us that there are distinctions among words, such as concepts, terms, descriptors, and captions; that there are a number of coherent relations among these concepts; and that there are distinctions to be made between structures and systems. The world of enduring knowledge relies on the traditions of logic, philosophy, grammar, linguistics, and, more recently, classification and information science.

In the world of education, the notion of Computer Supported Collaborative Work (CSCW) has taken the form of Computer Supported Collaborative Learning (CSCL). Here there is an assumption that young children and students can benefit from collaboration even before they have specialized knowledge. Sharing becomes a virtue in itself even before there is content to be shared. It is striking that mobile instruments play an important role in this vision also. For instance, Stanford University's Learning Lab foresees working with collaborative learning tools, interactive spaces, personal learning strategies, and tools. One such example is their Personal Electronic Notebooks with Sharing (PENS). [76]

On the other hand, from the business side this move towards convergence has seen an increasing attention to visualizing the steps/stages/phases in processes and production, a Taylorisation of the whole spectrum of human activity from physical labor to the creative process itself through the use of simulations, workflow schemes, and enterprise integration. One could argue that Engelbart's concept of augmented knowledge at the Bootstrap Institute is effectively another step in this same direction. And as the mastery of processes becomes ever greater, there is a corresponding emphasis on (business) process re-engineering.

Along with these attempts to visualize processes, there have been two other trends: one towards operation at a distance (through tele-conference, tele-collaboration, tele-control, tele-management, tele-operation, and tele-presence), and another toward virtualization (through virtual communities, virtual corporations, schools, and universities). And concurrent with this shift from scientific management of work in the sense of Taylor to knowledge management is an emphasis on learning organizations, on learning applied to entities, companies, and corporations, independent of the individuals and persons working within them. (Hence, theoretically, no single executive "makes a difference" to a company, and hence can be replaced as mergers and takeovers ensue.) These quests to master new knowledge owe much to systems theory, "chaos theory" (a seemingly contradictory combination of terms), complexity, [77] and developments in neural networks, whereby systematic treatments of apparently random forms bring unexpected patterns of order.

What makes these trends the more significant is that thinkers concerned with the systematization of intellect, such as Guilford, have intuitively sought to link units, classes, relations, systems, etc. with products and operations (figure 12).


Figure 12. J. P. Guilford's Structure of Intellect.

Through an explicit integration of old and new knowledge this is possible. We need to combine the logical rigor of the innovations in knowledge organization by the champions of enduring knowledge in memory institutions with the dynamic methods being developed by those using new knowledge in the worlds of technology, business, and finance. The ensuing augmented knowledge will bring about much more than a re-organization of knowledge. It will generate an unparalleled series of new insights. Indeed it may well reveal that what seemed excessive hype, may prove to be more like understatement in the face of the fundamental transformations in how we deal with knowledge in the century to come. Augmented knowledge is a key to our future.

Seen in terms of the history of the Internet, all this has an added fascination. For it will be recalled that the idea of augmented knowledge was also one of the initial inspirations for the work of Douglas Engelbart as he was founding the Internet in 1969. And yet the focus was quite different. Engelbart was concerned with sharing information within a large organization (figure 13) and with how engineers and designers in one large company could share their ideas online in the creation of new knowledge (figure 14). For this reason, Engelbart was so fascinated with the potentials of CSCW, which has emerged as a field in itself. [78] Engelbart's vision of Augmented Knowledge, which forms the basis for his Bootstrap Institute, bears some interesting parallels to the emphasis by Frederick Brooks [79] on Augmented Intelligence as opposed to Artificial Intelligence (IA vs. AI).

Meanwhile, there have been curious shifts in the original vision of the Internet as foreseen by Engelbart and his colleagues over the past 30 years. Nearly a half century ago, Engelbart, one of the most brilliant minds of our century, foresaw the possibility of hypermedia e-mail and CSCW for large corporations and ultimately for everyone. The United States has continued to develop the notion of the Internet as a tool for creating new knowledge. The rhetoric of being collaborative remains, but the realities of the dominant competitive ethic of the United States have led to an ever greater emphasis on e-commerce, whereby short-term commercial gain dominates over long-term intellectual advantage.

Collaborative work tools have become very big business. In 1996, Boeing, Ford, Kodak, MacNeal-Schwendler [80] , and Structural Dynamics founded the Technology for Enterprisewide Engineering Consortium. [81] Recently Boeing has begun licensing its collaborative software. [84] New companies such as Alibre [85] and Aristasoft [86] are joining the ranks of more established firms in the field such as SherpaWorks, which produces Enterprisewide Product Data Management, [87] and Nexprise, [88] which is used by Lockheed Martin.


Figure 13. From Douglas Engelbart: Each functional domain is a candidate for working interchange with all others. [82]


Figure 14. From Douglas Engelbart: Close cooperation between large organizations puts new demands on knowledge-work interchange. [83]


Figure 15. Vision of a new approach to augmented knowledge, which combines enduring knowledge (through libraries, museums, and archives via virtual reference rooms) and new knowledge (through collaborative environments) with filters for various cultural viewpoints.

Some idea of the scale of this enterprise emerges if one looks to Lockheed Martin, a firm which is known to many as a missiles and space company. The Lockheed Martin Palo Alto Lab is working with the University of Southern California on Virtual Environments for Training (VET) [89] . It also has a division called Lockheed Martin Integrated Business Solutions which describes itself as "the world's preeminent systems engineering and integration [90] company."

For instance, we enabled one of the country's largest residential home lenders to realize its potential for financing over $1 trillion in mortgages. We delivered a new multitiered data warehouse and client/server architecture to one of the nation's largest telecommunications companies, enabling it to maximize network services and decrease the threat of telephone fraud. [91]

Such figures make the predictions about e-commerce reaching a number of billion dollars in the next few years sound trivial. Meanwhile, as Europe becomes increasingly aware of the Internet, it has focussed increasingly on the potentials of the Internet as a means of gaining new access to enduring knowledge [92] and has seen this also as a way of augmenting knowledge.

It is not a question here of determining whether the American goal is better or worse than the European. What interests us is that both are concerned with augmenting knowledge and both are complementary. Having access to the cumulative memory of civilization is a wonderful thing. Having access to this as we work together collaboratively is even more powerful. What we need to do is combine the American approach qua new knowledge with the European approach qua enduring knowledge to arrive at fundamental new levels of insight for the whole of humanity.

A next step will be to include in this vision a South American, an African, and an Asian point of view as is emerging in ASEAN through pioneering work in Japan, Malaysia, Singapore, Hong Kong, and, increasingly, China. [93] This approach with respect to continents will then need to be fine-tuned such that it can reflect accurately the diversity of individual countries, provinces, cities, and ultimately individuals (figure 15).

Explicit Knowledge Implicit Knowledge
Codified Tacit
Information, Theories, Formulae, Procedures Experience, Abilities, Attitude
Handbooks, Manuals, Drawings, Schemas Being able and willing
Transfer by Instruction Sharing through Demonstration
Obtainable by Study Copying and Imitation
Explicit Knowledge is Power Implicit Knowledge can be Power

Figure 16. Basic differences between explicit and implicit knowledge.

The domain of enduring knowledge is focussed primarily on what can be termed explicit knowledge. On the other hand, there is also a challenge of capturing implicit knowledge (figure 16), which Polanyi termed personal or tacit knowledge [94] and which is closely related to the challenge of filters for cultural dimensions. How one captures the essential characteristics is more elusive. Here virtual reality environments such as VET can prove useful in the future, but this cannot solve the problem of retrospective treatment of these themes. Meanwhile there are other more immediate challenges.

10. Challenges

Various stumbling blocks stand in the way of such a re-organization of knowledge as outlined above. Two major challenges are metadata and the privatization of knowledge. Others include questions of knowledge management: the extent to which indexes should be pre-structured or made available on-demand, the potentials of cooperation, and the role of filtering.

Metadata

The scenarios outlined above concerning virtual reference rooms set out from a simple, but elusive assumption, namely, that the names, subjects, places, and chronologies in the various reference works are all standardized such that one can move seamlessly from one list to another. This is the domain of metadata, which establishes rules of information about information. Many important advances in metadata have been and are being made, notably, through the Dublin Core [95] and the Resource Description Format (RDF). [96] These pragmatic solutions are important because they promise universal access to at least some basic aspects of collections. [97] Nonetheless, a profound danger remains that an ability subsequently to perform deeper queries of sources could disappear.

Moreover, such solutions focus entirely on static knowledge, on contemporary knowledge, without attention to cultural or historical changes in knowledge. As noted earlier, new methods are needed which enable access to these complexities of historical and cultural diversity, such that we can develop new kinds of dynamic knowledge maps, some of which are evolutionary. To this end, it has been recommended that the European Commission, in the context of its long-term research program, promote the creation of a new kind of metadata. This could then become a starting point for further collaboration with other countries around the world.

Because such a long-term approach to metadata will entail very high level and large-scale technologies in areas where industry will reasonably not see immediate gain, it has also been recommended that the underlying technologies be developed by the Joint Research Centre (JRC), probably in conjunction with national research institutes in countries such as Germany (GMD), France (INRIA), and Italy (CNR). Second, given the complexity of the challenge, a modular approach has been suggested.

Privatization of knowledge

Trends toward privatization of knowledge have come mainly from publishers and multi-national corporations, although recently universities have become more active in this domain.

Publishers

Already during the Renaissance various city-states and cities created city libraries for their citizens. In some university towns (e.g., Bologna, Padua, Heidelberg, Oxford) the university library informally served a public function. Such examples evolved into the concept of public libraries paid for by taxpayers and readily accessible to all citizens.

During the Renaissance and until recently publishers played an essential role in publishing the works of authors, which were then bought by libraries and thus made available to members of the public who could not afford to have their own collections of books. While this basic pattern has continued, other trends in the past 50 years have begun to undermine this long, positive tradition of publishers.

With the enormous rise of universities during the 1960's and 1970's, spawned partly by the "baby boom" after the Second World War and also by a new philosophy which held that potentially everyone had a right to a university education, publishers realized that there was a new market in providing reference books and classics for these new institutions. This inspired the rise of a reprint industry. The good news was that new universities now had their own copies of standard reference works. The bad news was these reprint companies increasingly assumed the copyright of works which they had merely reprinted. As these materials became increasingly connected with electronic databases through companies such as Dialog, new deals between reprint companies and those offering electronic services (e.g., Reed and Elsevier, later Reed-Elsevier) meant that reference materials, which had largely begun in the public domain, were now in the hands of private companies, themselves subsidiaries of multi-national conglomerates.

These reference materials entail two things: the actual contents and the names or subjects which those contents describe. If those names and subjects become viewed as proprietary, then the hopes of combining these names with others to make universally accessible authority lists becomes difficult if not impossible. This is not to say of course that publishers no longer have a role to play, only that with respect to the metadata which they use, qua names, subjects, places, etc., more cooperation with public institutions is needed. Else the concept of universal access will no longer be feasible.

Industry

A second stumbling block toward this vision comes from an unexpected quarter: industry and business. [98] The business world assures us that the knowledge society is the emerging paradigm for the twenty-first century. One might have expected that business would do everything in its power to facilitate this. To understand why this is not the case, some history is needed. Traditionally there has been a single institution which was responsible for coordinating the whole of known knowledge. In the West, from about the fourth to the eleventh century, this task was almost exclusively in the hands of the Catholic Church. The eleventh and twelfth centuries saw the rise of universities, and although certain orders of the church, notably, the Franciscans, Dominicans, and later the Jesuits, continued to play a considerable role, as the universities gradually became secularized they effectively became the central repositories of learning until the nineteenth century. Since then six major changes have occurred:

  1. There has been the rise of polytechnics (in France the Ecole Polytechnique) and technical colleges (in Germany the Technische Hochschule as at Braunschweig, Berlin, Darmstadt, and Munich and the Eidgenössische Technische Hochschule in Switzerland). As a result technical knowledge came increasingly in the purview of these new institutions.
  2. The rise of nation states led to the idea of national libraries. Here Panini's vision for the British Museum soon became a model for national libraries in Europe and all around the world. By the end of the nineteenth century, the enormous public support of these national libraries made them the new central repositories of knowledge.
  3. As the idea of the nation state flowered, the notion of national research laboratories became an increasingly natural prospect. One had to do specialized research in order to protect the competitive advantage of one's country. In countries such as Italy, this national research council (Consiglio Nazionale delle Ricerche) typically maintained close contact with the great universities (Rome, Bologna, Pisa), but at the same time produced major bodies of knowledge not readily accessible to those within the university. In the United States, these institutes grew into a vast network, often wrapped in secrecy. For instance, the Department of Energy alone has 20 major laboratories including Ames, Argonne, Lawrence Livermore, and Los Alamos. [99] It is significant that the names of these institutions are familiar while their activities are not.
  4. In other countries such as Britain, another development emerged in the 1960's and 1970's. Under a rhetoric that government must focus on its "core business" there was a trend to privatize its holdings, with a result that major research laboratories were sold, often to private enterprise.
  5. The major corporations found it useful to develop their own research labs. These grew steadily in scale. Here a few examples will suffice to give an indication of the scope of the phenomenon. IBM, for instance, has major laboratories such as Watson and Almaden, plus an Advanced Semiconductor Research and Development Center (in East Fishkill, NY), [100] as well as laboratories in Winchester (England), Zürich (Switzerland), and Naples (Italy). Similarly, the Japanese firm NEC has a number of laboratories in Japan, namely Kawasaki, Osaka, Ohtsu, Sagamihara, and Tsukuba, as well as others in Europe (Bonn and Berlin) and North America (Princeton [101] and San Jose). Philips has its main laboratory in Eindhoven with others in England (Redhill), France (Limeil-Brévannes), Germany (Aachen), and the United States (Briarcliff Manor). In 1997, Hitachi had 17,000 researchers in 35 labs with an annual budget of $4.9 billion. The University of Bologna, one of the oldest universities of the world and also one of the largest with 103,000 students, has an annual budget approaching $1 billion, of which a large part goes to day-to-day teaching and administration rather than to research.
  6. There has been a rise of private universities, often specifically connected with a specific corporation. There are over 1000 corporate universities in the United States alone. Some of these corporate universities are larger than most