An Agent-Based Intelligent Distributed Information Management System for Internet Resources

Stuart SOLTYSIAK <stuart.soltysiak@bt.com>
BT Advanced Communications Technology Centre
United Kingdom

Takeshi OHTANI <ohtani@flab.fujitsu.co.jp>
Fujitsu Laboratories Ltd.
Japan

Marcus THINT <marcus.thint@concert.com>
Concert
USA

Yuji TAKADA <yuji@flab.fujitsu.co.jp>
Fujitsu Laboratories Ltd.
Japan

Abstract

Finding and managing information on the Internet is a complex and time-consuming process. This paper describes an Intelligent Distributed Information Management System (IDIoMS) that supports the sharing, managing, searching, and presentation of information that is widely distributed over a network such as the Internet. IDIoMS provides a comprehensive set of tools for personalized information delivery, and a platform that will enable straightforward management of distributed information systems using Internet technologies. IDIoMS benefits users through the automatic provision of timely and relevant information with minimal need for users to search for that information. The system also benefits service providers through the plug-and-play provision of information services that minimizes the overhead needed to make their services widely available.

1 Introduction
- 1.1 The IDIoMS vision
2 The technologies behind IDIoMS
- 2.1 Open Agent Middleware
- 2.2 Personal Agent Framework
3 IDIoMS
4 Summary and conclusions
5 References

1 Introduction

Although the Internet provides access to a vast repository of information, the search and retrieval of specific, useful information and the management of heterogeneous information sources on the Internet is becoming increasingly difficult. This paper presents an Intelligent Distributed Information Management System (IDIoMS) that supports the sharing, managing, searching, and delivery of widely distributed information.

IDIoMS, developed jointly by Fujitsu Laboratories and British Telecommunications (BT), makes significant use of software agent technology, integrated using Internet technologies. Advanced management of distributed information resources is provided by Fujitsu Laboratories' Open Agent Middleware platform, while personalized information management and delivery is performed by BT's Personal Agent Framework.

IDIoMS will benefit users through the automatic provision of timely and relevant information with minimal need for users to search for and find that information. The system will also benefit service providers through the plug-and-play provision of information services with minimal need for providers to make their services widely available.

1.1 The IDIoMS vision

IDIoMS is intended to facilitate the sharing, managing, searching, and presentation of information that is widely distributed over a network. As depicted in Figure 1, the IDIoMS end user (subscriber) would benefit from an environment in which he/she can receive relevant and timely information from distributed, heterogeneous content sources in a seamless manner.

Figure 1: IDIoMS general architecture and vision

To realize this vision, IDIoMS combines and provides personalized information management applications (provided by the Personal Agent Framework layer) and an advanced information resource management system (provided by the Open Agent Middleware [OAM] layer). The user need not know or specify particular sources of data; mediator agents within IDIoMS' OAM layer assume the responsibility for automatic location of the appropriate information sources and available services all over the network, and the personal agents can proactively request and filter relevant information on behalf of the user.

This paper is organized as follows. First we introduce the two technologies that comprise IDIoMS. Then, in Section 3 we describe the overall vision and architecture of IDIoMS. In Section 4 we summarize trials of IDIoMS performed within Fujitsu and BT, before summarizing the benefits and further issues of such a system.

2 The technologies behind IDIoMS

Having described the overall vision and arrangement of IDIoMS, we now explore the two components in more detail -- the Open Agent Middleware for management of information resources, and the Personal Agent Framework for personalized information management.

2.1 Open Agent Middleware

The OAM developed by Fujitsu Laboratories is a middle layer platform on Java virtual machine (JVM) to construct flexible, dynamic, scalable, and robust distributed systems over the Internet as multi-agent systems. OAM supports collaborations of agents in a seamless and transparent way, and it enables agents to be connected in a plug-and-play manner over a network. When a new agent is plugged into the network, OAM arranges the organization of agents, adapts the agents of the organization to each other, and manages their collaboration. OAM provides three main functions to enable such dynamic behavior: distributed mediation, reflective adaptation, and communication via field.

2.1.1 Distributed mediation

The OAM provides multiple mediator agents, federated and distributed over the network, that collaborate in order to service requests [1]. Mediators offer a service request brokerage to other agents over the network; that is, they have meta-information databases that they use to select resources that satisfy the request. Mediators enable access to resources, and perform collation and summarization of the resources' responses.

When a service provider plugs a service agent into OAM, information about the service is advertised to a mediator. The mediator stores the advertised information in its own condition table which contains condition/destination pairs, and then forwards the advertisement to neighboring mediators (gray arrows in Figure 1).

Mediation is based on matchmaking of a requested service with the set of services (advertisements) known to the mediator. When a mediator receives a service request it forwards the request to the correct service agent or another mediator agent according to the search condition. Results from the service agent are then returned to the sender (black arrows in Figure 2).

The distributed architecture of the mediators brings several advantages. For example, load balancing of mediator work can be performed over machines because there are many mediator agents distributed over a network. The condition table can be kept only within mediator agents and no other agents get it. Redundant routes can be set between agents to improve robustness. Therefore, our distributed architecture is more scalable, secure, and robust.

Figure 2: Distributed matchmaking by federated mediators

2.1.2 Reflective adaptation

Each agent has its own data format and protocol. Therefore, an organization of agents must agree on data formats and protocols before starting to collaborate. Within the OAM, agents can update interfaces and protocols for new interactions and collaborations in runtime.

The reflective adaptation is based on the agent/action programming model. Agents are realized as Pathwalker agent components [2]. In a Pathwalker agent component, interfaces and protocols are realized as actions that are replaceable and extensible modules of the Pathwalker components. Once interfaces and protocols are agreed among the organization of agents, actions of each agent are replaced or added dynamically. These actions are supplied from repositories with or without collaboration with mediators. The module can be obtained also from the agent who talks with a new protocol (Figure 3).

Reflective adaptation involves changing the Java servlet code dynamically. This can be used to add new features to the interface, or even update existing code (adding an extra parameter to a method call, for example).

Figure 3: Reflective adaptation

2.1.3 Communication via field

For collaboration among an organization of agents, a common communication medium is required. This medium should be flexible, dynamic, and scalable enough to enable open communication for collaboration over the Internet. The field enables such communication among the organization of agents [3]. As a communication medium, agents communicate with others in the peer-to-peer manner or in the multicast manner on the field. As an information-sharing medium, agents can share information among themselves on the field as a message blackboard. Thus the field works as a logical network and at the same time as a shared memory.

Communication via the field is an event-driven, multicast-based communication. All agents on the field listen to all messages in the field, and each of them reacts to messages according to its own criteria named patterns. An agent becomes aware of an event anyway and reaction to the event is up to the agent. If a set of agents always reacts to certain types of messages, these agents would constitute a weak bounded group within the organization. This kind of pre-communication and pre-grouping is required for collaboration over open environment to be flexible and dynamic.

Moreover, agents on the field can be added to and deleted from the field anytime independently of other agents. The patterns of agents can also be dynamically changed. Thus, collaboration of the field is flexible and dynamic.

Figure 4: Communication via Field

2.2 Personal Agent Framework

The Personal Agent Framework (PAF) is a unified environment in which several personal agents are integrated. The framework maintains a dynamic user profile that can be shared among agents in the framework and external applications. Key benefits of the framework include:

personalized services with minimal burden on the user;
integrated services of multiple personal agents for information search and retrieval, interest-based networking, and just-in-time information delivery; and
a secure environment that ensures the privacy of personal profiles for registered users.

The user owns and is able to review and control the privacy level of each category of his/her profile, which is available to a suite of personal agent applications through a common application programming interface (API) (see Figure 5). Each agent can operate independently of other agents, but agents can and do share information to augment individual capabilities. Hence, the collaborative agent environment with a secure, common profile in the framework enables efficient personalized assistance for the user.

Figure 5: Components of the PAF

These are now briefly described in turn, although further description of the PAF and agent applications can be found in [4].

2.2.1 Profile manager

The profile manager is a core functionality of the PAF, with the responsibility to enable other applications to interact with the user profile, either through retrieving profile information or updating profile information. Additionally, the profile manager is responsible for managing the complete interest hierarchy.

PAF user profiles contain two main components: an interest profile and contact details. These are regarded as separate components, which enable contact details to be stored separately from interests, perhaps using alternative storage media (e.g., contact details might reside in a corporate directory, and personal interests in an Oracle database).

Contact details: The minimum essential contact details consist of name, mailing address, telephone number, fax number, e-mail address, and web homepage information. Other information may be provided, along with appropriate access and API methods.
Interest profiles: The interest profile is the core of the PAF profiles and provides the information necessary for the agent applications to perform their tasks. Interest profile information consists of a number of categories with associated information. The information that is provided by the interest profile API
- Includes a list of interest categories contained within the profile
- Can be ordered according to importance/expertise/age/alphabetically
- Lists membership according to privacy specifications
- Includes information associated with each interest category:
  - privacy setting (public/restricted/private)
  - expertise rating (curious to expert)
  - importance rating (low/medium/high)
  - keywords and phrases
- Interest category information should be available to both read and write.
Interest hierarchy: This component of the profile manager looks after the "systemwide" interests. It provides the ability to review the entire set of interests for this particular profile manager (i.e., domain). It is from this set of interests that users' interest profiles are derived (but users can customize their individual profiles as much as they desire).

2.2.2 Bugle

The Bugle is a personalized newspaper that provides news articles that are relevant to the user's interest profile. The Bugle makes use of the user's interest profile to find news articles that are relevant to the user. The Bugle performs more than a simple search of news feeds, though, as it makes use of the importance and expertise information associated with each interest to modify the search criteria. The Bugle attempts to find more news articles for important interests, as these are more relevant to the user. Moreover, for any interest the Bugle filters news articles according to the expertise information -- an expert in the field will receive more detailed items than a novice who is learning about the area.

2.2.3 Ivine

Ivine is an agent designed to locate people with common interests to facilitate networking. It informs its user about other people with similar interests, skills, and expertise within a particular user community, through the dynamic user profiles maintained by the framework. Each user's Ivine agent operates autonomously and contacts other registered agents periodically to compare interest profiles. During this exchange, the information disclosed to other agents is strictly controlled by their privacy settings to ensure that information is not divulged to parties that the user does not want to know.

2.2.4 Radar

Radar -- named for Radar O'Reilly of the television series M*A*S*H -- is not an information-finding agent as the Bugle and Ivine are, but is more of a just-in-time information delivery agent. Radar makes use of a number of information sources, many of which are other personal agents, to present appropriate information to the user at the right time. Radar monitors the user's current activity (at present within Microsoft Word) and uses the current position in the document to find material that might be relevant. This is undoubtedly a major benefit since the user no longer needs to actively search for information. The true advantage of Radar is that it brings information to the user's attention when it is most relevant and useful.

3 IDIoMS

IDIoMS is an integration of the two technologies described in the previous section -- the OAM manages the information resources that the PAF applications use to perform personalized information management functions. Users can make use of IDIoMS either statically -- viewing the results through the IDIoMS "portal" -- or dynamically via the use of Radar to provide relevant information just-in-time. A more detailed diagram of IDIoMS' architecture is shown in Figure 6 below.

Figure 6: Detailed view of the IDIoMS architecture

3.1 Using IDIoMS

From a user perspective, IDIoMS provides unified access to relevant and useful information. IDIoMS is built upon the notion of functional domains. A functional domain is a categorization of the community served by IDIoMS services. Typically each domain would appear as in Figure 6, with a PAF server containing user profiles, and associated agent services and information resources. IDIoMS is intended, however, to support multiple domains; indeed, a major driving factor behind the design of IDIoMS has been to facilitate inter-domain information and knowledge sharing. IDIoMS thus provides a distributed information system that will enable users from any domain to access information from any domain provided it is relevant to their interest profile and current context.

IDIoMS can be used statically or dynamically -- these modes of operation characterize the user's interaction according to information flow from user to IDIoMS:

Static use essentially treats IDIoMS as a personalized information portal, where users can read/access all of the relevant information found by their agents. Typically this covers reading the Bugle's newspaper, or Ivine's list of contacts with a particular interest or skill. In this usage pattern information tends to flow from IDIoMS to the user.
Dynamic use treats IDIoMS as a personalized information portal with real-time search capabilities. In this case users can access all of the functionality of the static usage pattern, but more important IDIoMS becomes the users' own information servant, continually locating information that is relevant to their current context. In this usage pattern information flows from user to IDIoMS and back again. It must be stressed, however, that the user does not need to explicitly inform IDIoMS of the context -- it can determine this for itself.

IDIoMS therefore provides a powerful assistant for the user, leaving him/her free to concentrate upon the important tasks. We shall now describe how IDIoMS supports the user, using Figure 6 as a reference point.

3.2 IDIoMS operation -- an example resource lifecycle

IDIoMS is built upon the use of Java technology -- and all services are integrated with IDIoMS using Java Servlets. Servlets enable all services to communicate over hypertext transfer protocol (HTTP), thus providing a scalable and robust connection medium. Furthermore, the use of HTTP as a communications medium is widespread, and therefore IDIoMS is able to capitalize on this to provide the inter-domain facilitation.

Adding a resource: Integrating a service or information resource with IDIoMS is straightforward. IDIoMS supports the integration of any type of resource; the only requirement IDIoMS places is that a Servlet "wrapper" be used to interface the resource to IDIoMS. In this manner IDIoMS enables heterogeneous resources to be integrated within the system, providing a powerful distributed information system. The servlet wrapper acts as a translator, accepting requests from mediator and other client agents and invoking the appropriate process(es) in the resource to service the request. Similarly the servlet translates the results from the resource for the requesting agent, enabling IDIoMS to make use of the resource.
Advertising the resource: Creating a servlet wrapper for a resource is not sufficient to enable IDIoMS to make full use of it. Once the resource has been added it must be advertised to a mediator. The advertisement of a resource provides mediators with the information they need to determine how to satisfy service requests. Briefly, advertisements contain information about the name of the resource and what that resource can provide (service description/information categorization) -- see Section 2.1 for further details. Mediators themselves propagate advertisements to other mediators; thus the presence of a new resource can quickly be notified to the rest of IDIoMS. The use of servlets and advertisements enables resources to be plugged into IDIoMS in such a way that they can be used immediately through the dynamic reconfiguration capabilities of the OAM. Advertisements within IDIoMS can be specified using extensible markup language (XML) -- this is translated into the logical conditions used internally by mediators.
Using the resource: Once a resource has been advertised to a mediator it is capable of being used by IDIoMS. As described in Section 2.1, mediators make use of advertisements to determine how to satisfy a request from a client agent. Requests from agents comprise two separate components -- the service request specifying the actual service type required and the search request that specifies what the service resource should do (e.g., a search condition for querying the service's database). Client agents are responsible for generating these requests, and once a mediator has received such a request it is parsed and compared with known advertisements. When a resource has been selected to service the request, the client's search request is forwarded to the service for processing. The mediator receives a response from the resource and then sends this back to the client that originated the request.
Removing the resource: The resource can be removed at any time without affecting the performance of the system. If a mediator fails to gain access to a resource, it will remove that resource's advertisement from its local cache and attempt to satisfy the request using alternative resources. Once an advertisement has been removed, a mediator will not subsequently attempt to forward requests to that resource. Should the resource reappear, it can be used once mediators have received its advertisement.

3.3 IDIoMS operation -- how the services work

The personalized information services that are provided by the PAF are integrated within IDIoMS with no change to their functionality as seen by the user. The approach to locating information using the OAM's mediators does differ from the existing PAF. Fundamentally each agent obtains information as described in the previous section: by submitting service/search requests to mediators. The construction of these requests does, however, differ slightly between agents.

Creating the service request: Each agent service can utilize particular types of resource. The Bugle, for example, can make use of any resource that provides news-type information. Ivine makes exclusive use of PAF profile manager resources for providing user profile information. Radar, on the other hand, can make use of any resource that can provide relevant information. Thus creation of a service request is dependent upon the particular agent application making the request.
Creating the search request: This contains the specific search criteria to be used by resources for processing. Since IDIoMS' agent applications are centered around the use of personal profiles, user interests form a central part of the search request. The interest(s) within the search request are specified by the agent application. In the case of the Bugle and Ivine, these are the interest category(ies) that are of current interest. When the Bugle generates a newspaper, it requests information for all of the interest categories from the user profile according to user preference settings (e.g., importance, expertise). Similarly, Ivine requests PAF profile information for categories from a user's profile to build a list of contacts. With Radar, the interest category is augmented by the text currently being typed within Word.

The final issue to discuss with respect to IDIoMS' operation is the cross-domain facilitation. Essentially mediator agents can act at a "meta-level" whereby they act as bridges between domains. Essentially this treats each IDIoMS domain as a separate resource with appropriate advertisement to mediators representing other domains. In actuality, advertisements for a resource in one domain can be forwarded to other domains, thereby enabling that resource to be available from more than one domain. Of course, resources may not be suitable for access to another domain (such as providing information for a domain with interests that are mutually exclusive to the present domain) but this will not affect system operation since that resource will not be requested to service a client agent for the secondary domain.

3.4 IDIoMS trials

As an initial application domain, the IDIoMS project is being trialled as a support system for system engineers. This case study is designed to demonstrate and assess the potential assistance the IDIoMS project can provide. At the time of writing (January 2000) the trials have just commenced in both companies, and as such we can describe only the scope of each trial at this point.

Within Fujitsu, IDIoMS is being trialled with the SolutionNET Promotion Section, which supports all of Fujitsu's systems engineers by developing and managing information systems for them. Within BT, software engineers from the Belfast Engineering Centre are taking part in the trial to assess IDIoMS in support of their work focusing on Internet and intranet services and applications.

4 Summary and conclusions

The overriding vision of this project is to develop a generic, agent-based platform capable of providing intelligent support for the management of distributed information systems and associated applications. IDIoMS has taken a significant step toward this end, combining a powerful assistant giving the user timely access to the right information and the right people, with an effective approach to managing distributed information systems. The resultant system being trialled in a systems engineering domain will help assess contributions to personal productivity and efficiency, and highlight additional research issues. As IDIoMS is built upon Internet technologies there is large scope for application of the system -- Internet, intranet, and extranet solutions can all be envisaged with IDIoMS. A toolset for distributed information systems provides powerful mechanisms for the management and dissemination of electronic information in a number of areas:

IDIoMS application for supporting systems and software engineers
Exploration of research issues surrounding the system
Assess productivity and efficiency improvements, cost reduction
Enhanced generic IDIoMS toolset for specific application areas within organizations
Develop organizational intranet and extranet solutions
Future agent-based systems (e.g., [5])

Clearly there are many areas of IDIoMS that can be more fully investigated -- not only from an expansion of the range of agent services provided, but also in the management of IDIoMS. Perhaps most important is the one issue that plagues the Internet -- ontology management. Clearly IDIoMS needs to address ontological issues to become a more enhanced and capable system, but such issues remain difficult to resolve. The use of XML and other standards can mitigate problems to a certain extent, but they do not provide a solution. Nonetheless, we believe that the start made with IDIoMS demonstrates that an advanced agent-based information management system is a very real and powerful application in the complex, distributed Internet society.

5 References

[1] T. Mohri and Y. Takada. Virtual Integration of Distributed Database by Multiple Agents. In Proc. of the First International Conference, DS'98, LNAI 1532, pp. 413-414, 1998.

[2] I. Iida, N. Fujino, T. Nishigaya and T. Iwao, Multi-agent Platform for Seamless Personal Communications, Telecom 99 Forum Interactive summit, Int.7, Oct. 1999.

[3] T. Iwao, M. Okada, Y. Takada, and M. Amamiya. Flexible Multi-Agent Collaboration using Pattern Directed Message Collaboration of Field Reactor Model. In Proc. of the Second Pacific Rim International Workshop on Multi-Agents, PRIMA'99, LNAI 1733, pp. 1-15, 1999.

[4] I.B. Crabtree, S.J. Soltysiak and M. Thint. Adaptive Personal Agents. Personal Technologies Journal, Vol. 2, No. 3, pp. 141-151, 1998.

[5] J.C. Collis, S.J. Soltysiak, D.T. Ndumu and N. Azarmi. Living with Agents. BT Technology Journal, Vol. 18, No. 1, pp. 66-7, 2000. (HTML copy)

An Agent-Based Intelligent Distributed Information Management System for Internet Resources

Abstract

Contents