http://www.spinellis.gr/pubs/conf/2001-EPY-Mitos/html/mitos.html This is an HTML rendering of a working paper draft that led to a publication. The publication should always be cited in preference to this draft using the following reference:
|
The wide availability and accessibility of information have made its management and deployment even more difficult. To this end, remarkable effort has been made for the development of information systems that handle the processing, analysis and management of information. However, the success of these systems does not only depend on the quality of information handling, but also on the appropriate presentation of information to the end-user. MITOS 1 system analyses financial news by employing techniques from the areas of Natural Language Processing, Information Filtering and Information Extraction. Moreover, by acknowledging the importance of the presentation of information, MITOS has also incorporated User Modelling techniques, which enable the provision of personalized content adapted to each user’s profile.
Aiming to address the problem of information overload, MITOS project developed an integrated system for information management able to filter and extract information from electronic news, derived from the Greek financial market. MITOS system combined techniques from the areas of Information Filtering, Natural Language Processing and Information Extraction. Moreover, by acknowledging the importance of the presentation of information, MITOS has also incorporated User Modelling techniques, which enable the provision of personalized content adapted to each user’s profile. The co-operation of the aforementioned technologies provided an integrated system of information processing.
In order to better understand and satisfy user needs, a 1 st prototype was created and evaluated. The feedback that we got enabled us to improve as well as add new functionalities to the system. Based on the users’ remarks, a 2 nd prototype of the integrated system has been developed and evaluated leading to the final system prototype.
In this paper we do not only present the technological features of MITOS project, but we also demonstrate the way the analysed information is presented to the end user. In Section 2 we present an overview of MITOS project, its objectives and initial user requirements, and we outline the system’s architecture. In Section 3, we demonstrate the functionality provided to the end user, whereas in Section 4 we illustrate how the evaluation led us to better capture and satisfy users’ needs. Finally, in Section 5 we present some concluding remarks.
In order to achieve the above objectives, MITOS project exploited and integrated the following technologies:
MITOS is a distributed system. Its subsystems can reside in different servers and communicate via the TCP/IP protocol [8] (see Fig.1). In its current implementation, MITOS front-end, i.e. the user interface, resides in the main server under the Solaris Operating System. The user interface module (developed mainly using PHP, JavaScript and HTML) works under an Apache Web Server, installed on the same server. The MITOS database (PostgreSQL) containing the results of the information filtering and extraction sub-systems is also on the Solaris server. The MITOS back-end, i.e. information filtering, information extraction and user modelling sub-systems, have been installed on a Windows NT Server. The User Modelling module works under Microsoft’s Internet Information Server and communicates with the user interface module via HTTP GET actions.
Figure 1. MITOS Architecture
The Information Filtering sub-system consists of a set of tools that acquire thematic domains’ models from a news database (training corpus), and classify new texts into their corresponding domains [10]. More specifically, the Information Filtering Module consists of:
MITOS' User Modelling sub-system [6] keeps track of the user's selections each time he/she posts a search query, in order to determine the user's preferences and adjust some of the user interface's options according to these preferences. MITOS uses a long-term model approach, based on the assumption that user preferences are expected to appear after a period of system usage. Apart from creating a personal user profile, the user modelling sub-system also creates group profiles, the user communities [12]. A community corresponds to a group of users who exhibit common behaviour in their interaction with the system. Model creation for personal profiles is performed on-line whereas the creation of user communities is performed as a batch process.
Aiming to provide an intelligent data repository of analysed financial news, MITOS offers an efficient way of searching and retrieving the appropriate information. It allows users to exploit the categorization of news, as well as their presentation in a summarized form. Three different search modes are provided: simple search, advanced search and search based on the selection of companies’ names.
The `Simple Search' mode enables users to look for news related to the categories of their choice (see Fig. 2). This way of searching is based on the classification of financial news, conducted by the use of Information Filtering techniques. Fourteen categories have been identified as the most representative thematic domains. Users can select the categories of their interests (e.g. mergers and acquisitions) and retrieve all financial news belonging to them. In order to limit the amount of information retrieved, users can specify additional terms of search such as a date or a period of time that they prefer. Moreover, users can further increase the accuracy of their search by forming complex boolean queries of keywords, using AND/OR/NOT/NEAR operators. This additional feature can be provided, since the analysis and processing of information enabled the determination of keywords characterizing the financial news. The simple search results are presented in a list that includes the date and title of financial news in a chronological order. The latter helps users to quickly identify and select to read the full text of news that interest them the most.
The `Topic Search by Company Name' mode enables users to select the thematic domains of news they prefer as well as one or more company names from a multi-selection list. In order to further limit their request users can also specify a date or period of time (see Fig. 3).
The `Advanced Search' mode enables users to search news based on specific event types, such as Acquisitions, Mergers, etc. (see Fig. 4). For each event type, they can define specific search terms and retrieve financial news in a summarized format. This operation exploits the results of Information Extraction, which analyses news, belonging to the aforementioned events, and extracts from their content the most significant elements of information. The analysis and identification of the most important data enabled not only the presentation of news in a summarised format, but also the use of multiple criteria for the provision of a more efficient way of search.
Figure 2. Simple Search mode
Figure 3. Topic Search by Company Name mode
Figure 4. Advanced Search mode
In order to use the `Advanced Search', users need first to select a specific category (e.g. Acquisition, Merger). According to the event type selected, the search form is dynamically modified in order to present a set of search terms fit to the selected event. These terms of search are distinct to each event type. For example, when users search for news involving a company’s acquisition (see Fig. 5), they can specify the names of the companies involved, the percentage of acquisition, and the period of time that interests them, whereas when they are looking for a merger, they can only specify the involved companies and the period of merger. The news retrieved, are presented in a summarized format, which incorporates the most significant information (e.g. Date, Company name, amount of money exchanged, percentage of acquisition and time of acquisition). The users can select to view only the parts of information that interests them. They can also select the ordering they prefer (e.g. by date or by percentage of acquisition). Finally, users can access the full text of news by simply selecting their date from the list.
All these functionalities were enhanced adopting user-modelling techniques to provide personalized content to the users. In particular, we focus our work on the concept of a user community, which seems to apply best to a public Web site. The system monitors and understands users’ preferences regarding specific companies and thematic domains. By maintaining the users’ needs the system can provide personalized user interfaces. This feature facilitates the use of both “simple” and “advanced” search, by pre-selecting the thematic domains that interest the users.
Another feature offered by MITOS system is the "Personal Newsletter", which presents the most recent news according to the user's profile. The user-modelling module provides information regarding the user's most preferred thematic domains and the user's most preferred companies. It then displays the full text of all recent news related to the preferred domains and companies.
Fig. 5 Advanced Search mode (company acquisition event)
The first level of system evaluation is described briefly, since it is beyond the scope of this paper. For the testing and further improvement of the Information Filtering, Natural Language Processing, Information Extraction and User Modelling subsystems, multiple prototypes were created and evaluated during the course of MITOS project [9, 10, 11, 6]. The development of prototypes facilitated early testing and provided a common reference point for all members of the design team. It improved the quality and completeness of the subsystem’s functional specification and led to the identification and solving of problems at an early stage of the system’s development.
Having realized that user requirements frequently do not become apparent until a system is in use, we tried to provide, from an early stage, a common base of discussion between developers and users. We, thus, developed and tested multiple prototypes of MITOS front-end. The evaluation of the user interface and its provided functionality was conducted in the following three stages: creation of a mockup, development and evaluation of a first prototype, development and evaluation of the second prototype
The mockup consisted of HTML pages, which presented the main functions of the system. In that early stage MITOS back end had not been developed. Our goal was to present a preliminary design of the site’s functionality. Therefore the data presented in the site were just examples especially created for the needs of the mockup. The usefulness of the mockup evaluation was rather limited. Instead of looking at the presented functionalities, most users claimed that they could not give valuable feedback since the data provided was limited and the supported functions incomplete. Therefore we had to wait the development of the first prototype, in order to examine the level of system’ s acceptance as well as to better understand the users’ needs.
The first prototype enabled us to show something tangible to the users, before committing them to the final system. Prototypes usually represent an incomplete version of the final system consisting of inputs, intermediary stages and outputs [1]. In our case, we were able to provide some level of functionality, since preliminary versions of back-end systems were released. We, thus, incorporated a large amount of financial news, which were processed off-line by the Information Filtering and Information Extraction sub-systems. The analysed information was stored in the system’s database and thus became available to the end user. The first prototype supported only two types of search: Simple and Advanced. It also incorporated user-modelling techniques for the provision of personalized content. We, thus, presented to users a prototype supporting the main MITOS functions. The evaluation was conducted by persons interested in financial business as well as by persons experienced in the development of information processing systems. A brief description of the system’s functionality along with some suggested usage scenarios were distributed to the evaluators. Since our aim was to assess whether MITOS system satisfied the users’ needs the data collected was mainly qualitative.
The users commented that the functionalities provided by MITOS system were useful. By being able to test the system using real data they could better appreciate its utility. However, they requested major improvements to the user interface, which seemed to be poor and unattractive. The limited attention that was given to the design of the user interface prohibited users of appropriately assessing the system’s functionalities. They also mentioned, the lack of a detailed online help. Besides the improvements in the MITOS web site appearance, the users also suggested to incorporate some additional functionality. Although they were satisfied from the provision of two ways of search, they recommended adding the feature of searching by using company names. They also proposed the provision of an extra Web page, which would include the most recent news of interest, in the form of a personal newsletter.
Following users' suggestions, the third way of search, called “Topic Search by Company Name” was incorporated. We also offered the “Personal Newsletter” service to present the most recent financial news related to thematic domains and companies of users’ interests. Besides adding these features, we have also made major improvements to the overall appearance of the user interface. In order to provide a friendly and attractive environment we designed some Web pages asking for feedback from the users. The agreed web pages were incorporated to the final version of the system. We finally enhanced our user interface by including a more detailed online help.
In the second prototype emphasis was not only given to the evaluation of its operations but also to the quality and accuracy of the processed information. A beta version of the Information Filtering and Information Extraction sub-systems was installed in the MITOS server. We then started to feed our database with financial news, which were analysed and processed online by the back end’s systems. During the evaluation of the 2 nd prototype, users reported only minor weaknesses and mentioned limited problems related to the systems functionality. Most improvements suggested were implemented in the final version of the system.
Besides these limited remarks the overall system was found to be useful. According to users’ opinion, the second version of the user interface is easy, attractive and friendly to use. This enabled them to pay more attention to the evaluation of the content and functionality of the site. Most users were satisfied by the results of the analysis and processing of information and argued that MITOS system facilitates the retrieval of financial news. They also said that it enables them to easily and quickly identify valuable information.