http://www.spinellis.gr/pubs/jrnl/2003-PC-GTWeb/html/gtweb.html This is an HTML rendering of a working paper draft that led to a publication. The publication should always be cited in preference to this draft using the following reference:
|
Diomidis D. Spinellis
Department Management Science and Technology
Athens University of Economics and Business
Patision 76, GR-104 34 Athens, Greece
email: dds@aueb.gr
The purpose behind the realization of the GTWeb is to examine how trip diaries can be created and presented with minimal effort by exploiting the synergies of integrating different consumer-grade information appliances and publicly accessible databases. An information appliance [22] can be defined as one designed to perform a specific activity, able to share information with other appliances [15]. Many researchers have already examined personal navigation systems and location-based applications; as an example [21] proposes the WorldBoard system for associating information with places on a global scale, [4] describes the use of a personal digital assistant to aid the navigation in exhibition areas and fairs, while [2] describes how sentient computing systems can change their behavior based on their environment. Although the combination of positional data with visual media has been examined as a way to augment existing geographical information systems (GIS) [13], modern research, often performed within the wearable computing community [14], explores how the integration of data from digital cameras and GPS devices can be used to empower human cognition and intelligence. Thus, [20] examine the use of georeferenced photographs in an educational setting as a way to investigate community change, and [24] theorize how tourist photos could be annotated based on GPS-acquired information; an application also proposed by [5] who in addition suggest using the positional data to assist the user in photograph composition and the reconstruction of 3D images. In parallel, more recent research has examined the use of positional data to infer significant locations in a person's life [3], and the automatic summarization of continuously acquired personal multimedia content [10].
The contribution of this article is the concrete demonstration of following ideas:
In the following sections we present the GTWeb's design and implementation, present examples of its actual use, and discuss the lessons we learned from its implementation.
A GTWeb site consists of the trip overview, timelines, maps, photographs, and a technical summary of the processing details.1 A UML diagram of the GTWeb content tree appears in Figure 2.
The trip overview provides a textual narrative of the whole trip
like the following:2
``From 2.08 km S of Kastraki (hill) (topological, street map) (Sun Aug 19, 2001 10:48:55) to
1.74 km W of Metokhion Konstamonitou (populated place) (topological, street map) (Sat Aug
25, 2001 09:14:29) covering a travel distance of 898.02 km at an average speed of 60 km/h
over an area of 45909 sq km. Duration 5 day(s), travel time 14:45 (travel map).''
We use timelines to order trip events-like approaches to geographical features and photographs-based
on the time of their occurrence thus generating a trip diary:
Wed Aug 22, 2001
[ ... ]
[ ... ]
Maps and photographs are also ordered in a chronological order and divided into separate pages based on the day the trip was made. GTWeb contains a separate overview map for each trip leg, and detailed maps covering smaller areas. Each detailed map shows the route traveled and geographic features (populated places, streams, hills, etc.) annotated with the time they were approached (figures 3 and 4). Each map is prefixed by a description of the trip part it illustrates.
Photographs are indexed in a chronological order using thumbprints and annotated with a description
of the time and place they were taken:
Wed Aug 22, 2001 13:42:56
About (most recent fix taken 2 seconds from the picture time) (topological, street map) 0.49 km SE of Moni Xenofondos (monastery) (topological, street map)
The same description, together with links to the corresponding trip leg map and the detailed trip part map, also appears under the full-sized image of each photograph. All descriptions contain links leading to dynamically generated topological and street maps available on public Web sites.
You can see the data-flow diagram of the GTWeb creation process in Figure 6. The GTWeb software first processes the GPS track log together with the gazetteer database to annotate the track log with the nearest-in Euclidean distance-geographical features for each track point. Topography (a grid of altitude points on the earth globe) and coastline data (closed polygons) is then used to create the various maps. During this phase, the trip track and geographical features are superimposed on the maps drawn by matching the respective longitude and latitude coordinates. Finally, the pictures are allocated into different maps and textually annotated based on the time assigned by the respective appliance to each track log point and each digital picture. The availability of time information for both track log points and the pictures was the crucial factor that allowed us to integrate the two different data sets.
The data model used to construct the GTWeb is depicted as a UML diagram in Figure 7. The primary types of data objects are:
To create the GTWeb the three data objects are extended by combining features of their parent classes. Thus
The time and location of the traveler's ``visit'' to the vicinity of a given geographical feature is determined by the track log point that has the smallest Euclidean distance to the given feature. This can be formalized as follows:
|
|
Most data is stored in its native format, apart from picture metadata where an intermediate program layer transforms filesystem-resident information into XML that is used for further processing. Thus a photograph's details will appear as follows:
<photo> <name>DSC00007.JPG</name> <time>998474606</time> <caption>Ouranoupoli</caption> <localtime>Wed Aug 22 13:03:26 2001</localtime> <gmtime>Wed Aug 22 10:03:26 2001</gmtime> </photo>
In the future standardized schemas based on XML should probably be used for interfacing and accessing all data, thus avoiding incompatibilities between different cameras and GPS devices. Similarly, at the physical level the serial NMEA-based interface that we used for GPS data capture and the compact flash filesystem we used for transferring the photographs could probably be standardized through uniform USB or Bluetooth device profiles.
For the selection of the GTWeb presentation and implementation technologies we had to choose between three different alternatives. A query-based interface would present results (maps, photographs) based on conditions specified by a user (show me where I was on August 17th, 2001). Such an approach however is unsuitable for casual browsing, which we felt was a highly desirable feature. If the above approach was supplanted by a dynamic browsing interface, it would allow users to create content on the fly based on their actions; using this approach users would be able to zoom and pan on the maps and photographs. However, the drawback of both approaches is that they would depend at run-time on a number of large software applications such as a relational database, an application server, and a GIS. The complexity of these applications might be an inhibiting factor for the adoption of a system we designed mainly for personal use. In addition, the platform's software and hardware requirements would introduce maintenance problems and would create a significant preservation risk for material that would typically be archived for decades-who hasn't nostalgically browsed photo albums or diaries recorded 20 or 50 years ago? We feel that the static HTML content we selected as our GTWeb presentation format is a lot more likely to survive a series of system upgrades over a period of 10-50 years, than a perhaps more versatile system that would create content dynamically.
The system has been implemented in the form described in the previous section and has been used to illustrate a number of trips. In practice it works well for summarizing relatively long (100km) trips; shorter distances are less effectively presented due to the lack of publicly available low-scale digital geographic data. This is also the reason we are currently not providing picture hyperlinks from the maps. A large number of photographs taken at the same location will still be correctly categorized and ordered by the time they were taken, but their geographic annotation will not be very informative. In the future, publicly available coordinate positions for elements such as monuments, town areas, road names, and other notable features could be used to address this shortcoming. A database of such feature details can be created through community cooperative efforts by harvesting elements such as photograph captions. In the form the system is currently realized it is more useful for presenting car, plane, or boat trips, than e.g. hiking or bicycle excursions.
A small, but irritating, problem we encountered concerned the time synchronization of the two appliances. The correct setting of an appliance's clock is a task notoriously neglected, thus when correct time stamps are needed for synchronizing the data of the digital camera with that of the GPS they may be unavailable. Furthermore, timezones and the daylight savings time create additional challenges. Obviously both appliances need to be synchronized to use the same timeframe as a common reference. However cameras typically operate on local time, GPS devices on UTC time, and different PC operating systems on one or the other. In addition, meaningful captions and timelines have to be generated, not based on the local time of the processing computer, but on the local time that was in effect in the place and on the date where the trip was made. Correctly handling and documenting this behavior is a problem that we never handled to our complete satisfaction.
One other important issue that will emerge once GTWebs are published on the web, concerns the creators' privacy. A GTWeb may reveal more data and to more people than its publisher realizes. As an example, a speeding violation can be deduced by examining a trip's timeline. Appropriate measures have to be taken to distinguish between an individual's or a family's intranet hosting personal experiences from the public Internet. The former can be perhaps hosted on CD-ROM media never to be shared on the Internet. However, keep in mind that publishing and sharing the details of a trip is a time honored tradition that we humans seem to enjoy from the early antiquity.
To investigate end-user views of the GTWeb's presentation format we conducted a small informal study by directing around 200 members of our academic community to view a sample trip report and fill-in an online questionnaire. We had 30 responses; a 15% response rate. The (self-selected) population of our survey's respondents can be considered young with ages ranging from 18 to 35 years and an average age of 22 years. Their sex was roughly balanced (46% female, 53% male), as were their perceived IT skills: 13% considered themselves beginners, 66% reported they used computers with confidence, and 20% considered themselves experts. When asked to compare GTWeb with other ways traditionally used to present photographs 63% found GTWeb better than a paper-based album, while 83% found GTWeb better than a plain on-line photograph collection (26% and 13% respectively preferred the alternative presentation). Photograph captions and online maps were found interesting by 80% and 86% of the respondents, while 16% and 6% found them useless, and 3% and 6% found them irritating. Finally, 83% answered in two separate questions that the would like to use GTWeb to present information about their trips and also use it as the only way to present information about their trips. This figure dropped to 63% when asked whether they would like to use GTWeb to present their photograph collection, and to a meager 13% who said they were likely to use GTWeb as the only way to present their photograph collection.
The above figures, although not an outcome of a statistically rigorous study, indicate that the strongest advantage of GTWeb is its presentation of spatial data in the form of annotated maps. The organization of photographs, although not criticized, was mostly considered a ``nice to have'' feature that would probably be supplemented by additional dissemination forms such as photo albums, email, and (increasingly) multimedia messaging (MMS) exchanges. In retrospect, we should have been expecting this finding, since digital photographs, offering most affordances of their paper-based relatives (and some additional ones), live and compete in an already rich ecosystem that has evolved over a period of more than 150 years. In contrast, detailed spatial data of the form generated by GPS devices is a relatively new phenomenon. GPS receivers are gradually being added into consumer mobile electronics [8], but most people have no experience with using, presenting, and disseminating the data these devices generate.
The design and implementation of the GTWeb taught us a number of important lessons regarding the presentation of trip diaries and the integration of information appliances.
The informal survey we conducted showed that end-users, although generally positive towards new ways for organizing, displaying, and disseminating digital photographs, are mostly interested in tools that allow them to effectively use the new data types generated by their appliances, in our case the trip log's coordinates. We believe that designers of other applications dealing with novel data types, such as e.g. RFID tag data streams captured from consumer goods, will face in the future similar opportunities.
Standardization played a vital part in our endeavor. All topography elements, gazetteer information, coastlines, and the GPS track log were based the same standard geodesic system (WGS-84) making it possible to superimpose and link elements acquired from the end-user device and different agencies on the same map. In addition, track log information could be downloaded from the GPS device using a common hardware interface, and the photographs could be accessed on the camera's storage device in a filesystem format recognized by our workstation's operating system. Furthermore, modular open-source software and public databases that could be downloaded in their entirety provided the facilities for annotating the photographs and displaying the track logs in a meaningful context.
One other important lesson from building the GTWeb system concerns the importance of what was in effect ancillary data for integrating the two appliances. Both the GPS track log and the photographs were tagged with date and time information that we exploited to link them together. Most of the value-added GTWeb content was derived by joining (in the database sense of the word) data using as a join key approximate time or location matches. We believe that making this type of data, generated by many appliances, available will breed a number of innovative applications.
Our low-end choices of technology were also instructive. The content delivery mechanism we used (static web pages) although not sophisticated when compared to the various active content technologies proved surprisingly effective, portable, and resilient. Similarly, our choice of consumer-grade appliances demonstrated that many interesting pervasive computing applications can be constructed by combining ordinary equipment in innovative ways.
1 You can see a GTWeb example at http://www.spinellis.gr/gtweb/Chalkidiki.
2 We use italic characters to denote hyperlinks.
3 http://dbwww.essc.psu.edu/dbndx/tree/data/dem/etopo5.html
4 http://www.ngdc.noaa.gov/seg/topo/