=Paper=
{{Paper
|id=Vol-443/paper-2
|storemode=property
|title=Implementation and Evaluation of a Semantics-based User Interface for Web Gazetteers
|pdfUrl=https://ceur-ws.org/Vol-443/paper2.pdf
|volume=Vol-443
}}
==Implementation and Evaluation of a Semantics-based User Interface for Web Gazetteers==
Implementation and Evaluation of a Semantics-based User
Interface for Web Gazetteers
Krzysztof Janowicz Mirco Schwarz Marc Wilkes
Institute for Geoinformatics Institute for Geoinformatics Institute for Geoinformatics
University of Münster, University of Münster, University of Münster,
Germany Germany Germany
janowicz@uni-muenster.de mirco.schwarz@uni-muenster.de marc.wilkes@uni-muenster.de
ABSTRACT techniques developed in conjunction with the social Web
As the Web moves towards structured data, new communi- 2.0, such as search-while-you-type forms (also known as au-
cation paradigms and user interfaces become necessary to tocompletion) and tag clouds, with the reasoning capabilities
support users beyond simple keyword search. In contrast of the semantic Web.
to applications for expert users, these interfaces should hide
the complexity of ontology-based systems but still offer their GAZETTEERS AS APPLICATION AREA
reasoning capabilities. Using Web gazetteers as application Gazetteers are place name directories. Each gazetteer record
area, we discuss how subsumption and similarity reasoning consists at least of a triple (N, F, T ) where N corresponds
can be integrated into modern Web interfaces and explain the to one or more place names, F represents one or more geo-
made design decisions. To evaluate our approach, the results graphic footprints (i.e., locations), and T is the type (class)
from a first human participants test are presented. of the described feature (i.e., the representation of a real
world geographic entity). Hence, gazetteers support at least
Author Keywords two types of queries. First, they map between place names
Information Retrieval, Similarity, Geographic Feature Types and respective footprints: N → F ; and second, they map be-
tween names and geographic feature types: N → T . Note
ACM Classification Keywords that a named geographic place is an abstract individual de-
H.5.2 Information Interfaces and Presentation: User Inter- fined to refer to a physical region in space and categorized
faces—Graphical user interface; H.3.3 Information Storage according to commonly agreed characteristics. The place
and Retrieval: Information Search and Retrieval name is a handle to support communication. Hence, Place
is a social construct of interest for a particular community
INTRODUCTION AND MOTIVATION during a specific time span. A place may change its name,
The major benefit of semantics-based information retrieval location, and type (e.g., from city to ruin) over time [10, 12].
is the possibility to reach beyond simple keyword search,
and hence to support the user in answering complex queries. Gazetteers are key components of all georeferenced infor-
While there are several promising approaches to semantics- mation systems, including GIScience applications in many
based retrieval such as subsumption and similarity reason- diverse fields, Web-based mapping services such as Google
ing, there is surprisingly little work on how to hide the ad- MapsTM , and an increasing number of Web 2.0 applications
ditional complexity (e.g., the underlying ontology) from the such as plazes.com. Gazetteers are crucial for geoparsing
user. In addition, the results have to be presented in a way where references to geographic locations (by their names)
that they are also comprehensible for non-experts. The ex- are recognized in texts and converted to coordinate refer-
tended retrieval capabilities of the semantic Web should not ences [10]. Most gazetteers are either accessed via a Web
be slowed down by more complex user interfaces. page or through an application programming interface (API).
The geographic feature type is selected from a typing scheme,
Based on our previous work [14, 13], we discuss the im- in most cases from a feature type thesaurus. The type of the
plementation and evaluation of a semantics-based user in- described geographic feature is of special importance. For
terface for Web gazetteers. It makes use of subsumption instance, a search for places named Berlin might return more
and similarity reasoning to support the users in navigating than hundred places with a lot of different feature types vary-
through geographic feature types. Our idea is to combine ing from capital (located in Germany), over administrative
area (a state in Germany), to stream (located in Colombia).
One of the major Web gazetteers is the Alexandria Digital
Library (ADL) Gazetteer1 . The ADL Web interface depicted
in figure 1 includes a map to narrow down the search to a par-
ticular spatial extent, a form to enter the place name (or parts
of it), as well as a scrolldown list with about 200 (or 1200
Workshop on Visual Interfaces to the Social and the Semantic Web 1
(VISSW2009), IUI2009, Feb 8 2009, Sanibel Island, Florida, USA. http://www.alexandria.ucsb.edu/gazetteer/
Copyright is held by the author/owner(s).
1
if one includes the non-preferred) geographic feature types. SEMANTICS-BASED INFORMATION RETRIEVAL
To learn about the definitions of these types, the user has to In contrast to mere syntactic search, semantics-based infor-
switch to the ADL Feature Type Thesaurus (FTT) and read mation retrieval takes the underlying conceptualization (or
the (often ambiguous and brief) plain text descriptions. The attribution) into account to improve searching and brows-
thesaurus also contains information about super types and ing through structured data. Three approaches can be dis-
some manually selected related types (with many types re- tinguished: 1) those based on classical subsumption reason-
maining unrelated to others). Some gazetteers try to take up ing, 2) those based on so-called non-standard inference tech-
the participantion idea of the Web 2.0 and allow for user de- niques such as computing the least common subsumer (lcs)
fined places and types. In both cases, the navigation through or most specific concept (mcs) [18, 17], and finally 3) those
and understanding of the types is a major shortcoming [12]. approaches based on computing semantic similarity2 [5, 6,
3, 14]. As depicted in figure 2, subsumption reasoning can
The following example illustrates the navigation between be applied to vertical search, while similarity works best for
feature types in ADL. A user selects lakes from the feature horizontal search and should not be applied to compare sub
type scrolldown list using the ADL gazetteer Web interface. and super types.
Lakes is a narrower term [2] of hydrographic features which
is a top term in the FTT hierarchy. As the user cannot find
a particular geographic feature, she decides to change the
type to search for reservoirs. Unfortunately, reservoirs is
not a narrower term of hydrographic features, and therefore
she looks up the textual definition of lakes in the thesaurus.
Here, reservoirs is specified as related term [2] and she can
click on the link and read the definition of reservoirs. It is
defined as narrower term [2] of hydrographic structure. This
in turn is a narrower term of manmade features which is an-
other top term in the ADL FTT.
Figure 2. Vertical and horizontal retrieval within an ontology.
Formally, the result set for a user’s query using subsumption
reasoning is defined as RS = {C | C v Cs }; where Cs is
the search (or query) concept specified by the user. As each
concept returned to the user is a subsumee of the search con-
cept, it meets the user’s search criteria. This makes selecting
an appropriate search concept the major challenge for end
users. In fact, the search concept is an artificial construct
and not necessarily the searched concept [15]. If it is too
generic, i.e., at the top of the hierarchy, the user will get a
large part of the queried ontology back as (unsorted) result
set. In contrast, if the search concept is too specific, the re-
Figure 1. The Alexandria Digital Library Gazetteer Web interface sult set might be empty.
showing a search for reservoirs in Florida.
In case of similarity-based retrieval, the result set is defined
As discussed by Janowicz and Keßler [12], semantics-based
as RS = {Ct | sim(Cs , Ct ) > t}; where Ct is the compared-
user interfaces in conjunction with geographic feature type
to target concept and t a threshold defined by the user or ap-
ontologies would support the user in browsing through and
plication [14, 15]. In contrast to subsumption reasoning, the
searching in gazetteers. The alternative interface proposed in
search concept is the concept the user is really searching for
this paper aims at hiding the feature type scrolldown list (see
(no matter if it is part of the queried ontology or not). As
figure 1) from the user. Instead, we propose a search-while-
the concepts are compared for overlap (of their definitions
you-type form as well as a combination of horizontal and
or extensions), similarity is more flexible than subsumption
vertical search to improve the navigation between (related)
reasoning, but it is not guaranteed that the results match all of
feature types. These types will be determined based on their
the user’s search criteria. Note that the similarity estimations
formal specifications. Hence, users do not have to look up
between search and target concepts are asymmetric which is
potentially relevant types in the FTT by hand.
important for information retrieval. Usually, the results of a
2
We are focussing on measures which define similarity as degree
of conceptual overlap here. Other theories consider similarity as
(inverse) distance within a multi-dimensional space, or as a set of
transformations [7] from the search to the target concept.
2
similarity query are presented to the user as an ordered list
with descending similarity values.
Summing up, the benefits similarity offers during the re-
trieval phase, i.e., to deliver a flexible degree of (conceptual)
overlap to a searched concept, stands against shortcomings
during the usage of the retrieved information, namely that
the results not necessarily fit the user’s requirements.
To overcome these shortcomings, similarity theories such as
SIM-DL combine subsumption and similarity reasoning. In
a first step, the context of discourse [11] is defined by intro-
ducing a so-called context concept such that Cd = {Ct |Ct v
Cc }. Consequently, in the next step only such concepts are
compared for similarity which are subconcepts of Cc . This
way, the user can specify some minimal characteristics all Figure 3. Combining subsumption and similarity reasoning for in-
target concepts need to share. As depicted in figure 3, Cti formation retrieval. Cs is the search concept, Cti and Ctj are the
compared-to target concepts, while Cx is not a subsumee of the con-
and Ctj are compared for similarity to Cs while Cx is not. text concept Cc , and hence is not compared for similarity to Cs .
Note that for reasons of simplification, the figures 2 and 3
show a single hierarchy, while similarity takes all role-filler
pairs (e.g., the fact that a river has a spring as its origin content or the type of data. Van Kleek and colleagues [24]
∃ hasOrigin.Spring) into account to compute conceptual point to the obstacles users face when diving into the seman-
overlap. tic Web and introduce an agenda on how to improve knowl-
edge creation and access for end users. Van Ossenbruggen et
With regard to semantics-based user interfaces, subsumption al. argue why it is difficult to evaluate semantics-based user
reasoning can be applied to include lakes in the result set if interfaces [25]. Basically, one needs to distinguish between
the user is searching for (the string) waterbody. Mcs and lcs the quality of the underlying data, the involved reasoning
can be used to implement a query-by-example in where the engine, and the user interface as such. In this work, we fo-
result set is computed out of a set of prototypical individuals cus on the user interface. Additional information on the un-
(e.g., waterbodies) selected by the user [26, 15]. Similarity derlying data and ontologies is discussed in [12], while the
measures can be implemented to propose alternative or sup- similarity reasoning is described in [14].
plementary results such as reservoirs if the user is searching
for (the string) lake. However, to propose meaningful alter- The Atom interface [21] realizes a circular layout to support
natives the underlying similarity measure needs to be cog- users in navigating through semantic Web data. One exam-
nitively plausible, i.e., the rankings returned by such a mea- ple for a combination of Web 2.0 interaction and semantic
sure have to correlate with human similarity judgments for Web techniques is given by Heath and Motta [8], who de-
the same set of compared individuals or concepts based on scribe how non-experts can publish and consume RDF data
the same definitions3 . This has been shown for several mea- by means of tags in a reviewing Website. Whereas their
sures such as MDSM [19] and SIM-DL [14, 13]. As these work deals with tagging data and proposing related tags, we
similarity measures compare DL concepts defined within on- present an approach to interact with data classified in a tax-
tologies, the question what is similar to Lake depends on the onomy (of feature types) and focus on similarity and sub-
formal definition (and does not necessarily match the mental sumption instead of relatedness. A promising approach to
conceptualization of a particular human user). integrate semantic autocompletion for geographic locations
is described by Hildebrand et al. [9]. They propose an inter-
SEMANTICS-BASED USER INTERFACES face which allows for searching geographic places by name
While there are specifications on how to display RDF triples where the suggestions are grouped by country or feature
to the user [4, 20] and various semantic Web browsers such type. Instead of focusing on a search by name, in this paper,
as PowerMagpie4 , there is little work on how to integrate the we implement an additional feature type parameter, allowing
reasoning capabilities of the semantic Web within end user for a more restrictive search of places but also demanding for
interfaces for information retrieval. Schraefel and Karger an intuitive feature type selection process. An approach to
[22] argue against the assumption that data on the semantic ease concept selection is presented by Sinkkilä et al. [23].
Web should be displayed to end users as graphs simply be- They present an interface that combines semantic autocom-
cause graphs are used for the computational representation pletion with cross-ontological context navigation. Whereas
(as data model). As in contrast to HTML, XML (and there- Sinkkilä et al. chose to visualize the concept hierarchy as
fore RDF and OWL) separates design from content, the vi- well as related concepts, we suggest to hide the taxonomic
sualization should depend on the task to solve instead of the complexity from the user.
3
This is a weaker requirement than cognitive adequacy which
would imply that the computational similarity theory reproduces
the process of human similarity reasoning.
4
http://powermagpie.open.ac.uk/
3
IMPLEMENTATION and opacity visualize the ranked order of the similarity val-
In order to demonstrate and evaluate how semantics-based ues, where a big font size and high opacity indicate a high
information retrieval can be integrated in Web interfaces for similarity. Each feature type entry in the table is a hyper-
end users, we have implemented a prototypical interface for link. If a user clicks on one of the suggested feature types,
the ADL gazetteer. The interface can access the records it is selected as search parameter and the similar and super
stored in the gazetteer via the ADL API. To implement the types are computed thereupon. The color of the chosen fea-
search-while-you-type form as well as geographic feature ture type changes to orange to indicate that this type is part
type recommendations, the prototype makes use of AJAX of the new query.
(Asynchronous JavaScript and XML) which allows to up-
date certain parts of the Web interface without the need to The features resulting from a specific query are displayed at
reload the whole page. the right side of the interface and on the map. Clicking on
these links or map markers opens a text box which contains
As depicted in figure 4, the gazetteer Web interface allows the information from the ADL gazetteer (e.g., names, foot-
for specifying three search parameters: place name, geo- prints, broader administrative units).
graphic feature type, and location of the searched place. As
argued above, while specifying the name and location of a Figure 4 shows the result of an exemplary search for places
searched place is straightforward, choosing the correct fea- of type Reservoir located in southern Florida (without name
ture type is more difficult, especially in combination with the restriction). Using the interface, the search could be broad-
other parameters. An example is shown in figure 4; note that ened by clicking on the supertype Waterbody, leading to a
most of the places categorized as reservoirs are named lake larger result set (including the results for Reservoir). In case
or pond, hence they would not appear in the result list if the the search for reservoirs did not yield the desired result, the
user would search for geographic features of type Lake. user could also decide to search for similar geographic fea-
ture types such as Lake or Canal.
To point out why a semantics-based interfaces is valuable for
end users, imagine the following example. One of the search
results shown in figure 4 is Lake Manatee. Due to its name,
a user searching for this specific place might be tempted to
deduce that it is of type Lake. Specifying a query as shown
in figure 6 seems obvious, but will not return Lake Manatee,
since it is of type Reservoir. However, facing an empty result
set, the user could decide to either broaden the search by
selecting the supertype Waterbody, or try out a similar type
to Lake. In the latter case, searching for Reservoir, which is
most similar to Lake, yields the desired result.
EVALUATION
In this section we discuss first results and lessons learned
Figure 4. The semantics-based Web interface for the ADL gazetteer
showing the search result for reservoirs in southern Florida.
from a human participants test carried out using the new in-
terface.
When the users begin to enter characters in the feature type Our hypothesis was that an interface combining subsump-
input field a table appears showing suggested types, starting tion and similarity to support the user in selecting appro-
with the characters already entered in the input field. There- priate feature types would be more effective than interfaces
fore, users can immediately see which feature types are sup- solely based on either subsumption or similarity reasoning.
ported by the application without browsing the feature type We did not compare our interface to the original ADL inter-
thesaurus (or ontology) manually. In addition, the table pro- face. This is for the following reasons:
vides information about similar types and super types (which
are computed by the SIM-DL similarity server5 at query • The original ADL interface makes use of the ADL Fea-
time). These recommendations support the user in browsing ture Type Thesaurus while the new interface is based on
the underlying feature type ontology (vertically and horizon- a feature type ontology which contains only hydrographic
tally; see figure 3). feature types so far [12]. Additionally, we do not distin-
guish between preferred and non-preferred concepts and
To display the similarity rankings in the Web interface font have one root node instead of six.
size and opacity are used. This is known from tag clouds, as
used by several blogs or Web 2.0 recommendation portals,
and should therefore be familiar for the user. The font size • The new interface supports search-while-you-type, while
the ADL interface does not. Consequently, the partici-
5
The SIM-DL similarity server as well as the interface are free pants would have to scroll through the whole ADL feature
and open-source software and can be downloaded at http://sim- type list – while only some of these types are relevant for
dl.sourceforge.net/. the test.
4
• The ADL interface allows for multiple comparison oper- given to the participants. They were asked to speak out loud
ators for the place names (has any words, has all words, while filling in the test and each test was recorded on video.
has phrase, equals, matches pattern). To reduce complex- To make sure that the tasks are not biased towards subsump-
ity for end users, the new interface only supports has all tion or similarity reasoning, the questionnaire was designed
words for comparison. in a way that it involves both kinds of reasoning as well as in-
teraction with the map and the place name field. Finally, the
• The ADL interface map does not support map markers as participants could give general comments on whether they
used in the Google MapsTM mashup in the new interface liked the interface and how it could be improved.
to narrow down the search to a particular map extent.
As depicted in figure 4, all interfaces were designed in a way
To test the hypothesis we developed three versions of the that the search parameters form a sentence of the type
semantics-based interface, the combined, a similarity-based
and a subsumption-based version. While the subsumption- You are currently looking for places with . . . in the
based interface shows super and sub types, the combined in- place name, and . . . as place type, and that are located
terface only lists the super types (see figure 5). The underly- within this region: . . . .
ing assumption was that users will select specific types rather
than general ones and can use the super types to broaden
their search. Strictly speaking, this could be another hypoth-
esis for further testing. Showing similar, super, and sub types
may overload the interface and overstrain the user.
Figure 5. The geographic feature type recommendation part of the
interfaces: (1) combined interface, (2) similarity-based interface, (3)
subsumption-based interface.
To verify the hypothesis we investigated how many tasks Figure 6. The interface shows the sentence resulting from the user’s
were successfully solved per interface and how many user search parameters while querying.
interactions (e.g., clicks) were necessary. As the interface
acts as front-end for the ADL gazetteer (whose response During a series of pre-tests it turned out that many partici-
times differ), the time needed to solve the tasks cannot be pants are so accustomed to single keyword search fields (as
taken as criterion. A total of 30 participants performed the used in most common Web search engines7 ) that they put the
test6 , i.e., each interface was tested by 10 participants. To place name as well as the feature type into the place name
ensure that the participants understand how the interfaces field. This was still the case when they were asked to speak
work, a video was shown which stepwise presents how to out the query before submitting it. Therefore, the interfaces
solve an introductory task: were redesigned to display the sentence resulting from the
entered search parameters in the map area while loading (and
Ems-Task the participants were still asked to speak out the query). This
reduced the number of wrong queries.
• Find the river Ems in Germany. Make it the only
result shown in the ’found places’ list. Try using Analyzing the first results from our human participants test,
just the ’place name’ field. Try using both fields, it turns out that from a total of 40 tasks per interface, 72.5%
’place name’ AND ’place type’. were solved completely (i.e., including all sub tasks) using
the combined interface. The participants using the similarity-
• Find canals connected to the Ems. Try using based interface were able to solve 65% of the tasks, while
just the ’place name’ field. Try using both fields, 62.5% of the tasks were solved using the subsumption-only
’place name’ AND ’place type’. (Note that the interface.
number of ’found places’ differs. Do you see
why?) As depicted in figure 7, there is no clear difference in the
total number of user interactions necessary to solve the tasks.
Next, the participants were asked to solve the task on their However, the inter-quartile range differs clearly. It increases
own. Afterwards, four additional tasks (with sub tasks) were from the combined interface over the subsumption-based to
6
With an age ranging from 21 to 30 years, 9 were female and the similarity-based interface.
21 male (most of them students of either geoinformatics or com-
7
puter science). The questionnaire and the introductory video can In contrast to upcoming semantics-based search engines such as
be downloaded at http://sim-dl.sourceforge.net/downloads/. Powerset (http://www.powerset.com/).
5
In the current version of the interface, only ten entries of the
result set and their corresponding map markers are shown at
a time. Although it is possible to go forth (or back) to see
the next (or previous) ten entries, several participants stated
that they almost overlooked the fact that there are more than
ten result entries. As a consequence, they proposed to show
all markers on the map, where those currently not listed in
the sidebar could be greyed out. Some participants also re-
quested that features of similar types should be directly dis-
played on the map using colored markers to visualize their
similarity instead of the suggestion table.
Finally, a few participants mentioned that upon sending a
query to the server and reading the query summary, they re-
alized a mistake in one of the search parameters, and hence
missed a button to cancel the request. As this was not possi-
ble, the wrong query was counted as user interaction during
the test.
Figure 7. The box plot shows the median as well as the 0.25 and 0.75
quartiles for the number of total user interactions (clicking, typing in CONCLUSIONS AND FURTHER WORK
words, following suggestions, interacting with the map) per interface In this paper we discussed results from testing the integra-
type. Comb is the combined interface using subsumption and similarity,
while Sim only shows similar types as suggestions and Sub only sub and
tion of subsumption and similarity reasoning into end user
super types. interfaces to support horizontal and vertical retrieval. While
a combination of both seems to be the most promising ap-
proach, it is not surprising that there is a gap between being
While these first results support our hypothesis, the test also able to implement more intelligent user interfaces and mak-
points so some aspects which let us rethink the design of ing them intuitively useable [25]. One solution would be
the Web interface as well as the number of user interac- to hide even more of the complexity (e.g., the suggestions
tions as quality measure. Comparing manually entered ver- table) from the user by directly displaying features of very
sus clicked feature types, intuitively one would expect that similar types. However, this may be interpreted as pater-
the more types are suggested, the more will be clicked in- nalism and reduce the user’s trust in such interfaces. For
stead of being manually entered. Hence, the number of man- instance, while reservoirs and lakes might be similar enough
ually entered feature types should be lower in the combined for most tasks, some application areas such as water man-
interface compared to the other ones. This turns out to be agement may require a clear distinction between both. The
a wrong assumption. In fact, the ratio between clicked and extended retrieval capabilities of the semantic Web should
manually entered types was about one to one in case of the be used to offer support, while the final decision should still
combined and the similarity interface, while being higher be up to the user. This leads to the question of how to in-
for the subsumption-based interface. To understand these corporate contextual information in the user interfaces and
results, we reviewed the videos taken from the participants to which degree context influences the resulting suggestions
during the test. It turns out that several participants had en- [1, 16, 11]. From the viewpoint of the social Web, one could
tered the type manually after seeing it in the suggestion table. also think of allowing human users to add similar types by
This points towards two difficulties. First, we have to inves- hand to improve the suggestions.
tigate whether this was caused by the interface design, i.e.,
by the positioning of the forms and the table (for instance
using eye tracking). Second, this makes it more difficult to
understand to which degree the suggested types were helpful
to the user in selecting the next type to be displayed.
After completing the tasks, the participants were asked to an-
swer additional questions about the design and functionality
of the interface. While the participants liked the interfaces
in general and rated the type suggestion table to be useful
(giving both the second best median rank on a 5-level Likert Figure 8. Conceptual design for the feature type suggestions allowing
scale), it turns out that some of the participants still had prob- for vertical and horizontal navigation.
lems dealing with two input fields. A given suggestion for
improvement was to have just one input field and automat- Based on our experiences and the test results, we designed
ically identify keywords representing feature types. Some a new navigation component which will be integrated in the
participants recommended to cache previous queries, allow- next version of the Web interface (figure 8). This component
ing for an easy comparison of several result sets using the will replace the feature type suggestion table in the upper
browser’s navigation (back and forward) buttons. right part of the interface. It will appear as soon as a feature
6
type is selected or typed in by the user. The suggestion ta- Workshop on Ontology Matching (OM2007) at
ble itself will appear just below the feature type input field ISWC/ASWC2007, Busan, South Korea, 2007.
as usually implemented in autosuggestion forms. The new
component visualizes explicitly the difference between hor- 4. C. Bizer, R. Lee, and E. Pietriga. Fresnel - Display
izontal and vertical navigation. A vertical slider can be used Vocabulary for RDF. Technical report, The World Wide
to specify more general or more specific feature types than Web Consortium (W3C), June 2005.
the searched type. The result set only contains places that
5. A. Borgida, T. Walsh, and H. Hirsh. Towards
are of a type under the slider. The horizontal slider allows
Measuring Similarity in Description Logics. In
for including similar types in the result set. This integrates
Proceedings of the 2005 International Workshop on
the idea of a similarity threshold which was discussed in the
Description Logics (DL2005), volume 147 of CEUR
semantics-based information retrieval section. Here, only
Workshop Proceedings. CEUR, Edinburgh, Scotland,
those places are included in the result set that are to the left of
UK, 2005.
the slider. As in the presented user interface, less opacity in-
dicates decreasing similarity. The use of font size, however, 6. C. d’Amato, N. Fanizzi, and F. Esposito. A Semantic
differs. Whereas previously the font size also represented Similarity Measure for expressive Description Logics.
the similarity value, within the conceptual navigation com- In CILC 2005, Convegno Italiano di Logica
ponent the font size indicates the number of features found Computazionale, Rome, Italy, 2005.
within the map extent for that specific type; where a larger
font size points to a higher number of found places. 7. R. L. Goldstone and J. Son. Similarity. In K. Holyoak
and R. Morrison, editors, Cambridge Handbook of
In figure 8, the immediate super type of Reservoir is Water- Thinking and Reasoning. Cambridge University Press,
body, although another immediate super type of Reservoir 2005.
specified in the underlying ontology is Manmade. Allow-
ing for generalizing also towards Manmade now yields two 8. T. Heath, E. Motta, T. Heath, and E. Motta. Ease of
problems. First, the complexity of the navigation component Interaction Plus Ease of Integration: Combining Web 2.
would increase, ending up in a graph-like representation of 0 and the Semantic Web in a Reviewing Site. Web
the ontology (which is the opposite of our design goal). Sec- Semantics: Science, Services and Agents on the World
ond, generalizing towards multiple super types will result in Wide Web, 6(1):76–83, 2008. Semantic Web and Web
a large and diverse result set. That is why we propose to only 2.0.
generalize along one inheritance path. Deciding which path
of inheritance is shown in the navigation component depends 9. M. Hildebrand, J. van Ossenbruggen, A. Amin,
on the context concept Cc whose integration in the interface L. Aroyo, J. Wielemaker, and L. Hardman. The Design
is an open issue. Space of a Configurable Autocompletion Component.
Technical report, CWI technical report INS-E0708,
November 2007.
ACKNOWLEDGEMENTS
The comments from three anonymous reviewers provided 10. L. L. Hill. Georeferencing: The Geographic
useful suggestions to improve the content and clarity of the Associations of Information (Digital Libraries and
paper. This work is funded by the Semantic Similarity Mea- Electronic Publishing). The MIT Press, 2006.
surement for Role-Governed Geospatial Categories (SimCat)
project granted by the German Research Foundation (DFG 11. K. Janowicz. Kinds of Contexts and their Impact on
Ra1062/2-1), as well as the International Research Training Semantic Similarity Measurement. In 5th IEEE
Group on Semantic Integration of Geospatial Information Workshop on Context Modeling and Reasoning
(DFG GRK 1498). (CoMoRea’08) at the 6th IEEE International
Conference on Pervasive Computing and
Communication (PerCom’08), pages 441–446, Hong
REFERENCES Kong, March 2008. IEEE Computer Society.
1. R. Albertoni and M. D. Martino. Semantic Similarity of
Ontology Instances Tailored on the Application 12. K. Janowicz and C. Keßler. The Role of Ontology in
Context. In A. K. Dey, B. Kokinov, D. Leake, and Improving Gazetteer Interaction. International Journal
R. Turner, editors, ODBASE - OTM Conferences, of Geographical Information Science,
volume 4275 of Lecture Notes in Computer Science, 10(22):1129–1157, 2008.
pages 1020–1038. Springer, 2006.
13. K. Janowicz, C. Keßler, I. Panov, M. Wilkes,
2. ANSI/NISO. ANSI/NISO Z39.19 - 2005 Guidelines for M. Espeter, and M. Schwarz. A Study on the Cognitive
the Construction, Format, and Management of Plausibility of SIM-DL Similarity Rankings for
Monolingual Controlled Vocabularies. NISO Press, Geographic Feature Types. In L. Bernard,
2005. A. Friis-Christensen, and H. Pundt, editors, 11th
AGILE International Conference on Geographic
3. R. Araújo and H. S. Pinto. Towards Semantics-Based Information Science (AGILE 2008), Lecture Notes in
Ontology Similarity. In P. Shvaiko, J. Euzenat, Geoinformation and Cartography, pages 115–133,
F. Giunchiglia, and B. He, editors, Proceedings of the Girona, Spain, 5-8. May 2008. Springer.
7
14. K. Janowicz, C. Keßler, M. Schwarz, M. Wilkes, 24. M. Van Kleek, M. Bernstein, P. André, M. Perttunen,
I. Panov, M. Espeter, and B. Baeumer. Algorithm, D. Karger, and M. Schraefel. Simplifying Knowledge
Implementation and Application of the SIM-DL Creation and Access for End-Users on the SW. In CHI
Similarity Server. In Second International Conference 2008 SWUI Workshop, April 2008.
on GeoSpatial Semantics (GeoS 2007), number 4853 in
Lecture Notes in Computer Science, pages 128–145. 25. J. van Ossenbruggen, A. Amin, and M. Hildebrand.
Springer, 2007. Why Evaluating Semantic Web Applications is
Difficult. In Semantic Web User Interaction at CHI
15. K. Janowicz, M. Wilkes, and M. Lutz. Similarity-based 2008: Exploring HCI Challenges, 2008.
Information Retrieval and its Role within Spatial Data
26. M. Wilkes and K. Janowicz. A Graph-based Alignment
Infrastructures. In 5th International Conference on
Approach to Similarity between Climbing Routes. In
Geographic Information Science (GIScience 2008),
First International Workshop on Information Semantics
Park City, Utah, USA, September 2008; forthcoming.
and its Implications for Geographic Analysis (ISGA
16. C. Keßler. Similarity Measurement in Context. In B. M. ’08) held in conjunction with GIScience 2008, the 5th
Kokinov, D. C. Richardson, T. Roth-Berghofer, and International Conference on Geographic Information
L. Vieu, editors, 6th International and Interdisciplinary Science, Park City, Utah, USA, September 23 2008.
Conference, CONTEXT 2007, volume 4635, pages
277–290. Springer, Copenhagen, Denmark, August
2007.
17. R. Küsters. Non-Standard Inferences in Description
Logics, volume 2100 of Lecture Notes in Artificial
Intelligence. Springer, 2001.
18. R. Möller, V. Haarslev, and B. Neumann.
Semantics-Based Information Retrieval. In Proc.
IT&KNOWS-98: International Conference on
Information Technology and Knowledge Systems, 31.
August- 4. September, Vienna, Budapest, pages 49–56,
1998.
19. A. Rodrı́guez and M. Egenhofer. Comparing Geospatial
Entity Classes: An Asymmetric and Context-dependent
Similarity Measure. International Journal of
Geographical Information Science, 18(3):229–256,
2004.
20. L. Rutledge, J. van Ossenbruggen, and L. Hardman.
Making RDF Presentable: Integrated Global and Local
Semantic Web Browsing. In WWW ’05: Proceedings of
the 14th international conference on World Wide Web,
pages 199–206, New York, NY, USA, 2005. ACM.
21. K. Samp, C. Basca, B. McDaniel, and S. Decker. Atom
Interface - A Novel Interface for Exploring and
Browsing Semantic Space. In Semantic Web User
Interaction at CHI 2008: Exploring HCI Challenges,
2008.
22. M. Schraefel and D. Karger. The Pathetic Fallacy of
RDF. In International Workshop on the Semantic Web
and User Interaction (SWUI) 2006, Athens, Georgia,
USA, November 2006.
23. R. Sinkkilä, E. Mäkelä, E. Hyvönen, and T. Kauppinen.
Combining Context Navigation with Semantic
Autocompletion to Solve Problems in Concept
Selection. In K. Belhajjame, M. d’Aquin, P. Haase, and
P. Missier, editors, SeMMA, volume 346 of CEUR
Workshop Proceedings, pages 61–68. CEUR-WS.org,
2008.
8