=Paper=
{{Paper
|id=Vol-369/paper-16
|storemode=property
|title=Humboldt: Exploring Linked Data
|pdfUrl=https://ceur-ws.org/Vol-369/paper15.pdf
|volume=Vol-369
|dblpUrl=https://dblp.org/rec/conf/www/KobilarovD08
}}
==Humboldt: Exploring Linked Data==
Humboldt: Exploring Linked Data
Georgi Kobilarov Ian Dickinson
Hewlett-Packard Labs Hewlett-Packard Labs
Bristol, UK Bristol, UK
georgi.kobilarov@gmx.de ian.dickinson@hp.com
ABSTRACT Current research in the semantic web user interface area, es-
We present Humboldt, a novel user interface for browsing pecially in semantic web powered search engines, has tended
RDF data. Current user interfaces for browsing RDF data to focus on information retrieval tasks. We characterize
are reviewed. We argue that browsing tasks require both these tasks as those in which a user has a very specific ques-
a facet browser’s ability to select and process groups of re- tion or query in mind which he tries to answer or solve. Com-
sources at a time and a ’resource at a time’ browser’s ability panies like Powerset [4] focus on providing mechanisms to
to navigate anywhere in a dataset. We describe Humboldt answer tasks such as e.g. ’what is the population of Berlin?’,
which combines these two features in a single coherent in- ’which books are written by author X?’, etc. using semantic
terface. Our approach is based on the operation of pivoting, technologies.
which enables the user to move the focus of a browsing from
one set of resources to a set of related resources. With re-
peated use of the pivot operation the user can browse any- In this paper, we focus on exploratory tasks and character-
where in the data. We describe a preliminary evaluation of ize them as follows: in exploratory tasks, a user has a more
our approach and discuss its implications for further devel- vague idea of the question he wants to answer. His interests
opment. depend upon the changing characteristics of the information
context [7] [8].
Categories and Subject Descriptors We give two business related examples for this kind of tasks:
H5.2 [Information Interfaces and Presentation]: User
Interfaces; H5.4 [Information Interfaces and Presenta-
tion]: Hypertext/Hypermedia • A product manager plans a product update. He wants
to get an overview of the new features of competi-
General Terms tors’ products. User reports of his product have been
Design, Human Factors tracked by his companies support department. His de-
velopment team came up with new ideas for innova-
Keywords tive product features. The production department has
published technical data on new methods of cost re-
user interface, semantic web, faceted browsing, rdf, linked
duction factoring. The product manager needs to de-
data
cide which changes will be made with the next product
release.
1. INTRODUCTION
As more and more large Linked Data sources such as DBpe- • A financial analyst notices an irregularity in a dataset.
dia [5] have become available over the last year, the need for He seeks for an explanation. The company works in a
improved user interaction approaches for dealing with these broad market and it might be a competitor’s news as
highly interlinked RDF [21] datasets has become more ob- well as a companies news. The company is active in
vious. The quantity of data has reached a level where users foreign markets and depends on certain suppliers. Po-
are overwhelmed and feel lost in a similar way as in earlier litical news as well as changes in the suppliers market
days of the web. That phenomenon was described as ’Lost might also have influenced the financial data.
in Hyperspace’ [14].
Both examples highlight the difficulty of having a computer
system ’solve’ these problems by providing a single answer.
These tasks require gathering together structured and un-
structured data sources. In this paper, we focus on ex-
ploratory tasks using only structured data.
In the current document-centric web, a key mechanism of
web browsers to support information exploration is the brows-
ing history. Users can follow different links to other docu-
ments and, depending on the document’s relevance, decide than interfaces tailored to a specific domain. While there
to further explore or go back and follow a different path. are certain visual representations for particular data types,
The possibility of returning to a previous decision and fol- e.g. calendars and timelines for time-related data and maps
lowing a different path makes decisions less risky. for geographical data, there are no such representations for
Browsing histories and bookmarking functionality help users most other types of complex resources, e.g. people, films,
to store discovered information in the system instead of hav- companies etc. Hence we treat those specific representations
ing to memorize it for later reuse. as additional features to a more generic user interaction ap-
proach and do not include them in our analysis.
The Semantic Web with its access to more structured infor-
mation makes data aggregation techniques more important With growing availability of Linked Data [9] on the Web,
in exploratory tasks. Structured data can more easily be we argue that there is a need for generic approaches which
aggregated, supporting better overview of large amounts of do not restrict users to specific tasks or to specific data that
data. could be used, as both might not be known a priori.
In information retrieval tasks on web documents, a goal is
to find a specific document or - in terms of graph-theory - 2.1 Browsing
finding a particular node in the graph of linked documents.
There are various different semantic web browsers for Linked
While the path to that node or the edges connecting it are
Data available, which use a similar interface design to Tim
important for localizing the node, the node’s meaning is
Berners-Lee’s Tabulator [10]. A browser interface is typi-
mostly independent of which path was used to retrieve it.
cally designed to display one RDF resource at a time, and
enables the user to browse from resource to resource. Tab-
ulator’s original design uses a tree to show the relationship
That is different in Semantic Web tasks. The path used to
between resources. Other Linked Data browsers including
locate a node or the relationship between nodes is more im-
Disco [11] or the Zitgist DataViewer [15] use a design of
portant. The meaning of just retrieving the node of a person
one resource per page. The tree-view as well as the one-
named ’Ian’ is different than retrieving it as a link from a
resource-per-page design enables the user to browse detailed
person named ’Georgi’ by following a foaf:knows link. The
descriptions of single RDF resources, while the tree-view ad-
later represents ’Ian as a friend of Georgi’. With this exam-
ditionally preserves the browsing path.
ple we want to highlight how and why exploratory tasks are
different and why the user’s exploration process and path is
more meaningful that in the web of documents.
This design reflects a common user interaction on the web
of documents. There, the smallest piece of information is a
web page, and a web browser enables users to navigate from
A user will want a semantic web interface to follow a sim-
one page to another. This browsing approach has been ap-
ilar flexible approach as browsing backwards and forwards
plied to semantic web linked data, where the smallest piece
in exploratory tasks. An interface supporting that approach
of information is a RDF resource. We argue that this kind
would support user behaviour and exploration strategies, in
of interaction does not facilitate the key benefits of Seman-
which the user has a perceived low risk of making early de-
tic Web data. Much of the currently available Linked Data
cisions because he can go back and modify them later.
is either scraped from web pages, such as DBpedia, or also
available as human readable webpage as well like Geonames
[1] or Musicbrainz [3]. We do not see much benefit to users
We outline a simple example for casual end-users: a user
from viewing RDF data rendered by a Linked Data browser
might want to explore a dataset of films in order to find an
when a HTML representation of the same information is
interesting one for a movie night. He does not know upfront
available, which is specifically tailored to fit the need of
which films are available and has no specific preference of
human readers. Linked data browser will need to provide
the genre. Over the past, he has seen many movies and do
higher benefit by giving more control to the user than web
have some preferences for particular directors and actors,
pages could do.
but is willing to explore new ones.
However, this might change as more and more RDF data
The essence of this task is viewing the actual available data,
becomes available from databases, where no corresponding
which will influence his exploration strategy and might change
web pages are available. Semantic Web technologies could
his interests as described above.
serve as methods to create web documents, but the key in
browsing and analysing this data lies in our opinion in aggre-
gation of data, like the currently available Web2.0 mashups.
2. OVERVIEW OF EXISTING
APPROACHES
In this section, we review three different kinds of seman- Single resource at a time RDF browsers enable users to ex-
tic web user interfaces according to their ability to support plore large RDF datasets, but their design does not support
users in information exploration tasks. We identify the key the user in aggregation tasks. The browser design might
principles of each approach, and will then further discuss enable aggregation tasks when a kind of basket is available
how the advantages can be combined into one user interface to temporally store discovered resources, such as Piggybank
design. We are interested in generic user interfaces rather [18]. Piggybank focuses on collecting resources, while we
try to find ways of exploring these collections. Also, in their 2.3 Query Builder
current design, single resource at a time browsers give no Graph based query builders try to give the user the most ex-
suggestions of which links to follow because of their lack of pressivity in querying an RDF graph. An common approach
support in creating an overview for the user. is to provide a triple based user interface which helps to build
triple pattern to express queries. One example of such an
interface is the DBpedia Query Builder [6]. It supports the
For example, given a user’s FOAF [13] profile, browsers do user in building up triple pattern by using auto-completion
not give hints for the user which links to friends might be for triple predicates. For example if the users starts with
interesting, when users naturally want to deal with collec- the triple pattern ?film rdf:type film , auto-completion
tions of resources. A user strategy (see section 1) for ex- for next triple pattern will only show predicates which are
ploring such a list of friends with a Linked Data browser is used with actual instances in the repository of type film,
to browse to each friend, memorize details that might be of ordered by their usage number. This supports the user in
interest and repeat that procedure for other friends. This exploration tasks as he sees which relationships exist be-
strategy is very time consuming and impractical for larger tween instances. But a user has to have an explicit question
lists of friends. Here, faceted browsing or faceted filtering of in mind which he wants to formulate into a query. While
the friendlist might serve as a helpful overview method. the interface helps to analyse the relationships, it does not
show actual data while constructing a query. In addition,
users do not always have explicit queries upfront, but need
2.2 Faceted Filtering to explore the available data first in order to find out what
The method of faceted filtering [26] or clustering helps the information might be interesting to them.
user to get an overview of a given set of resources. The basic
concept of facets is to partition the information space using
orthogonal conceptual dimensions of the data. A widely Using actually available data for query construction is a con-
used customer application with a faceted filtering approach cept known in database environments as Query by Example
is iTunes, Apple’s music library and media player [20]. (QbE) [27]. Users can create queries by entering example
iTunes’ music library can be filtered according to different data and conditions, and the system then translates these
dimensions of the music files’ metadata. Users can filter the examples into a more formal query according to the actual
library on certain artists, albums or genres by selecting val- data schema. The Microsoft Access query builder uses an
ues from facets. interface which refers to the QbE concept in order to enable
casual users to construct database queries.
Exhibit [19] is a popular interface which provides a faceted 3. OVERVIEW OF OUR APPROACH
filtering view on graph based data. Exhibit’s user interface We present an interface design in which we combined the
presents different facets for a given set of resources. The key three previously discussed interaction approaches: Brows-
user interaction is faceted filtering, i.e. a list of resources can ing, Faceted Filtering and Query Building. We integrated
be filtered by selecting values within facets. E.g. films might the following three characteristics in our exploratory inter-
be filtered by selecting ’Steven Spielberg’ in a ’Director’ facet face: multi-resource at a time, browsing and implicit query
of a list of films. This action will filter down the list of films building. We argue that the multi-resource interface design
to those, whose director is Steven Spielberg. Exhibit uses a is most helpful for exploration tasks as it supports overview
declarative language to create a faceted view on a list of re- techniques and data aggregation as shown in faceted browsers.
sources. Other faceted browser like Longwell [2] or MSpace Therefore we also integrate a faceted filtering mechanism
[23] calculate facets automatically. similar to Exhibit. Our interface approach does not relate
specificly to RDF. It is applicable to any graph-based data
structure. For our experiments and demonstration we use
Faceted Filtering supports exploration tasks as it enables a highly interlinked dataset extracted from DBpedia (see
the user to quickly get an overview of what data is present. figure 1). This dataset contains data related to the movie
Compared to the previously described design of one resource domain and consists of films, directors, actors, cities, and
at a time, faceted browsers use a multi resource at a time companies. We modified the dataset in order to provide a
design. Even larger sets of resources can easily be clustered rich linking structure with n:n relations between instances,
or filtered by certain dimensions in order to quickly see what as well as circular relationships. Films have many actors,
data is available. For example, a large list of people might actors play in different films of different directors. Directors
be clustered by their birthplace or by the organization they get awards for films and work for companies which produce
work for. The user can then filter the set of people down to films.
a smaller set according to these facets.
4. INTERFACE DESIGN
We now describe the main elements of our interface (see
Current implementations work with a fixed list of resources
figure 2). The interface is presented as a fixed workspace of
on which faceted filtering is performed. With growing data-
three different elements: result list, facet list and history.
sets, the number of potential facet values also grows. A set
of thousands of films might have a facet with hundreds of
different actors. Hierarchical facets are one solution to fur- 4.1 Result List
ther cluster facet values. The core of our interface is the result list. The result list
presents a collection of RDF resources as unstructured list
prefix :
prefix p:
:Catch_Me_if_You_Can p:has_director :Steven_Spielberg .
:Catch_Me_if_You_Can p:starring :Tom_Hanks .
:Catch_Me_if_You_Can p:starring :Leonardo_DiCaprio .
:The_Departed p:has_director :Martin_Scorsese .
:The_Departed p:starring :Jack_Nicholson .
:The_Departed p:starring :Leonardo_DiCaprio .
:Martin_Scorsese p:birthPlace :Queens .
:Steven_Spielberg p:birthPlace :Cincinnati .
:Leonardo_DiCaprio p:birthPlace :Los_Angeles .
:The_Departed p:distributor :DreamWorks .
:DreamWorks p:keyPeople :Steven_Spielberg .
Figure 1: dataset sample
of items without exposing their relations. The result list is
used in two ways: for displaying an external RDF document
and for displaying query results from within the browser. Figure 2: Humboldt Browser
When an external document is loaded, all resources in this
document get rendered into a plain list independently of
their relationship. A tag cloud on top of the list can be used A list of films could be related to a facet of people. Users
to filter the resources by their rdf:type. This can be used want to be able to restrict these relations to certain prop-
to render the output of other applications, e.g. semantic erties, e.g. to people who directed or produced a film and
search interfaces. These applications can provide the URI exclude people who acted in a film. This operation could be
of a RDF document which contains a list of resources to be understood as a drill down into the result-facet relationship.
displayed and browsed with Humboldt.
As in Exhibit, we designed facets for filtering the result list.
The user can select a value in a facet to filter down the result
4.2 Facets list to all resources that are related to the selected resource.
The main point of interaction in our interface design is a
This will cause all other faceted to be recomputed based on
list of facets, presented on the right hand side of the re-
the now filtered list of results.
sult list. Facets are computed from all resources that are
related to resources in the result list and grouped by their
rdf:type. This method differs from other faceted browsers,
For example, having a list of films loaded into the result list,
where facets are usually constructed from a predicate that
the user can select Steven Spielberg from the Director facet.
connects two resources. By using rdf:type instead we want
This will filter the list of films to those, which are related
to test the hypothesis that introducing an additional level of
to Steven Spielberg by any predicate. The user could later
abstraction reduces the number of facets. This new level of
drill down into the Director facet to select only directed by
abstraction should help the user to get an overview over the
and awarded for as relations between these films and Steven
available data more quickly. During this exploration phase,
Spielberg.
the user does not need to see all details of existing relations.
4.3 Pivot Operation
For example, a list of people (results) in a movie related In order to combine a faceted layout with browsing capabil-
dataset can be connected to a list films (facets) with many ity, we use the Pivot operation, which is similar to Siderean
different predicates like directed, stars in, has produced, Seamark [24]. Based on the operations in data drilling [16],
has written, has edited, and was awarded for. If the our approach allows multiple representations of data accord-
dataset also contains cities that are related to the people, ing to different dimensions. In our faceted interface design,
facets like born in, lives in, died in, etc would appear. a facet or dimension of the dataset is treated as a first class
By grouping by type, the only facets are film and city. The object. Pivoting on a facet means to handle the values of
user can get an overview of the available data and later drill this facet as results and rebuild a faceted view around these
down by splitting one facet into several ones based on par- results (see figure 3). In this way, we added a browsing func-
ticular predicates or sub-classes. He thereby restricts the tionality by which the whole dataset can be explored. This
relationships between results and values in a certain facet. interaction has been called refocusing in a nested faceted
browsing prototype by David Huynh [17].
source, a per user-session in-memory triple store.
A controller provides methods to compute all result list
items and all facets on a given command queue and data
source at any given time. The command queue thereby rep-
resents the implicit construction of a query via different piv-
oting steps without having to store intermediate result sets.
On every event causing a page refresh, the result list is com-
puted by parsing the whole command queue and executing
SPARQL queries on the data source, which is populated via
HTTP GETs on RDF documents.
Figure 3: Pivoting example
In the current prototype we do not have a mechanism in
Furthermore, we apply selected filters to the whole dataset place to dynamically fetch further RDF data by deferencing
and all pivoting steps. When a resource list is filtered by cer- URIs. This could be archived by using an approach similar
tain facet values and the user then pivots on another facet, to the Semantic Web Client Library [12]. Such a mechanism
the interface uses the result of the filtering for building the will be included in a future version of the prototype. The
next view. This enables the user to build up a query while primary objective of the current phase of work was to eval-
browsing actual data and instantly seeing the result of each uate the effectiveness of the user interaction model.
incremental step of building up his query. Thereby, we sup-
port the implicit construction of queries.
In our prototype there are four different commands which
can be enqueued: SelectCommand, FilterCommand, Piv-
E.g. a user who filters films on certain directors and then otCommand, and StepCommand. The SelectCommand in-
pivots on actors will see all actors in the result list, who are cludes or excludes resources from the resultlist or filters the
related to any film in the previous result list. resultlist by type. The FilterCommand restricts results by
values in a certain facet. The PivotCommand tracks the
4.4 History user’s pivot operations and the StepCommand tracks steps
In order to support the user in building an implicit query in the history.
over a number of connected list (of facets), we suggest track-
ing the user’s path through the graph. We use a separate
History control, which displays the path the user has taken Through the following chapters, we will use the following
via pivoting facets from the starting point to the current example set of interactions:
page. We chose to display the path as a linear sequence
in the history control, which enables to step back to pre-
• filter a list of film by director Steven Spielberg
viously visited pages. While this represents the implicitly
built query, filters that are applied on a graph node affect • pivot on facet ’Actors’
other nodes as well.
• pivot on facet ’Films’
When a user pivots from films to actors, filters these ac- • filter on director Martin Scorsese
tors by their birthplace, and then steps back to films, the
filtered actors themselves affects the list of films presented.
A prototype of our user interface and an example dataset is
This enables drilling-down into facets. As described previ-
available at http://humboldt-project.org.
ously, facets are partitions of data according to different di-
mensions (see section 2.2). Actors are one dimension of film
data, but actors themselves have different other dimension,
e.g. their places of birth. Starting with film, users can drill
into their actors and their places of birth, and afterwards
zoom out to films again using the history control.
5. IMPLEMENTATION
We have implemented a prototype of our interface as server-
side web application in C#. The core of the implementa-
tion is a command queue which tracks all user interactions.
This implementation design follows the Command Pattern
[25]. This programming pattern is often used to implement
undo/redo functionality in graphical user interfaces. We
used this data structure to track user interactions in order
to represent the history internally in a flexible and extend-
able way. Each command queue is associated with one data Figure 4: Humboldt’s architecture
6. USER FEEDBACK Every user was able to articulate a correct description of
In this section we describe a preliminary evaluation our user what every single operation would cause. They correctly
interface approach. We used the previously mentioned film- described operation 3 as such, that it will show a list of all
related dataset extracted from DBpedia for an evaluation films in which one or more of the currently shown actors
of our interface prototype. A small group of semantic web has starred in. However, some users showed surprise when
researchers were given the task to explore the dataset by actually executing operation 3 and viewing the result. Some
setting different filters and pivoting between facets. They users remembered that operation 1 filtered the list of films
could filter films by particular directors or actor, pivot on a to Steven Spielberg related ones, and were thereby surprised
list of actors to filter them by birthplace or by other films to see more values in the director facet. Other users had
they stared in, etc. forgotten about operation 1, but showed the same surprise
when they were reminded by us about that filtering opera-
tion. Even with an understanding of the interface operation
We will discuss the issues our evaluation group had with our and the structure of the dataset, only a few users were able
prototype. We elected not to use a formal quantitative eval- to correctly predict the outcome of the described sequence
uation process, as our interface design is in an early stage of operations.
of development and we were interesting in getting qualita-
tive feedback from users while using the interface instead of
quantitatively measuring results. We remind readers that the dataset contains a n:n relation
between films and actors, and therefore the result set of the
sequence of all three operations is a superset of the result
6.1 Filtering set of operation 1 (above).
While using Humboldt, users easily understood the faceted
filtering operation. They were able to predict and under-
stand the result of selecting resources in facets and appreci- 6.3 History
ated the newly introduced level of abstraction of type-based Our users quickly understood that each pivoting operation
facets. However, they missed the feature of narrowing a type adds a field to the history control shown in the top right cor-
based facet to certain predicated describing the relationship ner of the screen. We asked them to note that the current
between e.g. a list of films and a list of directors. This fea- representation of the history as a one-dimension sequence is
ture (see section 4.2) was not implemented in the prototype just a test and that this layout can not represent all pivot
shown during our user study. operation. Although we could have chosen to represent the
history as a graph in order to give it the more expressivity,
we decided to just test how users would react to the possi-
As described in section 4.2, we introduced a higher level bility of navigating previous pivoting steps.
of abstraction using facets that are based on the type of
resources that are directly related to current resources in
the result list. The users’ understanding upfront was mixed. It was not
clear to them whether using the history would take them to
the same resultset as it was at that time, or whether the
6.2 Pivot later added filtering operation would be incorporated. It
When using the pivoting operation for the first time, our was unclear if the history preserves the state or navigates
users expressed some confusion. The pivoting operation one large query.
causes every control on the screen to update. The result list
is populated with new items and new facets appear. Even if 6.4 Review of user feedback
the user was familiar with the concept of pivoting, they still
In this section we discuss our interface design and the results
showed that reaction. We showed a different prototype to
of our user feedback. We were satisfied overall with the in-
each user, in which we used an animation in order to make
troduction of our pivoting operation. This approach has
the pivoting operation easier to understand. In this anima-
shown a way to enable users to more easily explore an inter-
tion, when clicking on ’pivot’ the items in the resultlist and
linked RDF dataset at a more aggregated level compared to
all facets except the pivoted one slowly disappear, the piv-
single-resource at-a-time browsing. Using animations helped
oted facet then ’flies’ into the result list and finally the new
to understand the pivoting operation and we will incorpo-
facets appear. The users appreciated this animation and
rate these animations into our main prototype.
noted that it makes the pivot operation itself much easier to
understand.
The history and its navigation functionality complemented
the pivoting operation successfully. However we came to the
We tried to test users’ understanding of multiple pivoting
conclusion that the way the history is currently presented is
operations by asking them to express their expectations. We
not optimal. Our initial concern was that having a linear
tested that by using the following sequence of actions:
history could be a major disadvantage and changing its lay-
out to a graph based design could serve the users needs much
1. filter films by director Steven Spielberg better. We are now considering that having a separate his-
tory control in the user interface might not work at all.
2. pivot on actors
3. pivot on films Our users did not only demand a history functionality which
represents the pivoting path, but also presents previously We additionally assume that the user might have set addi-
performed filtering operations. In combination with our con- tional filters on actors and on their films.
cern about the linear layout of the history, we conclude that If the user now discovers that his first decision of filtering
the history’s function needs to be better incorporated into only on Steven Spielberg films was incorrect, he could easily
the overall interface design instead of being a separate and go back and change that filter (maybe by adding another
for the user somehow disconnected user control. director) while retaining all other operations.
Hence, he can use a strategy of exploring a dataset and
building up a query graph by navigating the dataset first
Our current interface design is layout as a fixed workspace and afterwards fine-tune the result set by adding, changing
(result list facets), where data is put into (supported by or removing filters. Enabling this interaction strategy in the
our animations). For the next experiments we will try a user interface for graph-based data is the key contribution
design of a moving workspace on top of a two-dimensional of this paper.
dataset layout. While previous discussions such as Schraefel
and Karger [22] have highlighted issues of graph visualisa-
tion tools, we argue that using a faceted graph visualisation We conclude that this strategy is especially helpful and use-
layout instead of an instance based layout could produce ful for exploratory tasks and thereby our interface design
better results. successfully addressed our outlined hypothesis.
7. EVALUATION 8. CONCLUSION
We now discuss how well our user interaction approach serves In this paper, we presented a new approach for browsing
the in section 1 described needs of users performing ex- Linked Data that combines the benefits of being able to ex-
ploratory tasks on interlinked RDF data. As outlined in plore throughout a dataset offered by single resource browsers
section 1, exploratory tasks are performed with greater user such as Tabulator with the ability to manage groups of re-
satisfaction if the following characteristics are present: sources offered by faceted browsers such as Exhibit or Long-
well. We have reviewed current most common user inter-
• the interface design is consistent with the user’s be- face designs for RDF data and identified their key principles
haviour and expectations and benefits in order to developed a user interaction design
based on these principles which is tailored to data explo-
• the user does not need to memorize details of his explo- ration strategies.
ration, because the application supports him by track-
ing his interactions
9. FUTURE WORK
• the risk of following different paths is low In the future we would like to try optimizing the workspace
layout in order integrate the history functionality in a more
consistent way. We want to evaluate that layout in a more
Our user study showed that with minor tweaks, the overall
quantitatively measured usability study. We will also add
behaviour of our interface is predictable for the user. We ar-
the previously described facet splitting functionality and in-
gue that some of the issues described in the previous section,
tegrate the Semantic Web Client Library for dereferencing
e.g. the issue with n:n relations of films and people, might
URIs at runtime.
be omitted with some user training. These issues arose out
of the richness of our underlying dataset, which will be com-
mon in many RDF datasets and users might get a better 10. REFERENCES
understanding over time. [1] Geonames. http://www.geonames.org/.
[2] Longwell. http://simile.mit.edu/wiki/Longwell.
[3] Musicbrainz. http://musicbrainz.org/.
By implicitly tracking the users interaction and providing [4] Powerset. http://www.powerset.com.
a mechanism to review previous navigation steps, we also
[5] S. Auer, C. Bizer, G. Kobilarov, J. Lehmann,
satisfy the second need. Even if users were able to memo-
R. Cyganiak, and Z. Ives. Dbpedia: A nucleus for a
rize all their previous steps, they appreciate if they are not
web of open data.
expected to do so. We argue that with larger datasets users
[6] S. Auer and J. Lehmann. What have innsbruck and
were not even able to memorize their actions, and our his-
leipzig in common? extracting semantics from wiki
tory mechanism is thereby helpful as well as necessary.
content. In ESWC, pages 503–517, 2007.
Most importantly, we successfully lowered the user’s per-
ceived risk of bad decisions by disconnecting the building of [7] M. Q. W. Baldonado and T. Winograd. Sensemaker:
a query path and fine-tuning the filtering operations. We an information-exploration interface supporting the
want to highlight this argument with an example. Given contextual evolution of a user’s interests. In CHI ’97:
the previously used sequence of interaction Proceedings of the SIGCHI conference on Human
factors in computing systems, pages 11–18, New York,
NY, USA, 1997. ACM.
• filter a list of films by one particular director (Steven [8] M. J. Bates. The design of browsing and berrypicking
Spielberg) techniques for the online search interface. Online
Review, v13 n5 p407, 1989.
• pivot to all actors of these films
[9] T. Berners-Lee. Linked data, 2006.
• pivot to their films http://www.w3.org/DesignIssues/LinkedData.html.
[10] T. Berners-Lee, Y. Chen, L. Chilton, D. Connolly,
R. Dhanaraj, J. Hollenbach, A. Lerer, and D. Sheets.
Tabulator: Exploring and analyzing linked data on the
semantic web. In Proceedings of the 3rd International
Semantic Web User Interaction Workshop, 2006.,
2006.
[11] C. Bizer and T. Gauss. Disco - hyperdata browser,
2007.
http://www4.wiwiss.fu-berlin.de/bizer/ng4j/disco/.
[12] C. Bizer, T. Gauss, and R. Cyganiak. Semantic web
client library. http://www4.wiwiss.fu-
berlin.de/bizer/ng4j/semwebclient/.
[13] D. Brickley and L. Miller. Foaf: the Šfriend of a
friendŠ vocabulary, 2004. http://xmlns.com/foaf/0.1/.
[14] D. M. Edwards and L. Hardman. Lost in hyperspace:
cognitive mapping and navigation in a hypertext
environment. In Hypertext: theory into practice, pages
90–105, Exeter, UK, UK, 1999. Intellect Books.
[15] F. Giasson. Zitgist dataviewer.
http://dataviewer.zitgist.com/.
[16] J. Gray, A. Bosworth, A. Layman, and H. Pirahesh.
Data cube: A relational aggregation operator
generalizing group-by, cross-tab, and sub-totals. IEEE
12th International Conference on Data Engineering,
pages 152–159, 1996.
[17] D. Huynh. Nested faceted browsing.
http://people.csail.mit.edu/dfhuynh/projects/nfb/.
[18] D. Huynh, S. Mazzocchi, and D. Karger. Piggy bank:
Experience the semantic web inside your web browser.
Web Semant., 5(1):16–27, 2007.
[19] D. F. Huynh, D. R. Karger, and R. C. Miller. Exhibit:
lightweight structured data publishing. In WWW ’07:
Proceedings of the 16th international conference on
World Wide Web, pages 737–746, New York, NY,
USA, 2007. ACM.
[20] A. Inc. itunes. http://www.apple.com/itunes/.
[21] F. Manola and E. Miller. Rdf primer, Februar 2004.
[22] m.c. schraefel and D. Karger. The pathetic fallacy of
rdf. In International Workshop on the Semantic Web
and User Interaction (SWUI) 2006,
[”lib/utils:month_12911” not defined] 2006.
[23] m.c. schraefel, M. Wilson, A. Russell, and D. A.
Smith. mspace: improving information access to
multimedia domains with multimodal exploratory
search. Commun. ACM, 49(4):47–49, 2006.
[24] Siderean. Seamark navigator, 2007.
http://www.siderean.com.
[25] Wikipedia. Command pattern.
http://en.wikipedia.org/wiki/Command Pattern.
[26] K.-P. Yee, K. Swearingen, K. Li, and M. Hearst.
Faceted metadata for image search and browsing. In
CHI ’03: Proceedings of the SIGCHI conference on
Human factors in computing systems, pages 401–408,
New York, NY, USA, 2003. ACM.
[27] M. M. Zloof. Query-by-example: a data base
language. IBM Systems Journal, 1977.