<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>April</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Application of the Linked Data Visualization Model on Real World Data from the Czech LOD Cloud</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>OVM Agendas</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
          <xref ref-type="aff" rid="aff5">5</xref>
          <xref ref-type="aff" rid="aff6">6</xref>
          <xref ref-type="aff" rid="aff7">7</xref>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Linked Data, Visualization, Semantic Web</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Consolida ted Law</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Court decisions</institution>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Figure 1: Czech Linked Open Data (CzLOD) Cloud</institution>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Geocoordi nates</institution>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>Governmental Business-entities Geographical Statistical</institution>
        </aff>
        <aff id="aff5">
          <label>5</label>
          <institution>Jakub Klímek Czech Technical University in Prague Faculty of Information Technology</institution>
        </aff>
        <aff id="aff6">
          <label>6</label>
          <institution>Jirˇí Helmich Charles University in Prague Faculty of Mathematics and Physics</institution>
        </aff>
        <aff id="aff7">
          <label>7</label>
          <institution>Martin Necˇ aský Charles University in Prague Faculty of Mathematics and Physics</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2014</year>
      </pub-date>
      <volume>8</volume>
      <issue>2014</issue>
      <abstract>
        <p>In the recent years the Linked Open Data phenomenon has gained a substantial traction. This has lead to a vast amount of data being available on the Web in what is known as the LOD cloud. While the potential of this linked data space is huge, it fails to reach the non-expert users so far. At the same time there is even larger amount of data that is so far not open yet, often because its owners are not convinced of its usefulness. In this paper we re ne our Linked Data Visualization Model (LDVM) and show its application via its implementation Payola. On a real-world scenario built on real-world Linked Open Data created from Czech open data sources we show how end-user friendly visualizations can be easily achieved. Our rst goal is to show that using Payola, existing Linked Open Data can be easily mashed up and visualized using an extensible library of analyzers, transformers and visualizers. Our second goal is to give potential publishers of (Linked) Open Data a proof that simply by publishing their data in a right way can bring them powerful visualizations at virtually no additional cost.</p>
      </abstract>
      <kwd-group>
        <kwd>Research projects</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Categories and Subject Descriptors</title>
      <p>H.5.2 [User interfaces]: GUIs, Interaction styles; H.3.5
[Online Information Services]: Data sharing; H.3.5 [Online
Information Services]: Web-based services</p>
    </sec>
    <sec id="sec-2">
      <title>1. INTRODUCTION</title>
      <p>Recently, vast amount of data represented in a form of
Linked Open Data (LOD) has appeared on the Web. The
key LOD principle is linking data entities in a machine
interpretable way so that they form a huge data space
distributed across the Web: the LOD cloud. The LOD cloud
is not interesting for end-users until there are useful tools
available built on top of it. Very important are tools which
are able to present various kinds of LOD to users who want
to explore the data. This includes LOD browsers and
visualization tools. An end-user often does not know more about
datasets than that there are some data structures contained
which could be visualized. For example, entities in a dataset
can be geocoded. In that case, the user may require to start
the exploration on a visualization of those entities on a map.
Or the dataset may contain a hierarchical structure and
various techniques for hierarchy visualizations can be used.</p>
      <p>RUIAN
Czech
Public
Contracts</p>
      <p>CPV 2008</p>
      <p>LAU
regions</p>
      <p>NUTS
codes
TED
Public
Contracts</p>
      <p>Demogra
phy</p>
      <p>Budgets
COI.CZ</p>
      <p>Elections
results
Exchange
rates</p>
      <p>The reader may argue that this does not depend on the
fact that we work with LOD, which is true. Any
nonLOD dataset can also be explored this way. However, LOD
brings an important new dimension (besides the uniform
data model { RDF) to the problem of data presentation,
especially when we talk about visualization. Suppose that
we have a dataset with addresses, we geocoded them and
display them on a map. Suppose now that this is a LOD
dataset and that we have other LOD datasets without GPS
coordinates, but linked to the geocoded entities of the
former one. We can build a tool which displays any of these
linked datasets on a map as well. This, of course, applies
only when the links make sense in terms of location, but the
point is that compared to non-LOD datasets, it is easy to
create and use links in LOD.</p>
      <p>
        In our previous work, we presented the Linked Data
Visualisation Model (LDVM) [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. It enables us to combine
various LOD datasets and visualize them with various kinds
of visualizations. It separates the part which prepares the
data for visualizations from the actual visualizers.
Visualizers then specify their expected input data structure (e.g.,
hierarchical with labels, geo-coordinates, etc.) in RDF using
broadly accepted vocabularies. This allows reuse of
visualizers for a broad range of LOD datasets. We focus on two
groups of users and their cooperation. First are expert users
who can easily prepare analyses and visualization
components and the second group are lay users. They can use and
combine components prepared for them by the experts
using a LDVM implementation without extensive knowledge of
RDF and SPARQL. An example of such a use by a lay user
can be visualizing a given analysis using various visualizers
or running the same analysis on various data sources.
      </p>
      <p>Contributions. The primary purpose of this paper is
to demonstrate the bene ts that LDVM brings to users on
real-world data. We show that our implementation Payola
allows any expert user to easily access his SPARQL endpoint
of choice or upload his RDF le, perform analyses using
SPARQL queries and visualize the results using a library
of visualizers. At the same time, the components created
by experts can be also easily used and reused by lay users.
We present several visualization scenarios of real-world data
from the Czech LOD (CzLOD) cloud. The Czech LOD cloud
contains various datasets we publish in our research group.
We describe these datasets brie y in this paper as well. Each
scenario takes several datasets from the CzLOD cloud,
combines them together, extracts a data structure which can be
visualized and o ers appropriate visualizers to the user.</p>
      <p>Outline. The rest of this article is organized as follows:
In Section 2 we survey the CzLOD cloud and describe the
currently available datasets. In Section 3 we present the
Linked Data Visualization Model (LDVM), a simple yet
powerful formalism for building analyses, transformations
and visualizations of Linked Data. In Section 4 we brie y
describe Payola, our implementation of LDVM. In Section 5
we present our real world examples of analyzers,
transformers and visualizers on our running LDVM instance and put
our contributions in a perspective of publishing data of
public administration bodies. In Section 6 we brie y survey
related work and nally, in Section 7 we conclude.</p>
    </sec>
    <sec id="sec-3">
      <title>CZECH LOD CLOUD</title>
      <p>In this section, we introduce a survey of the Czech Linked
Open Data (CzLOD) cloud. As the data itself is not the
main contribution of this paper, we will not go into too
much detail. We are working on the cloud continuously in
the OpenData.cz initiative since 2012 to show the owners of
the data, who are mainly public bodies, what bene ts can
be gained from proper publication. The cloud is accessible
at http://linked.opendata.cz/sparql and runs on
OpenLink Virtuoso 7 Open-Source triplestore 1 and currently
contains approximately 100 million triples not counting the
largest dataset, RUIAN, which is described later. Figure 1
contains a map of the CzLOD cloud similar to the global
LOD cloud. It is also color coded, red are the datasets about
Czech business entities, green are the geographical datasets,
yellow are the governmental datasets and blue are the
statistical datasets. In addition, the CzLOD cloud also includes
various e-Health datasets, which are, however, beyond the</p>
      <sec id="sec-3-1">
        <title>1https://github.com/openlink/virtuoso-opensource</title>
        <p>scope of this paper.</p>
        <p>Business entities datasets. In the heart of the CzLOD
cloud is the ARES dataset. It is data from various Czech
registries and mainly the Business registry. In the Czech
Republic, every business entity has its unique 8-digit identi
cation number. Based on this number, it is easy to devise a rule
for unique business entity URI creation. For example, the
Czech Ministry of Interior is identi ed by http://linked.
opendata.cz/resource/business-entity/CZ00007064. For
each business entity in the Business registry, the dataset
contains its o cial name, type, address of headquarters, kinds
of its activities, names of shareholders, etc.</p>
        <p>Another dataset about business entities is COI.CZ, which
contains data about inspections, resulting nes, and bans
issued by the Czech Trade Inspection Agency. Each inspection
record contains the business entity identi cation number,
location, region (NUTS code and LAU code) and information
about the resulting ne or ban. Again, this data links easily
to our other datasets about business entities via the URL
based on the identi cation number. The source data is
published as 3 star open data2 (CSV les) by the agency3.</p>
        <p>Our Research projects dataset contains information about
research grants funded by various Czech grant agencies. For
each project there is data about amounts of money paid to
each of the participants for each year of the project as well
as identi cation numbers of all participants and additional
information about the projects. The source data can be
exported as Excel les from a web portal maintained by the
Research, Development and Innovation Council 4.
Geographical datasets. Our newest and biggest
geographical dataset is RUIAN - register of territorial
identi cation, addresses and real estates. It has more than 600
million triples and contains data about all address points,
streets, parts of towns, towns, cities, various levels of regions
and also about every building and every lot in the Czech
Republic including the hierarchy. Each object in RUIAN has
assigned geocoordinates, which can be transformed to GPS
coordinates. This creates a powerful base for geocoding in
the Czech Republic. RUIAN is also linked to NUTS and
LAU codes, which are 5-level European codes for towns,
regions etc. The source data is in XML and freely accessible5.
Other geographical datasets contain the already mentioned
NUTS and LAU codes hierarchies. Additionally, the
(Geocoordinates) dataset contains geocoordinates for each address
found in our datasets created by geocoding.</p>
        <p>Governmental datasets. Currently, there are three kinds
of governmental datasets in the CzLOD cloud. The rst
kind contains information about institutions of public power
(OVM ), e.g., ministries, cities, but even public notaries, etc.
For each institution, that is also a business entity, there is
its identi cation number, address, type and also information
about its o ces and their opening hours. In addition, there
is a dataset with agendas of these institutions, that are also
linked to laws according to which they are established. This
data gives a good base for, e.g., mobile applications that
give the user his location and opening hours of the nearest
notary, etc. The second kind of our governmental datasets</p>
      </sec>
      <sec id="sec-3-2">
        <title>2http://5stardata.info/</title>
      </sec>
      <sec id="sec-3-3">
        <title>3http://www.coi.cz/cz/spotrebitel/</title>
        <p>open-data-databaze-kontrol-sankci-a-zakazu/</p>
      </sec>
      <sec id="sec-3-4">
        <title>4http://www.isvav.cz</title>
      </sec>
      <sec id="sec-3-5">
        <title>5http://vdp.cuzk.cz/</title>
        <p>Analyzer
Visualization
Transformer
Visualizer</p>
        <p>Visual Mapping Transformation</p>
        <sec id="sec-3-5-1">
          <title>Source RDF and non-RDF</title>
        </sec>
        <sec id="sec-3-5-2">
          <title>Data</title>
          <p>Data Transformation</p>
        </sec>
        <sec id="sec-3-5-3">
          <title>Analytical RDF</title>
        </sec>
        <sec id="sec-3-5-4">
          <title>Abstraction</title>
          <p>Visualization Transformation</p>
        </sec>
        <sec id="sec-3-5-5">
          <title>Visualization RDF</title>
        </sec>
        <sec id="sec-3-5-6">
          <title>Abstraction</title>
        </sec>
        <sec id="sec-3-5-7">
          <title>View</title>
          <p>Analytical
SPARQL
Operators
Visualization
Operators</p>
          <p>View
Operators
are law datasets. The main part consists of consolidated laws
of the Czech Republic. The other part consists of decisions
of Czech Supreme court linked to laws. In addition, there
are datasets with information about public contracts.
Statistical datasets. Our statistical datasets include data
about demography and budgets of cities linked to the cloud
via NUTS and LAU codes. We also have exchange rates of
all currencies to Euro from the European Central Bank.
Finally, there are results of elections to the Czech parliament.</p>
          <p>
            LINKED DATA VISUALIZATION MODEL
In this section we brie y go through the Linked Data
Visualization Model (LDVM), which we de ned in our previous
work [
            <xref ref-type="bibr" rid="ref4">4</xref>
            ]. First, we give an overview of the model and then
we formalize its key elements.
3.1
          </p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Overview of LDVM</title>
      <p>
        The Linked Data Visualization Model (LDVM) is an
adaptation of the general Data State Reference Model (DSRM) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]
for the speci cs of the visualization of RDF and Linked Data.
It is an abstract data process inspired by a typical
Knowledge Discovery Process [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. We extend DSRM with three
additional concepts { analyzers, transformers and
visualizers. They denote reusable software components that can
be chained to form an LDVM instance. Figure 2 shows an
overview of the LDVM. The names of the stages,
transformations and operators proposed by DSRM have been slightly
adapted to the context of RDF and Linked Data. LDVM
resembles a pipeline starting with raw source data (not
necessarily RDF) and results with a visualization of the source
data. It is organized into 4 stages that source data needs to
pass through:
1. Source RDF and non-RDF data: raw data that can
be RDF or adhering to other data models and formats
(e.g. XML, CSV) as well as semi-structured or even
non-structured data (e.g. HTML pages or raw text).
2. Analytical abstraction: extraction and representation
of relevant data in RDF obtained from source data.
3. Visualization abstraction: preparation of an RDF data
structure required by a particular visualization
technique (e.g., 1D, 2D, 3D or multi-dimensional data, tree
data, etc.)
4. View: creation of a visualization for the end user.
      </p>
      <p>Data is propagated through the LDVM pipeline by
applying 3 types of transformation operators:
1. Data transformation: transforms the raw data
represented in a source data model or format into a
representation in the RDF data model; the result forms the
base for creating the analytical RDF abstraction.
2. Visualization transformation: transforms the obtained
analytical abstraction into a visualization abstraction.
3. Visual mapping transformation: maps the
visualization abstraction data structure to a concrete visual
structure on the screen using a particular visualization
technique speci ed using a set of parameters.</p>
      <p>There are operators within the stages that allow for
instage data transformations:
1. Analytical SPARQL operators: transform the output
of the data transformation to the nal analytical
abstraction (e.g. aggregations, enrichment from LOD).
2. Visualization operators: further re ne the
visualization abstraction data structure (e.g., its condensation
if it is too large for a clear visualization).
3. View operators: allow a user to interact with the view
(e.g., rotate, scale, zoom, etc.).
3.2</p>
    </sec>
    <sec id="sec-5">
      <title>LDVM stages</title>
      <p>Source RDF and non-RDF Data Stage. The rst stage
considers RDF as well as non-RDF data sources. The data
transformation transforms the source data to an RDF
representation that forms a base for creating an analytical
abstraction. If the source RDF data does not have a suitable
structure for the following analysis, the transformation can
be a sequence of one or more SPARQL queries that map the
source data to the required structure.</p>
      <p>Analytical RDF Abstraction Stage. The output of the
second stage (analytical RDF abstraction) is produced by
applying a sequence of various analytical SPARQL operators
on the RDF output produced by the data transformation.
We call the sequence an analyzer (see Figure 2). Our goal is
to enable users to reuse existing analyzers for analyzing
various datasets. We want to enable users to nd analyzers that
can be applied for analyzing a given data set and, vice versa,
to nd datasets that may be analyzed by a given analyzer
automatically. Therefore, it is necessary to be able to decide
whether an analyzer can be applied on a given dataset, i.e.
whether the analyzer is compatible with the dataset. We
formalize the notion of compatibility later in Section 3.3.</p>
      <p>Visualization Abstraction Stage. We want to ensure that
visualizers are reusable for di erent analytical abstractions.
However, building speci c visualizers for particular
analytical abstractions would not enable such reuse. This is because
each visualization tool visualizes particular generic
characteristics captured by analytical abstractions. For example,
there can be a visualizer of tree structures using the TreeMap
technique or another visualizer of the same structures using
the SunBurst technique. And, another visualizer may
visuDBpedia</p>
      <p>EU Public
Contracts
Class Hierarchy</p>
      <p>Analyzer</p>
      <p>Property
Hierarchy Analyzer</p>
      <p>Public Spending</p>
      <p>Analyzer
ClassProp-2-SKOS
Vis. Transformer</p>
      <p>Place-2-SKOS Vis.</p>
      <p>Transformer</p>
      <p>PCO-2-GeoLocQB
Vis. Transformer
Sunburst
Visualizer</p>
      <p>TreeMap
Visualizer</p>
      <p>Columns on
GMaps Visualizer
alize 2-dimensional structures on Google Maps. An
analytical abstraction may contain encoded both the tree structure
as well as the 2-dimensional structure. All three mentioned
visualizers can be applied to the analytical abstraction as
well as on any other abstraction which contains the same
structures encoded. Therefore, we need to transform the
analytical abstraction into the form accepted by the desired
visualizers. An example can be that we have a visualizer for
a tree-like structure which accepts the SKOS6 vocabulary
with its skos:broader property for the hierarchy. The
analytical abstraction might already contain this hierarchy, then
no visualization transformation is required. Or, the
analytical abstraction might contain a tree-like hierarchy modeled
using rdfs:subClassOf property and it needs to be
transformed here. This transformation is performed by the
visualization abstraction stage. We call the transformation a
visualization transformer. Again, a user can reuse various
transformers for extracting visualization abstractions of the
desired kind from compatible analytical abstractions.</p>
      <p>View Stage. The output of the (view ) stage is produced
by a component called a visualizer. A view is a visual
representation of a visualization abstraction on the screen. A
visualizer performs visual mapping transformation that may
be con gured by a user using various parameters, e.g.
visualization technique, colors and shapes. The user can also
manipulate the nal view using the view in-stage operators
such as zoom and move. A visualizer can be reused for
visualizing various visualization abstractions that contain data
structures accepted by the visualizer.
3.3</p>
    </sec>
    <sec id="sec-6">
      <title>Formalization</title>
      <p>
        The core concepts of LDVM are reusable components, i.e.
analyzers, visualization transformers and visualizers.
Analyzers and visualization transformers consume RDF data
via their input interfaces and produce RDF data as their
output. Visualizers consume RDF data and produce a
visualization a user can interact with. The goal is to formally
introduce the concept of compatibility of a component with
6http://www.w3.org/TR/skos-reference/
its input RDF data. We formalized the model in our
previous work [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. However, since then as the implementation
progressed, we have simpli ed the formalization for it to be
more practical and with no e ect on its power. Given the
formalization, we are then able to decide whether a given
analyzer can be applied on a given RDF dataset. Similarly,
we can decide whether a visualization transformer can be
applied on a given analytical abstraction, etc. Our approach is
based on the idea to describe the expected input of a LDVM
component with an input signature and the expected
output with an output data sample. The signature and the
data sample are provided by the creator of the component.
Each component can then check whether its input signature
is compatible with the output sample of the previous
component. The input signature comprises a set of SPARQL
ASK queries which should be inexpensive so that they can
be evaluated quickly on a number of datasets. The output
data sample is a small RDF data sample that shows the
format of the output of the component. The input signature
of one component is then compatible with the output data
sample of another component when all the SPARQL ASK
queries of the signature are evaluated on the data sample as
true. Our rationale is to provide a simple and lightweight
solution, which allows to check the compatibility of a number
of components without complex reasoning.
      </p>
      <p>Definition 1 (Input signature). A set of SPARQL
ASK queries SC = fQ1; Q2; : : : ; Qng is an input signature
of a LDVM component C.</p>
      <p>Note that an analyzer can potentially extract data from
multiple data sources, e.g., SPARQL endpoints or RDF les.
Then the analyzer would have to have a separate input
signature for each data source. However, for simplicity, we omit
this slight extension. Analyzers and visualization
transformers provide an output data sample, against which an input
signature of another LDVM component can be checked.</p>
      <p>Definition 2 (Output data sample). RDF data DC
representing the maximum possible structure of the output
data format produced by a LDVM component C using
minimum amount of triples is an output data sample of C. This
only applies to analyzers and visualization transformers.</p>
      <p>Definition 3 (Compatibility). We say that LDVM
component C with input signature SC is compatible with
LDVM component D with output data sample DD i each
Qi 2 SC = fQ1; Q2; : : : ; Qng returns true when executed on
DD, i.e., Qn</p>
      <p>i=1 E(Qi; DD) = 1 where E(Qi; DD) 2 f0; 1g is
the evaluation of SPARQL ASK query Qi against data DD.</p>
      <p>Given the output data samples are small and the SPARQL
ASK queries are inexpensive, we can, for a given SPARQL
endpoint, automatically o er all possible visualizations
using available LDVM components to our lay users. The
process for checking of available visualizations using LDVM
starts with the analyzers. Each analyzer performs SPARQL
ASK queries from its input signature. If it is compatible (all
ASKs return true), it is marked as available. Next are the
visualization transformers. Because they are optional and
also can be chained, they need to perform their checks in
iterations. In the rst iteration, all transformers perform their
ASKs from their input signatures on the output data
samples of available analyzers. Those who succeed are marked
available. In the next iteration, all transformers that are not
available perform their ASKs on the output data samples of
the newly available transformers. This ends when there is
no new available transformer. Finally, all visualizers
perform their ASKs on all available analyzers and visualization
transformers. The result is a set of all possible combinations
of what can be visualized in the given SPARQL endpoint.
See Figure 3 for illustration.</p>
    </sec>
    <sec id="sec-7">
      <title>4. IMPLEMENTATION: PAYOLA</title>
      <p>
        Payola7 is a web framework for analyzing and
visualizing Linked Data [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. It enables users to build their own
instances of LDVM pipelines. Payola provides an LDVM
analyzer editor in which SPARQL queries and custom
plugins can be combined. Firstly, the user de nes a set of data
sources such as SPARQL endpoints or RDF les as input
data and then connects other plugins to them. Some of the
plugins are designed to provide simple SPARQL constructs.
Join and Union plugins enable users to analyze a dataset
created from multiple datasets stored in separate SPARQL
endpoints. It is also possible to transform results of an
analyzer with a custom transformer. When the pipeline is
evaluated, the user can choose a visualizer to see the results
in various forms. Throughout the LDVM pipeline all data
is RDF and the user can download the results in a form of
an RDF le.
      </p>
      <p>Payola also o ers collaborative features. A user is able to
create an analyzer and share it with the rest of the Payola
users. That enables them to run such an analyzer as well
as to create a new analytical plugin, which is based on that
analyzer. As analytical plugins have parameters that a ect
their behavior, a new analyzer{based plugin may also have
parameters, which can be chosen from the parameters of the
plugins of the original analyzer. This feature supports
formation of an ecosystem where expert users create analyzers
for those who are less experienced. Combining those
analyzers into new ones enables even inexperienced users to create
a complex analyzer with less e ort.</p>
      <p>It is possible to extend Payola with custom plugins for
analysis and visualization. For instance, a user is allowed to
upload a code snippet of a new analytical plugin via our web
interface. The framework compiles the code and integrates
the created plugin immediately into the application.</p>
      <p>
        Let us brie y describe some of the latest Payola features.
Based on the previous user evaluation presented in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], we
focused our work on improving the user experience. We
introduced changes to make it even easier for non{expert
users to browse LDVM pipeline results without extensive
knowledge of LOD principles or Payola itself.
      </p>
      <p>The latest Payola version o ers a one{click solution for
presenting results of an LDVM pipeline in a chosen
visualizer. When an LDVM pipeline is created, it is assigned
a unique URL. When a user accesses such a URL, Payola
automatically loads the pipeline and creates the desired
visualization (see Section 5.3.2). To speed things up, we also
implemented caching of analyzer results so that we can serve
more users in a shorter time without repeated analysis
evaluation. This brings us very close to what we see as a nal
stage of delivering a visualization to a non{expert user {
embedding an LDVM visualization based on an LDVM pipeline
into an external website. That is a part of our future work.</p>
      <sec id="sec-7-1">
        <title>7http://payola.cz</title>
        <p>5.</p>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>DEMONSTRATION OF LDVM</title>
      <p>In this section, we present our real world example of
implementation and usage of LDVM. We present various
analyzers, visualization transformers and visualizers, which are
actual LDVM components with input signatures and
output data samples. The examples run in Payola, our LDVM
implementation (see Section 4).
5.1</p>
    </sec>
    <sec id="sec-9">
      <title>Analyzers</title>
      <p>In this section, we describe two analyzers that create
analytical abstractions from the CzLOD cloud, their input
signatures and output data samples. An analyzer is a software
component that produces RDF data and for a given data
source (or possibly more data sources) can say whether it
can extract data from this data source or not. It can, for
instance, represent a complex computation over simple data
or it can simply be a SPARQL query, which is a case of our
two examples. Note that when an analyzer is in a form of
a SPARQL CONSTRUCT query, its input signature
corresponds to its WHERE clause and its output data sample is
an instance of its CONSTRUCT clause.
5.1.1</p>
      <p>A1: Institutions of public power</p>
      <p>The rst analyzer A1 takes data from 2 datasets:
Institutions of public power (OVM) and Geocoordinates. From the
OVM dataset, it extracts the institutions with their types
and addresses8. The types of the institutions are expected
to be skos:Concepts and the labels of the types are expected
to be skos:prefLabels. From the Geocoordinates dataset,
the analyzer extracts geocoordinates gained by geocoding
the OVM addresses. The input signature of the analyzer
consists of one ASK query SA1 = fQA1 g :
# Q of A1
[] s: name [] ;
ovm : typSubjektu ? type ;
s: address ? address .
? address s: streetAddress [];
s: postalCode [];
s: addressRegion [];
s: addressLocality [];
s: geo ?g.
? type skos : prefLabel [] .
?g s: longitude [];</p>
      <p>s: latitude [] .</p>
      <p>And an example of its output data sample DA1 is:
# D of A1
&lt;ovm &gt; s: geo &lt;geo &gt; ;
s: title " title ";
s: description " desc " ;
ovm : typSubjektu &lt;type &gt;.
&lt;type &gt; skos : prefLabel " Type " .
&lt;geo &gt; s: latitude " 50.088289 ";</p>
      <p>s: longitude " 14.404446 ".</p>
      <p>Our SPARQL endpoint contains this kind of data and
therefore QA1 returns true, which means that A1 is
compatible with our SPARQL endpoint.
5.1.2</p>
      <p>A2: Inspections of COI.CZ</p>
      <p>The second analyzer A2 takes data from 5 datasets:
Inspections of the Czech Trade Inspection Agency (COI.CZ),
ARES (Business Registry), NUTS codes hierarchy, LAU
codes hierarchy and also Geocoordinates. From COI.CZ it
extracts information about inspections which resulted into
8s is a pre x for http://schema.org/
sanctions. Speci cally, it extracts their dates, places,
resulting nes, links to business entities inspected and links to
LAU regions in which the inspection took place. From LAU
regions, the analyzer takes names of the regions and links
to broader NUTS codes. From NUTS codes, the analyzer
takes names of the regions and their hierarchy.</p>
      <p>From ARES, the analyzer extracts names of inspected
business entities. Finally, from Geocoordinates, it extracts
the geocoordinates of addresses found in COI.CZ. The
input signature of this analyzer consists of 2 SPARQL ASK
queries SA2 = fQ1A2 ; Q2A2 g :
# Q1 of A2
[] a s: CheckAction ;
s: location /s: location ? region ;
s: location /s: geo ? geo ;
s: object ? object ;
dcterms : date ? date ;
s: result ? result .
? result a coicz : Sanction ;</p>
      <p>s: result / gr : hasCurrencyValue [] .
? object gr : legalName [] .
? region a ec : LAURegion ;</p>
      <p>ec : level 2 .
? geo s: latitude [];</p>
      <p>s: longitude [].</p>
      <p>FILTER ( datatype (? date ) = xsd : date )</p>
      <p>Q1A2 checks for the inspections s:CheckAction, its region
(LAU), geocoordinates, business entity, date and ne. The
ne has to have an amount, the business entity has to have
a legal name, the region must be LAU level 2. Q2A2 checks
the LAU and NUTS datasets whether there is the region
hierarchy present and whether the regions have their names.
# Q2 of A2
[] a s: CheckAction ;
s: location /s: location ? region ;
s: result /s: result [] .
? region a ec : LAURegion ;
ec : level 2 ;
dcterms : title [] ;
ec : hasParentRegion ? lau1 .
? lau1 dcterms : title [] ;</p>
      <p>ec : hasParentRegion ? nuts3 .
? nuts3 rdfs : label [] ;</p>
      <p>ec : hasParentRegion ? nuts2 .
? nuts2 rdfs : label [] ;</p>
      <p>ec : hasParentRegion ? nuts1 .
? nuts1 rdfs : label [].</p>
      <p>This data is present in our SPARQL endpoint so both the
queries return true. Therefore, A2 is compatible with our
data source. An example of the output data sample DA2 is:
# D of A2
&lt;ca &gt; a s: CheckAction ;
s: location &lt;region &gt; ;
s: geo &lt;geo &gt;;
s: title " title ";
s: description " description ";
dcterms : date " 2014 -02 -16 " ^^ xsd : date ;
rdf : value 2 .
&lt;geo &gt; s: latitude " 50.088289 ";</p>
      <p>s: longitude " 14.404446 ".
&lt;region &gt; a ec : LAURegion ;
ec : level 2 ;
rdfs : label " label " ;
ec : hasParentRegion &lt;lau1 &gt;.
&lt;lau1 &gt; rdfs : label " label " ;</p>
      <p>ec : hasParentRegion &lt;nuts3 &gt; .
&lt;nuts3 &gt; rdfs : label " label " ;</p>
      <p>ec : hasParentRegion &lt;nuts2 &gt; .
&lt;nuts2 &gt; rdfs : label " label " ;</p>
      <p>ec : hasParentRegion &lt;nuts1 &gt; .
&lt;nuts1 &gt; rdfs : label " label ".</p>
      <p>In this section we describe a sample visualization
transformer. It can be used to connect output RDF data from our
analyzers or any other compatible analyzers to the inputs of
our visualizers. A visualization transformer can be any
software component that transforms data between di erent
formats or performs aggregations for better visualization. Note
that because we use RDF, the visualization transformers are
in fact SPARQL CONSTRUCT queries. Again, their input
signatures correspond to their FROM clauses and their
output data samples correspond to their CONSTRUCT clauses.
5.2.1 T1: Region hierarchy to SKOS hierarchy</p>
      <p>Because we have various tree structure visualizers that
accept tree structures using skos:Concepts for nodes with
skos:prefLabel for labels and skos:broader properties for
edges and also accept optional rdf:value for the size of a
leaf, we need to transform the hierarchy extracted in
analyzer A2 (see Section 5.1.2) accordingly. The region
hierarchy, that is in the output data sample of analyzer A2
consists of ec:LAURegions for regions, s:CheckAction for the
inspections made by COI.CZ. In addition, the inspections
have their sanction amounts in rdf:value, which we want
to visualize as sizes of their corresponding leaves in the
resulting tree visualization. Therefore, the input signature of
T1 consists of one SPARQL ASK query ST1 = fQT1 g which
corresponds to the output data sample of A2:
# Q of T1
[] a s: CheckAction ;
s: location ? region ;
s: title [] ;
rdf : value [] .
? region a ec : LAURegion ;
ec : level 2 ;
rdfs : label [] ;
ec : hasParentRegion ? lau1 .
? lau1 rdfs : label [] ;</p>
      <p>ec : hasParentRegion ? nuts3 .
? nuts3 rdfs : label [] ;</p>
      <p>ec : hasParentRegion ? nuts2 .
? nuts2 rdfs : label [] ;</p>
      <p>ec : hasParentRegion ? nuts1 .
? nuts1 rdfs : label [] .</p>
      <p>An example of its output data sample DT1 will correspond
to the input signature of visualizer V1 (see Section 5.3.2):
# D of T1
&lt;ca &gt; a skos : Concept ;
skos : prefLabel " title ";
rdf : value 100 ;
skos : broader &lt;region &gt; .
&lt;region &gt; a skos : Concept ;
skos : prefLabel " label " ;
skos : broader &lt;lau1 &gt; .
&lt;lau1 &gt; a skos : Concept ;
skos : prefLabel " label " ;
skos : broader &lt;nuts3 &gt;.
&lt;nuts3 &gt; a skos : Concept ;
skos : prefLabel " label " ;
skos : broader &lt;nuts2 &gt;.
&lt;nuts2 &gt; a skos : Concept ;
skos : prefLabel " label " ;
skos : broader &lt;nuts1 &gt;.
&lt;nuts1 &gt; a skos : Concept ;</p>
      <p>skos : prefLabel " label ".</p>
    </sec>
    <sec id="sec-10">
      <title>5.3 Visualizers</title>
      <p>In this section, we present sample visualizers which
visualize the results of the aforementioned analyzers. Moreover,
they demonstrate how visualizers bene t from the concept of
input signatures and compatibility checks. For each of
visualizers, we will describe its input signature. Since a product
of a visualizer is not a dataset, but a visualization, there is
no speci cation of an output data sample. The
compatibility check is, once again, a SPARQL ASK query which is,
in the case of a visualizer, executed against an output data
sample of the last transformer in a given LDVM pipeline.</p>
      <p>Since one of the main reasons why the LDVM was
proposed is to facilitate the process of LOD exploration, we
have chosen to utilize some well-known visualization
techniques to present a dataset in a form, which is
understandable by non-expert users. We have experimented with two
commonly used visualization techniques: a tree hierarchy
visualization and a map visualization. One of the goals of
our experiments was to show that it is possible to integrate
well-known visualization libraries into an application, which
works with RDF and is based on the LDVM.
5.3.1</p>
      <p>V1: Tree hierarchy visualizers</p>
      <p>Tree hierarchy visualization is a commonly used
visualization technique. The results of the analyzers A1 and A2
(followed by the transformer T1) contain hierarchical data
structures which can be visualized in this way. As described
before, we chose the SKOS vocabulary as a format for tree
visualizations and therefore we present QV1 as the input
signature for a tree hierarchy visualizer V1:
# Q of V1
[] a skos : Concept ;
skos : prefLabel [] ;
rdf : value [] ;
skos : broader ?b .
?b a skos : Concept ;</p>
      <p>skos : prefLabel [] .</p>
      <p>Query QV1 enforces that the visualized dataset contains a
leaf node with a value speci ed, as well as a reference to its
parent. To traverse the hierarchy, we use the skos:broader
property, which stands for the has parent relationship. It is
now easy to check compatibility of QV1 with DA1 and DT1 .</p>
      <p>We chose to implement 4 di erent tree hierarchy
visualizers. To demonstrate the exibility of a LDVM-compliant
framework, we decided to use a freely available
visualization techniques based on a well-known and commonly used
document manipulation library D3.js9. Speci cally, we
introduce the following visualizers: Zoomable Treemap,
Sunburst, Zoomable Sunburst, and Layout Packing. The library
provides a module which produces adjacency diagrams or a
hierarchical layout using recursive circle-packing for a given
tree structure. It is not a hard task to build the expected
tree structure of JavaScript objects based on the data that
conforms the described input signature. Among others, we
use Apache Jena10 to serialize the results of an analyzer or
a transformer into RDF/JSON 11. The serialization is
transferred to a user's web browser, processed by a visualizer
and passed to the visualization library, which computes the
visualization itself. Note that LDVM does not specify
implementation details, we could use JSON-LD, RDF/XML,
Turtle or any other serialization format, moreover an
arbitrary non-RDF format.</p>
      <p>We present some visualizations based on aforementioned
analyzers in Figure 4 (live demos 12 13 14 ).
5.3.2</p>
      <p>V2: Geo data visualizers</p>
      <p>Geo data visualization is another example of commonly
used visualization techniques. There are many Open Data
mashups that integrate map visualizations in order to
provide an eye{catching presentation of arbitrary datasets.</p>
      <p>The input signature of a map visualizer can be actually
very simple. We de ne QV2 to be the only query of an input
signature of the visualizer V2:
# Q of V2
?[] s: geo ?c ;</p>
      <p>s: description [] ;</p>
      <sec id="sec-10-1">
        <title>9http://d3js.org</title>
        <p>10https://jena.apache.org/
11https://dvcs.w3.org/hg/rdf/raw-file/default/rdf-json/index.html
12http://vis.payola.cz/coi-treemap
13http://vis.payola.cz/coi-z-sunburst
14http://vis.payola.cz/coi-packed</p>
        <p>Again, it is easy to see that such a query would return true
when matched against DA1 or DA2 . Hence, V2 is compatible
with A1 and A2.</p>
        <p>We integrated two di erent map visualization libraries in
order to provide three di erent visualizers. Two of them are
based on the Google Maps JavaScript API 15 and the third
one utilizes the ArcGIS API for JavaScript 16. The former
stands for a classic map visualization where a resource with
geo data is represented on a map by a single marker. The
other one generates a heatmap layer. The signature does not
contain any additional values. Each resource contributes to
the generated heatmap layer equally, while locally increasing
the intensity of the heatmap by one. The third visualizer
utilizes a clustering layer which automatically groups markers
that are close to each other. When zooming in, markers are
getting further apart. Therefore, the layer dissolves clusters
into smaller ones or reveals single markers.</p>
        <p>These map visualizers clearly motivate the notion of
input signatures. We have three di erent software
components doing the same task so it is very natural to unify their
input format. As in the case of hierarchies, we could
utilize other vocabularies with properties such as wgs84:lat or
geo:point, which, in fact, have the same semantic meaning.
That is also one of the reasons for the support of transformer
chaining in LDVM. We could have a LDVM pipeline, where
an analyzer outputs data in a proprietary geographic
coordinate system, followed by a transformer, which converts such
a system to WGS84 using the wgs84 ontology, followed by a
transformer, which converts wgs84 to s:geoCoordinates.</p>
        <p>To make the basic map visualizer more usable, we
extended it to provide a faceted browsing capability. Let us
use the results of analyzer A1 for demonstration. Consider
input signature QV2 and the output data sample DA1 . A
visualizer with this signature ignores other properties, such as
ovm:typSubjektu (type of institution). The types of
institutions are instances of skos:Concept, which is often used
as a type. This might suggest that the property links the
resource (institution) to its type. In our case, the institution
type can be a notary, a municipality, a ministry, etc. We
allow the users to use these properties as facets to customize
the visualization when exploring the dataset by letting them,
e.g., to change a color of a speci c type of institutions or even
hide them.</p>
        <p>To detect this type of properties, the following query is
executed against the visualized dataset:
select distinct ?p where
{
}
[] &lt; http :// schema . org / geo &gt; [];</p>
        <p>?p [] .</p>
        <p>It gives us a list of properties that might be used for
marker grouping. Since such a list contains also properties
de ned by the input signature, we need to involve property
blacklisting to exclude properties that would probably
create a group for each marker separately (titles, descriptions,
etc.).</p>
        <p>Examples of visualizations with faceted browsing can be
seen in Figures 517 and 618 (see footnotes for links to live
demos). When multiple properties are matched, we need to
solve some minor issues such as compute visibility based on
all lters or how to apply multiple color settings to a single
marker. In the case of the color, we let the user decide which
property is to be used to change colors of markers.
5.4</p>
      </sec>
    </sec>
    <sec id="sec-11">
      <title>Evaluation</title>
      <p>One of the main aims of LOD is to enable data reuse in
unforeseen ways by third parties. Speci cally, public bodies
representatives often state that they do not want to publish
raw data, because it does not have a nice visualization for
the public. And then they spend large amounts of money to
build custom portals with functionality that has been
implemented many times before that visualize their unpublished
data. They are hard to convince that publishing the data
itself, not to mention in some standardized format or even
as LOD, can bring them bene ts. This is because for the
general public the data is useless without interpretation in
a form of an application. What the public bodies do not
realize is that the development of those applications could
be left to third-parties. With these facts in mind we can
say that with Payola framework and LDVM as its formal
15https://developers.google.com/maps/documentation/javascript/
16https://developers.arcgis.com/javascript/
17http://vis.payola.cz/ovm-gmaps
18http://vis.payola.cz/coi-gmaps
background, we can show a library of analyzers,
transformers and visualizers that can be easily used and reused for
all Linked Data. In addition, we have concrete examples
as evidence of feasibility of this approach as shown in this
paper. We also showed that implementation of open source
visualizers such as those from D3js.org as plugins to Payola
can be done easily. The data from COI.CZ, that are
presented in this paper, are one of the COMSODE datasets.
The publisher of this data in COI.CZ now gets a free and
powerful visualization of its data and integration with other
datasets and all he had to do was to publish a CSV le.</p>
    </sec>
    <sec id="sec-12">
      <title>RELATED WORK</title>
      <p>
        More and more projects are focused on analyzing,
exploring and visualizing Linked Data. The most sophisticated
survey to date has been presented by Dadzie and Rowe [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
They concluded that most of the tools were not suitable to
be used by lay users and the situation has not signi cantly
improved since. One is still required to understand the
basics of the Semantic Web while using Linked Data browsers
such as Tabulator [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and Explorator [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The user is
expected to navigate a graph through tabular views displaying
property{value pairs of explored resources. However, they
do not o er features that would enable a user to overview a
whole dataset. At the time of writing of this paper,
Tabulator did not support current versions of web browsers and
therefore it was not possible to completely check up on its
progress. Compared to our one-click pipeline execution,
which enables an expert user to prepare a visualization and
share it with a non-expert one, we nd Explorator a rather
complicated tool. It is not very easy even for an expert
user to start using it. Another exploration tool is Freebase
Parallax 19, which o ers advanced visualizations like
timelines, maps and other rich snippets, but works with a xed
data source { Freebase. Semaplorer [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] is an exploration
mashup based on multiple large datasets. It demonstrates
the power of Linked Data while being focused on the tourism
data domain. It provides faceted browsing capabilities for 4
di erent types of facets (person, time, tag and location).
      </p>
      <p>
        There are several tools that visualize data based on
vocabularies in a similar way that our new visualizers do.
Let us start with visualizers like map4rdf 20, LinkedGeoData
browser [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] and Facete21, which understands spatial data.
The rst two focus on geographical data visualizations, but
both are built on top of speci c datasets. That means that
compared to Payola, the user is not able to apply the
visualizer to his own dataset. map4rdf supports faceted discovery
of Spanish institutions and enables the user to add a speci c
overlay containing statistical SCOVO-based data in a form
of a timeline visualization. According to [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], the authors
are currently working on DataCube vocabulary support. Its
most interesting feature is ltering of values by choosing
an arbitrary region on a map. LinkedGeoData browser
enables its users to explore POIs all over the world. Facete
is a JavaScript SPARQL-based Faceted Search library. It
enables users to explore an arbitrary dataset stored on an
arbitrary SPARQL endpoint. Using facets a user is able to
narrow down the volume of the explored data. Facete o ers
a basic table view, but it also provides some more
sophis19http://parallax.freebaseapps.com
20http://oeg-dev.dia.fi.upm.es/map4rdf/
21http://cstadler.aksw.org/facete/
ticated visualization widgets. One of them is focused on
visualization of spatial data. Since Facete is an exploration
tool, it completely lacks features that would provide data
analysis or transformation like Payola does. It just enables
a user to explore and lter data from a chosen SPARQL
endpoint. Another group are vocabulary-based visualizers.
CubeViz [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] o ers the same circle packing layout
visualization as Payola does, but based on the DataCube vocabulary.
Payola also o ers an experimental version of a DataCube
visualizer, but is not limited to it. FoaF Explorer 22 is
focused on visualizing FOAF pro les. One can also mention
ViDaX [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], a Java desktop Linked Data visualizer based
on the Prefuse 23 visualization library. Based on ontologies
and property types, it suggest suitable visualizations to its
users. However, we did not nd a copy of the tool anywhere
so we were unable to experiment with it. Rhizomer [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]
offers various types of visualizations for di erent datasets. It
suggests a reasonable work ow for datasets exploration: 1)
Overview, 2) Filter, 3) Visualize. It includes the treemap,
timeline, map and chart visualizers. However, it focuses just
on the visualization stage of a LDVM pipeline.
      </p>
      <p>
        There are also tools which let the user to build a custom
analyzer as Payola does. The best known is Deri Pipes [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ],
which is a platform that enables a user to create mashups
and perform data transformations. However, it is focused
just on data analysis which means that there could be a
Payola analytical plugin which would use a pipeline produced
by Deri Pipes as another analyzer data source. Open Data
Mashup 24 provides a very similar functionality, but it also
o ers visualizations based on vocabularies, including map
visualizations. It is based on two types of widgets a user
is able to combine together. The rst one is a data source,
the second one is a visualizer. However, it distinguishes only
two dataset types - statistical and spatial and lacks
exibility since a visualizer receives data a widget which combines
a data source, an analyzer and a transformer.
      </p>
      <p>
        We have also seen some generic graph visualizers like
VisualRDF 25, which is a work in progress and is being
currently developed while utilizing the D3.js library. Tools
like IsaViz [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], Fen re [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] and RDF{Gravity26 use the
well-known node-link visualization technique to represent a
dataset. Payola also o ers generic graph visualizations, but
on top of that, it provides a way of customizing the
visualization based on ontologies and user-de ned vocabularies.
Using an extensible library of visualizers, Payola is able to
visualize an arbitrary dataset.
      </p>
      <p>IsaVis also belongs to a group of tools implementing
Fresnel - Display Vocabulary for RDF 27, which speci es how a
resource should be visually represented by Fresnel-compliant
tools like LENA 28 and Longwell 29. Those are also focused
only on the visualization stage of LDVM. Fresnel vocabulary
could be perceived as a LDVM visualization abstraction.</p>
      <p>We have already mentioned Facete, which is a SPARQL
based JavaScript library. There are also other similar
li22http://xml.mfd-consult.dk/foaf/explorer/
23http://prefuse.org/
24http://ogd.ifs.tuwien.ac.at/mashup/
25https://github.com/alangrafu/visualRDF
26http://semweb.salzburgresearch.at/apps/rdf-gravity/
27http://www.w3.org/2005/04/fresnel-info/
28https://code.google.com/p/lena/
29http://simile.mit.edu/issues/browse/LONGWELL
braries like Sgvizler 30 or Visualbox 31, which enables a user
to embed a dataset visualization into their website.
Unlike Facete, they require a user to have a deep knowledge
of SPARQL language, since that is the only possible way of
using those tools. Last but not least, we mention a
publishing framework Exhibit 32. It enables the user to create web
pages with advanced search and ltering features providing
visualizations like maps, timelines or charts. However, it
requires the input data to be in a form of JSON and
recommends using Babel 33 service to transform RDF and other
data formats into the desired JSON variant.</p>
    </sec>
    <sec id="sec-13">
      <title>CONCLUSIONS</title>
      <p>In this paper, we presented the Czech LOD cloud { a set of
interlinked LOD datasets we have published in our research
group and we used it for demonstration of the bene ts of
the Linked Data Visualization Model (LDVM) for LOD
visualization. We brie y recapitulated the basic principles of
LDVM, updated its formalization and shortly described our
own implementation of LDVM - Payola. Then we presented
several visualization scenarios of datasets from the Czech
LOD cloud. The scenarios demonstrated bene ts of LDVM
for users { how they can combine various LDVM components
to extract required data structures from their datasets (with
so called analyzers and visualization transformers ) and how
even lay users can easily reuse suitable visualizers to
visualize the extracted structures.</p>
    </sec>
    <sec id="sec-14">
      <title>ACKNOWLEDGMENTS</title>
      <p>This work was partially supported by a grant from the
European Union's 7th Framework Programme number 611358
provided for the project COMSODE and also partially by
the TACR grant no. TA02010182.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>S.</given-names>
            <surname>Araujo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Shwabe</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Barbosa</surname>
          </string-name>
          .
          <article-title>Experimenting with Explorator: a Direct Manipulation Generic RDF Browser and Querying Tool. In WS on Visual Interfaces to the Social and the Semantic Web (VISSW2009</article-title>
          ),
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>T.</given-names>
            <surname>Berners-Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Chilton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Connolly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Dhanaraj</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hollenbach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Lerer</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Sheets</surname>
          </string-name>
          . Tabulator:
          <article-title>Exploring and analyzing linked data on the semantic web</article-title>
          .
          <source>In 3rd Int. Semantic Web User Interaction WS</source>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>J.</given-names>
            <surname>Brunetti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Gil</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Garcia</surname>
          </string-name>
          .
          <article-title>Facets and Pivoting for Flexible and Usable Linked Data Exploration</article-title>
          .
          <source>In Interacting with Linked Data Workshop</source>
          , ILD'
          <fpage>12</fpage>
          ,
          <string-name>
            <surname>Crete</surname>
          </string-name>
          , Greece, May
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Brunetti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Auer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Garc</surname>
          </string-name>
          <string-name>
            <surname>a</surname>
          </string-name>
          , J. Kl mek, and
          <string-name>
            <given-names>M.</given-names>
            <surname>Necasky</surname>
          </string-name>
          .
          <article-title>Formal Linked Data Visualization Model</article-title>
          .
          <source>In Proceedings of the 15th International Conference on Information Integration and Web-based Applications &amp; Services (IIWAS'13)</source>
          , pages
          <fpage>309</fpage>
          {
          <fpage>318</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>E. H.</given-names>
            <surname>Chi</surname>
          </string-name>
          .
          <article-title>A Taxonomy of Visualization Techniques Using the Data State Reference Model</article-title>
          .
          <source>In IEEE Symposium on Information Vizualization</source>
          <year>2000</year>
          , INFOVIS '00, Washington, DC, USA,
          <year>2000</year>
          . IEEE.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>A.-S.</given-names>
            <surname>Dadzie</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Rowe</surname>
          </string-name>
          .
          <article-title>Approaches to visualising Linked Data</article-title>
          .
          <source>Semantic Web</source>
          ,
          <volume>2</volume>
          (
          <issue>2</issue>
          ):
          <volume>89</volume>
          {
          <fpage>124</fpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>A. de Leon</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Wisniewki</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Villazon-Terrazas</surname>
            , and
            <given-names>O. Corcho.</given-names>
          </string-name>
          <article-title>Map4rdf - Faceted Browser for Geospatial Datasets</article-title>
          .
          <source>In Proceedings of the First Workshop on USING OPEN DATA. W3C</source>
          ,
          <year>June 2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>B.</given-names>
            <surname>Dumas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Broche</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Hoste</surname>
          </string-name>
          , and
          <string-name>
            <given-names>B.</given-names>
            <surname>Signer</surname>
          </string-name>
          . Vidax:
          <article-title>An interactive semantic data visualisation and exploration tool</article-title>
          .
          <source>In Proceedings of the International Working Conference on Advanced Visual Interfaces</source>
          ,
          <source>AVI '12</source>
          , pages
          <fpage>757</fpage>
          {
          <fpage>760</fpage>
          , New York, NY, USA,
          <year>2012</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>I.</given-names>
            <surname>Ermilov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Martin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lehmann</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Auer</surname>
          </string-name>
          .
          <article-title>Linked open data statistics: Collection and exploitation</article-title>
          . In P. Klinov and D. Mouromtsev, editors,
          <source>Knowledge Engineering and the Semantic Web</source>
          , volume
          <volume>394</volume>
          of Communications in Computer and Information Science, pages
          <volume>242</volume>
          {
          <fpage>249</fpage>
          . Springer Berlin Heidelberg,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>U.</given-names>
            <surname>Fayyad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Piatetsky-Shapiro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and P.</given-names>
            <surname>Smyth</surname>
          </string-name>
          .
          <article-title>From data mining to knowledge discovery in databases</article-title>
          .
          <source>AI magazine</source>
          ,
          <volume>17</volume>
          (
          <issue>3</issue>
          ):
          <fpage>37</fpage>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>T.</given-names>
            <surname>Hastrup</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Cyganiak</surname>
          </string-name>
          , and
          <string-name>
            <given-names>U.</given-names>
            <surname>Bojars</surname>
          </string-name>
          .
          <article-title>Browsing Linked Data with Fen re. In Linked Data on the Web (LDOW2008) workshop</article-title>
          , in conjunction with WWW 2008 conference,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>J. Kl mek</surname>
          </string-name>
          , J. Helmich, and
          <string-name>
            <given-names>M.</given-names>
            <surname>Necasky</surname>
          </string-name>
          . Payola:
          <article-title>Collaborative Linked Data Analysis and Visualization Framework</article-title>
          .
          <source>In 10th Extended Semantic Web Conference (ESWC</source>
          <year>2013</year>
          ), pages
          <fpage>147</fpage>
          {
          <fpage>151</fpage>
          . Springer,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>D.</given-names>
            <surname>Le-Phuoc</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Polleres</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hauswirth</surname>
          </string-name>
          , G. Tummarello, and
          <string-name>
            <given-names>C.</given-names>
            <surname>Morbidoni</surname>
          </string-name>
          .
          <article-title>Rapid prototyping of semantic mash-ups through semantic web pipes</article-title>
          .
          <source>In Proceedings of the 18th international conference on World wide web, WWW '09</source>
          , pages
          <fpage>581</fpage>
          {
          <fpage>590</fpage>
          , New York, NY, USA,
          <year>2009</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>E. Pietriga.</surname>
          </string-name>
          <article-title>IsaViz: a Visual Environment for Browsing and Authoring RDF Models</article-title>
          .
          <source>In WWW 2002, the 11th World Wide Web Conference</source>
          , Honolulu, Hawaii, USA,
          <year>2002</year>
          . World Wide Web Consortium.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>S.</given-names>
            <surname>Schenk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Saatho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Staab</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <surname>A. Scherp.</surname>
          </string-name>
          <article-title>SemaPlorer|interactive semantic exploration of data and media based on a federated cloud infrastructure</article-title>
          .
          <source>Web Semantics: Science, Services and Agents on the World Wide Web</source>
          ,
          <volume>7</volume>
          (
          <issue>4</issue>
          ):
          <volume>298</volume>
          {
          <fpage>304</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>C.</given-names>
            <surname>Stadler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lehmann</surname>
          </string-name>
          ,
          <string-name>
            <surname>K.</surname>
          </string-name>
          <article-title>Ho ner, and</article-title>
          <string-name>
            <surname>S. Auer.</surname>
          </string-name>
          <article-title>LinkedGeoData: A Core for a Web of Spatial Open Data</article-title>
          .
          <source>Semantic Web Journal</source>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>