<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>April</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Keith Alexander Talis Ltd.</string-name>
          <email>keith.alexander@talis.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Jun Zhao Department of Zoology, University of Oxford</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Michael Hausenblas DERI, National University of Ireland</institution>
          ,
          <addr-line>Galway</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Richard Cyganiak DERI, National University of Ireland</institution>
          ,
          <addr-line>Galway</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2009</year>
      </pub-date>
      <volume>20</volume>
      <issue>2009</issue>
      <abstract>
        <p>In this paper we discuss the design and implementation of voiD, the \Vocabulary Of Interlinked Datasets", a vocabulary that allows to formally describe linked RDF datasets. We report on use cases for voiD, the current state of the speci cation and its potential applications in the context of linked datasets.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>
        With the growth of the number of linked datasets [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ],
automating certain tasks, such as discovery, selection and
optimisation, becomes more and more important. Now, one
might argue that URIs and RDF [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] are all one needs to
explore the linked datasets; follow-your-nose1, however bears
some inherent problems. The possible links that can be
followed from a starting URI raises both performance and trust
issues. The main reason for these issues lies in the
granularity level of the available descriptions. Additionally, the
dynamics of the data-sources [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] also has an impact on the
performance of, say, a crawl over a collection of datasets;
and the reliability of secondary data resources [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ].
      </p>
      <p>
        In the early days of linked data (2006 and 2007) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] the
main focus of the community was on publishing data and
nding good practices. Now, in the second phase, other
issues such as usability, quality, performance, reliability of
the infrastructure and the data in the linked data ecosystem
are increasingly recognized to be important.
      </p>
      <p>How can we overcome the limitations of follow-your-nose
while retaining the self-descriptive momentum and being
able to exploit available tools, methodologies, etc.? A
simple yet e ective approach is to decrease granularity. Rather
than talking about single resources, we talk about something
which up to now only existed in drawings, such as in the
LOD cloud2, which graphically represents the landscape of
2.</p>
      <p>
        In the following we will describe our motivation use cases
for voiD. In the context of linked data, we basically di
erentiate between:
on the one hand linked data publisher (a person or
organisation exposing structured data as RDF on the
Web and interlink it with other datasets), and
linked data consumer on the other hand; these might
be machines, for example using a semantic indexer or
a query engine or, as well, humans, e.g., when using a
Web of Data browser such as Tabulator [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>It is worth noting that the following use cases are not
necessarily restricted to the linked data domain.
2.1</p>
    </sec>
    <sec id="sec-2">
      <title>Efficient Discovery of Datasets</title>
      <p>2.1.1</p>
      <sec id="sec-2-1">
        <title>Dataset Publisher</title>
        <p>A dataset publisher might not be identical with the party
who created the raw datasets, but one who publishes them
onto the Web in a more accessible format. A dataset
publisher wants to be able to publish metadata about the dataset
such that:</p>
        <p>The dataset can be found and aggregated by search
engine applications, or discovered in relevant searches;
The metadata provides clear licensing information so
that consumers can know how they can use the data
and to whom they should attribute credits for
creating/publishing the dataset;
That consumers can obtain information about access
interfaces, such as APIs and SPARQL endpoints.</p>
        <p>It is in the best interest of a dataset publisher to provide
potential users of the data with information that supports
them in accessing and using the dataset.
2.1.2</p>
      </sec>
      <sec id="sec-2-2">
        <title>Search Engine Provider</title>
        <p>A search engine provider wants to discover detailed
descriptions about datasets e ciently. A crawler has
stumbled upon an individual RDF document on the Web. How
will it discover metadata that applies to the entire dataset
and cannot be repeated in every single document? The
simple approach of just putting the voiD description online and
linking it from somewhere on the site does not meet our
needs, as it could take the crawler a long time to nd the
description. It is important that the voiD description is
discoverable as soon as the crawler nds the rst RDF
document, so that the crawler can use voiD metadata to guide
its processing of the data.</p>
        <p>
          The Sindice search engine [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ] already uses Semantic
Sitemaps [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] to enable discovery and e cient processing of
datasets. It seems natural to address the situations above by
building on Semantic Sitemaps.
2.2
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Expressing Research Data</title>
      <p>A developer working together with biologists wants to help
domain experts to nd research data published by their peer
colleagues. These are often produced for a particular
experiment, for a particular study or publication, or hosted by
a particular public database. Scientists might know whose
datasets they would prefer to access because they often have
a clear idea about their content or they trust more on that
data provider. When looking for new datasets, they might
search for datasets that provide relevant content (such as
information about genes, proteins, or micro-array gene
expression), that are produced in a right experiment environment,
or that provide additional information that will complement
their local experiment results.</p>
      <p>To nd the right dataset and to make this dataset
accessible for biologists in a user-facing application, the developer
often has to go through the following process:</p>
      <p>Locate a dataset that contains information relevant
to biologists' research interests, such as information
about a speci c organism; or more speci cally,
genomic information about a particular organism;
Find out how this dataset can be programmatically
accessed, as an RDF dump, through SPARQL endpoint
or any other protocol;
Find out the licence associated with the dataset,
making sure that data are accessible under open-access
licence or certain attribution;
Understand the content of the dataset in order to
perform an alignment with other datasets. Information
about URIs used in the dataset can help one with the
data identity alignment, schema(s) used in the dataset
for data schema alignment, and its links with other
datasets for assisting data integration.
2.3</p>
      <p>A consumer may have discovered several datasets, for
example as a result of an indexer query. The question then
arises how to select appropriate datasets from this list of
potential candidates. The consumer, either a human or a
query federation engine, might wish to de ne
\appropriateness" along the following criteria:
1. The content of the dataset, that is, what the dataset
mainly is about. Based on some kind of categorisation
scheme a selection could take place;
2. The interlinking to other datasets, that is, to which
other datasets and how the dataset is interlinked;
3. Vocabularies used in the dataset.</p>
      <p>The criteria listed above can be understood in terms of
quality and quantity. For example, one might be interested
only in datasets containing foaf:interest links to a certain
other dataset. Or, where the number of links are of
interest, one may specify that only datasets with more than one
million links should be taken into account.
2.4</p>
    </sec>
    <sec id="sec-4">
      <title>Query Optimisation</title>
      <p>With many datasets now on the Web both connectable
(through shared vocabulary terms) and connecting (by
linking to resources in other datasets), it is naturally desirable
to query across multiple datasets at once with SPARQL.</p>
      <p>
        Optimisation of SPARQL queries can be achieved in a
static way. A set of logical rules [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] can be applied to a
query engine, to calculate all equivalent query plans for a
given query and then choose the most optimised query plan
to be executed. To optimise SPARQL queries dynamically,
i.e., deciding the best execution approach during the
execution phase [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], one can use the statistical information
about datasets, such as how much information is provided
about a particular entity or property. This information can
be used by the query mediator to optimise query plans by,
for example, modifying the order in which a query pattern is
executed according to the estimated size of data results [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
3.
      </p>
    </sec>
    <sec id="sec-5">
      <title>VOCABULARY DESIGN</title>
      <p>
        The Vocabulary of Interlinked Datasets (voiD) [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] is a
vocabulary and a set of instructions that enables the discovery
and usage of linked datasets. The principle of the voiD
effort is to use real requirements to guide the scope of the
design, and to re-use existing vocabularies wherever
possible instead of creating our own. Therefore, we have kept
the creation of new classes and properties under the voiD
namespace (http://rdfs.org/ns/void#) to the minimum.
3.1
      </p>
    </sec>
    <sec id="sec-6">
      <title>Datasets</title>
      <p>In the following, we will de ne and explain the basic
concepts voiD operates with. A fundamental entity in voiD is
a dataset.</p>
      <p>Definition 1. A dataset is a set of RDF triples that are
published, maintained or aggregated by a single provider.</p>
      <p>
        We think of a dataset as a meaningful collection of triples,
that deal with a certain topic, originate from a certain source
or process, are hosted on a certain server, or are aggregated
by a certain custodian. The term thus has a social dimension
that is not easy to capture in a formal de nition. This
differentiates datasets from RDF graphs [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], which are purely
mathematical constructs. Any arbitrary set of RDF triples
is an RDF graph, by de nition, regardless of the triples'
semantics. Also, typically a dataset is accessible on the Web,
for example through resolvable HTTP URIs or through a
SPARQL endpoint, and it contains su ciently many triples
that there is bene t in providing a concise summary.
      </p>
      <p>The ultimate purpose of creating a void:Dataset instance
is that this single resource represents the entire RDF dataset,
and thus allows us to make statements about the entire
dataset within the standard RDF model. The relationship
between a void:Dataset instance and the concrete triples
contained in the dataset is established only in an operational
manner: A voiD description usually contains access
information, such as the address of a SPARQL endpoint where the
triples can be accessed.</p>
      <p>We nd that most datasets describe a well-de ned set of
resources. Hence, a dataset can also be seen as a set of
descriptions for certain resources, which often share a common
URI pre x (such as http://dbpedia.org/resource/).</p>
      <p>
        HTTP URIs have \owners", due to their use of DNS
domain names. URI ownership is de ned as \a relation
between a URI and a social entity, such as a person,
organisation, or speci cation."[
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] Information about a URI that is
provided by the URI owner is called authoritative
information. We use this notion to de ne authoritative datasets:
      </p>
      <p>Definition 2. A dataset is authoritative with respect to
a certain URI namespace if it contains information about
resources named by URIs in this namespace, and is published
by the URI owner.</p>
      <p>A straightforward method of publishing authoritative
datasets is by using resolvable HTTP URIs in the linked data
style. The URI owner also con gures the server that
responds when the URI is resolved. Therefore, if resolving
yields a description of the resource named by the URI, then
the data is authoritative.</p>
      <p>The notion of authoritative information supports the
social convention that a URI owner gets to decide what a URI
identi es. Providing authoritative information is how the
URI owner communicates this decision to the world. Even
if third parties disagree with that information, they can still
agree that they are talking about the same thing, which
would be much harder without the grounding provided by
the existence of an authoritative source.
3.2</p>
    </sec>
    <sec id="sec-7">
      <title>Linksets</title>
      <p>Besides datasets, voiD also deals with interlinking
between datasets. Interlinking in voiD is a rst-class citizens,
hence modelled as a class.</p>
      <p>The conceptual model of voiD links is depicted in Fig. 1.
Let us assume there are two datasets. One of them contains
links to the other, that is, it contains RDF triples that
connect resources from both datasets. We model this in voiD
using two instances of void:Dataset, and another dataset
:LS1 which is a subset of one of the datasets, and declared
to be of type void:Linkset. We de ne void:Linkset as:</p>
      <p>Definition 3. A linkset LS is a set of RDF triples where
for all triples ti = hsi; pi; oii 2 LS, the subject is in one
dataset, i.e. all si are described in DSsrc, and the object is
in another dataset, i.e. all oi are described in DSsink.</p>
      <p>The natural expectation is that both DSsrc and DSsink
are themselves described in voiD. We note that the triples
ti are often referred to as \interlinking triples".</p>
      <sec id="sec-7-1">
        <title>3.2.1 Inline links vs. 3rd-party links</title>
        <p>In voiD we are able to model two di erent situations: the
classic LOD3 case vs. the 3rd-party case (Fig. 2). In the
classic LOD case, the linkset is a subset of one of the two
involved datasets, while in the 3rd-party case a third dataset
is involved that actually contains the linkset.</p>
        <p>
          Though the 3rd-party cases is not yet widely implemented
in the context of linked data, this pattern of keeping links
separate from interlinked datasets has been well argued in
existing research such as found in the Hypertext
community [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]. In LOD, there are already rst applications (RKB
explorer, see section 5.1), and it is very likely that such
systems will evolve over time and grow considerably.
        </p>
      </sec>
      <sec id="sec-7-2">
        <title>3.2.2 Interlinking Regarding Directionality</title>
        <p>Independent of the former situation, voiD distinguishes
between the non-directed vs. directed cases. In some
cases one is interested in stating the direction of the
interlinking (for example with foaf:interest), and in other
situations the direction is of no interest (e.g., owl:sameAs),
as shown in Fig. 3.</p>
        <p>In order to express the interlinking as outlined above, voiD
o ers the following RDF properties:
void:subset to state where the interlinking triples
reside (read: a dataset :DS has a subset :LS);
void:target to declare an interlinking target (for the
non-directed case); in the directed case, one can use
void:subjectsTarget and void:objectsTarget to
determine the direction (both being sub-properties of
void:target);
void:linkPredicate to express the RDF property (type)
of the interlinking in a linkset.</p>
        <p>We note that it is expected that per RDF predicate a
respective instance of void:Linkset could be created,
depending on the needs of an application. Further, one may take
into consideration that due to the modelling of void:target
and its sub-properties, light-weight subsumption inferencing
may be necessary to apply generic queries that will not
distinguish between the directed and non-directed case.</p>
        <p>In listing 1 a sample voiD description is depicted
describing the interlinking from DBpedia to DBLP. It is an example
3LOD ... Linking Open Data datasets, see http://esw.
w3.org/topic/SweoIG/TaskForces/CommunityProjects/
LinkingOpenData
(a) Classic LOD case: Describing the :DBpedia
dataset and its contained :DBpedia2DBLP linkset
(b) 3rd-party case: Describing the stand-alone
:DBpedia2DBLP linkset
for a directed case. The description de nes nothing about
who published this voiD description about DBpedia, which
means that it could also be an example for a 3rd-party case.
Further, the listing 2 shows a SPARQL query that is
executed against listing 1 to search for a dataset that is about
\computer science" and which is linked from DBpedia. The
result yields the dataset :DBLP.
3.3</p>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>Reuse of Other Vocabularies</title>
      <p>
        In the voiD guide [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] we describe the reuse of other
vocabulary terms not directly de ned in the core voiD vocabulary
alongside with voiD. Some important properties from other
vocabularies are listed in the following. We note that there
are many other aspects one may want to choose to describe
in a dataset. A complete description of recommended usage
can be found from the voiD user guide [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>Properties from the dcterms namespace for general
metadata, such as the publishing organization and
publishing date of a dataset;</p>
      <p>
        (b) Directed case.
foaf:homepage of the dataset's homepage should be
used, to allow one to connect di erent descriptions of
the same dataset provided in di erent places on the
Web. The recommended process in voiD is IFP
smushing on foaf:homepage property;
dcterms:subject should be used to categorise a dataset.
For the general case, we recommend the use of a
DBpedia resource URI (http://dbpedia.org/resource/
XXX) to categorise a dataset, where XXX stands for
the thing which best describes the main topic of what
the dataset is about. However, DBpedia might not
contain concepts for describing some domain speci c
datasets. For example, there are no exact DBpedia
resource URIs for describing that a dataset is about \in
situ hybridisation image". We hence encourage
publishers to describe such datasets using concepts widely
adopted in their own communities, so that they can
not only capture precisely the categorisation of their
datasets but also ensure that these datasets could be
connected with other relevant data from their domains;
Statistical information represented using the
\Statistical Core Vocabulary" (SCOVO)4 [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ].
3.4
      </p>
    </sec>
    <sec id="sec-9">
      <title>Dataset Licensing</title>
      <p>
        As stated in Section 2, it is crucial for a data publisher to
associate appropriate licensing information with their
published data, so that potential users of the dataset would
4http://purl.org/NET/scovo
know under which terms they can use it and what
attribution they should apply. The dcterms:license property
should be used to to point to the license under which a
dataset has been published. Further, to allow automatic
analysis of datasets, voiD also recommends a set of
canonical identi ers for well-known licenses [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The example
below states that the DBpedia dataset is published under the
terms of the GNU Free Documentation License.
1 : DBpedia a void : Dataset ;
2 dcterms : license
      </p>
      <p>&lt; http :// www . gnu . org / copyleft / fdl . html &gt; .</p>
      <p>Listing 3: An exemplary voiD description about
data license.</p>
      <p>
        Licensing of datasets is a complex issue. Datasets are
collections of facts rather than creative works, and di erent
laws apply. Scientists are most cautious about publishing
their datasets onto the Web and they might request very
speci c or strict policies for sharing their data. Most
licenses such as Creative Commons or the GPL are based on
copyright and are designed to protect creative works, but
not databases, and applying them to datasets might not
have the desired legal result. Meanwhile, e orts such as
Open Data Commons [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] and Science Commons [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] are
developing dedicated licenses for data.
3.5
      </p>
    </sec>
    <sec id="sec-10">
      <title>Statistics</title>
      <p>Of special interest to distributed SPARQL agents will
be the statistics about the triples available in the dataset,
described with the void:statItem predicate. We adopt
SCOVO for representing statistics. The main class in SCOVO
is the scovo:Item, which records a single number or
statistical value along with so called dimensions. We provide two
types of information for describing statistics:</p>
      <p>Statistics concerning the whole dataset or linkset, such
as overall triple count or ne-grained statistics,
expressing the number of instances of a class or
property by using di erent pre-de ned dimensions,
including void:numberOfResources, etc.;
Attributing statistics to a source, recording where a
statistical datum stems from.</p>
      <p>Listing 4 demonstrates possible statistic information one can
publish for their dataset. The current modelling of statistics
1 : DBpedia a void : Dataset ;
2 void : statItem [
3 rdf : value 20000;
4 scovo : dimension void : numberOfResources ;
5 scovo : dimension foaf : Person ;
6 dcterms : source &lt; http :// wiki . dbpedia . org / &gt; ;
7 ] .</p>
      <p>Listing 4: Expressing statistics about a dataset in
voiD.
in voiD is still experimental. We had to make choices
between (i) a precise usage of scovo through a rather verbose
expression and (ii) creations of shortcuts to express statistics
needed for describing linked datasets:</p>
      <p>Scovo has an implicit assumption that all scovo:Items
associated with the dataset they describe share the
same dimensions. This does not t well with our
requirements for being able to mix items of di erent
dimensions for a dataset. On the other hand, the correct
Scovo modelling would lead to awkwardly complex and
verbose notation for simple statistics.</p>
      <p>We encourage the use of classes and properties in places
where scovo requires an instance of scovo:Dimension.
This breaks the symmetry of the scovo model. scovo
would require us to create a scovo:Dimension for each
class or property. This would be quite verbose.</p>
      <p>Because of the issues above, queries for statistics
information using SPARQL can be awkward. It will often require
a verbose check to make sure that an item has only certain
dimensions and no others.
3.6</p>
    </sec>
    <sec id="sec-11">
      <title>Additional Terms in voiD</title>
      <p>
        RDF datasets use one or more RDF-Schema vocabularies
or OWL ontologies, hence we provide the void:vocabulary
to list vocabularies used in a dataset. To express technical
features of a dataset, such as formats in which the data is
available, one can use void:feature. Further, a SPARQL
endpoint that provides access to a dataset via the SPARQL
protocol can be announced using the void:sparqlEndpoint
property. Listing 5 shows the usage of the terms described
above. We note that a complete list of the terms is available
from the voiD user guide [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
1 : DBpedia void : sparqlEndpoint
      </p>
      <p>&lt; http :// dbpedia . org / sparql &gt; ;
2 void : feature [ dcterms : format</p>
      <p>" application / rdf + xml " ; ] ;
3 void : vocabulary</p>
      <p>&lt; http :// xmlns . com / foaf /0.1/ &gt; .</p>
      <p>Listing 5: Additional voiD terms usage.</p>
      <p>PUBLICATION AND CONSUMPTION
We envision dataset publisher to o er a voiD description
along with their dataset. A voiD description typically has
two parts, (i) manually created part (categorisation,
vocabulary, license, etc.), and (ii) automatically generated part,
mainly regarding statistics.</p>
      <p>In the following we will discuss the publication process of
voiD descriptions and their discovery in order to consume
them.
4.1</p>
    </sec>
    <sec id="sec-12">
      <title>Publication</title>
      <p>
        Publishing a voiD le means to physically deploy it on
the Web in an RDF serialisation. We have detailed out the
options in the voiD guide [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>For dataset that are published as a collection RDF
documents, as commonly seen in the linked data publishing style,
one can use a dcterms:isPartOf triple in each document to
link back to the URI identifying the voiD dataset, as shown
in listing 6. Resolving the dataset URI will answer a voiD
descriptions about the entire dataset, allowing agents to
discover the voiD description when encountering an individual
document from the collection. The intuition behind using
the dcterms:isPartOf property is that the RDF document
contains an RDF graph whose triples are part of the dataset.
1 &lt; http :// dbpedia . org / data / Berlin &gt; dcterms : isPartOf
: DBpedia .</p>
      <p>Listing 6: Use backlinks publish voiD description of
a dataset.</p>
      <p>
        As discussed in [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], we can imagine that voiD descriptions
are crawled and indexed by semantic search engines (such
as Sindice [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ] or Yahoo's search monkey [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]) in order to
provide a central point of lookup.
4.2
      </p>
    </sec>
    <sec id="sec-13">
      <title>Discovery via Sitemaps</title>
      <p>A discovery mechanism for use by RDF-harvesting web
crawlers (Fig. 4) has been de ned as follows:
1. Given a domain name, the client gets the le robots.txt
and searches for a line that starts with Sitemap:; the
rest of that line is the URI of a sitemap;
2. The semantic sitemaps extension to the sitemap
protocol de nes a &lt;sc:dataset&gt; element that can have a
&lt;sc:datasetURI&gt; child element. If present, the value
of that element is a URI that identi es the dataset
datasetURI;
3. The dataset URI datasetURI is dereferenced which
yields the voiD description of the dataset.</p>
      <p>After releasing the rst edition of voiD in early 2009, we
have seen a certain community uptake. People and
organisations would start using in di erent areas and for di erent
purposes, potentially far beyond what we have envisioned in
the realm of our own use cases. We report on known usages
of voiD in the following and point out potential application
areas.
5.1</p>
    </sec>
    <sec id="sec-14">
      <title>Existing Applications</title>
      <p>5.1.1</p>
      <sec id="sec-14-1">
        <title>Tools for Creating voiD Descriptions</title>
        <p>To boot-strap the process of creating voiD descriptions,
several tools are available: ve, the voiD editor (Fig. 5),
liftSSM5, an XSLT script able to boot-strap from a Semantic
Sitemap, and, for creating the quantitative, statistical data,
a new release of the NX parser6, o ering a voiD export for
statistics.
5.1.2</p>
        <p>“Linked Datasets Explorer” (LDE)</p>
        <p>To let user browse and explore a collection of voiD
descriptions, we have developed the LDE demonstrator. Fig. 6
shows the current state of LDE7 which operates on a
manually created, so called \seed" set of voiD descriptions.
5.1.3</p>
      </sec>
      <sec id="sec-14-2">
        <title>RKB explorer</title>
        <p>The RKB explorer has a voiD site8 which enables querying
and browsing for CRS datasets. Further, the interlinking of
the RKB sites can be visualised using the underlying voiD
descriptions (Fig. 7).
5.1.4</p>
      </sec>
      <sec id="sec-14-3">
        <title>Query Federation</title>
        <p>Only recently Clarck-Parsia announced their voiD
support9:
\There is a touch point with the linked data
effort, which meant that the new voiD vocabulary
5http://rdfs.org/ns/void-guide#sec_4_3_Publishing_
tools
6http://sw.deri.org/2006/08/nxparser/release/
nxparser-1.1.jar
7http://ld2sd.deri.org/lde/
8http://void.rkbexplorer.com/
9http://clarkparsia.com/weblog/2009/02/04/
distributed-query-pellet-into-the-void/
for describing datasets turns out to be very useful
for describing the distributed data sources that
we query over, including their interrelations."</p>
        <p>Further, OpenLink plans to release its \Smart SPARQL
Federation capabilities", based on voiD, soon.
5.1.5</p>
      </sec>
      <sec id="sec-14-4">
        <title>Middleware</title>
        <p>OpenLink's Sponger Middleware uses voiD for
generating linked data from non-RDF data sources such as HTML
pages. An example10 from http://linkeddata.uriburner.
com/ with the voiD description deployed as XHTML+RDFa
is shown in Fig. 8. Further, the statistics maintenance in
their Virtuoso Quad Store is performed based on voiD.
5.2</p>
      </sec>
    </sec>
    <sec id="sec-15">
      <title>Potential Applications</title>
      <p>
        We envision voiD to be applied in many scenarios, some
of which we have identi ed earlier in section 2. Only
recently, for example, we have started to develop a dataset
ranking algorithm based on voiD descriptions; this is subject
to more research. One could further apply voiD to DARQ
(Distributed ARQ) [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ].
      </p>
      <p>A totally di erent application domain is visualisation: for
example, \The Map of Data"11 in Sindice can be generated
automatically thanks to voiD.</p>
      <p>Ultimately, to be of use, one wants applications that
bene t from voiD. Put in other words, this means that, given
there are applications that consume voiD and o er some
added value, the incentive for publishers to provide voiD
descriptions is self-evident. One such application could be
a sort of dynamic dataset selector which, con gured with a
speci cation of the dataset (topics, license, interlinking with
certain other datasets) would at run-time of an application
discover and select appropriate datasets according to the
search speci cation.</p>
    </sec>
    <sec id="sec-16">
      <title>RELATED WORK</title>
      <p>
        To the best of our knowledge, no comparable approach
to voiD exists. That is, in the context of the Web of Data,
we are not aware of any speci cation that allows the
description of datasets and their interlinking the way voiD
does. However, we acknowledge previous work of Semantic
Sitemaps [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] and build upon it.
      </p>
      <p>
        In the scope of the Web of Documents, we note that at the
time of writing a W3C Working Draft of POWDER
(Protocol for Web Description Resources) [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] is available.
POWDER aims at providing information about Web resources,
such as scope, authoritative information, etc., without
retrieving the resources themselves. POWDER comes in two
avours, (i) as human-legible XML, and (ii) in an RDF
version. It also provides a GRDDL transformation to turn the
former into the latter. The descriptions can be applied to
groups of resources de ned via listing of URIs, regular
expressions, etc. Several publishing methods are suggested
(via HTML &lt;link&gt; in the header, HTTP Link: header or
using XHTML+RDFa). Especially in the Web of Trust,
POWDER is expected to play a vital role, though
implementation complexity might hinder wide-spread adoption.
      </p>
      <p>
        Further, OASIS's XRDS (eXtensible Resource Descriptor
Sequence) [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ] is an XML format for metadata discovery
10http://linkeddata.uriburner.com/about/html/http:
//twitter.com/mhausenblas#Dataset
11http://sindice.com/map
about a resource. The discovery protocol for XRDS
documents given a URI was de ned in 2006 as part of Yadis,
focusing on services such as OpenID and OAuth. In early
2008, XRDS-Simple was proposed12, but is now obsolete.
      </p>
      <p>
        Only very recently, the latest draft of \/host-meta" [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]
was proposed. The core of this proposal is a single
\wellknown location", /host-meta, acting as a directory of the
interesting metadata about a Web site. The format allows
di erent types of site metadata to be referenced by an URI
or included inline.
      </p>
      <p>
        One could understand the \HTTP Link: header" [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]
proposal related to voiD, as it also supports discovery, o ering
metadata about resources by resurrecting a (currently
deprecated) feature of HTTP. This proposal is at the time of
writing still under vivid discussion and not yet seen stable.
      </p>
      <p>
        Regarding federated SPARQL queries, DARQ (Distributed
ARQ) [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] proposes so called \service descriptions" that
are able to specify capabilities of a SPARQL endpoints.
The service descriptions enable the DARQ query engine
to decompose a query into sub-queries, each of which can
be answered by an individual service using query
rewriting and cost-based query optimisation to speed-up query
execution. Further, we note an attempt called \SPARQL
Endpoint Description"13 that aimed to allow the
announcement of endpoint capabilities and contents, support
discovery through service directories, and supply browsing and
federation hints. Both proposals seem to be not further
maintained and/or have not reached wide-spread adoption.
      </p>
      <p>Finally, we note that the W3C Technical Architecture
Group (TAG) started to contemplate about \Uniform
Access to Metadata"14, basically being a survey regarding the
problem of specifying a uniform method for obtaining
information pertaining to a resource without necessarily having
to parse a representation of the resource.
7.</p>
    </sec>
    <sec id="sec-17">
      <title>OUTLOOK</title>
      <p>
        We have released the voiD vocabulary and voiD user guide
to linked data communities in January this year. In this
release, we have used the use cases presented in section 2 to
guide the design scope of the voiD vocabulary. Supports
for describing the quality, provenance and versions of linked
datasets are to be addressed in the next release of voiD.
Also, the statistics modelling in the current voiD model is
still experimental. We are communicating with user
communities and the SCOVO team in order to propose a more
stable modelling in the coming release15. Additionally we
will liaison with initiatives such as the \Ontology Metadata
Vocabulary" [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] sharing similar goals.
      </p>
      <p>To test and evaluate the usefulness of voiD, we need tools
that use voiD to support the discovery of datasets or the
SPARQL query federation. Fortunately, semantic query
engines like Sindice and SPARQL query processing systems
(like OpenLink) are adopting voiD in their implementations.
It is challenging to completely automate the creation of voiD
descriptions. We need tools like the NX parser to take as
12http://www.hueniverse.com/hueniverse/2008/03/
putting-xrds-si.html
13http://esw.w3.org/topic/SparqlEndpointDescription
14http://www.w3.org/2001/tag/doc/
uniform-access-20090205.html
15See http://code.google.com/p/void-impl/issues/
list?can=2&amp;q=milestone:Release2.0forplannedissues.
much as possible of the heavy lifting for non-technical data
publishers as possible.</p>
    </sec>
    <sec id="sec-18">
      <title>Acknowledgements</title>
      <p>Our work has partly been supported by the European
Commission under Grant No. 217031, FP7/ICT-2007.1.2, project
\Domain Driven Design and Mashup Oriented Development
based on Open Source Java Metaframework for Pragmatic,
Reliable and Secure Web Development" (Romulus)16, and
the Joint Information Systems Committee [Project
\FlyWeb"]. The authors would further like to thank
(alphabetically): Orri Erling, Hugh Glaser, Olaf Hartig, Tom Heath,
Andreas Langegger, Ian Millard, Marc-Alexandre Nolin, Yves
Raimond, Yrjana Rankka, Francois Schar e, and Giovanni
Tummarello.
16http://www.ict-romulus.eu/</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>K.</given-names>
            <surname>Alexander</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Cyganiak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hausenblas</surname>
          </string-name>
          , and
          <string-name>
            <surname>J. Zhao.</surname>
          </string-name>
          <article-title>voiD guide|Using the Vocabulary of Interlinked Datasets</article-title>
          . Community Draft, voiD working group,
          <year>2009</year>
          . http://rdfs.org/ns/void-guide/.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>K.</given-names>
            <surname>Alexander</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Cyganiak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hausenblas</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhao</surname>
          </string-name>
          .
          <article-title>voiD, the \Vocabulary of Interlinked Datasets"</article-title>
          .
          <source>Community Draft</source>
          , voiD working group,
          <year>2009</year>
          . http://rdfs.org/ns/void/.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>P.</given-names>
            <surname>Archer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Smith</surname>
          </string-name>
          , and
          <string-name>
            <given-names>A.</given-names>
            <surname>Perego</surname>
          </string-name>
          .
          <article-title>Protocol for Web Description Resources (POWDER): Description Resources</article-title>
          .
          <source>W3C Working Draft 14 November</source>
          <year>2008</year>
          , POWDER Working Group,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>T.</given-names>
            <surname>Berners-Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Chilton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Connolly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Dhanaraj</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hollenbach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Lerer</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Sheets</surname>
          </string-name>
          . Tabulator:
          <article-title>Exploring and analyzing linked data on the Semantic Web</article-title>
          . In
          <source>In Proceedings of the 3rd International Semantic Web User Interaction Workshop (SWUI06)</source>
          , Athens, Georgia, USA,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>C.</given-names>
            <surname>Bizer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Heath</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Idehen</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Berners-Lee</surname>
          </string-name>
          .
          <article-title>Linked Data on the Web (LDOW2008)</article-title>
          .
          <source>In Linked Data on the Web Workshop(WWW2008)</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>L. A.</given-names>
            <surname>Carr</surname>
          </string-name>
          , D. C. DeRoure, W. Hall, and
          <string-name>
            <given-names>G. J.</given-names>
            <surname>Hill</surname>
          </string-name>
          .
          <article-title>The Distributed Link Service: A tool for publishers, authors and readers)</article-title>
          .
          <source>In Proceedings of the 4th International World Wide Web Conference: The Web Revolution)</source>
          , pages
          <fpage>647</fpage>
          {
          <fpage>656</fpage>
          , Boston, USA,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>R.</given-names>
            <surname>Cyganiak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Stenzhorn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Delbru</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Decker</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Tummarello. Semantic Sitemaps</surname>
          </string-name>
          :
          <article-title>E cient and exible access to datasets on the Semantic Web</article-title>
          .
          <source>In Proceedings of the 5th European Semantic Web Conference</source>
          , volume
          <volume>5021</volume>
          , pages
          <fpage>690</fpage>
          {
          <fpage>704</fpage>
          ,
          <string-name>
            <surname>Tenerife</surname>
          </string-name>
          , Spain,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>O.</given-names>
            <surname>Hartig</surname>
          </string-name>
          and
          <string-name>
            <given-names>R.</given-names>
            <surname>Heese</surname>
          </string-name>
          .
          <article-title>The SPARQL query graph model for query optimization</article-title>
          .
          <source>In Proceedings of the 4th European Semantic Web Conference</source>
          <year>2007</year>
          , pages
          <fpage>564</fpage>
          {
          <fpage>578</fpage>
          ,
          <string-name>
            <surname>Innsbruck</surname>
          </string-name>
          , Austria,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>J.</given-names>
            <surname>Hartmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Sure</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Haase</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Palma</surname>
          </string-name>
          , and
          <string-name>
            <surname>M. del Carmen</surname>
          </string-name>
          Suarez-Figueroa.
          <article-title>OMV { Ontology Metadata Vocabulary</article-title>
          . In C. Welty, editor,
          <source>ISWC 2005 - In Ontology Patterns for the Semantic Web</source>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>M.</given-names>
            <surname>Hausenblas</surname>
          </string-name>
          .
          <article-title>Discovery and usage of linked datasets on the Web of data</article-title>
          .
          <source>In Talis NodMag 4</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>M.</given-names>
            <surname>Hausenblas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Halb</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Raimond</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Feigenbaum</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Ayers</surname>
          </string-name>
          . SCOVO:
          <article-title>Using statistics on the Web of data</article-title>
          .
          <source>In 6th European Semantic Web Conference (ESWC2009)</source>
          ,
          <source>Semantic Web in Use Track</source>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>M.</given-names>
            <surname>Hausenblas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Halb</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Raimond</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Heath</surname>
          </string-name>
          .
          <article-title>What is the size of the Semantic Web</article-title>
          .
          <source>In Proceedings of I-Semantics</source>
          <year>2008</year>
          , Graz, Austria,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>M.</given-names>
            <surname>Hausenblas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Slany</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Ayers</surname>
          </string-name>
          .
          <article-title>A performance and scalability metric for virtual RDF graphs</article-title>
          .
          <source>In 3rd Workshop on Scripting for the Semantic Web (SFSW07)</source>
          , Innsbruck, Austria,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>HP</given-names>
            <surname>Lab</surname>
          </string-name>
          . TDB/Optimizer. http://jena.hpl.hp.com/wiki/TDB/Optimizer, 25 October,
          <year>2008</year>
          . Accessed in March
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>I.</given-names>
            <surname>Jacobs</surname>
          </string-name>
          and
          <string-name>
            <given-names>N.</given-names>
            <surname>Walsh</surname>
          </string-name>
          .
          <source>Architecture of the World Wide Web</source>
          , Volume One.
          <source>W3C Recommendation 15 December</source>
          <year>2004</year>
          ,
          <source>W3C Technical Architecture Group (TAG)</source>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>J.</given-names>
            <surname>Klump</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Bertelmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Brase</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Diepenbroek</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Grobe</surname>
          </string-name>
          , H. Hock, M. Lautenschlager,
          <string-name>
            <given-names>U.</given-names>
            <surname>Schindler</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Sens</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Wa</surname>
          </string-name>
          <article-title>chter. Data publication in the open access initiative</article-title>
          .
          <source>Data Science Journal</source>
          ,
          <volume>5</volume>
          :
          <fpage>79</fpage>
          {
          <fpage>83</fpage>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>G.</given-names>
            <surname>Klyne</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. J.</given-names>
            <surname>Carroll</surname>
          </string-name>
          , and
          <string-name>
            <surname>B. McBride.</surname>
          </string-name>
          <article-title>RDF/XML Syntax Speci cation (Revised)</article-title>
          .
          <source>W3C Recommendation</source>
          , RDF Core Working Group,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>P.</given-names>
            <surname>Mika</surname>
          </string-name>
          .
          <article-title>Microsearch: An interface for semantic search</article-title>
          . In Semantic Search,
          <source>International Workshop located at the 5th European Semamntic Web Conference (ESWC</source>
          <year>2008</year>
          ), volume
          <volume>334</volume>
          <source>of CEUR Workshop Proceedings</source>
          , pages
          <volume>79</volume>
          {
          <fpage>88</fpage>
          . CEUR-WS.org,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>P.</given-names>
            <surname>Miller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Styles</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Heath</surname>
          </string-name>
          .
          <article-title>Open data commons, a license for open data</article-title>
          .
          <source>In Proceedings of the Workshop on Linked Data on the Web (WWW2008)</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>M.</given-names>
            <surname>Nottingham</surname>
          </string-name>
          .
          <article-title>Link relations and HTTP header linking</article-title>
          .
          <source>Internet-Draft, 1 December</source>
          <year>2008</year>
          , IETF Network Working Group,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>M.</given-names>
            <surname>Nottingham</surname>
          </string-name>
          and
          <string-name>
            <given-names>E.</given-names>
            <surname>Hammer-Lahav</surname>
          </string-name>
          .
          <article-title>Host metadata for the Web</article-title>
          .
          <source>Internet-Draft, 10 February</source>
          <year>2009</year>
          , IETF Network Working Group,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>B.</given-names>
            <surname>Quilitz</surname>
          </string-name>
          and
          <string-name>
            <given-names>U.</given-names>
            <surname>Leser</surname>
          </string-name>
          .
          <article-title>Querying distributed RDF data sources with SPARQL</article-title>
          .
          <source>In Proceedings of the 5th European Semantic Web Conference</source>
          <year>2008</year>
          ), pages
          <fpage>524</fpage>
          {
          <fpage>538</fpage>
          . Springer,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>G.</given-names>
            <surname>Tummarello</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Delbru</surname>
          </string-name>
          , and
          <string-name>
            <surname>E. Oren. Sindice.</surname>
          </string-name>
          <article-title>com: Weaving the open linked data</article-title>
          .
          <source>Proceedings of the 6th International Semantic Web Conference 2007 (ISWC2007)</source>
          ,
          <volume>4825</volume>
          :
          <fpage>552</fpage>
          {
          <fpage>565</fpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>G.</given-names>
            <surname>Wachob</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Reed</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Chasen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Tan</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Churchill</surname>
          </string-name>
          .
          <source>Extensible Resource Identi er (XRI) Resolution Version 2.0. Committee Draft 03 28 February</source>
          <year>2008</year>
          ,
          <article-title>OASIS eXtensible Resource Identi er (XRI) TC</article-title>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>S.</given-names>
            <surname>Weibel</surname>
          </string-name>
          ,
          <string-name>
            <surname>A. S.</surname>
          </string-name>
          <article-title>for Information Science, and</article-title>
          <string-name>
            <surname>Technology.</surname>
          </string-name>
          <article-title>The Dublin Core: A simple content description model for electronic resources</article-title>
          .
          <source>Bulletin of the American Society for Information Science and Technology</source>
          ,
          <volume>24</volume>
          (
          <issue>1</issue>
          ):9{
          <fpage>11</fpage>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Miles</surname>
          </string-name>
          , G. Klyne, and
          <string-name>
            <given-names>D.</given-names>
            <surname>Shotton</surname>
          </string-name>
          .
          <article-title>Linked data and provenance in biological data webs</article-title>
          .
          <source>Brie ngs in Bioinformatics</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>