<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Describing Linked Datasets On the Design and Usage of voiD, the &quot;Vocabulary Of Interlinked Datasets&quot;</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Keith</forename><surname>Alexander</surname></persName>
							<email>keith.alexander@talis.com</email>
						</author>
						<author>
							<persName><forename type="first">Talis</forename><forename type="middle">Ltd Richard</forename><surname>Cyganiak</surname></persName>
						</author>
						<author>
							<persName><forename type="first">Michael</forename><surname>Hausenblas</surname></persName>
							<email>michael.hausenblas@deri.org</email>
						</author>
						<author>
							<persName><forename type="first">Jun</forename><surname>Zhao</surname></persName>
							<email>jun.zhao@zoo.ox.ac.uk</email>
						</author>
						<author>
							<affiliation key="aff0">
								<orgName type="department">DERI</orgName>
								<orgName type="institution">National University of Ireland</orgName>
								<address>
									<settlement>Galway</settlement>
								</address>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff1">
								<orgName type="department">DERI</orgName>
								<orgName type="institution">National University of Ireland</orgName>
								<address>
									<settlement>Galway</settlement>
								</address>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff2">
								<orgName type="department">Department of Zoology</orgName>
								<orgName type="institution">University of Oxford</orgName>
							</affiliation>
						</author>
						<title level="a" type="main">Describing Linked Datasets On the Design and Usage of voiD, the &quot;Vocabulary Of Interlinked Datasets&quot;</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">21EB31AD8FBE3F9537DEB6AD81DF73AA</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T18:53+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>In this paper we discuss the design and implementation of voiD, the "Vocabulary Of Interlinked Datasets", a vocabulary that allows to formally describe linked RDF datasets. We report on use cases for voiD, the current state of the specification and its potential applications in the context of linked datasets.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">INTRODUCTION</head><p>With the growth of the number of linked datasets <ref type="bibr" target="#b11">[12]</ref>, automating certain tasks, such as discovery, selection and optimisation, becomes more and more important. Now, one might argue that URIs and RDF <ref type="bibr" target="#b15">[17]</ref> are all one needs to explore the linked datasets; follow-your-nose 1 , however bears some inherent problems. The possible links that can be followed from a starting URI raises both performance and trust issues. The main reason for these issues lies in the granularity level of the available descriptions. Additionally, the dynamics of the data-sources <ref type="bibr" target="#b12">[13]</ref> also has an impact on the performance of, say, a crawl over a collection of datasets; and the reliability of secondary data resources <ref type="bibr" target="#b25">[26]</ref>.</p><p>In the early days of linked data (2006 and 2007) <ref type="bibr" target="#b4">[5]</ref> the main focus of the community was on publishing data and finding good practices. Now, in the second phase, other issues such as usability, quality, performance, reliability of the infrastructure and the data in the linked data ecosystem are increasingly recognized to be important.</p><p>How can we overcome the limitations of follow-your-nose while retaining the self-descriptive momentum and being able to exploit available tools, methodologies, etc.? A simple yet effective approach is to decrease granularity. Rather than talking about single resources, we talk about something which up to now only existed in drawings, such as in the LOD cloud 2 , which graphically represents the landscape of linked data on the Web using bubbles for datasets and arcs between bubbles for the links between these datasets. The voiD vocabulary, the "Vocabulary Of Interlinked Datasets", allows one to describe datasets (the bubbles) and linksets (the arcs between the bubbles), and in turn enables a number of tasks to be automated in a scalable manner.</p><p>The remainder of this paper is as follows: in Section 2 we discuss some use cases that provide the motivation for voiD. Then, in Section 3 we report on the design of the voiD core vocabulary and its usage along with other vocabularies such as Dublin Core <ref type="bibr" target="#b24">[25]</ref>. We describe the publication and the consumption of voiD descriptions in Section 4. In Section 5 we discuss current and potential applications of voiD and report on related work in Section 6. We discuss future plans and conclude in Section 7.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">USE CASES</head><p>In the following we will describe our motivation use cases for voiD. In the context of linked data, we basically differentiate between:</p><p>• on the one hand linked data publisher (a person or organisation exposing structured data as RDF on the Web and interlink it with other datasets), and</p><p>• linked data consumer on the other hand; these might be machines, for example using a semantic indexer or a query engine or, as well, humans, e.g., when using a Web of Data browser such as Tabulator <ref type="bibr" target="#b3">[4]</ref>.</p><p>It is worth noting that the following use cases are not necessarily restricted to the linked data domain.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Efficient Discovery of Datasets</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.1">Dataset Publisher</head><p>A dataset publisher might not be identical with the party who created the raw datasets, but one who publishes them onto the Web in a more accessible format. A dataset publisher wants to be able to publish metadata about the dataset such that:</p><p>• The dataset can be found and aggregated by search engine applications, or discovered in relevant searches;</p><p>• The metadata provides clear licensing information so that consumers can know how they can use the data and to whom they should attribute credits for creating/publishing the dataset;</p><p>• That consumers can obtain information about access interfaces, such as APIs and SPARQL endpoints.</p><p>It is in the best interest of a dataset publisher to provide potential users of the data with information that supports them in accessing and using the dataset.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.2">Search Engine Provider</head><p>A search engine provider wants to discover detailed descriptions about datasets efficiently. A crawler has stumbled upon an individual RDF document on the Web. How will it discover metadata that applies to the entire dataset and cannot be repeated in every single document? The simple approach of just putting the voiD description online and linking it from somewhere on the site does not meet our needs, as it could take the crawler a long time to find the description. It is important that the voiD description is discoverable as soon as the crawler finds the first RDF document, so that the crawler can use voiD metadata to guide its processing of the data.</p><p>The Sindice search engine <ref type="bibr" target="#b22">[23]</ref> already uses Semantic Sitemaps <ref type="bibr" target="#b6">[7]</ref> to enable discovery and efficient processing of datasets. It seems natural to address the situations above by building on Semantic Sitemaps.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Expressing Research Data</head><p>A developer working together with biologists wants to help domain experts to find research data published by their peer colleagues. These are often produced for a particular experiment, for a particular study or publication, or hosted by a particular public database. Scientists might know whose datasets they would prefer to access because they often have a clear idea about their content or they trust more on that data provider. When looking for new datasets, they might search for datasets that provide relevant content (such as information about genes, proteins, or micro-array gene expression), that are produced in a right experiment environment, or that provide additional information that will complement their local experiment results.</p><p>To find the right dataset and to make this dataset accessible for biologists in a user-facing application, the developer often has to go through the following process:</p><p>• Locate a dataset that contains information relevant to biologists' research interests, such as information about a specific organism; or more specifically, genomic information about a particular organism;</p><p>• Find out how this dataset can be programmatically accessed, as an RDF dump, through SPARQL endpoint or any other protocol;</p><p>• Find out the licence associated with the dataset, making sure that data are accessible under open-access licence or certain attribution;</p><p>• Understand the content of the dataset in order to perform an alignment with other datasets. Information about URIs used in the dataset can help one with the data identity alignment, schema(s) used in the dataset for data schema alignment, and its links with other datasets for assisting data integration.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3">Effective Dataset Selection</head><p>A consumer may have discovered several datasets, for example as a result of an indexer query. The question then arises how to select appropriate datasets from this list of potential candidates. The consumer, either a human or a query federation engine, might wish to define "appropriateness" along the following criteria:</p><p>1. The content of the dataset, that is, what the dataset mainly is about. Based on some kind of categorisation scheme a selection could take place;</p><p>2. The interlinking to other datasets, that is, to which other datasets and how the dataset is interlinked;</p><p>3. Vocabularies used in the dataset.</p><p>The criteria listed above can be understood in terms of quality and quantity. For example, one might be interested only in datasets containing foaf:interest links to a certain other dataset. Or, where the number of links are of interest, one may specify that only datasets with more than one million links should be taken into account.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.4">Query Optimisation</head><p>With many datasets now on the Web both connectable (through shared vocabulary terms) and connecting (by linking to resources in other datasets), it is naturally desirable to query across multiple datasets at once with SPARQL.</p><p>Optimisation of SPARQL queries can be achieved in a static way. A set of logical rules <ref type="bibr" target="#b21">[22]</ref> can be applied to a query engine, to calculate all equivalent query plans for a given query and then choose the most optimised query plan to be executed. To optimise SPARQL queries dynamically, i.e., deciding the best execution approach during the execution phase [14], one can use the statistical information about datasets, such as how much information is provided about a particular entity or property. This information can be used by the query mediator to optimise query plans by, for example, modifying the order in which a query pattern is executed according to the estimated size of data results <ref type="bibr" target="#b7">[8]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">VOCABULARY DESIGN</head><p>The Vocabulary of Interlinked Datasets (voiD) <ref type="bibr" target="#b1">[2]</ref> is a vocabulary and a set of instructions that enables the discovery and usage of linked datasets. The principle of the voiD effort is to use real requirements to guide the scope of the design, and to re-use existing vocabularies wherever possible instead of creating our own. Therefore, we have kept the creation of new classes and properties under the voiD namespace (http://rdfs.org/ns/void#) to the minimum.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Datasets</head><p>In the following, we will define and explain the basic concepts voiD operates with. A fundamental entity in voiD is a dataset. Definition 1. A dataset is a set of RDF triples that are published, maintained or aggregated by a single provider.</p><p>We think of a dataset as a meaningful collection of triples, that deal with a certain topic, originate from a certain source or process, are hosted on a certain server, or are aggregated by a certain custodian. The term thus has a social dimension that is not easy to capture in a formal definition. This differentiates datasets from RDF graphs <ref type="bibr" target="#b15">[17]</ref>, which are purely mathematical constructs. Any arbitrary set of RDF triples is an RDF graph, by definition, regardless of the triples' semantics. Also, typically a dataset is accessible on the Web, for example through resolvable HTTP URIs or through a SPARQL endpoint, and it contains sufficiently many triples that there is benefit in providing a concise summary.</p><p>The ultimate purpose of creating a void:Dataset instance is that this single resource represents the entire RDF dataset, and thus allows us to make statements about the entire dataset within the standard RDF model. The relationship between a void:Dataset instance and the concrete triples contained in the dataset is established only in an operational manner: A voiD description usually contains access information, such as the address of a SPARQL endpoint where the triples can be accessed.</p><p>We find that most datasets describe a well-defined set of resources. Hence, a dataset can also be seen as a set of descriptions for certain resources, which often share a common URI prefix (such as http://dbpedia.org/resource/). HTTP URIs have "owners", due to their use of DNS domain names. URI ownership is defined as "a relation between a URI and a social entity, such as a person, organisation, or specification." <ref type="bibr" target="#b13">[15]</ref> Information about a URI that is provided by the URI owner is called authoritative information. We use this notion to define authoritative datasets: Definition 2. A dataset is authoritative with respect to a certain URI namespace if it contains information about resources named by URIs in this namespace, and is published by the URI owner.</p><p>A straightforward method of publishing authoritative datasets is by using resolvable HTTP URIs in the linked data style. The URI owner also configures the server that responds when the URI is resolved. Therefore, if resolving yields a description of the resource named by the URI, then the data is authoritative.</p><p>The notion of authoritative information supports the social convention that a URI owner gets to decide what a URI identifies. Providing authoritative information is how the URI owner communicates this decision to the world. Even if third parties disagree with that information, they can still agree that they are talking about the same thing, which would be much harder without the grounding provided by the existence of an authoritative source.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">Linksets</head><p>Besides datasets, voiD also deals with interlinking between datasets. Interlinking in voiD is a first-class citizens, hence modelled as a class. The conceptual model of voiD links is depicted in Fig. <ref type="figure" target="#fig_0">1</ref>. Let us assume there are two datasets. One of them contains links to the other, that is, it contains RDF triples that connect resources from both datasets. We model this in voiD using two instances of void:Dataset, and another dataset :LS1 which is a subset of one of the datasets, and declared to be of type void:Linkset. We define void:Linkset as: Definition 3. A linkset LS is a set of RDF triples where for all triples ti = si, pi, oi ∈ LS, the subject is in one dataset, i.e. all si are described in DSsrc, and the object is in another dataset, i.e. all oi are described in DS sink .</p><p>The natural expectation is that both DSsrc and DS sink are themselves described in voiD. We note that the triples ti are often referred to as "interlinking triples".</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.1">Inline links vs. 3rd-party links</head><p>In voiD we are able to model two different situations: the classic LOD<ref type="foot" target="#foot_0">3</ref> case vs. the 3rd-party case (Fig. <ref type="figure" target="#fig_2">2</ref>). In the classic LOD case, the linkset is a subset of one of the two involved datasets, while in the 3rd-party case a third dataset is involved that actually contains the linkset.</p><p>Though the 3rd-party cases is not yet widely implemented in the context of linked data, this pattern of keeping links separate from interlinked datasets has been well argued in existing research such as found in the Hypertext community <ref type="bibr" target="#b5">[6]</ref>. In LOD, there are already first applications (RKB explorer, see section 5.1), and it is very likely that such systems will evolve over time and grow considerably.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.2">Interlinking Regarding Directionality</head><p>Independent of the former situation, voiD distinguishes between the non-directed vs. directed cases. In some cases one is interested in stating the direction of the interlinking (for example with foaf:interest), and in other situations the direction is of no interest (e.g., owl:sameAs), as shown in Fig. <ref type="figure" target="#fig_4">3</ref>.</p><p>In order to express the interlinking as outlined above, voiD offers the following RDF properties:</p><p>• void:subset to state where the interlinking triples reside (read: a dataset :DS has a subset :LS);</p><p>• void:target to declare an interlinking target (for the non-directed case); in the directed case, one can use void:subjectsTarget and void:objectsTarget to determine the direction (both being sub-properties of void:target);</p><p>• void:linkPredicate to express the RDF property (type) of the interlinking in a linkset.</p><p>We note that it is expected that per RDF predicate a respective instance of void:Linkset could be created, depending on the needs of an application. Further, one may take into consideration that due to the modelling of void:target and its sub-properties, light-weight subsumption inferencing may be necessary to apply generic queries that will not distinguish between the directed and non-directed case.</p><p>In listing 1 a sample voiD description is depicted describing the interlinking from DBpedia to DBLP. It is an example  for a directed case. The description defines nothing about who published this voiD description about DBpedia, which means that it could also be an example for a 3rd-party case. Further, the listing 2 shows a SPARQL query that is executed against listing 1 to search for a dataset that is about "computer science" and which is linked from DBpedia. The result yields the dataset :DBLP.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3">Reuse of Other Vocabularies</head><p>In the voiD guide <ref type="bibr" target="#b0">[1]</ref> we describe the reuse of other vocabulary terms not directly defined in the core voiD vocabulary alongside with voiD. Some important properties from other vocabularies are listed in the following. We note that there are many other aspects one may want to choose to describe in a dataset. A complete description of recommended usage can be found from the voiD user guide <ref type="bibr" target="#b0">[1]</ref>.</p><p>• Properties from the dcterms namespace for general metadata, such as the publishing organization and publishing date of a dataset;  • foaf:homepage of the dataset's homepage should be used, to allow one to connect different descriptions of the same dataset provided in different places on the Web. The recommended process in voiD is IFP smushing on foaf:homepage property;</p><p>• dcterms:subject should be used to categorise a dataset. For the general case, we recommend the use of a DBpedia resource URI (http://dbpedia.org/resource/ XXX) to categorise a dataset, where XXX stands for the thing which best describes the main topic of what the dataset is about. However, DBpedia might not contain concepts for describing some domain specific datasets. For example, there are no exact DBpedia resource URIs for describing that a dataset is about "in situ hybridisation image". We hence encourage publishers to describe such datasets using concepts widely adopted in their own communities, so that they can not only capture precisely the categorisation of their datasets but also ensure that these datasets could be connected with other relevant data from their domains;</p><p>• Statistical information represented using the "Statistical Core Vocabulary" (SCOVO)<ref type="foot" target="#foot_1">4</ref>  <ref type="bibr" target="#b10">[11]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.4">Dataset Licensing</head><p>As stated in Section 2, it is crucial for a data publisher to associate appropriate licensing information with their published data, so that potential users of the dataset would 1 @prefix owl : &lt; http :// www . w3 . org / 2 0 0 2 / 0 7 / owl # &gt; . 2 @prefix foaf : &lt; http :// xmlns . com / foaf /0.1/ &gt; . 3 @prefix dc : &lt; http :// purl . org / dc / terms / &gt; . 4 @prefix void : &lt; http :// rdfs . org / ns / void # &gt; . 5 @prefix dbp : &lt; http :// dbpedia . org / r e s o u r c e / &gt; .   know under which terms they can use it and what attribution they should apply. The dcterms:license property should be used to to point to the license under which a dataset has been published. Further, to allow automatic analysis of datasets, voiD also recommends a set of canonical identifiers for well-known licenses <ref type="bibr" target="#b0">[1]</ref>. The example below states that the DBpedia dataset is published under the terms of the GNU Free Documentation License. Listing 3: An exemplary voiD description about data license.</p><p>Licensing of datasets is a complex issue. Datasets are collections of facts rather than creative works, and different laws apply. Scientists are most cautious about publishing their datasets onto the Web and they might request very specific or strict policies for sharing their data. Most licenses such as Creative Commons or the GPL are based on copyright and are designed to protect creative works, but not databases, and applying them to datasets might not have the desired legal result. Meanwhile, efforts such as Open Data Commons <ref type="bibr" target="#b18">[19]</ref> and Science Commons <ref type="bibr" target="#b14">[16]</ref> are developing dedicated licenses for data.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.5">Statistics</head><p>Of special interest to distributed SPARQL agents will be the statistics about the triples available in the dataset, described with the void:statItem predicate. We adopt SCOVO for representing statistics. The main class in SCOVO is the scovo:Item, which records a single number or statistical value along with so called dimensions. We provide two types of information for describing statistics:</p><p>• Statistics concerning the whole dataset or linkset, such as overall triple count or fine-grained statistics, expressing the number of instances of a class or property by using different pre-defined dimensions, including void:numberOfResources, etc.;</p><p>• Attributing statistics to a source, recording where a statistical datum stems from.</p><p>Listing 4 demonstrates possible statistic information one can publish for their dataset. The current modelling of statistics 1 : DBpedia a void : Dataset ; Listing 4: Expressing statistics about a dataset in voiD.</p><p>in voiD is still experimental. We had to make choices between (i) a precise usage of scovo through a rather verbose expression and (ii) creations of shortcuts to express statistics needed for describing linked datasets:</p><p>• Scovo has an implicit assumption that all scovo:Items associated with the dataset they describe share the same dimensions. This does not fit well with our requirements for being able to mix items of different dimensions for a dataset. On the other hand, the correct Scovo modelling would lead to awkwardly complex and verbose notation for simple statistics.</p><p>• We encourage the use of classes and properties in places where scovo requires an instance of scovo:Dimension. This breaks the symmetry of the scovo model. scovo would require us to create a scovo:Dimension for each class or property. This would be quite verbose.</p><p>Because of the issues above, queries for statistics information using SPARQL can be awkward. It will often require a verbose check to make sure that an item has only certain dimensions and no others.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.6">Additional Terms in voiD</head><p>RDF datasets use one or more RDF-Schema vocabularies or OWL ontologies, hence we provide the void:vocabulary to list vocabularies used in a dataset. To express technical features of a dataset, such as formats in which the data is available, one can use void:feature. Further, a SPARQL endpoint that provides access to a dataset via the SPARQL protocol can be announced using the void:sparqlEndpoint property. Listing 5 shows the usage of the terms described above. We note that a complete list of the terms is available from the voiD user guide <ref type="bibr" target="#b0">[1]</ref>. Listing 5: Additional voiD terms usage.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">PUBLICATION AND CONSUMPTION</head><p>We envision dataset publisher to offer a voiD description along with their dataset. A voiD description typically has two parts, (i) manually created part (categorisation, vocabulary, license, etc.), and (ii) automatically generated part, mainly regarding statistics.</p><p>In the following we will discuss the publication process of voiD descriptions and their discovery in order to consume them.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1">Publication</head><p>Publishing a voiD file means to physically deploy it on the Web in an RDF serialisation. We have detailed out the options in the voiD guide <ref type="bibr" target="#b0">[1]</ref>.</p><p>For dataset that are published as a collection RDF documents, as commonly seen in the linked data publishing style, one can use a dcterms:isPartOf triple in each document to link back to the URI identifying the voiD dataset, as shown in listing 6. Resolving the dataset URI will answer a voiD descriptions about the entire dataset, allowing agents to discover the voiD description when encountering an individual document from the collection. The intuition behind using the dcterms:isPartOf property is that the RDF document contains an RDF graph whose triples are part of the dataset.</p><p>Listing 6: Use backlinks publish voiD description of a dataset.</p><p>As discussed in <ref type="bibr" target="#b9">[10]</ref>, we can imagine that voiD descriptions are crawled and indexed by semantic search engines (such as Sindice <ref type="bibr" target="#b22">[23]</ref> or Yahoo's search monkey <ref type="bibr" target="#b17">[18]</ref>) in order to provide a central point of lookup.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2">Discovery via Sitemaps</head><p>A discovery mechanism for use by RDF-harvesting web crawlers (Fig. <ref type="figure" target="#fig_11">4</ref>) has been defined as follows:</p><p>1. Given a domain name, the client gets the file robots.txt and searches for a line that starts with Sitemap:; the rest of that line is the URI of a sitemap;</p><p>2. The semantic sitemaps extension to the sitemap protocol defines a &lt;sc:dataset&gt; element that can have a &lt;sc:datasetURI&gt; child element. If present, the value of that element is a URI that identifies the dataset datasetURI;</p><p>3. The dataset URI datasetURI is dereferenced which yields the voiD description of the dataset. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">VOID IN THE WILD</head><p>After releasing the first edition of voiD in early 2009, we have seen a certain community uptake. People and organisations would start using in different areas and for different purposes, potentially far beyond what we have envisioned in the realm of our own use cases. We report on known usages of voiD in the following and point out potential application areas.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1">Existing Applications</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1.1">Tools for Creating voiD Descriptions</head><p>To boot-strap the process of creating voiD descriptions, several tools are available: ve, the voiD editor (Fig. <ref type="figure" target="#fig_12">5</ref>), liftSSM <ref type="foot" target="#foot_3">5</ref> , an XSLT script able to boot-strap from a Semantic Sitemap, and, for creating the quantitative, statistical data, a new release of the NX parser<ref type="foot" target="#foot_4">6</ref> , offering a voiD export for statistics.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1.2">"Linked Datasets Explorer" (LDE)</head><p>To let user browse and explore a collection of voiD descriptions, we have developed the LDE demonstrator. Fig. <ref type="figure" target="#fig_14">6</ref> shows the current state of LDE <ref type="foot" target="#foot_5">7</ref> which operates on a manually created, so called "seed" set of voiD descriptions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1.3">RKB explorer</head><p>The RKB explorer has a voiD site <ref type="foot" target="#foot_6">8</ref> which enables querying and browsing for CRS datasets. Further, the interlinking of the RKB sites can be visualised using the underlying voiD descriptions (Fig. <ref type="figure" target="#fig_5">7</ref>).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1.4">Query Federation</head><p>Only recently Clarck-Parsia announced their voiD support <ref type="foot" target="#foot_7">9</ref> : "There is a touch point with the linked data effort, which meant that the new voiD vocabulary for describing datasets turns out to be very useful for describing the distributed data sources that we query over, including their interrelations." Further, OpenLink plans to release its "Smart SPARQL Federation capabilities", based on voiD, soon.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1.5">Middleware</head><p>OpenLink's Sponger Middleware uses voiD for generating linked data from non-RDF data sources such as HTML pages. An example<ref type="foot" target="#foot_8">10</ref> from http://linkeddata.uriburner. com/ with the voiD description deployed as XHTML+RDFa is shown in Fig. <ref type="figure" target="#fig_16">8</ref>. Further, the statistics maintenance in their Virtuoso Quad Store is performed based on voiD.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.2">Potential Applications</head><p>We envision voiD to be applied in many scenarios, some of which we have identified earlier in section 2. Only recently, for example, we have started to develop a dataset ranking algorithm based on voiD descriptions; this is subject to more research. One could further apply voiD to DARQ (Distributed ARQ) <ref type="bibr" target="#b21">[22]</ref>.</p><p>A totally different application domain is visualisation: for example, "The Map of Data"<ref type="foot" target="#foot_9">11</ref> in Sindice can be generated automatically thanks to voiD.</p><p>Ultimately, to be of use, one wants applications that benefit from voiD. Put in other words, this means that, given there are applications that consume voiD and offer some added value, the incentive for publishers to provide voiD descriptions is self-evident. One such application could be a sort of dynamic dataset selector which, configured with a specification of the dataset (topics, license, interlinking with certain other datasets) would at run-time of an application discover and select appropriate datasets according to the search specification.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">RELATED WORK</head><p>To the best of our knowledge, no comparable approach to voiD exists. That is, in the context of the Web of Data, we are not aware of any specification that allows the description of datasets and their interlinking the way voiD does. However, we acknowledge previous work of Semantic Sitemaps <ref type="bibr" target="#b6">[7]</ref> and build upon it.</p><p>In the scope of the Web of Documents, we note that at the time of writing a W3C Working Draft of POWDER (Protocol for Web Description Resources) <ref type="bibr" target="#b2">[3]</ref> is available. POW-DER aims at providing information about Web resources, such as scope, authoritative information, etc., without retrieving the resources themselves. POWDER comes in two flavours, (i) as human-legible XML, and (ii) in an RDF version. It also provides a GRDDL transformation to turn the former into the latter. The descriptions can be applied to groups of resources defined via listing of URIs, regular expressions, etc. Several publishing methods are suggested (via HTML &lt;link&gt; in the header, HTTP Link: header or using XHTML+RDFa). Especially in the Web of Trust, POWDER is expected to play a vital role, though implementation complexity might hinder wide-spread adoption.</p><p>Further, OASIS's XRDS (eXtensible Resource Descriptor Sequence) <ref type="bibr" target="#b23">[24]</ref> is an XML format for metadata discovery about a resource. The discovery protocol for XRDS documents given a URI was defined in 2006 as part of Yadis, focusing on services such as OpenID and OAuth. In early 2008, XRDS-Simple was proposed <ref type="foot" target="#foot_10">12</ref> , but is now obsolete.</p><p>Only very recently, the latest draft of "/host-meta" <ref type="bibr" target="#b20">[21]</ref> was proposed. The core of this proposal is a single "wellknown location", /host-meta, acting as a directory of the interesting metadata about a Web site. The format allows different types of site metadata to be referenced by an URI or included inline.</p><p>One could understand the "HTTP Link: header" <ref type="bibr" target="#b19">[20]</ref> proposal related to voiD, as it also supports discovery, offering metadata about resources by resurrecting a (currently deprecated) feature of HTTP. This proposal is at the time of writing still under vivid discussion and not yet seen stable.</p><p>Regarding federated SPARQL queries, DARQ (Distributed ARQ) <ref type="bibr" target="#b21">[22]</ref> proposes so called "service descriptions" that are able to specify capabilities of a SPARQL endpoints. The service descriptions enable the DARQ query engine to decompose a query into sub-queries, each of which can be answered by an individual service using query rewriting and cost-based query optimisation to speed-up query execution. Further, we note an attempt called "SPARQL Endpoint Description"<ref type="foot" target="#foot_11">13</ref> that aimed to allow the announcement of endpoint capabilities and contents, support discovery through service directories, and supply browsing and federation hints. Both proposals seem to be not further maintained and/or have not reached wide-spread adoption.</p><p>Finally, we note that the W3C Technical Architecture Group (TAG) started to contemplate about "Uniform Access to Metadata"<ref type="foot" target="#foot_12">14</ref> , basically being a survey regarding the problem of specifying a uniform method for obtaining information pertaining to a resource without necessarily having to parse a representation of the resource.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7.">OUTLOOK</head><p>We have released the voiD vocabulary and voiD user guide to linked data communities in January this year. In this release, we have used the use cases presented in section 2 to guide the design scope of the voiD vocabulary. Supports for describing the quality, provenance and versions of linked datasets are to be addressed in the next release of voiD. Also, the statistics modelling in the current voiD model is still experimental. We are communicating with user communities and the SCOVO team in order to propose a more stable modelling in the coming release <ref type="foot" target="#foot_13">15</ref> . Additionally we will liaison with initiatives such as the "Ontology Metadata Vocabulary" <ref type="bibr" target="#b8">[9]</ref> sharing similar goals.</p><p>To test and evaluate the usefulness of voiD, we need tools that use voiD to support the discovery of datasets or the SPARQL query federation. Fortunately, semantic query engines like Sindice and SPARQL query processing systems (like OpenLink) are adopting voiD in their implementations. It is challenging to completely automate the creation of voiD descriptions. We need tools like the NX parser to take as much as possible of the heavy lifting for non-technical data publishers as possible.     </p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Interlinking modelling in voiD.</figDesc><graphic coords="3,76.17,612.86,191.28,96.46" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head></head><label></label><figDesc>(a) Classic LOD case: Describing the :DBpedia dataset and its contained :DBpedia2DBLP linkset (b) 3rd-party case: Describing the stand-alone :DBpedia2DBLP linkset</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: Interlinking regarding authoritative datasets.</figDesc><graphic coords="4,77.71,259.10,191.29,177.90" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head></head><label></label><figDesc>(a) Non-directed case. (b) Directed case.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: Interlinking regarding direction.</figDesc><graphic coords="4,328.77,202.32,215.20,136.29" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_5"><head>6 7 :</head><label>7</label><figDesc>DBpedia a void : Dataset ; 8 foaf : homepage &lt; http :// dbpedia . org / &gt; ; 9 void : subset : DBpedia2DBLP .</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_6"><head>10 11 :</head><label>11</label><figDesc>DBLP a void : Dataset ; 12 foaf : homepage &lt; http :// dblp . l3s . de / d2r / &gt; ; 13 dc : subject dbp : C o m p u t e r _ s c ie n c e ; 14 dc : subject dbp : Journal ; 15 dc : subject dbp : Proceedings .</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_7"><head>3 ? 5 ? 7 } 2 :</head><label>3572</label><figDesc>dataset a void : Dataset ; 4 dcterms : subject dbp : C o m p u t e r _s c i e n c e . linkset void : s ubjectsT arget : DBpedia ; 6 void : objectsTarget ? dataset .Listing An exemplary query on voiD.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_8"><head>1 :</head><label>1</label><figDesc>DBpedia a void : Dataset ; 2 dcterms : license &lt; http :// www . gnu . org / c o p y l e f t / fdl . html &gt; .</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_9"><head></head><label></label><figDesc>void : n u m b e r O f R e s o u r c es ; 5 scovo : dimension foaf : Person ; 6 dcterms : source &lt; http :// wiki . dbpedia . org / &gt; ; 7 ] .</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_10"><head>1 :</head><label>1</label><figDesc>DBpedia void : sparqlEn dpoint &lt; http :// dbpedia . org / sparql &gt; ; 2 void : feature [ dcterms : format " application / rdf + xml " ; ] ; 3 void : vocabulary &lt; http :// xmlns . com / foaf /0.1/ &gt; .</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_11"><head>Figure 4 :</head><label>4</label><figDesc>Figure 4: The voiD discovery-via-sitemaps process.</figDesc><graphic coords="6,328.77,53.80,215.18,153.52" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_12"><head>Figure 5 :</head><label>5</label><figDesc>Figure 5: Manual creation of voiD descriptions with ve.</figDesc><graphic coords="9,89.66,73.16,430.38,246.15" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_13"><head></head><label></label><figDesc>(a) Looking-up datasets. (b) Browsing datasets.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_14"><head>Figure 6 :</head><label>6</label><figDesc>Figure 6: Linked Datasets Explorer (LDE).</figDesc><graphic coords="9,309.46,388.77,215.19,271.58" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_15"><head>Figure 7 :</head><label>7</label><figDesc>Figure 7: Visualisation of RKB interlinking.</figDesc><graphic coords="10,125.53,86.30,358.65,311.36" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_16"><head>Figure 8 :</head><label>8</label><figDesc>Figure 8: OpenLink's instant-voiD-generator for structured HTML pages.</figDesc><graphic coords="10,125.53,492.81,358.66,169.82" type="bitmap" /></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_0">LOD ... Linking Open Data datasets, see http://esw. w3.org/topic/SweoIG/TaskForces/CommunityProjects/ LinkingOpenData</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_1">http://purl.org/NET/scovo</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_2">&lt; http :// dbpedia . org / data / Berlin &gt; dcterms : isPartOf : DBpedia .</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_3">http://rdfs.org/ns/void-guide#sec_4_3_Publishing_ tools</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="6" xml:id="foot_4">http://sw.deri.org/2006/08/nxparser/release/ nxparser-1.1.jar</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="7" xml:id="foot_5">http://ld2sd.deri.org/lde/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="8" xml:id="foot_6">http://void.rkbexplorer.com/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="9" xml:id="foot_7">http://clarkparsia.com/weblog/2009/02/04/ distributed-query-pellet-into-the-void/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="10" xml:id="foot_8">http://linkeddata.uriburner.com/about/html/http: //twitter.com/mhausenblas#Dataset</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="11" xml:id="foot_9">http://sindice.com/map</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="12" xml:id="foot_10">http://www.hueniverse.com/hueniverse/2008/03/ putting-xrds-si.html</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="13" xml:id="foot_11">http://esw.w3.org/topic/SparqlEndpointDescription</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="14" xml:id="foot_12">http://www.w3.org/2001/tag/doc/ uniform-access-20090205.html</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="15" xml:id="foot_13">See http://code.google.com/p/void-impl/issues/ list?can=2&amp;q=milestone:Release2.0forplannedissues.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="16" xml:id="foot_14">http://www.ict-romulus.eu/</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Acknowledgements Our work has partly been supported by the European Commission under Grant No. 217031, FP7/ICT-2007.1.2, project "Domain Driven Design and Mashup Oriented Development based on Open Source Java Metaframework for Pragmatic, Reliable and Secure Web Development" (Romulus) <ref type="bibr" target="#b14">16</ref> , and the Joint Information Systems Committee [Project "Fly-Web"]. The authors would further like to thank (alphabetically): Orri Erling, Hugh Glaser, Olaf Hartig, Tom Heath, Andreas Langegger, Ian Millard, Marc-Alexandre Nolin, Yves Raimond, Yrjänä Rankka, Francois Scharffe, and Giovanni Tummarello.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">voiD guide-Using the Vocabulary of Interlinked Datasets</title>
		<author>
			<persName><forename type="first">K</forename><surname>Alexander</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Cyganiak</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Hausenblas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Zhao</surname></persName>
		</author>
		<ptr target="http://rdfs.org/ns/void-guide/" />
	</analytic>
	<monogr>
		<title level="m">Community Draft, voiD working group</title>
				<imprint>
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">voiD, the &quot;Vocabulary of Interlinked Datasets</title>
		<author>
			<persName><forename type="first">K</forename><surname>Alexander</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Cyganiak</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Hausenblas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Zhao</surname></persName>
		</author>
		<ptr target="http://rdfs.org/ns/void/" />
	</analytic>
	<monogr>
		<title level="m">Community Draft, voiD working group</title>
				<imprint>
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Protocol for Web Description Resources (POWDER): Description Resources</title>
		<author>
			<persName><forename type="first">P</forename><surname>Archer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Smith</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Perego</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">W3C Working Draft 14</title>
				<imprint>
			<publisher>POWDER Working Group</publisher>
			<date type="published" when="2008-11">November 2008. 2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Tabulator: Exploring and analyzing linked data on the Semantic Web</title>
		<author>
			<persName><forename type="first">T</forename><surname>Berners-Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Chilton</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Connolly</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Dhanaraj</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Hollenbach</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Lerer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Sheets</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 3rd International Semantic Web User Interaction Workshop (SWUI06)</title>
				<meeting>the 3rd International Semantic Web User Interaction Workshop (SWUI06)<address><addrLine>Athens, Georgia, USA</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2006">2006</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Linked Data on the Web (LDOW2008)</title>
		<author>
			<persName><forename type="first">C</forename><surname>Bizer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Heath</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Idehen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Berners-Lee</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Linked Data on the Web Workshop(WWW2008)</title>
				<imprint>
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">The Distributed Link Service: A tool for publishers, authors and readers</title>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">A</forename><surname>Carr</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">C</forename><surname>Deroure</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Hall</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">J</forename><surname>Hill</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 4th International World Wide Web Conference: The Web Revolution)</title>
				<meeting>the 4th International World Wide Web Conference: The Web Revolution)<address><addrLine>Boston, USA</addrLine></address></meeting>
		<imprint>
			<date type="published" when="1995">1995</date>
			<biblScope unit="page" from="647" to="656" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Semantic Sitemaps: Efficient and flexible access to datasets on the Semantic Web</title>
		<author>
			<persName><forename type="first">R</forename><surname>Cyganiak</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Stenzhorn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Delbru</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Decker</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Tummarello</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 5th European Semantic Web Conference</title>
				<meeting>the 5th European Semantic Web Conference<address><addrLine>Tenerife, Spain</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2008">2008</date>
			<biblScope unit="volume">5021</biblScope>
			<biblScope unit="page" from="690" to="704" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">The SPARQL query graph model for query optimization</title>
		<author>
			<persName><forename type="first">O</forename><surname>Hartig</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Heese</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 4th European Semantic Web Conference 2007</title>
				<meeting>the 4th European Semantic Web Conference 2007<address><addrLine>Innsbruck, Austria</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2007">2007</date>
			<biblScope unit="page" from="564" to="578" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">OMV -Ontology Metadata Vocabulary</title>
		<author>
			<persName><forename type="first">J</forename><surname>Hartmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Sure</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Haase</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Palma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Del</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Carmen</forename><surname>Suárez-Figueroa</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ISWC 2005 -In Ontology Patterns for the Semantic Web</title>
				<editor>
			<persName><forename type="first">C</forename><surname>Welty</surname></persName>
		</editor>
		<imprint>
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Discovery and usage of linked datasets on the Web of data</title>
		<author>
			<persName><forename type="first">M</forename><surname>Hausenblas</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Talis NodMag</title>
		<imprint>
			<biblScope unit="volume">4</biblScope>
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">SCOVO: Using statistics on the Web of data</title>
		<author>
			<persName><forename type="first">M</forename><surname>Hausenblas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Halb</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Raimond</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Feigenbaum</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Ayers</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">6th European Semantic Web Conference (ESWC2009), Semantic Web in Use Track</title>
				<imprint>
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">What is the size of the Semantic Web</title>
		<author>
			<persName><forename type="first">M</forename><surname>Hausenblas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Halb</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Raimond</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Heath</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of I-Semantics 2008</title>
				<meeting>I-Semantics 2008<address><addrLine>Graz, Austria</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">A performance and scalability metric for virtual RDF graphs</title>
		<author>
			<persName><forename type="first">M</forename><surname>Hausenblas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Slany</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Ayers</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">3 rd Workshop on Scripting for the Semantic Web (SFSW07)</title>
				<meeting><address><addrLine>Innsbruck, Austria</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Architecture of the World Wide Web</title>
		<author>
			<persName><forename type="first">I</forename><surname>Jacobs</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Walsh</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">W3C Recommendation</title>
				<imprint>
			<date type="published" when="2004-12">December 2004. 2004</date>
			<biblScope unit="volume">15</biblScope>
		</imprint>
		<respStmt>
			<orgName>W3C Technical Architecture Group (TAG</orgName>
		</respStmt>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Data publication in the open access initiative</title>
		<author>
			<persName><forename type="first">J</forename><surname>Klump</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Bertelmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Brase</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Diepenbroek</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Grobe</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Höck</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Lautenschlager</surname></persName>
		</author>
		<author>
			<persName><forename type="first">U</forename><surname>Schindler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Sens</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Wächter</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Data Science Journal</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="page" from="79" to="83" />
			<date type="published" when="2006">2006</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<title level="m" type="main">RDF/XML Syntax Specification</title>
		<author>
			<persName><forename type="first">G</forename><surname>Klyne</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">J</forename><surname>Carroll</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Mcbride</surname></persName>
		</author>
		<imprint>
			<publisher>W3C</publisher>
		</imprint>
	</monogr>
	<note>Revised</note>
</biblStruct>

<biblStruct xml:id="b16">
	<monogr>
		<title level="m">RDF Core Working Group</title>
				<imprint>
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Microsearch: An interface for semantic search</title>
		<author>
			<persName><forename type="first">P</forename><surname>Mika</surname></persName>
		</author>
		<ptr target="CEUR-WS.org" />
	</analytic>
	<monogr>
		<title level="m">Semantic Search, International Workshop located at the 5th European Semamntic Web Conference (ESWC 2008)</title>
		<title level="s">CEUR Workshop Proceedings</title>
		<imprint>
			<date type="published" when="2008">2008</date>
			<biblScope unit="volume">334</biblScope>
			<biblScope unit="page" from="79" to="88" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Open data commons, a license for open data</title>
		<author>
			<persName><forename type="first">P</forename><surname>Miller</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Styles</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Heath</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Workshop on Linked Data on the Web (WWW2008)</title>
				<meeting>the Workshop on Linked Data on the Web (WWW2008)</meeting>
		<imprint>
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<monogr>
		<title level="m" type="main">Link relations and HTTP header linking</title>
		<author>
			<persName><forename type="first">M</forename><surname>Nottingham</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2008-12-01">1 December 2008. 2008</date>
			<publisher>IETF Network Working Group</publisher>
		</imprint>
	</monogr>
	<note>Internet-Draft</note>
</biblStruct>

<biblStruct xml:id="b20">
	<monogr>
		<title level="m" type="main">Host metadata for the Web</title>
		<author>
			<persName><forename type="first">M</forename><surname>Nottingham</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Hammer-Lahav</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2009-02-10">10 February 2009. 2009</date>
			<publisher>IETF Network Working Group</publisher>
		</imprint>
	</monogr>
	<note>Internet-Draft</note>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Querying distributed RDF data sources with SPARQL</title>
		<author>
			<persName><forename type="first">B</forename><surname>Quilitz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">U</forename><surname>Leser</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 5th European Semantic Web Conference</title>
				<meeting>the 5th European Semantic Web Conference</meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2008">2008. 2008</date>
			<biblScope unit="page" from="524" to="538" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Sindice. com: Weaving the open linked data</title>
		<author>
			<persName><forename type="first">G</forename><surname>Tummarello</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Delbru</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Oren</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 6th International Semantic Web Conference</title>
				<meeting>the 6th International Semantic Web Conference</meeting>
		<imprint>
			<date type="published" when="2007">2007. ISWC2007. 2007</date>
			<biblScope unit="volume">4825</biblScope>
			<biblScope unit="page" from="552" to="565" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">Extensible Resource Identifier (XRI) Resolution Version 2.0</title>
		<author>
			<persName><forename type="first">G</forename><surname>Wachob</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Reed</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Chasen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Tan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Churchill</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">OASIS eXtensible Resource Identifier (XRI) TC</title>
				<imprint>
			<date type="published" when="2008-02">February 2008. 2008</date>
		</imprint>
	</monogr>
	<note>Committee Draft 03 28</note>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">Information Science, and Technology. The Dublin Core: A simple content description model for electronic resources</title>
		<author>
			<persName><forename type="first">S</forename><surname>Weibel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">S</forename><surname>For</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Bulletin of the American Society for Information Science and Technology</title>
		<imprint>
			<biblScope unit="volume">24</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="9" to="11" />
			<date type="published" when="1997">1997</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">Linked data and provenance in biological data webs</title>
		<author>
			<persName><forename type="first">J</forename><surname>Zhao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Miles</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Klyne</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Shotton</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Briefings in Bioinformatics</title>
		<imprint>
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
