<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Dataset Dynamics Compendium: A Comparative Study</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Jürgen</forename><surname>Umbrich</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Digital Enterprise Research Institute</orgName>
								<orgName type="institution">National University of Ireland</orgName>
								<address>
									<settlement>Galway</settlement>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Boris</forename><surname>Villazón-Terrazas</surname></persName>
							<affiliation key="aff1">
								<orgName type="department" key="dep1">OEG-DIA</orgName>
								<orgName type="department" key="dep2">Facultad de Informática</orgName>
								<orgName type="institution">Universidad Politécnica de Madrid</orgName>
								<address>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Michael</forename><surname>Hausenblas</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Digital Enterprise Research Institute</orgName>
								<orgName type="institution">National University of Ireland</orgName>
								<address>
									<settlement>Galway</settlement>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Dataset Dynamics Compendium: A Comparative Study</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">F0942D6EF5B93A59C6FC254323BB6FA7</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T18:29+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>At the time of writing there exists no consensus about the approaches to detect, propagate and describe changes in resources and datasets of the Linked Open Data Web. This survey gives a comprehensive overview of the current technical solutions and a comparison of such based requirements we derived from use cases the community came up with. We give a detailed overview about the aspects of discovery, granularity level, and description of the changes, as well as the detection algorithms and notification mechanisms. Moreover, we present a high-level dataset dynamics stack that integrates the current technical solutions for dealing with changes in datasets of the Web of Data.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>1 On the Importance of Dynamics of Linked Datasets So far, Linked Data principles and practices are being adopted by an increasing number of data providers, getting as result a global data space on the Web containing billions of RDF triples <ref type="bibr" target="#b7">[8]</ref>. However, there are still various research challenges that must be overcome. One particular research challenge is datasets dynamics, and the Linked Data community is realizing the importance of this research area in the last recent months.</p><p>Although, there are some efforts to solve this problem in databases, e.g. <ref type="bibr" target="#b26">[28]</ref>, <ref type="bibr" target="#b17">[18]</ref>, and <ref type="bibr" target="#b5">[6]</ref>, among others; they do not cover all the aspects of the Linked Data datasets. This is because linked data can be classified as a self-organising ecosystem, i.e. many units participate in a parallel and distributed manner by creating, publishing and interlinking information. Moreover, enterprises or organisations which are using the paradigm of integrating data by applying the Linked Data principles have to face and deal with the adhering dynamics.</p><p>The topic of dataset dynamics covers all kinds of aspects related to changes of and between Linked Data resources and datasets. This research area includes: <ref type="bibr" target="#b0">(1)</ref> the design of vocabularies to describe dynamic characteristics and changes of datasets, (2) the auto-discovery of those descriptions, (3) web-scale communication methods for the interaction between consumers and producers for different change granularity levels and (4) algorithms to compute efficiently deltas between two data snapshots.</p><p>The players participating in the Linked Data ecosystem are manifold; for instance, the research community, Web 2.0 portals, and Facebook among others. The research community around Linked Data and its efforts to provide tools for data publishers to share their data as Linked Data. The result of these efforts are several software libraries which convert information from arbitrary formats into RDF, for example: D2R Server<ref type="foot" target="#foot_0">3</ref> , XLWrap<ref type="foot" target="#foot_1">4</ref> , Any23<ref type="foot" target="#foot_2">5</ref> , etc.</p><p>Also, more and more Web 2.0 portals start to expose their data as Linked Data, for example widely used content management systems like Drupal 7 <ref type="foot" target="#foot_3">6</ref> and knowledge management system like SemWiki<ref type="foot" target="#foot_4">7</ref> , News portal and multimedia domains, such as the New York Times or BBC, publish their articles and programs according to the principles of Linked Data. Moreover, companies adapt to describe their products using the GoodRelation ontology <ref type="foot" target="#foot_5">8</ref> which leads to better search results for the major search engines like Google or Yahoo. Governments contribute by integrating their data into the LOD cloud. All these players continuously contribute information to the LOD cloud.</p><p>The data producers and consumers put a tremendous effort into the interlinking of the valuable information pieces. Furthermore, data publishers continuously try to improve the quality of their data by changing vocabularies, adding new information and relations, or deleting obsolete ones.</p><p>Clearly, one can assume and even observe that Linked Data is very dynamic. However, at the time of writing there is no solid solution nor a clear research direction of the big picture of the problem. There are a number of use cases, derived requirements and proposals as we will show in this survey. Nevertheless, we can clearly state that none of the available proposals solve the problem of handling and communicating dataset dynamics in a sufficient way.</p><p>The contribution of this work can be summarised as follows: (1) a fundamental overview about the topic of dataset dynamics; (2) the presentation of use cases and requirements agreed by the Linked Data community; (3) a survey and comparison of proposals which are addressing the issues and partial solve them; and (4) an abstract dataset dynamics stack.</p><p>Moreover, it is worth to mention that dataset dynamics is important to provide an "efficient" consumption of Linked Data through the discovery, synchronisation, caching, and linkage of the datasets.</p><p>The remainder of this paper is organised as follow: Section 2 reviews some existing solutions and Section 3 presents identified use cases. Next, Section 4 discusses requirements derived from the use cases and depicts the high-level architecture of the solution. Then, Section 5 describes the vocabularies and mechanisms that cover the requirements identified. Finally, Section 6 provides some conclusions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Existing Deployed Systems</head><p>In this section we review some deployed systems that deal with the dataset dynamics. These systems are the result of preliminary efforts to solve the identified problems of handling and communicating dataset dynamics.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Sitemap Protocol</head><p>Website crawling can be made more efficient and predictable by using the Sitemap Protocol [21], originally developed by Google and now supported by all major search engines, as well as data search engines such as Sindice <ref type="bibr" target="#b22">[24]</ref>. It consists of a sitemap.xml file that is usually placed in the website root directory and contains a list of all the URLs to be crawled. The Sitemap protocol format consists of XML tags and it defines several elements, being the most importance in our context:</p><p>url, entry for each URL, the remaining elements are children of this.</p><p>changefreq, which defines how frequently the page is likely to change. This value provides general information to search engines and may not correlate exactly to how often they crawl the page. The sitemap will be fetched with the highest frequency indicated by the URLs contained in it. Because of this, it can save bandwidth if terms with the same change frequency are grouped into separate sitemaps.</p><p>lastmod, which represents the date of last modification of the file. This date should be in W3C Datetime<ref type="foot" target="#foot_6">9</ref> format. For example, Sindice uses the lastmod element to decide if the given URL has to be re-indexed or not. This is the most important information, as it can reduce the number of requests Sindice will to make to a specific site.</p><p>Thanks to the Sitemap protocol, websites that publish RDF datasets are ready for effective discovery and synchronization.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">DBpedia Live</head><p>DBpedia is community effort to extract information from Wikipedia and to make this information available on the Web. Nevertheless, a manual effort is necessary to produce a new release and the extracted information is not up-to-date. DBpedia Live <ref type="bibr" target="#b8">[9]</ref> is an extension of DBpedia, and is created to tackle the challenging problem of processing tens of thousands of changes per day in order to consume the constant stream of Wikipedia updates. Basically the DBpedia Live provides (1) up-to-date information and (2) a mechanism for allowing the Wikipedia community to maintain the DBpedia ontology collaboratively. The DBpedia Live framework consists of the following main components:</p><p>-PageCollections. Abstractions of local or remote resources of Wikipedia articles.</p><p>-Destinations. They store extracted RDF triples.</p><p>-Extractors. These extractors convert a specific type of wiki markup into triples.</p><p>-Parsers. These components help the extractors by identifying datatypes, and converting its corresponding values. -ExtractionJobs. They consist of a page collection, extractors, and a destination.</p><p>-Extraction Manager. This component manages the process of passing Wikipedia articles to the extractors and delivers their output to the destination.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3">PubSubHubbub</head><p>PubSubHubbub <ref type="bibr" target="#b15">[16]</ref> is a decentralized real-time Web protocol that delivers data to subscribers the moment it becomes available. This protocol extends the Atom <ref type="bibr" target="#b16">[17]</ref> and RSS <ref type="bibr" target="#b18">[19]</ref> protocols for data feeds, basically it turns Atom and RSS feeds into real-time streams. In a nutshell this protocol has three main participants:</p><p>-Publisher, a owner of a topic. It notifies the hub when the topic feed has been updated. -Hub, a server which implements both sides of this protocol. There are some public hubs, for example: App Engine<ref type="foot" target="#foot_7">10</ref> , Superfeedr<ref type="foot" target="#foot_8">11</ref> , and RabbitHub<ref type="foot" target="#foot_9">12</ref> -Subscriber, an entity (program or human being) that wants to be notified of changes on a topic.</p><p>Next, we present a very simple example that shows how it works: 1. A blogger or content creator creates a feed and specifies a hub. 2. A consumer subscribes to the blog using the RSS feed in the normal way.</p><p>3. New content is created and the source pings the hub saying "i have new content!" 4. The hub in turn "fat pings" the subscribers saying "Hey, the blog has new content, here it is!"</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.4">SparqlPuSH</head><p>The goal of sparqlPuSH <ref type="bibr" target="#b20">[22]</ref> is to enable proactive notification of changes happening in RDF stores, whatever they deal with: new data of a particular type being added, updated statements about a given resource, etc. To this end, sparqlPuSH relies on the aforementioned PubSubHubbub protocol to broadcast these updates.</p><p>In a nutshell the sparqlPuSH consists in the following steps: 1. to register the SPARQL queries related to the updates that must be monitored in a RDF store, 2. to broadcast changes when data mapped to these queries is updated in the store.</p><p>Moreover, it can be used as an interface on the top of any SPARQL endpoint and also comes with an ARC2<ref type="foot" target="#foot_10">13</ref> interface. Finally, this push approach can become a default model in various RDF store implementations, enabling more capabilities to monitor, in real-time, changes related to the RDF data.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Use Cases</head><p>We present the uses cases which are collected from the breakout session of the W3C LOD Track at WWW 2010 <ref type="foot" target="#foot_11">14</ref> . Overall, we identified four general use cases which we describe in detail in the remainder. The use cases are ordered by increasing complexity to handle and process changes.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Use Cases UC 1: Synchronisation</head><p>A dataset consumer wants to mirror or replicate (parts of) a dataset from the LOD cloud. Ideally, the consumer wants to be informed about the statements that have been added/removed at a certain time point. The notification about the changes enables him an efficient way to keep his dataset up-to-date. Most commonly, the synchronisation is either for a single data source or for a set of data sources.</p><p>Real World Example. The semantic web index, Sindice, wants to keep its index always updated with the current version of the available LOD datasets, for instance DBPedia. A notification from the DBpedia publishers enables the index systems to decide at which time they want to update their index, instead of actively checking periodically if there exists a new version of a dataset dump. Ideally, Sindice would get a notification about a change of a dataset in general, and can request details about the changes to decide if it is necessary to perform the update immediately or at late point in time.</p><p>Requirements. The requirements to fulfill this use case are: The dataset publisher needs adynamic description [DD] of the dataset which allows consumers to learn about high-level dynamics and the communication mechanism [CM] to learn about new changes. Further, the consumer has to be able to learn about this description by using a discovery mechanism <ref type="bibr">[DM]</ref>. The publisher has to provide the change description [CD] which contains machine readable and understandable information about what and how much has changed. Finally, all the above requirements should be compliant with the Architecture of the Web <ref type="bibr" target="#b9">[10]</ref> [CW] and have to scale to the size of the Web [SW].</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">Use Cases UC 2: Smart Caching</head><p>A developer uses one or more datasets from the LOD cloud in her application(s). Rather than implementing custom-code for keeping the local data in the application up-to-date (HTTP-level, dataset-level), the developer wants to use a smart cache that offers the functionality as required (e.g., 304/HTTP-level for small set of resources, notification for bulk-updates).</p><p>Real World Example. The execution of SPARQL queries directly over the LOD Web guarantees: on the one hand always fresh results, but is, on the other hand, very time and resource consuming. Each query triple pattern is executed directly over the dereferenced content of the URI constants in the triple patterns. The integration of a smart cache into the query processor could significantly increase the performance of such a system. The smart cache could be able to store statements or the whole content of resources which are rather very dynamic and frequently appear in queries. Such statements could be rdf:type statements.</p><p>Requirements. The requirements to fulfill this use case are the same as for UC1 with additionally: The dynamic and change descriptions of the datasets have to contain information for different granularity levels [GL], e.g., information about changes at a statement, source or dataset level.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3">Use Cases UC 3: Link Maintenance</head><p>In many scenarios, we can integrate information from various datasets by using or creating links between the these datasets. A crucial point for any application which relies on these links is the problem that links can change or resources can disappear or moved. The application should be able to learn if a link type changed (e.g. the relationship between two instances was refined) or if the linked resources vanished or its identifier changed.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Real World Example.</head><p>A music website enriches the information it provides about bands (for example http://dbpedia.org/resource/Green_Day) and artists with multilingual biography information retrieved from DBpedia. The website keeps the retrieved biographies in a local cache and updates them in regular intervals (e.g., once a day). Additionally it exposes its local information as Linked Data on the Web not including the cached biographies, but links to the corresponding DBpedia resources. The web application must be informed whenever the dependent resource (a representation of it) at DBpedia changes. Especially, if the resource is updated, deleted, or moved to another URI location (http://dbpedia.org/resource/Green_ Day_(band)). If the web application is unaware of the remote changes it risks to (i) loose the ability to updated its cached biographies if remote resource become unavailable or (ii) expose dead or semantically invalid links in its local linked dataset. Additionally, it would be desirable to be informed about new artists and band biographies becoming available in the course of time.</p><p>Requirements. The requirements to fulfill this use case are the requirements from UC1 and UC2 -that is the explicit representation of the dynamics and changes (CD &amp; DD) and the ability to discover the descriptions (DM) and which mechanism is used to communicate the changes (CM) in different granularity levels (GL). Further, for such a use cases the methods should be compliant to the architecture of the WWW and be scalable (CW &amp; SW).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.4">Use Cases UC 4: Vocabulary Evolution and Versioning</head><p>A given LOD dataset contains a set of resources that conform to a particular vocabulary. In other words, the vocabulary provides classes and properties for expressing the data of the dataset. Whenever the vocabulary changes (evolves), i.e. a new version of the vocabulary is available, there has to be some support for the propagation of the vocabulary/ontology changes to the dataset. The resources of the dataset have to be updated, and in this way they will be conformed to the new version of the vocabulary.</p><p>Real World Example. The FOAF ontology <ref type="foot" target="#foot_12">15</ref> provides classes and properties for expressing some DBpedia resources. Last January, a new version of foaf ontology was released. This new version updates some properties, e.g., foaf:givenname to foaf:givenName. A notification of the new version of the ontology, with its changes, is sent to the related datasets, including DBpedia. Each dataset will decide when perform the propagation of those changes.</p><p>Requirements. In this use case the requirements to fulfill are the same of UC1, but taking into account the ontology/vocabulary as well: (DD) the explicit representation of the ontology/vocabulary changes, (DM &amp; CM) the notification and change propagation approaches that allow to discover and register/subscribe the changes of the vocabulary/ontology, and (GL) the selection of the right granularity level of the changes.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Dataset Dynamics Requirements</head><p>We identified already the core requirements to solve our presented use cases, clustered into: description, mechanism to discover and communicate changes for certain granularity levels. Further, dealing with Linked Data according to the four principles puts also some architectural requirements in place.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Description [DD &amp; CD]</head><p>Dealing with dataset dynamics, we clearly need a way to describe that a dataset is dynamic and how a data consumer can learn about that something has changed and, in addition, what has changed. The descriptions should be machine readable and even more important understandable -that is that the description should use the same set of RDF vocabularies. Further, the descriptions should contain the important attributes: 1) general information about the expected frequency of changes 2) information and pointers to notification mechanisms. In addition, the following optional and welcomed attributes are: 3) information about the average change volume and 4) the type of change,e.g., most of changes are updates of available information or the add of information.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Granularity Levels [GL]</head><p>The majority of our use cases need or could benefit from different granularity levels by how changes are detected and communicated. Certain applications require change notifications on the statement level (e.g. the link type or the object value changed), whereas for others it is sufficient enough to know that there appeared a change in a dataset. For coarse grained levels it might be necessary to also know about the change fraction; e.g. 50% of the dataset changed. In more detail, we identified the necessity for the following change levels:</p><p>Mandatory Levels d Dataset level -the dataset DS changed r Resource level -the resource URI changed s Statement level -the statement (URI URI VALUE ) was deleted or added (this covers also update operations) Optional Levels g Graph Structural Level -e.g. adding removing named graphs which can be collections of statements from different sources</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Communication Mechanism [CM]</head><p>We need scalable mechanisms to communicate the change event and the changes itself. Data consumers should be able to either actively check (pull) or listen (push) for such events and the change descriptions. A typical publisher/subscriber mechanism with different communication channels would be ideal. There should be a communication channel that contains information that something has changed and how much has changed. Another grouped of channels should then contain detailed information about the changes for different granularity levels. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Abstract Dataset Dynamics Stack</head><p>Next, we introduce the abstract and high-level dataset dynamics architecture and technology stack as depict in Figure <ref type="figure" target="#fig_0">1</ref>. We have datasets which are undergoing changes as we already motivated above. This information is consumed by software agents and humans, representative examples of such producer/consumer interactions are motivated in our use cases. The architecture stack exists of two description layers-both on the dataset and consumer sideand communication mechanisms which enables the interaction between consumers and dataset providers. A dataset needs to describe its general dynamic attributes (e.g., the average expected change frequency) and how a consumer can learn about occurring changes (cf. dynamic semantics in Figure <ref type="figure" target="#fig_0">1</ref>). The consumer, on the other hand, needs to learn and understand that a dataset has undergoing a change and what exactly has changed (cf. change semantics in Figure <ref type="figure" target="#fig_0">1</ref>). Further, we can see that we have to deal with many-to-many relationship between consumers and datasets which requires that the communication mechanisms needs to be highly scalable on both sides. We can expect to have potentially millions of consumers and thousands of datasets, millions of resources and billions of triples. So each dataset can be consumed by millions of agents and each agent can consume thousands if not millions of information pieces. In addition, the communication mechanisms should be compliant to the architecture of the WWW which means concrete solutions have to make use of URIs and the HTTP protocol layer. We deliberately omitted here the discovery layer which allows consumers to automatically locate the resource dynamic description of a dataset. The aspects of resource description discovery is surveyed in <ref type="bibr" target="#b23">[25]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Survey of Approaches</head><p>In this section we describe the most significant approaches that treats the aspects related to datasets dynamics, taking into account the identified requirements in Section 4. We have grouped the approaches into the two identified layers of Figure <ref type="figure" target="#fig_0">1</ref>, descriptions and communication mechanisms.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1">Descriptions</head><p>There are several approaches for describing the dataset dynamics and the semantics of changes in a dataset. Next, we present the most representative approaches we have found in the literature. The summary of this survey and the comparison with the requirements are listed in Table <ref type="table">1</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Descriptions</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Dynamic description</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Change description</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Granularity levels</head><formula xml:id="formula_0">DaDy √ - - DSNotify Eventset Vocabulary √ √ s Talis Changeset Vocabulary √ √ r OWL 2 change ontology √ √ d,r CHAO √ √ r</formula><p>Table <ref type="table">1</ref>: Summary of description proposals and their coverage of the requirements. Granularity levels: dataset (d), resource (r) and statement (s) level.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Dady</head><p>The Dataset Dynamics Vocabulary, DaDy <ref type="bibr" target="#b2">[3]</ref>, can represent information about the regularity (regular, irregular) and frequency (no, low, mid, high) of updates and provide a link to the update notification source URI. It is designed to be used with voiD <ref type="bibr" target="#b1">[2]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>DSNotify Eventset</head><p>The DSNotify Eventset Vocabulary <ref type="bibr" target="#b3">[4]</ref> [12], is a vocabulary for change events in linked data sources, and can be used to describe timely-ordered sets of events that modify resources in linked data resources. Eventsets<ref type="foot" target="#foot_13">16</ref> are associated with two voiD:Datasets a source and a target dataset.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Talis Changeset</head><p>The Talis Changeset Vocabulary <ref type="bibr" target="#b4">[5]</ref> defines a set of terms for describing changes to resource descriptions. In the context of this vocabulary, a resource description is the set of triples that in some way include a description of a resource. Moreover, the vocabulary introduces the notion of a ChangeSet which encapsulates the delta between two versions of a resource description. The delta is represented by two set of triples: additions and removals. A ChangeSet can be used to modify a resource description by first removing all triples from the description that are in the removals set and adding the triples in the additions set.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>OWL 2 change ontology</head><p>The OWL 2 change ontology <ref type="bibr" target="#b0">[1]</ref> is a fined-grained taxonomy of ontology changes that considers the lowest-level atomic operations that can be performed in an ontology, but also on other abstraction levels, for example: atomic, entity and composite changes. This ontology allows to describe on a fine grained level how an ontology has changed from one version to another.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>CHAO</head><p>The Change and Annotations Ontology, CHAO <ref type="bibr" target="#b12">[13]</ref>, represents ontology changes within the ontology-evolution system of Protégé. The ontology consists of two parts. The basis is an ontology of basic change operations and there is an extension that defines complex change operations. Instances of these ontologies record information about chances including meta information about them, e.g., author, timestamp, annotations, etc.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.2">Communication Mechanisms</head><p>The set of communication mechanisms include discovery mechanism, communication protocols and delta computations. As mentioned earlier, we exclude the discovery and delta computation mechanisms from this survey since they deserve a comprehensive survey on its own. The summary of this survey and the comparison with the requirements are listed in Table <ref type="table" target="#tab_0">2</ref>. Current approaches for the communication between dataset consumers and publisher can be split into two kinds of notification mechanisms: 1) a pull mechanism (e.g., feed subscription, web crawlers or monitored queries) and 2) a push mechanism which are mainly implemented by a publisher/subscriber model. A detailed comparison about push vs. pull mechanisms is given by Bhide et.al <ref type="bibr" target="#b6">[7]</ref>. Further, the authors propose a combination of a push and pull based approach as an nearly optimal solution for the communication process (cf. PubSubHubbub). A hybrid push and pull approach seems to be able to establish a stable system which can deal with arbitrarily large numbers of subscribers and changes and rapid changes of the dataset.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Mechanisms</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Pull Based Approaches</head><p>In short, Pull based approaches have to deal with a large communication overhead for the messages exchanged and a large number of clients and further, have problems to deal with rapidly changing data. Atom Atom <ref type="bibr" target="#b16">[17]</ref> is an XML-based Web content and metadata syndication format, and an application-level protocol for publishing and editing Web resources belonging to periodically updated websites. Atom is a relatively recent spec and is much more robust and feature-rich than RSS <ref type="bibr" target="#b16">[17]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>SDShare</head><p>The protocol for the Syndication of Semantic Descriptions, SDShare <ref type="bibr" target="#b19">[20]</ref>, defines how a RESTful web service can publish a series of web accessible feeds that describe snapshots and changes to collections of semantic descriptions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>OAI-PMH</head><p>The Open Archives Initiative Protocol for Metadata Harvesting, OAI-PMH <ref type="bibr" target="#b13">[14]</ref>, provides an application-independent interoperability framework based on metadata harvesting. There are two classes of participants in the OAI-PMH framework: 1) Data Providers administer systems that support the OAI-PMH as a means of exposing metadata; and 2) Service Providers use metadata harvested via the OAI-PMH as a basis for building value-added services.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>RDFSync</head><p>RDFsync <ref type="bibr" target="#b10">[11]</ref> is an approach for the efficient synchronization of RDF models. Because of the RDF semantics, RDF models cannot be efficiently synchonized by the rsync<ref type="foot" target="#foot_14">17</ref> or similar algorithms. RDFSync is based on the decomposition of a model into Minimum Self-Contained graphs (MSGs).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Push Based Approaches</head><p>Basically, push based mechanisms can deal very efficiently with high frequently changing data, but on the other side these push mechanisms have to maintain the list subscriber and the states of open connections which can cause also scalability problems. Another problem is that the notification messages can be potentially very large (especially if a client requests changes on a statement level). PingTheSemanticWeb PingTheSemanticWeb <ref type="bibr" target="#b14">[15]</ref> is a web service archiving the location of recently created/updated RDF documents on the Web. If one of those documents is created or updated, its author can notify PTSW that the document has been created or updated by pinging the service with the URL of the document.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Semantic Pingback</head><p>Semantic Pinback <ref type="bibr" target="#b21">[23]</ref> tackles the quality, timeliness and coherence as well as direct end user benefits of the emerging Linked Data Web. Semantic Pingback extends the well-known Pingback method, which is technological cornerstone of the blogosphere. It is based on the advertising of an RPC service for propagating typed RDF links between Data Web resources.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>WebHooks</head><p>WebHooks <ref type="bibr" target="#b25">[27]</ref> are HTTP callbacks which uses HTTP POST operations to learn that something happens and for notifications. Clients just register a webhook to a URL and receive notifications whenever a event occurs. The publisher just needs to send a HTTP POST to the specific URL with the event description.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>SPARUL</head><p>The updated language for RDF graphs, Sparql/Updates <ref type="bibr" target="#b24">[26]</ref>, is able to express updates to an RDF store. It is intended to be an standard mechanism by which updates to a remote RDF store can be described, communicated and stored.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.3">Comparison Overview</head><p>After having analyzed the existing descriptions and communication mechanisms, that treats the aspects related to datasets dynamics, we present the comparison of the existing research works according to the identified requirements described in Section 4.</p><p>Regarding the existing descriptions, we can state that: DD is covered by all the approaches; CD is covered by the Talis ChangeSet, the OWL 2 change ontology, and the Change and Annotations Ontology; GL is covered by (1) DSNotify Eventset Vocabulary at statement level, (2) Talis ChangeSet at the resource description level, (3) OWL 2 change ontology at entity and composite changes, and (4) Change and Annotations Ontology, at basic and complex operations. Regarding the communication mechanisms, we can say that most fall into either push-based or pull-based approaches, with only few supporting both. However, the communication mechanisms are not mature enough.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">Conclusion</head><p>The importance of this work is motivated by the problematic of web-scale handling dataset dynamics; the discovery of the description of the dynamic of a dataset, how a data consumer can learn about changes and the actual protocol to communicate changes and methods to efficiently compute changes between two versions of a dataset. This survey encompasses that there exists not a clear and solid solution to solve the highlighted problems of discovering, describing and communicating the change dynamics of Linked Data (re)sources.</p><p>We compared a number of deployed systems and technologies concerning requirements we derived from real world use cases the community came up with. Further, we introduced an abstract dataset dynamics stack, which provides the community with a framework and can potentially serve as a basis for further work into benchmarking and comparing solutions on a wider range.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>DiscoveryFig. 1 :</head><label>1</label><figDesc>Fig. 1: Abstract Dataset Dynamics Stack.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 2 :</head><label>2</label><figDesc>Summary of communication mechanism proposals and their coverage of the requirements. Granularity levels: dataset (d), resource (r) and statement (s) level.</figDesc><table><row><cell></cell><cell>Discovery</cell><cell cols="2">Granularity</cell></row><row><cell></cell><cell>mechanism</cell><cell>level</cell><cell></cell></row><row><cell>Atom PubSubHubbub SDShare OAI-PMH PingTheSemanticWeb</cell><cell>√ √ √ √ √</cell><cell cols="2">Pull Push d,r -d,r,s d,r,s d,r,s -d,r --r</cell></row><row><cell>SemanticPingback Web Hooks</cell><cell>-√</cell><cell>--</cell><cell>r d,r</cell></row><row><cell>RDFSync SPAURL DSNotify</cell><cell>-√ √</cell><cell>d,r,s -d,r,s</cell><cell>-s -</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_0">http://www4.wiwiss.fu-berlin.de/bizer/d2r-server/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_1">http://xlwrap.sourceforge.net/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_2">http://any23.org/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="6" xml:id="foot_3">http://drupal.org/node/725382</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="7" xml:id="foot_4">http://km.aifb.kit.edu/ws/semwiki2006/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="8" xml:id="foot_5">http://www.heppnetz.de/projects/goodrelations/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="9" xml:id="foot_6">http://www.w3.org/TR/NOTE-datetime</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="10" xml:id="foot_7">http://pubsubhubbub.appspot.com/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="11" xml:id="foot_8">http://blog.superfeedr.com/api/http/pubsubhubbub/pubsubhubbub/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="12" xml:id="foot_9">http://github.com/tonyg/rabbithub/#readme</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="13" xml:id="foot_10">http://arc.semsol.org/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="14" xml:id="foot_11">http://www.w3.org/2010/04/w3c-track.html</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="15" xml:id="foot_12">http://xmlns.com/foaf/spec/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="16" xml:id="foot_13">A Eventset is a container of events that occur in a dataset.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="17" xml:id="foot_14">rsync is a software application for Unix systems which synchronizes files and directories from one location to another while minimizing data transfer using delta encoding when appropriate.</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgements</head><p>The research leading to these results has received funding from the European Community's Seventh Framework Programme (FP7/2007-2013) under Grant Agreement n 256975 , LOD Around-The-Clock (LATC) Support Action.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<ptr target="http://omv.ontoware.org/OWLChanges" />
		<title level="m">Owl 2 change ontology</title>
				<imprint>
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<ptr target="http://vocab.deri.ie/void/guide" />
		<title level="m">Vocabulary of interlinked datasets</title>
				<imprint>
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<ptr target="http://purl.org/NET/dady" />
		<title level="m">Dataset dynamics (dady) vocabulary</title>
				<imprint>
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<ptr target="http://dsnotify.org/vocab/eventset/0.1/" />
		<title level="m">Dsnotify eventsets: A vocabulary for change events in linked data sources</title>
				<imprint>
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<ptr target="http://vocab.org/changeset/schema" />
		<title level="m">Talis changeset vocabulary</title>
				<imprint>
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Semantics and implementation of schema evolution in object-oriented databases</title>
		<author>
			<persName><forename type="first">J</forename><surname>Banerjee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Kim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H.-J</forename><surname>Kim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">F</forename><surname>Korth</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">SIGMOD &apos;87: Proceedings of the 1987 ACM SIGMOD international conference on Management of data</title>
				<meeting><address><addrLine>New York, NY, USA</addrLine></address></meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="1987">1987</date>
			<biblScope unit="page" from="311" to="322" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Adaptive push-pull: Disseminating dynamic web data</title>
		<author>
			<persName><forename type="first">M</forename><surname>Bhide</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Deolasee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Katkar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Panchbudhe</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Ramamritham</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Shenoy</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Transactions on Computers</title>
		<imprint>
			<biblScope unit="volume">51</biblScope>
			<biblScope unit="page" from="652" to="668" />
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Linked Data -The Story So Far</title>
		<author>
			<persName><forename type="first">C</forename><surname>Bizer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Heath</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Berners-Lee</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">International Journal on Semantic Web and Information Systems</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="1" to="22" />
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Dbpedia live extraction</title>
		<author>
			<persName><forename type="first">S</forename><surname>Hellmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Stadler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Lehmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Auer</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of 8th International Conference on Ontologies, DataBases, and Applications of Semantics (ODBASE)</title>
		<title level="s">Lecture Notes in Computer Science</title>
		<meeting>of 8th International Conference on Ontologies, DataBases, and Applications of Semantics (ODBASE)</meeting>
		<imprint>
			<date type="published" when="2009">2009</date>
			<biblScope unit="volume">5871</biblScope>
			<biblScope unit="page" from="1209" to="1223" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Architecture of the world wide web, volume one</title>
		<author>
			<persName><forename type="first">I</forename><surname>Jacobs</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Walsh</surname></persName>
		</author>
		<idno>REC-webarch-20041215</idno>
	</analytic>
	<monogr>
		<title level="m">World Wide Web Consortium</title>
				<imprint>
			<date type="published" when="2004-12">December 2004</date>
		</imprint>
	</monogr>
	<note type="report_type">Recommendation</note>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Rdfsync: efficient remote synchronization of rdf models</title>
		<author>
			<persName><forename type="first">C</forename><surname>Morbidoni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Tummarello</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Erling</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Bachmann-Gmr</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 6th International Semantic Web Conference and 2nd Asian Semantic Web Conference (ISWC/ASWC2007)</title>
				<editor>
			<persName><forename type="first">K</forename><surname>Aberer</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">K.-S</forename><surname>Choi</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">N</forename><surname>Noy</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">D</forename><surname>Allemang</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">K.-I</forename><surname>Lee</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">L</forename><forename type="middle">J B</forename><surname>Nixon</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><surname>Golbeck</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">P</forename><surname>Mika</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">D</forename><surname>Maynard</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">G</forename><surname>Schreiber</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">P</forename><surname>Cudr-Mauroux</surname></persName>
		</editor>
		<meeting>the 6th International Semantic Web Conference and 2nd Asian Semantic Web Conference (ISWC/ASWC2007)<address><addrLine>Busan, South Korea; Berlin, Heidelberg</addrLine></address></meeting>
		<imprint>
			<publisher>Springer Verlag</publisher>
			<date type="published" when="2007-11">November 2007</date>
			<biblScope unit="volume">4825</biblScope>
			<biblScope unit="page" from="533" to="546" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Dsnotify: Handling broken links in the web of data</title>
		<author>
			<persName><forename type="first">P</forename><surname>Niko</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Bernhard</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Nineteenth International WWW Conference (WWW2010)</title>
				<meeting><address><addrLine>Raleigh, NC, USA</addrLine></address></meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2010">2 2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">A framework for ontology evolution in collaborative environments</title>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">F</forename><surname>Noy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Chugh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Musen</surname></persName>
		</author>
		<author>
			<persName><surname>Musen M</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">5th International Semantic Web Conference</title>
				<imprint>
			<publisher>Springer-LNCS</publisher>
			<date type="published" when="2006">2006</date>
			<biblScope unit="page" from="544" to="558" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<monogr>
		<author>
			<persName><surname>Oai-Pmh</surname></persName>
		</author>
		<ptr target="http://www.openarchives.org/OAI/openarchivesprotocol.html" />
		<title level="m">The open archives initiative protocol for metadata harvesting</title>
				<imprint>
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<monogr>
		<author>
			<persName><surname>Pingthesemanticweb</surname></persName>
		</author>
		<ptr target="http://pingthesemanticweb.com/" />
		<title level="m">The open archives initiative protocol for metadata harvesting</title>
				<imprint>
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<ptr target="http://code.google.com/p/pubsubhubbub/" />
		<title level="m">A simple, open, web-hook-based pubsub protocol</title>
				<imprint>
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<monogr>
		<ptr target="http://www.ietf.org/rfc/rfc4287.txt" />
		<title level="m">The atom syndication format</title>
				<imprint>
			<date type="published" when="2010">rfc4287. 2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">A survey of schema versioning issues for database systems</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">F</forename><surname>Roddick</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Information and Software Technology</title>
		<imprint>
			<biblScope unit="volume">37</biblScope>
			<biblScope unit="page" from="383" to="393" />
			<date type="published" when="1995">1995</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<monogr>
		<ptr target="http://www.rss.com/" />
		<title level="m">Really simple syndication</title>
				<imprint>
			<date type="published" when="1999">1999</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<monogr>
		<title level="m" type="main">Protocol for the syndication of semantic descriptions</title>
		<author>
			<persName><surname>Sdshare</surname></persName>
		</author>
		<ptr target="http://www.egovpt.org/fg/CWA_Part_1b" />
		<imprint>
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<monogr>
		<author>
			<persName><surname>Sparqlpush</surname></persName>
		</author>
		<ptr target="http://code.google.com/p/sparqlpush/" />
		<title level="m">pubsubhubbub (push) interface for sparql endpoints</title>
				<imprint>
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Weaving a Social Data Web with Semantic Pingback</title>
		<author>
			<persName><forename type="first">S</forename><surname>Tramp</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Frischmuth</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Ermilov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Auer</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the EKAW 2010 -Knowledge Engineering and Knowledge Management by the Masses</title>
				<meeting>the EKAW 2010 -Knowledge Engineering and Knowledge Management by the Masses</meeting>
		<imprint>
			<date type="published" when="2010-10">October, 2010. 2010</date>
			<biblScope unit="page" from="135" to="149" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Sindice.com: Weaving the Open Linked Data</title>
		<author>
			<persName><forename type="first">G</forename><surname>Tummarello</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Oren</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Delbru</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 6th International Semantic Web Conference and 2nd Asian Semantic Web Conference (ISWC/ASWC2007)</title>
				<meeting>the 6th International Semantic Web Conference and 2nd Asian Semantic Web Conference (ISWC/ASWC2007)<address><addrLine>Busan, South Korea</addrLine></address></meeting>
		<imprint>
			<biblScope unit="page" from="547" to="560" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<monogr>
		<title level="m" type="main">Discovering resources on the web -a comparison of discovery mechanism for the web of data and the web of documents</title>
		<author>
			<persName><forename type="first">J</forename><surname>Umbrich</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Hausenblas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Archer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Hammer-Lahav</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Wilde</surname></persName>
		</author>
		<ptr target="http://uldis.deri.ie" />
		<imprint>
			<date type="published" when="2009">8. 2009</date>
		</imprint>
		<respStmt>
			<orgName>Linked Data Research Centre</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Technical Report 1</note>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">A language for updating rdf graphs</title>
		<author>
			<persName><forename type="first">S</forename><surname>Update</surname></persName>
		</author>
		<ptr target="http://www.w3.org/Submission/SPARQL-Update/" />
	</analytic>
	<monogr>
		<title level="m">w3c member submission</title>
				<imprint>
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<monogr>
		<title/>
		<author>
			<persName><surname>Webhooks</surname></persName>
		</author>
		<author>
			<persName><surname>Webhooks</surname></persName>
		</author>
		<ptr target="http://www.webhooks.org/" />
		<imprint>
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<analytic>
		<title level="a" type="main">A framework for schema updates in an object-oriented database system</title>
		<author>
			<persName><forename type="first">R</forename><surname>Zicari</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Building an object-oriented database system: the story of 02</title>
				<meeting><address><addrLine>San Francisco, CA, USA</addrLine></address></meeting>
		<imprint>
			<publisher>Morgan Kaufmann Publishers Inc</publisher>
			<date type="published" when="1992">1992</date>
			<biblScope unit="page" from="146" to="182" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
