<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Mining and Managing Large-Scale Linked Open Data</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Ansgar</forename><surname>Scherp</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">University of Kiel Christian-Albrechts</orgName>
								<address>
									<addrLine>-Platz 4</addrLine>
									<postCode>24118</postCode>
									<settlement>Kiel</settlement>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="department">ZBW -Leibniz Information Centre for Economics</orgName>
								<orgName type="institution">Kiel University since January</orgName>
							</affiliation>
						</author>
						<title level="a" type="main">Mining and Managing Large-Scale Linked Open Data</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">A04F93636F5BCC3714321C38ACA1B028</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-19T18:01+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Linked Open Data (LOD) is about publishing and interlinking data of different origin and purpose on the web. The Resource Description Framework (RDF) is used to describe data on the LOD cloud. In contrast to relational databases, RDF does not provide a fixed, pre-defined schema. Rather, RDF allows for flexibly modeling the data schema by attaching RDF types and properties to the entities. Our schema-level index called SchemEX allows for searching in large-scale RDF graph data. The index can be efficiently computed with reasonable accuracy over large-scale data sets with billions of RDF triples, the smallest information unit on the LOD cloud. SchemEX is highly needed as the size of the LOD cloud quickly increases. Due to the evolution of the LOD cloud, one observes frequent changes of the data. We show that also the data schema changes in terms of combinations of RDF types and properties. As changes cannot capture the dynamics of the LOD cloud, current work includes temporal clustering and finding periodicities in entity dynamics over large-scale snapshots of the LOD cloud with about 100 million triples per week for more than three years.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body/>
		<back>
			<div type="annex">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>at UC Irvine, CA, USA from 2006 to 2007. Subsequently, he has led work packages in EU projects such as WeKno-wIt and Social Sensor at the University of Koblenz-Landau. Currently, Ansgar is scientific coordinator of the EU H2020 project MOVING (http://moving-project.eu/) on training users from all societal sectors to improve their information literacy by training how to choose, use, and evaluate data mining methods in connection with their daily research tasks and to become data-savvy information professionals.</p></div>			</div>
			<div type="references">

				<listBibl/>
			</div>
		</back>
	</text>
</TEI>
