<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Towards A Unified Knowledge Graph Data Management System</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Baozhu</forename><surname>Liu</surname></persName>
							<email>liubaozhu@tju.edu.cn</email>
							<affiliation key="aff0">
								<orgName type="department">College of Intelligence and Computing</orgName>
								<orgName type="institution">Tianjin University</orgName>
								<address>
									<settlement>Tianjin</settlement>
									<country key="CN">China</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Xin</forename><surname>Wang</surname></persName>
							<email>wangx@tju.edu.cn</email>
							<affiliation key="aff0">
								<orgName type="department">College of Intelligence and Computing</orgName>
								<orgName type="institution">Tianjin University</orgName>
								<address>
									<settlement>Tianjin</settlement>
									<country key="CN">China</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Pengkai</forename><surname>Liu</surname></persName>
							<email>liupengkai@tju.edu.cn</email>
							<affiliation key="aff0">
								<orgName type="department">College of Intelligence and Computing</orgName>
								<orgName type="institution">Tianjin University</orgName>
								<address>
									<settlement>Tianjin</settlement>
									<country key="CN">China</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Sizhuo</forename><surname>Li</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">College of Intelligence and Computing</orgName>
								<orgName type="institution">Tianjin University</orgName>
								<address>
									<settlement>Tianjin</settlement>
									<country key="CN">China</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Towards A Unified Knowledge Graph Data Management System</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">6201A30BCF2292726938C5915AB441AA</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T08:55+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Knowledge graph currently has two main data models: RDF graph and property graph. The query language on RDF graph is SPARQL, while the query language on property graph is mainly Cypher. Different data models and query languages hinder the wider application of knowledge graphs. In the paper, we propose a unified interoperable knowledge graph database system, which can effectively manage both RDF and property graphs.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">INTRODUCTION</head><p>With the proliferation of Knowledge Graphs (KG), the applications of KGs have a rapid growth in recent years. RDF (Resource Description Framework) graph and property graph are the two mainstream data models of KGs. On one hand, RDF has become the World Wide Web Consortium recommendation to represent KGs, and is widely used by triple stores, such as gStore <ref type="bibr" target="#b0">[1]</ref>. On the other hand, property graphs are widely applied to graph databases, such as Neo4j <ref type="bibr" target="#b1">[2]</ref>. It has been widely recognized that it is necessary to unify the data models and query languages for KG database management. To this end, we propose a unified KG data management system, which consists of three components, i.e., storage manager, query processing coordinator, and Web interface, making multiple KGs managable in a unified database management system. The queries will be translated into unified semantics denoted by relational algebra using the query processing coordinator. In storage manager, RDF graphs and property graphs will be shred into relations with the specific approaches.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">APPROACH AND NOVELTY</head><p>As shown in Fig. <ref type="figure" target="#fig_0">1</ref>, to the best of our knowledge, the system proposed in this paper is the first KG database system that realizes a unified storage scheme, facilitates the interoperability of SPARQL and Cypher, and meanwhile provides a Web interface to visualize the query results and explanations. <ref type="bibr" target="#b0">(1)</ref> Based on the relational model, a unified storage scheme is utilized to efficiently store RDF graphs and property graphs, and support the query requirements of knowledge graphs. (2) Using the characteristic-set-based method, </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Unified Knowledge Graph Storage</head><p>Visualization Due to the unified storage scheme and query processing method that we utilized, it is easier to manage multiple KGs in one database. Users no longer need to switch among different database systems to obtain storage and query support for different data models.</p><p>To verify the effectiveness and efficiency of the proposed system, extensive experiments were conducted on several data sets. The experimental results show that our system outperforms gStore <ref type="bibr" target="#b0">[1]</ref> and Neo4j <ref type="bibr" target="#b1">[2]</ref>, which are two state-of-the-art KG database systems. The comparison of the features supported by the systems is shown in Table <ref type="table">.</ref> 1.</p><p>Table <ref type="table">1</ref>: System Comparison.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: The Overall Architecture. the storage problem of untyped entities is addressed. (3) The interoperability of SPARQL and Cypher is realized, and enables them to interchangeably operate on the same knowledge graph. (4) With a unified Web interface, users are allowed to query with two different languages over the same KG and visualize query results and explanations.Due to the unified storage scheme and query processing method that we utilized, it is easier to manage multiple KGs in one database. Users no longer need to switch among different database systems to obtain storage and query support for different data models.To verify the effectiveness and efficiency of the proposed system, extensive experiments were conducted on several data sets. The experimental results show that our system outperforms gStore<ref type="bibr" target="#b0">[1]</ref> and Neo4j<ref type="bibr" target="#b1">[2]</ref>, which are two state-of-the-art KG database systems. The comparison of the features supported by the systems is shown in Table.1.Table1: System Comparison.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>id source target property id property Vertex Table 1 Edge Table 2 id property Vertex Table 2 id source target property Edge Table 1 id property Vertex Table 3</head><label></label><figDesc></figDesc><table><row><cell cols="2">Web Interface</cell><cell></cell></row><row><cell>SPARQL</cell><cell>Cypher</cell><cell></cell></row><row><cell>Lexical Parser</cell><cell>Syntax Parser</cell><cell>Semantic Translation</cell></row><row><cell cols="3">Unified Knowledge Graph Query Processing</cell></row><row><cell>Property Graph</cell><cell>RDF Graph</cell><cell></cell></row><row><cell></cell><cell>OR</cell><cell></cell></row><row><cell></cell><cell>Knowledge Graph</cell><cell></cell></row></table></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>ACKNOWLEDGMENTS</head><p>This work is supported by National Key Research and Development Program of China (2019YFE0198600); the National Natural Science Foundation of China (61972275, 61972402); and CCF-Huawei Database Innovation Research Plan (CCF-Huawei DBIR2019004B).</p></div>
			</div>

			<div type="annex">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>System</head><p>Storage Query RDF Property Graph</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>BGP Text Search</head><p>Graph Analysis</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">FUTURE WORKS</head><p>In order to meet the storage and query requirements of large-scale KG data, we will focus on the distributed KG data management systems in the future. Moreover, more query features will be supported in the unified KG management system.</p></div>			</div>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">gstore: a graph-based sparql query engine</title>
		<author>
			<persName><forename type="first">Lei</forename><surname>Zou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Lei</forename><surname>Tamer Özsu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Xuchuan</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ruizhe</forename><surname>Shen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Dongyan</forename><surname>Huang</surname></persName>
		</author>
		<author>
			<persName><surname>Zhao</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">The VLDB journal</title>
		<imprint>
			<biblScope unit="volume">23</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="565" to="590" />
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Graph database applications and concepts with neo4j</title>
		<author>
			<persName><forename type="first">Justin</forename><forename type="middle">J</forename><surname>Miller</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Southern Association for Information Systems Conference</title>
				<meeting>the Southern Association for Information Systems Conference<address><addrLine>Atlanta, GA, USA</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2013">2013</date>
			<biblScope unit="volume">2324</biblScope>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
