<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Towards A Unified Knowledge Graph Data Management System</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Baozhu Liu</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Xin Wang</string-name>
          <email>wangx@tju.edu.cn</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pengkai Liu</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sizhuo Li</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>College of Intelligence and Computing, Tianjin University</institution>
          ,
          <addr-line>Tianjin</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Reference Format: Baozhu Liu, Xin Wang, Pengkai Liu, Sizhuo Li. Towards A Unified Knowledge Graph Data Management System. In the 2nd Workshop on Search</institution>
          ,
          <addr-line>Exploration, and Analysis in Heterogeneous Datastores, SEA Data 2021</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>Knowledge graph currently has two main data models: RDF graph and property graph. The query language on RDF graph is SPARQL, while the query language on property graph is mainly Cypher. Diferent data models and query languages hinder the wider application of knowledge graphs. In the paper, we propose a unified interoperable knowledge graph database system, which can efectively manage both RDF and property graphs.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>
        With the proliferation of Knowledge Graphs (KG), the applications
of KGs have a rapid growth in recent years. RDF (Resource
Description Framework) graph and property graph are the two mainstream
data models of KGs. On one hand, RDF has become the World Wide
Web Consortium recommendation to represent KGs, and is widely
used by triple stores, such as gStore [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. On the other hand, property
graphs are widely applied to graph databases, such as Neo4j [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. It
has been widely recognized that it is necessary to unify the data
models and query languages for KG database management. To this
end, we propose a unified KG data management system, which
consists of three components, i.e., storage manager, query processing
coordinator, and Web interface, making multiple KGs managable in
a unified database management system. The queries will be
translated into unified semantics denoted by relational algebra using
the query processing coordinator. In storage manager, RDF graphs
and property graphs will be shred into relations with the specific
approaches.
      </p>
    </sec>
    <sec id="sec-2">
      <title>APPROACH AND NOVELTY</title>
      <p>As shown in Fig. 1, to the best of our knowledge, the system
proposed in this paper is the first KG database system that realizes a
unified storage scheme, facilitates the interoperability of SPARQL
and Cypher, and meanwhile provides a Web interface to
visualize the query results and explanations. (1) Based on the relational
model, a unified storage scheme is utilized to eficiently store RDF
graphs and property graphs, and support the query requirements
of knowledge graphs. (2) Using the characteristic-set-based method,
Copyright © 2021 for the individual papers by the papers’ authors. Copyright © 2021
for the volume as a collection by its editors. This volume and its papers are published
under the Creative Commons License Attribution 4.0 International (CC BY 4.0).
Published in the Proceedings of the 2nd Workshop on Search, Exploration, and
Analysis in Heterogeneous Datastores, co-located with VLDB 2021 (August 16-20, 2021,
Copenhagen, Denmark) on CEUR-WS.org.</p>
      <p>Web Interface
SPARQL Cypher</p>
      <sec id="sec-2-1">
        <title>Lexical Parser</title>
      </sec>
      <sec id="sec-2-2">
        <title>Syntax Parser</title>
        <p>Visualization</p>
        <sec id="sec-2-2-1">
          <title>Knowledge Graph</title>
          <p>Vertex Table 1
id property
Vertex Table 2
id property
Edge Table 1
id source target property
Vertex Table 3
id property
Edge Table 2
id source target property</p>
        </sec>
        <sec id="sec-2-2-2">
          <title>Unified Knowledge Graph Storage</title>
          <p>Query</p>
          <p>Graph
Analysis
√
√
√
RPQ
√
×
×
3</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>FUTURE WORKS</title>
      <p>In order to meet the storage and query requirements of large-scale
KG data, we will focus on the distributed KG data management
systems in the future. Moreover, more query features will be supported
in the unified KG management system.</p>
    </sec>
    <sec id="sec-4">
      <title>ACKNOWLEDGMENTS</title>
      <p>This work is supported by National Key Research and Development
Program of China (2019YFE0198600); the National Natural Science
Foundation of China (61972275, 61972402); and CCF-Huawei
Database Innovation Research Plan (CCF-Huawei DBIR2019004B).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Lei</given-names>
            <surname>Zou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M Tamer</given-names>
            <surname>Özsu</surname>
          </string-name>
          , Lei Chen, Xuchuan Shen,
          <string-name>
            <given-names>Ruizhe</given-names>
            <surname>Huang</surname>
          </string-name>
          , and
          <article-title>Dongyan Zhao. gstore: a graph-based sparql query engine</article-title>
          .
          <source>The VLDB journal</source>
          ,
          <volume>23</volume>
          (
          <issue>4</issue>
          ):
          <fpage>565</fpage>
          -
          <lpage>590</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Justin</surname>
            <given-names>J</given-names>
          </string-name>
          <string-name>
            <surname>Miller</surname>
          </string-name>
          .
          <article-title>Graph database applications and concepts with neo4j</article-title>
          .
          <source>In Proceedings of the Southern Association for Information Systems Conference</source>
          , Atlanta,
          <string-name>
            <surname>GA</surname>
          </string-name>
          , USA, volume
          <volume>2324</volume>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>