<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Simple visualization of structures of interrelated concepts in the FRBRoo ontology</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Krzysztof Sielski</string-name>
          <email>sielski@man.poznan.pl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marcin Werla</string-name>
          <email>mwerla@man.poznan.pl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Poznań Supercomputing and Networking Center</institution>
          ,
          <addr-line>ul. Noskowskiego 12/14, 61-704 Poznań</addr-line>
          ,
          <country country="PL">Poland</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The Knowledge Base which was created by Poznań Supercomputing and Networking Center (PSNC) as a part the SYNAT1 project integrates information from distributed heterogeneous sources such as digital libraries, digital museums, scientific and technical information systems. The gathered knowledge is stored in an RDF semantic database and is represented in FRBRoo ontology with some custom extensions, which had to be introduced in order to represent all the information without any semantic loss. As of the beginning of 2013, the Knowledge Base contained information from over 3,100,000 metadata records, which were originally encoded in various schemas: PLMET (data obtained from Polish Digital Libraries Federation), MARC 21 XML (from union catalog of Polish research libraries NUKAT), MONA (from the National Museum in Warsaw) or CDWA LITE (from the National Museum in Krakow). These records were converted to FRBRoo ontology using jMet2Ont[1] tool. Some auxiliary data sources such as VIAF, Geonames, KABA Subject Headings and Lexvo have been used to enrich the records with detailed information. Currently, the number of RDF triples building the Knowledge Base is 536M, which includes 235M explicit and 301M implicit triples. The implicit triples have been added by the inference engine with our custom rule set, which is a subset of OWL 2 RL/RDF entailment rules. Unlike traditional relational databases, data represented as triples does not have a precise schema with strict constraints. Instead, OWL ontologies describe the structure of concepts and relations between them. As FRBRoo is a complex ontology with many classes, this model is often converted to a simpler one when presented to an user. The contents of the Knowledge Base can be explored in a couple different ways: - a raw SPARQL endpoint, which is aimed at expert users who know the ontology very well and have precisely defined goals;</p>
      </abstract>
      <kwd-group>
        <kwd>Semantic Web</kwd>
        <kwd>ontology</kwd>
        <kwd>OWL</kwd>
        <kwd>RDF</kwd>
        <kwd>FRBRoo</kwd>
        <kwd>RDF Unit</kwd>
        <kwd>knowledge base</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>- a full text search application, which searches for keywords provided by user
in RDF literals from the triplestore and uses the Query Processing Module (QPM)
which maps on-the-fly information represented in the FRBRoo ontology to a
simplified model, consisting of the following concepts: works, items, persons, places,
legal bodies, and subjects;</p>
      <p>- a geographical search application, which allows user to select an area on a
map to find all objects connected with places contained in that area (e.g. all
publications whose subject is a particular city);</p>
      <p>- an application to explore semantic database with dynamically fetched
portions of data describing particular object from the triplestore, which are presented
as interrelated FRBRoo concepts in a legible way understandable by non-experts.</p>
      <p>
        The last named application was built as a proof of concept of RDF Unit[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. RDF
Units are graphs which consist of several ontology objects of different classes that are
needed to provide all the essential information about a certain resource. For example,
an RDF Unit for a particular instance of Publication Expression from the Knowledge
Base would include objects representing its Title, Publication Event and Place of
Publishing, but not geographical coordinates of that place. RDF Units are dynamically
constructed based on the metaproperties of ontology relations and actual data in the
triplestore.
      </p>
      <p>Such graphs are transformed into a tree structure, in which the examined resource
becomes a root. Then, the obtained RDF Unit tree is prepared for presentation by
replacing names of predicates with more user friendly labels and by flattening some
long predicate paths to a single dummy edge in order to provide information in a
straightforward way. This transformations are represented as a set of rules which take
into consideration a predicate and classes of a subject and an object. Examples of such
rules include (here [?] stands for any class):</p>
      <p>- [E21_Person] P100_i_died_in [E69_Death] P4_has_time_span
[E52_Time-Span] P1_is_identified_by [?]→ date of death</p>
      <p>- [F18_Serial_Work] P148_has_component
[F14_Individual_Work] → series element</p>
      <p>- [?] P9_consists_of [F28a_Contribution] P14_carried_out_by
[?] → contributor</p>
      <p>- [?] P9_consists_of [F28a_Contribution] P2_has_type [?] →
in the role of</p>
      <p>Figure 1 presents a result of mapping one record in MARC 21 XML schema to
FRBRoo. It is a graph of 47 connected FRBRoo objects represented by 108 RDF
triples. Figure 2 presents a view in our application that represents an RDF Unit of an
Individual Work resource which was created in mentioned mapping. This unit
contains all the information from source record except for author's and contributors'
dates of life, which can be examined in those resources’ view.</p>
      <p>The described Knowledge Base browser application was prepared for dynamic
viewing of FRBRoo data from the triplestore, but this approach is generic and should
work for another ontologies as well. It uses no predefined SPARQL queries and is
based only on a relatively small configuration: a set of graph path flattening rules for
presentation and a set of single metaproperty for each ontology predicate which is
used to build RDF Units.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Walkowska</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Werla</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          . (
          <year>2012</year>
          ).
          <article-title>Advanced Automatic Mapping from Flat or Hierarchical Metadata Schemas to a Semantic Web Ontology</article-title>
          .
          <source>TPDL 2012. Lecture Notes in Computer Science</source>
          , vol.
          <volume>7489</volume>
          , pp.
          <fpage>260</fpage>
          -
          <lpage>272</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Sielski</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Walkowska</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Werla</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          . (
          <year>2013</year>
          ).
          <article-title>Methodology for Dynamic Extraction of Highly Relevant Information Describing Particular Object from Semantic Web Knowledge Base</article-title>
          .
          <source>TPDL 2013. Lecture Notes in Computer Science</source>
          , vol.
          <volume>8092</volume>
          , pp.
          <fpage>260</fpage>
          -
          <lpage>271</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>