<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Enabling Contextualized Knowledge Description with Non-symbolic Integration</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Elton Soares</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Raphael Thiago</string-name>
          <email>raphaeltg@br.ibm.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Wallas Santos</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rodrigo Santos</string-name>
          <email>rodrigo.costag@ibm.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marcio Moreno</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>IBM Research, Brazil</institution>
          ,
          <addr-line>Av Pasteur 146 Rio de Janeiro - RJ</addr-line>
          ,
          <country country="BR">Brazil</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Formal knowledge descriptions have been one of the main tools for enabling knowledge representation and reasoning in AI. The W3C Semantic Web initiative proposed a set of standards, including on a common data interchange format that enabled greater interoperability and integration between knowledge descriptions generated by di erent organizations in the academy and industry. Nonetheless, these standards have been unable to provide contextualized integration of knowledge descriptions e ectively, as they do not provide constructs to explicitly de ne contextualized n-dimensional (e.g. time and space, n-ary and hierarchical facts, etc.) relationships between concepts from di erent descriptions, leading to ine ciencies in the reasoning and query processing over decontextualized relationships, and inconsistencies in the modeling approaches utilized for dealing with this limitation. The main goal of this demo is to present how a hybrid knowledge representation model and description language, namely Hyperknowledge, allows contextualized integration of knowledge descriptions using constructs that enable the explicit de nition of rich contextual relationships. This demonstration will be performed using the Knowledge Explorer System (KES), that supports visualization and management of the e ective contextualization of Hyperknowledge expressivity w.r.t. n-dimensional relationships between concepts from multiple descriptions and between those concepts with their corresponding non-symbolic content.</p>
      </abstract>
      <kwd-group>
        <kwd>Hyperknowledge</kwd>
        <kwd>Hybrid Knowledge Representation</kwd>
        <kwd>Knowledge Management</kwd>
        <kwd>Knowledge Visualization</kwd>
        <kwd>Knowledge Curation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>As the AI industry and research community grow, the number and size of
knowledge descriptions produced and shared using the W3C Semantic Web
Standards 2 (WSWS) tend to grow at a similar pace.</p>
      <p>Copyright c 2020 for this paper by its authors. Use permitted under Creative
Commons License Attribution 4.0 International (CC BY 4.0).
2 https://www.w3.org/standards/semanticweb/</p>
      <p>
        Meanwhile, it has become clear that knowledge descriptions are not always
absolute, but instead are assumed to hold under certain circumstances that de ne
a speci c context [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Descriptions may need to be contextualized, for example,
with regards to dimensions such as time and space, as a fact that holds in a
certain period, or a speci c location of the world, might not be true in another
period or location.
      </p>
      <p>Existing alternatives based on the WSWS, such as RDF graphs/datasets 3,
the OWL Time Ontology 4 or Ontology Patterns for N-ary relations 5 can be
used to work around the expressivity limitations of the underlying conceptual
model, but they fail to provide a general approach in the following aspects,
respectively: Multiple valid interpretations depending on the assumptions made
for graph naming and meaning; addresses only one speci c contextual dimension
using the same constructs that are used to represent the knowledge descriptions
associated with it; propose multiple ways of representing the same n-ary relations
using binary relations, as the underlying conceptual model is unable to represent
them as rst-class constructs.</p>
      <p>Therefore, as the WSWS does not prescribe a standard approach to
contextualize knowledge descriptions, the context of those descriptions is often expressed
in their identi ers or using textual labels and annotations, which is both ine
cient, w.r.t reasoning and query processing but also hard to standardize across
multiple organizations.</p>
      <p>
        Several theories of context have been investigated in seminal works of the
elds of AI and knowledge representation, for example, the formalization of
contexts as rst-class constructs [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] and the use of a "box" metaphor to properly
represent contexts [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Both of these theories served as inspiration for the approach
presented in this work, that not only addresses the problem of contextualized
knowledge description integration, but also its integration with non-symbolic
content (i.e. multimodal content such as machine learning (ML) model, image,
video, text, audio, etc.) in a given context.
      </p>
      <p>
        The integration of knowledge and non-symbolic content has been deeply
inuenced by the semantic gap [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] between what the content means and its
knowledge representation, which has been an open issue until the recent proposal of
a hybrid conceptual model, namely Hyperknowledge [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], capable of representing
relationships between conceptual description and non-symbolic content.
      </p>
      <p>Previous solutions from the non-symbolic AI community, such as metadata
standards and models, attempted to bridge this gap by allowing the speci cation
of prede ned elds to describe low-level aspects of non-symbolic content (e.g.
MPEG-7 6 and Dublin Core 7). Meanwhile, solutions from the symbolic AI
3 https://www.w3.org/TR/rdf11-datasets
4 https://www.w3.org/TR/owl-time/
5 https://www.w3.org/2001/sw/BestPractices/OEP/n-aryRelations-20060323
6 https://mpeg.chiariglione.org/standards/mpeg-7
7 https://dublincore.org/speci cations/dublin-core/
community, such as the WSWS, focused on formally describing abstract concepts
and semantic relationships between them (e.g., RDF 8, OWL 9).</p>
      <p>
        Hyperknowledge lls the gap left by previous solutions by providing
rstclass constructs that can be used to explicitly describe relationships between
concepts and n-dimensional fragments of non-symbolic content [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. It also
provides constructs that enable the contextualization of those relationships,
therefore enabling the description of rich relationships involving concepts from
multiple ontologies in a given context.
2
      </p>
      <p>
        The Hyperknowledge Conceptual Model
The Hyperknowledge conceptual model is composed of three main groups of
entities: terminal nodes, composite nodes, and links [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>A terminal node is composed of a collection of information units. The exact
notion of what constitutes an information unit is part of the node de nition
and depends on its specialization. A context node is a composite node (or a
set) that may contain links, terminal nodes, and composite nodes. Links de ne
relationships among nodes' anchors. There are di erent types of links such as
SPO (subject-predicate-object) links, causal links, and constraint links.</p>
      <p>Every node, either composite or terminal, can be connected with other nodes
through anchors, that represent portions or fragments of a node. Anchors can
be used to represent n-dimensional fragments of a node (e.g time and space
dimensions), enabling the creation of links between either concept or content
nodes within those dimensions. Also, as every entity in the Hyperknowledge
model can be contained in a composite node, it is possible to create links that
represent the relationships between nodes in a given context.</p>
      <p>Therefore, the n-dimensional anchors and the context nodes can be used
to explicitly contextualize n-dimensional relationships between concepts from
multiple knowledge descriptions and corresponding non-symbolic content, which
are represented as content nodes.
3</p>
      <p>Knowledge Description and Non-Symbolic Integration
To enable the integration of knowledge descriptions and non-symbolic content
using Hyperknowledge, we devised an architecture that enables both human
users (e.g., knowledge engineers, domain experts, and developers) and other
systems (e.g. AI assistants and tools) to interact with the same Hyperknowlege
Base (HKBase).</p>
      <p>
        An overview of this architecture is depicted in Figure 1 where we highlight
the process of ingestion of knowledge descriptions, expressed using WSWS (e.g.,
8 https://www.w3.org/RDF/
9 https://www.w3.org/OWL/
RDF and OWL), and non-symbolic content (e.g., ML models and images).
Human users can perform this process using KES while other systems can
communicate directly with the HKBase using a RESTful API [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. During the
ingestion of the knowledge descriptions, HKBase converts those descriptions to
the Hyperknowledge representation and stores the original descriptions and the
corresponding Hyperknowledge descriptions using one of its storage options.
      </p>
      <p>For this demonstration, the main storage option used for the
Hyperknowledge descriptions is a triple store as it enables e cient ingestion and querying
of knowledge descriptions based on WSWS, which makes it easier to integrate
applications that were originally implemented using WSWS. As illustrated by
the dashed arrows and boxes in Figure 1 the Hyperknowledge descriptions could
be stored and queried using a document or graph database as well.</p>
      <p>For the ingestion of non-symbolic content, we currently make use of object
storage but future implementations will make use of optimized storage options
for each type of content.</p>
      <p>Our main object storage option is IBM Cloud Object Storage 10 but our
system can be used with any AWS S3 11 compatible object storage. In that sense, we
currently have support to all Hyperknowledge features using either MongoDB 12
or JanusGraph 13, but the extended RDF compatibility and SPARQL support
are optimized when using a triple store, which in the current implementation is
Apache Jena 14.</p>
      <p>Datasets</p>
      <p>Knowledge
Descriptions
Non-Symbolic</p>
      <p>Content</p>
      <p>Knowledge Engineers,Domain</p>
      <p>Experts &amp; Developers
Knowledge Explorer</p>
      <p>System (KES)</p>
      <p>AI Assistants &amp;Tools
Hyperknowledge Base</p>
      <p>Hyperknowledge</p>
      <p>Descriptions
Non-Symbolic
Content</p>
      <p>Storage</p>
      <p>Triple Store
Document Database</p>
      <p>
        Graph Database
Object Storage
10 https://www.ibm.com/cloud/object-storage
11 https://docs.aws.amazon.com/AmazonS3
12 https://www.mongodb.com/
13 https://janusgraph.org/
14 https://jena.apache.org/
The main goal of this demo is to show Hyperknowledge features that support the
contextualized integration of multiple knowledge descriptions and non-symbolic
content through KES. In this sense, we de ned a dataset containing tra c
simulation images extracted from a visual perception benchmark [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], concepts
extracted from these images, and several ontologies used in Intelligent
Transportation Systems to show how KES can support the contextualized speci cation of
rich relationships between concepts from multiple knowledge descriptions for the
Smart Mobility domain. The demo shows the collaborative capabilities of the
system by illustrating di erent users having a synchronized view of the same
knowledge description, while also being able to explore it individually, by expanding
nodes of interest or visualizing the result of queries. The images will be exploited
to demonstrate KES capability of contextualized integration of knowledge
descriptions with non-symbolic content. The use of the ontologies is to illustrate
the support to contextualized knowledge description integration. Finally, queries
performed using KES will demonstrate its capability to answer contextualized
spatial queries over non-symbolic content and knowledge descriptions.
Demo video 1: Collaborative ingestion and contextualized integration of tra c
knowledge descriptions and non-symbolic content.
https://ibm.box.com/v/iswc2020-kes-demo1-video1
Demo video 2: Individualized exploration of the knowledge descriptions.
https://ibm.box.com/v/iswc2020-kes-demo1-video2
Demo video 3: Visualization of contextualized spatial queries over tra c
images and knowledge descriptions.
https://ibm.box.com/v/iswc2020-kes-demo1-video3
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>MCCARTHY</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          : Notes on formalizing context,
          <source>ijcai (</source>
          <year>1993</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Moreno</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , Brand~ao, R.,
          <string-name>
            <surname>Santos</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sousa</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cerqueira</surname>
          </string-name>
          , R.:
          <article-title>Representing experts' interpretive trails with hyperknowledge speci cations</article-title>
          .
          <source>In: Proceedings of the 5th International Conference on Frontiers of Educational Technologies</source>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Moreno</surname>
            ,
            <given-names>M.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brandao</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cerqueira</surname>
          </string-name>
          , R.:
          <article-title>Extending hypermedia conceptual models to support hyperknowledge speci cations</article-title>
          .
          <source>International Journal of Semantic Computing</source>
          <volume>11</volume>
          (
          <issue>01</issue>
          ) (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Moreno</surname>
            ,
            <given-names>M.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Santos</surname>
          </string-name>
          , R.C.,
          <string-name>
            <surname>dos Santos</surname>
            ,
            <given-names>W.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cerqueira</surname>
          </string-name>
          , R.: Kes:
          <article-title>The knowledge explorer system</article-title>
          . In: International Semantic Web Conference (P&amp;D/Industry/BlueSky) (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Moreno</surname>
            ,
            <given-names>M.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Santos</surname>
            ,
            <given-names>R.C.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Santos</surname>
            ,
            <given-names>W.H.S.d.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fiorini</surname>
            ,
            <given-names>S.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Silva</surname>
          </string-name>
          , R.M.d.G.:
          <article-title>Multimedia search and temporal reasoning</article-title>
          . arXiv preprint arXiv:
          <year>1911</year>
          .
          <volume>08225</volume>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Richter</surname>
            ,
            <given-names>S.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hayder</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koltun</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Playing for benchmarks</article-title>
          .
          <source>In: IEEE International Conference on Computer Vision</source>
          ,
          <string-name>
            <surname>ICCV</surname>
          </string-name>
          <year>2017</year>
          . pp.
          <volume>2232</volume>
          {
          <issue>2241</issue>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Sera</surname>
            <given-names>ni</given-names>
          </string-name>
          , L.,
          <string-name>
            <surname>Homola</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Contextualized knowledge repositories for the semantic web</article-title>
          .
          <source>Journal of Web Semantics</source>
          <volume>12</volume>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Smeulders</surname>
            ,
            <given-names>A.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Worring</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Santini</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gupta</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jain</surname>
          </string-name>
          , R.:
          <article-title>Content-based image retrieval at the end of the early years</article-title>
          .
          <source>IEEE Transactions on pattern analysis and machine intelligence</source>
          <volume>22</volume>
          (
          <issue>12</issue>
          ) (
          <year>2000</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>