<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>OntoQAV: A Pipeline for Visualising Ontology Quality</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Silvio Mc Gurk</string-name>
          <email>silvio.mcgurk.15@um.edu.mt</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jeremy Debattista</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Charlie Abela</string-name>
          <email>charlie.abela@um.edu.mt</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>ADAPT Centre, School of Computer Science and Statistics, Trinity College Dublin</institution>
          ,
          <country country="IE">Ireland</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Malta</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>Ontological Quality has been the subject of extensive research due to the importance of ensuring that a system's underlying ontologies are t for purpose. Understanding quality problems may not be straightforward, depending on the ontologies' magnitude and complexity, the nature and extent of the problems, and the metrics used in its quality assessment. In this poster paper we present an innovative pipeline3 linking together a quality assessment framework (Luzzu) and an ontology visualisation framework (WebVOWL) in order to establish an ecosystem whereby knowledge engineers can assess and interactively understand quality problems within concepts and properties in ontologies.</p>
      </abstract>
      <kwd-group>
        <kwd>Ontology Quality</kwd>
        <kwd>Quality Framework</kwd>
        <kwd>Quality Assessment</kwd>
        <kwd>Quality Visualisation</kwd>
        <kwd>Ontology Visualisation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The use of ontologies has become widespread across many domains (e.g.
biological, geographical, government, etc: : :) as they provide the means for sharing
concepts and data among di erent organisations [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Together with the RDF
standard, the idea behind ontologies was to solve data interoperability
problems. However, choosing a t-for-use ontology for a system might not be the
simplest task. Research carried out by [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], amongst others, resulted in a
number of metrics being proposed to help identify quality problems. Nonetheless,
without the right tools, ontology stakeholders still encounter di culties when
choosing the right ontology for the task at hand.
      </p>
      <p>
        Inspired by the progress achieved on Linked Data quality frameworks [
        <xref ref-type="bibr" rid="ref1 ref4">1, 4</xref>
        ] and
ontology visualisation frameworks [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], we aim to exploit these state-of-the-art
frameworks in order to address a missing niche in ontology quality, that is,
assisting system engineers in nding the t-for-use ontology. In this poster
paper we present OntoQAV, a pipeline integrating Luzzu [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], a generic Linked
Data quality assessment framework, and WebVOWL [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], a widely used tool
to represent RDF-based ontologies in a visual format. The rational behind
this pipeline is to create an ecosystem with the aim of allowing stakeholders
to assess ontologies on various quality metrics and to visualise any identi ed
problems. We aim to provide the possibility of assessing multiple ontologies at
once and presenting a comparative visualisation and summary of the quality
problems within the ontologies being assessed. The main contributions of this
pipeline are: (1) implementing ontology quality metrics for Luzzu identi ed in
our previous work [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]; and (2) implementing a plug-in for WebVOWL which
takes as input a quality problem report from Luzzu and displays the problems
in WebVOWL. Metrics for Luzzu and the WebVOWL plug-in are available in a
public repository, along with instructions and links for a demonstration of the
pipeline (available at http://github.com/silviomcgurk/OntoQAV).
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>The OntoQAV Pipeline</title>
      <p>Therefore, the result is an augmented visualisation which gives a graphical
representation of the nodes and links within the ontology, with an additional
visualisation layer of the problems identi ed by the quality assessment framework.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Visualising Ontology Quality Problems</title>
      <p>
        Luzzu [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] and WebVOWL [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] are the two tools chosen in our proposed pipeline
to demonstrate a proof-of-concept for visualising ontology quality. Furthermore,
in the future, we aim to make this pipeline generic, implementing and making
use of mechanisms and vocabularies that facilitates the data exchange between
existing and future tools. Luzzu is an extensible Linked Data quality assessment
framework. It also provides quality metadata and problem reports that can be
leveraged in semantic-driven frameworks for other tasks. WebVOWL is an
ontology visualisation tool, representing concepts and properties of ontologies in a
way that can be easily understood. Upon completion of the quality assessment
for a given ontology, Luzzu provides a Linked Data structured problem report
(cf. Listing 1.1 for a snippet) highlighting problematic concepts and properties
for each assessed metric. This problem report is then converted into a JSON-LD
serialisation and used during the modi cation of the WebVOWL DOM objects
to represent the problematic concepts.
ex : QualityProblem a qpro : QualityProblem ;
qpro : i s D e s c r i b e d B y ex : C y c l e s I n O n t o l o g y M e t r i c ;
qpro : p r o b l e m a t i c T h i n g [
a r d f : Statement ;
r d f : o b j e c t ex : P i z z a ;
r d f : p r e d i c a t e ex : subClassOf ;
r d f : s u b j e c t ex : NamedPizza ] .
      </p>
      <p>
        Listing 1.1: Snippet from Luzzu Problem Report (Turtle)
Together with the JSON-LD serialisation of the problem report, the assessed
ontology is loaded into WebVOWL for its visualisation. Following the loading of
the ontology, the user is given a choice to view the problematic concepts, upon
which the proposed rendering of quality visualisation is triggered. At this stage,
the pipeline plug-in interacts with the rendered visualisation of the ontology and
modi es it through the browser's Document Object Model (DOM) to augment
the visualisation with quality information. Shading of the red colour has been
selected to represent quality issues within an ontology. For every problem identi ed
by Luzzu, the plug-in gives a red shade to the problematic components (nodes,
properties or relationships). Components that fail more than once (with di erent
metrics) will have a darker shade. As a result, the shading of red from light to
dark colour indicates the extent of possible quality issues of the respective
component. Additional information regarding the quality problems of concepts and
properties of the visualised ontology are shown to the user in the WebVOWL
sidebar. Figure 2a shows the visualisation of the pizza ontology rendered in
WebVOWL. Once the quality visualisation plug-in is invoked, problematic concepts
are highlighted in red (cf. Figure 2b). In this example, the pizza ontology has
been assessed by the Cycles in Ontology Metric [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. The assessment identi ed the
concepts American, NamedPizza, and Pizza and properties rdfs:subClassOf
violating this metric, and thus are highlighted in red.
(a) Ontology Visualisation in
VOWL
      </p>
      <p>Web(b) Augmented Quality Layer and
Sidebar Quality Information</p>
    </sec>
    <sec id="sec-4">
      <title>Final Remarks and Future Work</title>
      <p>In this poster paper we proposed an innovative pipeline that links together a
quality assessment framework and visualisation tool in e ort to establish an
ecosystem to enhance the evaluation of ontologies from a quality perspective,
providing an intuitive way of looking at various quality problems an ontology
might have. Our future work includes a plan to do a comprehensive quality
assessment on LOV4 ontologies and make available their respective quality
assessment, quality problem reports and augmented visualisation.
4 http://lov.okfn.org/dataset/lov/</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Debattista</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Auer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lange</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <string-name>
            <surname>Luzzu - A Methodology</surname>
          </string-name>
          and
          <article-title>Framework for Linked Data Quality Assessment</article-title>
          .
          <source>Journal of Data and Information Quality</source>
          . (
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Lohmann</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Negru</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haag</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ertl</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Visualizing ontologies with VOWL</article-title>
          .
          <source>Semantic Web</source>
          .
          <volume>7</volume>
          ,
          <fpage>399</fpage>
          -
          <lpage>419</lpage>
          (
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Mc</surname>
            Gurk
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Abela</surname>
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Debattista</surname>
            <given-names>J</given-names>
          </string-name>
          .:
          <article-title>Towards Ontology Quality Assessment</article-title>
          .
          <source>Joint proceedings of the MEPDaW 2017 and LDQ</source>
          <year>2017</year>
          .
          <article-title>(</article-title>
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Poveda-Villaln</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gomez-Perez</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Suarez-Figueroa</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>OOPS! (OntOlogy Pitfall Scanner!):</article-title>
          .
          <source>Intl. Journal on Semantic Web and Information Systems</source>
          . (
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Ristoski</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paulheim</surname>
          </string-name>
          , H.:
          <article-title>Semantic Web in data mining and knowledge discovery: A comprehensive survey</article-title>
          .
          <source>Web Semantics: Science, Services and Agents on the World Wide Web. 36</source>
          ,
          <fpage>1</fpage>
          -
          <lpage>22</lpage>
          (
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>