<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>B-Annot: Supplying Background Model Annotations for Ontology Coherence Testing</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Vojtech Svatek</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Simone Serra</string-name>
          <email>serrazimone@gmail.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Miroslav Vacura</string-name>
          <email>vacuramg@vse.cz</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Martin Homola</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jan Kluka</string-name>
          <email>klukag@fmph.uniba.sk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Comenius University in Bratislava</institution>
          ,
          <addr-line>Mlynska dolina, 842 48 Bratislava</addr-line>
          ,
          <country country="SK">Slovakia</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Univ. of Economics</institution>
          ,
          <addr-line>Prague, W. Churchill Sq.4, 130 67 Prague 3</addr-line>
          ,
          <country country="CZ">Czech Republic</country>
        </aff>
      </contrib-group>
      <fpage>59</fpage>
      <lpage>66</lpage>
      <abstract>
        <p>The demo paper presents B-Annot, a Protege plugin for annotation of ontologies and linked data vocabularies by background model distinctions. In addition, it brie y demonstrates the subsequent use of the annotations created by B-Annot, for verifying the ontological coherence of the ontologies/vocabularies at the level of meta-models. Finally, possible further extensions of the tool and its role in the (backgroundmodel-driven) ontological engineering work ow are brie y discussed.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>remaining two instance-level resources in the diagram (2 and 4) are the `o ering'
itself and the value `90' (minutes) understood as `typical' and thus modeled as a
resource rather than literal.5 In the lower part of the diagram we approximate the
ontological background of this fragment (omitting the entities that would be types
in both diagrams, for easier readability). Among other things we see that notion
of `album', originally being the value of the object property mo:release type,
now becomes an additional type of the product o ered, and that the `O ering'
object becomes absorbed by the `o ers' relationship (now with arity &gt;2).</p>
      <p>
        Obviously, modelling the ontological background for each individual data
fragment is infeasible. The mapping between the `foreground view' of the
domain (as contained in the vocabulary) and the corresponding `background view'
thus has to be established at the level of entity types, which means, indirectly
(note that especially less expressive vocabularies are just collections of unlinked
entities whose connection is only established at the level of instance data). On
the one side of the mapping is an ontological foreground model (OFM), i.e., the
structure of an RDFS/OWL ontology; on the other side is an analogous
ontological background model (OBM). OBM models should be represented in a suitable
OBM language of modelling primitives (OBML). Two such languages are
{ OntoClean [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], which labels OFM classes with the ontological notions of
essentiality, rigidity (e.g., in the rst example mentioned, `permanent' classes
Human and Robot would be rigid while the `temporary' class Student would
be anti-rigid), identity and unity.
{ The recently designed PURO OBML [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ],6 aiming to capture the background
distinctions of OFM entities as in the bottom part of Fig. 1: that between
objects (`particulars') and their types (`universals') and that between
relationships (or `valuations' by a quantitative value) and self-standing objects.
OntoClean has proven useful for taxonomy-centric ontologies that dominate, e.g.,
in bioinformatics. On the other hand, PURO has been speci cally designed for
`relation-centric' ontologies/vocabularies [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], which are prominent in LD.
Another important phenomenon in LD is that an existing entity might be
systematically used with a di erent background distinction than foreseen in the
vocabulary speci cation; for example, a property that is assumed to have
categories of objects in its range might refer to individual objects in some dataset.
Therefore, `generic' annotation of vocabularies might not be su cient; we should
also be able to annotate vocabularies `as they are used' in a speci c dataset.
      </p>
      <p>
        By their capacity of underlying the entities from various operational
(typically, domain-restricted) knowledge models with background ontological
distinctions, OBMLs are analogous to foundational ontologies. The di erence is in the
way the `surface' and `deep' model are interconnected. A foundational ontology
provides root concepts upon which the `surface model' concepts are grafted; both
models thus share the same space. In contrast, OBMs reside in their own `layer';
5 Such kind of modeling is not common in MO, but, rather, in GR-compliant
ontologies, cf. http://www.ebusiness-unibw.org/ontologies/opdm/#ontologies.
6 A more extensive description is in [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
when connecting an OFM with an OBM, we thus need to `inject' a `proxy' of one
model to the other model, in order not to let the one interfere with the formal
semantics of the other. Two alternatives for creating such a `proxy', assuming
both layers are to be expressed in OWL-DL, are as follows:
{ OBM entities could become values of speci c OWL annotation properties,
and be saved as unobtrusive part of (a copy of) the OFM.
{ OFM entities (classes, properties and individuals) could be uniformly
metamodelled as syntactical instances to be inserted as an A-Box into a
metamodelling ontology, where their mapping to OBM can be captured.
The rst alternative is favourable for visibility of the OBM distinctions to a
human when working with the OFM. The second alternative, in turn, allows
to carry out conceptual coherence checking according to constraints de ned in
the meta-modelling ontology, via a generic OWL DL reasoning mechanism. This
approach has been previously tested for the OntoClean OBML in [
        <xref ref-type="bibr" rid="ref10 ref2">2, 10</xref>
        ], and
later for the PURO OBML by us [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
      </p>
      <p>
        In this system/demonstration paper we present B-Annot : a Protege plugin7
that allows to create and save meta-models of a selected vocabulary with respect
to either OntoClean or PURO, and (especially for the latter) in two
modalities, `generic' and `dataset-speci c'. (Storage of OBM distinctions in annotation
property values, as well as other enhancements, is forthcoming.) We also brie y
demonstrate how the annotations can be used for conceptual coherence checking;
in contrast to PURO-only coherence checking described in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], we nowadays rely
on a modular set of ontologies that also includes an OntoClean module.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>B-Annot Functionality</title>
      <p>
        In summary, the tool allows the user, for the vocabulary to be annotated already
loaded into the Protege editor,
{ to select the meta-ontology (either OntoClean or PURO) and decide whether
generic or dataset-level annotation is going to take place;
{ for dataset-speci c annotation, to inspect the statistics of presence of entities
from the given vocabulary in di erent datasets, fetched online from
LODStats [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], to select an appropriate dataset, and to view the list of entities
from the vocabulary that occur in this dataset;
{ for dataset-speci c annotation, to browse a pre-computed summary of the
dataset (inspired by [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]), with entities from the vocabulary highlighted;
{ select an entity (in one of the Protege tabs) and annotate it with a
background model distinction;
{ save the whole annotation set to an RDF le, and load it back.
7 Available from http://patomat.vse.cz/cz.vse.bannotation.plugin.view.jar.
mo:Musical
Manifestation
7351...3537
gr:Quantitative
      </p>
      <p>ValueFloat
rdf:type mo:release rdf:type</p>
      <p>type
mo:ean 1</p>
      <p>ex:recordedLength
rdf:type 4 gVra:hluaes
gr:includes
90
mo:album
gr:SomeItems
gr:Offering
rdf:type</p>
      <p>2
gr:offers
3
gr:eligible</p>
      <p>regions
rdf:type
“US-CA”
gr:Business</p>
      <p>Entity
Album
type
1
recorded length
of instances
90 min</p>
      <p>Territory of
USA and Canada</p>
      <p>offers where?
offers instances
of what?</p>
      <p>2 offers
who offers?</p>
      <p>3</p>
      <p>We will now describe the scenario of dataset-speci c annotation, since generic
annotation is essentially a subset of it. Furthermore, we will use the PURO
metaontology as more relevant in the dataset-speci c mode. (OntoClean would be
applied in the same way.) Fig. 2 shows the B-Annot interface after the choice of
PURO and dataset-speci c annotation mode (FOAF has been previously loaded
into Protege as ontology to be annotated). The user can see that of the 14
datasets for which the statistics has been fetched, 10 use some number of FOAF
entities, ranging from 1 to 23; these are relevant to the annotation session. After
clicking at the `summary' button for the Geospecies dataset, an ordered
listing of frequent `class-property-class' is displayed, a part of which is in Fig. 3.8
FOAF entities, here the properties depiction, isPrimaryTopicOf, primaryTopic
and topic, are displayed in red. Finally, the actual annotation takes place. In
Fig. 4 we see that the user, based on the observation that foaf:topic is usually
valued by biological taxa9 in this dataset, assigns this property the PURO label
`PrT' (`property whose range is a type') from the pull-down menu (with items
picked from the meta-ontology depending on the entity type to be annotated:
class, property or individual). Entity annotations are subsequently listed in the
bottom part of the window, and can, eventually, be saved (and reloaded) in bulk,
as a set of hasLabel 10 triples.
8 We also experiment with `class-property-class-property-class' paths, but they are not
implemented in the current version of the system.
9 For the sake of this example, we omit the philosophical discussion whether and for
what purpose a taxon should indeed be understood as a universal.
10 Every meta-modelling ontology has its own hasLabel property; here it is the one
from the PURO ontology.</p>
    </sec>
    <sec id="sec-3">
      <title>Coherence Checking Examples</title>
      <p>
        For each OBML considered, the distinctions underlying a particular OFM can
be compared to a prede ned set of coherence rules. For OntoClean there are four
standard coherence rules [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]: Given two properties, p and q, when q subsumes
p then: a) if q is anti-rigid then p must be anti-rigid, b) if q carries an identity
criterion then p must carry the same criterion, c) if q carries a unity criterion
then p must carry the same criterion, and d) if q has anti-unity then p must
also have anti-unity. The PURO OBML, in turn, speci es three constraints: for
a) entity coherence, b) type coherence, and c) relation coherence (for details
see [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]). We demonstrate the coherence checking on two example annotations.
      </p>
      <p>The rst is a fragment of the GR ontology annotated with PURO OBML,11
containing class ProductOrService with subclasses Individual and
ProductOrSer11 http://patomat.vse.cz/gr_mm.owl
viceModel. Using DL consistency checking over the PURO meta-ontology12 and
this fragment leads to inferred membership of class ProductOrService to a special
`diagnostic' class of the PURO ontology: Incoherent-TPU. This class which
contains meta-models of classes that `do not have homogeneous instances' in terms
of PURO, speci cally, whose instances can be both particulars and universals.</p>
      <p>
        The second example is annotation of a fragment of the ontology used to
demonstrate OntoClean inconsistencies in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. This fragment13 includes
metaentities representing six classes of the original ontology annotated with
OntoClean labels. The OntoClean meta-ontology used for coherence checking14
allows for validation of all four coherence rules. The ontology contains four classes
(Incoherence-Antiunity, Incoherent-Identity, Incoherent-Rigidity,
Incoherent-Unity) that are { as result of inference { lled with individuals that represent classes
in meta-model that are incoherent with regard to respective OntoClean rules.
For example, the class AmountOfWater was annotated with OntoClean labels
+O U +R. Its subclass LivingBeing was annotated with OntoClean labels
+O +U +R. The defect of the model is that a class with anti-unity label
(simply said, class of objects whose arbitrary `section' is again an instance of
the same class) cannot subsume a class with unity label (i.e., containing objects
that have `strict boundaries around themselves'). Therefore it is inferred that
the individual meta-modelling the class LivingBeing belongs to the diagnostic
class Incoherent-Antiunity.
4
      </p>
    </sec>
    <sec id="sec-4">
      <title>Conclusions and Future Work</title>
      <p>
        The B-Annot plugin represents the rst proof-of-concept implementation of
annotation technology for ontologies and vocabularies that is (1) not restricted
to a single theoretical framework but supports multiple OBMLs, and (2)
interconnects the browsing/editing of ontologies (as supported by common
ontological editors) with LD summaries. It is a part of a prospective eco-system of
tools (other existing ones include, e.g., pattern-based ontology transformation
tools [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]) supporting (informed rather than merely intuitive) reuse and design of
ontologies on the semantic web.
      </p>
      <p>Serious usability tests and requirement collection for B-Annot is only planned
after some of the envisaged enhancements will have taken place.</p>
      <p>A straightforward extension of B-Annot will be the possibility to also store
annotations in OWL annotation properties of a copy of the annotated ontology.
This will allow for easy browsing of the annotations in their original context.</p>
      <p>
        Background annotation by distinctions referring to notions like `rigid' (in
OntoClean) or `particular' (in PURO) risks to discourage even reasonably
experienced ontological engineers without philosophical background. The threshold
should thus be set as low as possible in the future, via operationalized
annotation guidelines. For OntoClean's rigidity alternatives, a promising approach has
12 http://patomat.vse.cz/puro_v1.1.owl
13 http://patomat.vse.cz/ontoclean-coherence-check-1.owl
14 http://patomat.vse.cz/ontoclean-v.1.0.owl
already been shown by Seyed, who designed a wizard relying on common-sense
verbalization of the meaning of these alternatives [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. For the PURO OBML
distinctions, textual guidelines with examples have already been designed and
tested in an classroom assignment; the experience gained will be used to design
verbalisation templates similar to those from [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>As the amount of mature vocabularies and their stable entities is still low15
their purely manual annotation via B-Annot is feasible. In long term, however,
partial automation could be achieved by leveraging on two di erent sources: (1)
via linguistic parsing of associated texts, especially the values of rdfs:comment,
and, (2) via logically inferring the most likely annotations based on previously
assigned annotations of interrelated entities, e.g., from superclasses to subclasses.
This work has been supported from the EU ICT FP7 under no. 257943 (LOD2
project), from the VSE IGA project no. 34/2014, from the Slovak VEGA project
no. 1/1333/12, and from project APVV-0513-10.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Auer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Demter</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Martin</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>J.: LODStats</given-names>
          </string-name>
          <article-title>An Extensible Framework for High-Performance Dataset Analytics</article-title>
          .
          <source>In: EKAW</source>
          <year>2012</year>
          , Galway, Springer LNCS 7603.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Glimm</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rudolph</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , Volker, J.:
          <article-title>Integrated metamodeling and diagnosis in OWL 2</article-title>
          .
          <source>In: Proc. ISWC</source>
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Guarino</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Welty</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>An Overview of OntoClean</article-title>
          . In: Staab,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Studer</surname>
          </string-name>
          , R., eds.:
          <source>The Handbook on Ontologies</source>
          , pp.
          <volume>151</volume>
          {
          <issue>172</issue>
          , Springer-Verlag,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Presutti</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          , et al.:
          <article-title>Extracting core knowledge from Linked Data</article-title>
          .
          <source>In: Proceedings of the Second Workshop on Consuming Linked Data</source>
          ,
          <fpage>COLD2011</fpage>
          . (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Seyed</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>A Method for Evaluating Ontologies { Introducing the BFO-Rigidity Decision Tree Wizard</article-title>
          .
          <source>In: FOIS</source>
          <year>2012</year>
          :
          <volume>191</volume>
          {
          <fpage>204</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Svab-Zamazal</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dudas</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Svatek</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>User-Friendly Pattern-Based Transformation of OWL Ontologies</article-title>
          .
          <source>In: Proc. EKAW'12</source>
          ,
          <string-name>
            <surname>Galway</surname>
          </string-name>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Svatek</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Homola</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kluka</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vacura</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Ontological Distinctions for Linked Data Vocabularies</article-title>
          .
          <source>Technical Report TR-2013-039</source>
          . Comenius University, Bratislava,
          <year>2013</year>
          . Available online: http://kedrigern.dcs.fmph.uniba.sk/ reports/display.php?id=
          <fpage>54</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Svatek</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Homola</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kluka</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vacura</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Mapping Structural Design Patterns in OWL to Ontological Background Models</article-title>
          .
          <source>In: Proc. K-CAP</source>
          <year>2013</year>
          , ACM.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Svatek</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Homola</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kluka</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vacura</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Metamodeling-Based Coherence Checking of OWL Vocabulary Background Models</article-title>
          .
          <source>In: Proc. OWLED</source>
          <year>2013</year>
          , online http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>1080</volume>
          /owled2013_6.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Welty</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>OntOWLClean: Cleaning OWL ontologies with OWL</article-title>
          .
          <source>In: Proc. FOIS</source>
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          15 The statistics at http://lov.okfn.org/dataset/lov/stats/ reveals
          <article-title>that out of the several thousand entities referenced in LD, there are only about 150 that are at the same time reused by more than one other vocabulary and instantiated by at least 100 LOD instances</article-title>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>