<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>OWL Support for (Some) Non-Deductive Scenarios of Ontology Usage</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Vojetcˇh Sva´tek</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Miroslav Vacura</string-name>
          <email>vacuram@vse.cz</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Martin Ralbovsky´</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ondreˇj Sˇva´b-Zamazal</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bijan Parsia</string-name>
          <email>bparsia@cs.man.ac.uk</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Economics</institution>
          ,
          <addr-line>W. Churchill Sq.4, 130 67 Prague 3</addr-line>
          ,
          <country country="CZ">Czech Republic</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Manchester</institution>
          ,
          <addr-line>Oxford Road, Manchester M13 9PL</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Applications of ontologies exist that go beyond standard deductive reasoning and rather have the character of empirical discovery in knowledge/data. We analyse the inventory of OWL with respect to two such applications, namely to pattern-based ontology matching and to ontology-aware knowledge discovery from databases.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Most design decisions about the OWL standard are motivated by its DL
semantics, reflecting deductive (in the general sense) scenarios of ontology exploitation.
We however assume that OWL ontologies will be frequently used and
sometimes even built by people primarily interested in computerised processing of
information beyond the scenarios foreseen by logicians. The inventory of OWL
could possibly be revisited with such applications in mind. We elaborate on two
such scenarios we exploited in our research: pattern-based ontology matching and
ontology-based knowledge discovery from databases (KDD). Other areas worth
similar analysis could be e.g. ontology retrieval over the web [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] or ontology-based
information extraction [
        <xref ref-type="bibr" rid="ref3 ref6">3, 6</xref>
        ].
By our experience from the Ontology Alignment Evaluation Initiative (OAEI),3
ontology matching systems often fail on discrepancies that are not due to
inherently different conceptualisations but due to different logical patterns (including
those listed in http://www.w3.org/2001/sw/BestPractices/OEP/) used when
adapting the conceptualisations to the restricted language. Let us present two
such cases: indicating reified n-ary relations and capturing the criteria of
concept partition. To illustrate them, we borrow from the ‘conference organisation’
domain as one of those examined within OAEI.4
      </p>
      <sec id="sec-1-1">
        <title>3 http://oaei.ontologymatching.org/2007/ 4 http://nb.vse.cz/∼ svabo/oaei2008/</title>
        <p>
          Problem A.1 A reviewer can submit a review of a paper. There is inherently
a ternary relationship between the reviewer, review and paper. The
modeller of ontology A may use the SWBPD n-ary relation pattern:5 creating
a concept such as ReviewSubmission, with properties reviewSubmitted,
submittedBy and reviewForPaper. The modeller of ontology B may
however prefer a simpler (though lossy) solution: creating multiple properties
expressing alternative variants of review, e.g. submittedPositiveReview
and submittedNegativeReview. If the ReviewSubmission concept were
annotated as ‘reiefid relation’ in ontology A, the task for a (pattern-aware)
mapping system, aligning A with B via a heterogeneous mapping [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ], and
possibly enriching one other via a re-engineering pattern, would be easier.
Problem A.2 Submitted papers can be under review, accepted or rejected. At
the same time, papers can be submitted as e.g. full papers, position papers
or posters. In ontology A, this can be done by class SubmittedPaper
having disjoint partition subclasses such as PaperUnderReview, AcceptedPaper
and RejectedPaper, as well as FullPaper, PositionPaper and Poster.
In ontology B, the same can be done using two data properties for the
SubmittedPaper class: phase and category (with values analogous to the
classes above). If the two disjoint partition axioms in ontology A were
annotated as (informally) criterion=phase and criterion=category,
respectively, the task for a mapping system aligning A with B would be easier, again
yielding heterogeneous mappings (between an axiom and a data property).
3
        </p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>Ontology-Based KDD</title>
      <p>
        In the context of KDD, formal ontologies allow, among other, to form the data
mining task more easily and accurately and to constrain the search space [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. In
the context of data pre-processing for association mining,6 as the most critically
needed part of domain information were identified important values dividing
the domain of a data attribute, and attribute groupings. We illustrate them on
medical examples inspired by [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>Problem B.1 For the human blood pressure, values that separate normal blood
pressure from hypertension and hypotension could be used to create
categories of the respective attributes; such categories could have solid chance for
yielding strong associations and also make these hypotheses more easily
interpretable by experts. Cutting values can be implicitly derived from
restrictions on data properties. For example, if class Hypertension Observation
is defined as having the value of data property hasSystolicBPValue higher
than 140 and the value of data property hasDiastolicBPValue higher than
90 then these values can be suggested as cutting values in the data
preparation phase of KDD. The designer of the ontology should however also be
able to specify cut values of a data property directly.</p>
      <sec id="sec-2-1">
        <title>5 http://www.w3.org/TR/swbp-n-aryRelations</title>
        <p>6 We do not see the this information specifically relevant to association mining; it can
probably be used in numerous other KDD tasks.</p>
        <p>Problem B.2 Attributes such as Angina Pectoris, Myocardial Ischemia and
Hypertension should be grouped together, as they all correspond to
cardiovascular diseases. The same holds for attributes such as Height, Weight or
Girth (biometric measures). While the former are likely to appear as classes
in an OWL ontology, the latter could sensibly be data properties.
4</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Possible Solutions</title>
      <p>
        It seems that (entities or axioms in) OWL ontologies could be enriched with
the required information (expressing e.g. relation reicfiation, partition criteria,
property cut values and property groupings) in at least four ways:
Solution S0 Enriching the specicfiation of the OWL language itself
Solution S1 Using taxonomic inheritance
Solution S2 Using meta-modelling
Solution S3 Using the forthcoming annotation system of OWL [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>The rfist option is clearly hard to put through due to language parsimony
concerns. We will thus not consider it for the moment.</p>
      <p>The second option is essentially harmless: additional information is attached
to the given entity by subordinating it to an entity from a specicfi ‘structural’
ontology. This however only allows to assign binary features (and not e.g.
numerical values) and only to atomic entities (not to axioms).</p>
      <p>
        The third option consists in introducing new properties that have as their
domain either OWL meta-classes such as rdfs:Class or owl:DataProperty or
their purposefully defined subclasses. This allows to enrich all entities that are
instances of the given meta-class; it however immediately lifts the ontology to
OWL Full and also contaminates the domain model with information irrelevant
to deduction. A prototype of such practice is the way of modelling so-called
meta-properties in OntoClean [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]: as discussed in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], the OntoClean
methodology requires putting properties on classes in such a way that OntoClean conflicts
manifest as an inconsistency. However, it seems likely that modelers need to keep
OntoClean coniflcts and domain modeling inconsistencies separate. Notably, an
OntoClean coniflct could be due to an incorrect subsumption; making the
ontology inconsistent makes it harder to understand and debug that subsumption.
      </p>
      <p>The fourth option looks most universal; it however depends on the final
structure of the OWL 2 annotation system. An advantage might be the possibility to
store additional information in a separate file (information space).</p>
      <p>Let us now relate the four problems above to the solutions S1-S3:
A.1 Mere concepts have to be annotated. It is thus possible to use S1, namely,
to create a special ontology with class such as ReifiedRelation, to
import this ontology, and to subordinate the ReviewSubmission class to it. A
disadvantage of this solution is the fact that an ontological concept was
introduced for a notion that is merely anchored in the language (here, OWL as
language lacking n-ary relation constructs) and has no real-world semantics.</p>
      <p>Alternatively, S3 could presumably be used.</p>
      <p>A.2 Disjoint partition axioms have to be annotated. The only applicable
solution seems to be S3.</p>
      <p>
        B.1 In our implemented prototype [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] we used S2, i.e. meta-modelling: data
properties such as domainDividingValue were assigned to rdfs:Class.7
Alternatively, S3 could presumably be used.
      </p>
      <p>B.2 As OWL allows to specify a taxonomy for both classes and properties
(including data properties), no information external to the actual domain
ontology is presumably needed in most cases, as attribute grouping can reeflct
the taxonomic closeness of the respective ontology entities. If necessary, an
ontology specifically expressing ‘groupings relevant to data mining’ could
be imported, with artificial classes such as Group Concept, i.e. realizing S1.
If such ‘hard-coding’ were undesirable, S2 could alternatively be used, via
introducing an object property such as sameAttributeGroup having
metaclasses in its domain and range. Finally, S3 could be applied in a similar
manner as S1.</p>
      <p>In general, it seems that exploiting a rich annotation system would help avoid
possible negative impact on formal complexity and/or domain accuracy.
This work was partially supported by the IGA VSE grant no.20/08 “Evaluation
and matching ontologies via patterns” and by the CSF project no.201/08/0802,
“Application of Knowledge Engineering Methods in Knowledge Discovery from
Databases”.
7 This was influenced by the concept-centric nature of UMLS; for a more
DB-schemalike ontology, owl:dataProperty or its new subclass would fit well, in turn.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>Annotation</given-names>
            <surname>System. OWL WG</surname>
          </string-name>
          <article-title>Work-in-Progress document</article-title>
          , http://www.w3.org/ 2007/OWL/wiki/Annotation System.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>d'Aquin</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gridinoc</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Angeletou</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sabou</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motta</surname>
          </string-name>
          , E.:
          <article-title>Characterizing Knowledge on the Semantic Web with Watson</article-title>
          .
          <source>In: EON'07 Workshop</source>
          at ISWC'
          <volume>07</volume>
          ,
          <string-name>
            <surname>Busan</surname>
          </string-name>
          , Korea,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Embley</surname>
            ,
            <given-names>D. W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tao</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liddle</surname>
            ,
            <given-names>D. W.</given-names>
          </string-name>
          :
          <article-title>Automatically extracting ontologically specified data from HTML tables of unknown structure</article-title>
          .
          <source>In: Proc. ER '02</source>
          , pp.
          <fpage>322</fpage>
          -
          <lpage>337</lpage>
          , London, UK,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Ghidini</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Serafini</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Reconciling concepts and relations in heterogeneous ontologies</article-title>
          .
          <source>In: Proc. ESWC</source>
          <year>2006</year>
          , Budva, Montenegro, LNCS
          <volume>4011</volume>
          ,
          <fpage>50</fpage>
          -
          <lpage>64</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Guarino</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Welty</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>An Overview of OntoClean</article-title>
          . In: Staab,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Studer</surname>
          </string-name>
          , R., eds.:
          <source>The Handbook on Ontologies</source>
          , Springer-Verlag,
          <fpage>151</fpage>
          -
          <lpage>172</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6. Labsky´,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Nekvasil</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            , Sva´tek, V.,
            <surname>Rak</surname>
          </string-name>
          ,
          <string-name>
            <surname>D.</surname>
          </string-name>
          :
          <article-title>The Ex Project: Web Information Extraction using Extraction Ontologies</article-title>
          .
          <source>In: Proc. PriCKL'07, ECML/PKDD Workshop on Prior Conceptual Knowledge in Machine Learning and Knowledge Discovery. Warsaw</source>
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7. Sva´tek V.,
          <string-name>
            <surname>Rauch</surname>
            <given-names>J.</given-names>
          </string-name>
          , Ralbovsky´ M.:
          <article-title>Ontology-Enhanced Association Mining</article-title>
          . In: Ackermann, Berendt (eds.).
          <source>Semantics, Web and Mining</source>
          , Springer-Verlag,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Zeman</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ralbovsky</surname>
          </string-name>
          ´ M., Sva´tek V.,
          <string-name>
            <surname>Rauch</surname>
            <given-names>J</given-names>
          </string-name>
          .:
          <article-title>Ontology-Driven Data Preparation for Association Mining</article-title>
          . Online http://keg.vse.cz/onto-kdd-draft.pdf.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>