<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>What Formalism for the Semantic Web?</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Hassan A¨ıt-Kaci</string-name>
          <email>hassan.ait-kaci@univ-lyon1.fr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>ANR Chair of Excellence CE DAR Project LIRIS Universite ́</institution>
          <addr-line>Claude Bernard Lyon 1</addr-line>
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The world is changing. The World Wide Web is changing. It started out as a set of purely notational conventions for interconnecting information over the Internet. The focus of information processing has now shifted from local disconnected disc-bound silos to Internet-wide interconnected clouds. The nature of information has also evolved. From raw uniform data, it has now taken the shape of semi-structured data and meaningcarrying so-called “Knowledge Bases.” While it was sufficient to process raw data with structure-aware querying, it has now become necessary to process knowledge with contents-aware reasoning. Computing must therefore adapt from dealing with mere explicit data to inferring implicit knowledge. How to represent such knowledge and how inference therefrom can be made effective (whether reasoning or learning) is thus a central challenge among the many now facing the world wide web. So called “ontologies” are being specified and meant to encode formally encyclopedic as well as domain-specific knowledge. One early (still on-going) such effort has been the Cyc1 system. It is a knowledge-representation system (using LISP syntax) that makes use of a set of varied reasoning methods, altogether dubbed “commonsense.” A more recent formalism issued of Description Logic (DL)-viz. the Web Ontology Language (OWL2)-has been adopted as a W3C recommendation. It encodes knowledge using a specific standardized (XML, RDF) syntax. Its constructs are given a modeltheoretic semantics which is usually realized operationally using tableau3-based reasoning.4 The point is that OWL is clearly designed for a specific logic and reasoning method. Saying that OWL is the most adequate interchange formalism for Knowledge Representation (KR) and automated reasoning (AR) is akin to saying that English is the best designed human language for facilitating information interchange among humans-notwithstanding the fact that it was simply imposed by the most recent pervasive ruling power, just as Latin was Europe's Lingua Franca for centuries. Thus, it is fair to ask one's self a simple question: “Is there, indeed, a single most adequate knowledge representation and reasoning method that can be such a norm? ” 1 http://www.cyc.com/platform/opencyc 2 http://www.w3.org/TR/owl-features/ 3 http://en.wikipedia.org/wiki/Method_of_analytic_tableaux 4 Using of tableau methods is the case of the most prominent SW reasoner [6, 5, 7]. Systems using alternative reasoning methods must first translate the DL-based syntax of OWL into their own logic or RDF query processing. This may be costly [9] and/or incomplete [8].</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>I personally do not think so. In this regard, I share the general philosophy of Doug
Lenat5, Cyc’s designer—although not the haphazard approach he has chosen to follow.6</p>
      <p>If one ponders what characterizes an ontology making up a knowledge base, some
specific traits most commonly appear. For example, it is universally acknowledged that,
rather than being a general set of arbitrary formal logical statements describing some
generic properties of “the world,” a formal knowledge base is generally organized as
a concept-oriented information structure. This is as important a change of perspective,
just as object-oriented programming was with respect to traditional method-oriented
programming. Thus, some notion of property “inheritance” among partially-ordered
“concepts” (with an “is-a” relation) is a characteristic aspect of KR formalisms. In
such a system, a concept has a straightforward semantics: its denotes of set of elements
(its “instances”) and the “is-a” relation denotes set inclusion. Properties attached to a
concept denote information pertaining to all instances of this concept. All properties
verified by a concept are therefore inherited by all its subconcepts.</p>
      <p>Sharing this simple characteristic, formal KR formalisms have emerged from
symbolic mathematics that offer means to reason with conceptual information, depending
on mathematical apparatus formalizing inheritance and the nature of properties attached
to concepts. In Description Logic7, properties are called “roles” and denote binary
relations among concepts. On the other hand, Formal Concept Analysis (FCA8) uses an
algebraic approach whereby an “is-a” ordering is automatical derived from
propositional properties encoding the concepts that are attached to as bit vectors. A concept is
associated an attribute with a boolean marker (1 or “true”) if it possesses it, and with
a (0 or “false”) otherwise. The bit vectors are simply the rows of the “property
matrix” relating concepts to their attributes. This simple and powerful method, originally
proposed by Rudolf Wille, has a dual interpretation when matching attributes with
concepts possessing them. Thus, dually, it views attributes also as partially ordered (as the
columns of the binary matrix). An elegant Galois-connection ensues that enables
simple extraction of conceptual taxonomies (and their dual attribute-ordered taxonomies)
from simple facts. Variations such as Relational Concept Analysis (RCA9) offer more
expressive, and thus more sophisticated, knowledge while preserving the essential
algebraic properties of FCA. It has also been shown how DL-based reasoning (e.g. OWL)
can be enhanced with FCA.10</p>
      <p>Yet another formalism for taxonomic attributed knowledge, which I will present
in more detail in this presentation, is the Order-Sorted Feature (OSF ) constraint
formalism. This approach proposes to see everything as an order-sorted labelled graph.
5 http://en.wikipedia.org/wiki/Douglas_Lenat
6 However, I may stand corrected in the future since knowledge is somehow fundamentally
haphazard. My own view is that, even for dealing with a heterogenous world, I would rather
favor mathematically formal representation and reasoning methods dealing with uncertainty
and approximate reasoning, whether probabilistic, fuzzy, or dealing with inconsistency (e.g.
rough sets, paraconsistency).
7 http://en.wikipedia.org/wiki/Description_logic
8 http://en.wikipedia.org/wiki/Formal_concept_analysis
9 http://www.hse.ru/data/2013/07/04/1286082694/ijcai_130803.pdf
10 http://ijcai-11.iiia.csic.es/files/proceedings/</p>
      <p>T13-ijcai11Tutorial.pdf
Sorts are set-denoting and partially ordered with an inclusion-denoting “is-a” relation,
and so form a conceptual taxonomy. Attributes, called “features,” are function-denoting
symbols labelling directed edges between sort-labelled nodes. Such OSF graphs are a
straightforward generalization of algebraic First-Order Terms (FOTs) as used in Logic
Programming (LP) and Functional Programming (FP). Like FOTs, they form a lattice
structure with OSF graph matching as the partial ordering, OSF graph unification
as infimum (denoting set intersection), and OSF graph generalization as supremum.11
Both operations are very efficient. These lattice-theoretic properties are preserved when
one endows a concept in a taxonomy with additional order-sorted relational and
functional constraints (using logical conjunction for unification and disjunction for
generalization for the attached constraints). These constraints are inherited down the
conceptual taxonomy in such a way as to be incrementally enforceable as a concept becomes
gradually refined.</p>
      <p>
        The OSF system has been the basis of Constraint Logic Programming for KR
and ontological reasoning (viz. LIF E ) [
        <xref ref-type="bibr" rid="ref1 ref2">2, 1</xref>
        ]. As importantly, OSF graph-constraint
technology has been at work with great success in two essential areas of AI: NLP and
Machine Learning:
– it has been a major paradigm in the field of Natural Language Processing (NLP)
for a long time; notably, in so-called “Head-driven Phrase Structure Grammar”
(HPSG12) and Unification Grammar (UG13) technology [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. This is indeed not
surprising given the ease with which feature structure unification enables combining
both syntactic and semantic information in a clean, declarative, and efficient way.14
– Similarly, while most of the attention in the OSF literature has been devoted to
unification, its dual—namely, generalization—is just as simple to use, and computes
the most specific OSF term that subsumes two given terms [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. This operation is
central in Machine Learning and with it, OSF technology lends itself to be
combined with popular Data Mining techniques such as Support Vector Machines using
frequency or probabilistic information.
      </p>
      <p>In this presentation, I will give a rapid overview of the essential OSF formalism
for knowledge representation along its reasoning method which is best formalized as
order-sorted constraint-driven inference. I will also illustrate its operational efficiency
and scalability in comparison with those of prominent DL-based reasoners used for the
Semantic Web.</p>
      <p>The contribution of this talk to answering the question in its title is that the Semantic
Web effort should not impose a priori putting all our eggs in one single (untested)
basket. Rather, along with DL, other viable alternatives such as the FCA and OSF
formalisms, and surely others, should be combined for realizing a truly semantic web.
11 This supremum operation, however, does not (always) denote set union—as for FOT
subsumption, it is is not modular (and hence neither is it distributive).
12 http://en.wikipedia.org/wiki/Head-driven_phrase_structure_
grammar
13 http://www.cs.haifa.ac.il/˜shuly/malta-slides.pdf
14 http://citeseer.ist.psu.edu/viewdoc/summary?doi=10.1.1.51.2021</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1. A¨
          <string-name>
            <surname>IT-KACI</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <article-title>Data models as constraint systems-a key to the Semantic Web</article-title>
          .
          <source>Constraint Processing Letters</source>
          <volume>1</volume>
          (
          <year>November 2007</year>
          ),
          <fpage>33</fpage>
          -
          <lpage>88</lpage>
          . online: http://cs.brown.edu/ people/pvh/CPL/Papers/v1/hak.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2. A¨
          <string-name>
            <surname>IT-KACI</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <article-title>AND PODELSKI, A. Towards a meaning of LIFE</article-title>
          .
          <source>Journal of Logic Programming</source>
          <volume>16</volume>
          ,
          <fpage>3</fpage>
          -
          <lpage>4</lpage>
          (
          <year>1993</year>
          ),
          <fpage>195</fpage>
          -
          <lpage>234</lpage>
          . online: http://hassan-ait-kaci.net/pdf/ meaningoflife.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3. A¨
          <string-name>
            <surname>IT-KACI</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          , AND SASAKI,
          <string-name>
            <surname>Y.</surname>
          </string-name>
          <article-title>An axiomatic approach to feature term generalization</article-title>
          .
          <source>In Proceedings of European Cinference on Machine Learning (ECML</source>
          <year>2001</year>
          )
          <article-title>(Freiburg</article-title>
          , Germany,
          <year>2001</year>
          ),
          <string-name>
            <given-names>L. D.</given-names>
            <surname>Raedt</surname>
          </string-name>
          and P. Flach, Eds.,
          <source>LNAI 2167</source>
          , Springer-Verlag, pp.
          <fpage>1</fpage>
          -
          <lpage>12</lpage>
          . online: http://www.hassan
          <article-title>-ait-kaci</article-title>
          .net/pdf/ecml01.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>CARPENTER</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <article-title>Typed feature structures: A generalization of first-order terms</article-title>
          .
          <source>In Proceedings of the 1991 International Symposium on Logic Programming</source>
          (Cambridge, MA, USA,
          <year>1991</year>
          ),
          <string-name>
            <given-names>V.</given-names>
            <surname>Saraswat</surname>
          </string-name>
          and
          <string-name>
            <given-names>K.</given-names>
            <surname>Ueda</surname>
          </string-name>
          , Eds., MIT Press, pp.
          <fpage>187</fpage>
          -
          <lpage>201</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>MOTIK</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>SHEARER</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , AND HORROCKS,
          <string-name>
            <surname>I.</surname>
          </string-name>
          <article-title>Hypertableau reasoning for description logics</article-title>
          .
          <source>Journal of Artificial Intelligence Research</source>
          <volume>36</volume>
          ,
          <issue>1</issue>
          (
          <year>September 2009</year>
          ),
          <fpage>165</fpage>
          -
          <lpage>228</lpage>
          . online: https://www.jair.org/media/2811/live-2811-4689-jair.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>SHEARER</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , MOTIK,
          <string-name>
            <surname>B.</surname>
          </string-name>
          ,
          <article-title>AND HORROCKS, I. HermiT: A highly-efficient OWL reasoner</article-title>
          .
          <source>In Proceedings of the 5th International Workshop on OWL Experiences and Directions</source>
          (Karlsruhe, Germany,
          <year>October 2008</year>
          ), U. Sattler and C. Dolbear, Eds., OWLED'08, CEUR Workshop Proceedings. online: http://www.cs.ox.ac.uk/ian.horrocks/ Publications/download/2008/ShMH08b.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>SIRIN</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>PARSIA</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>GRAU</surname>
            ,
            <given-names>B. C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>KALYANPUR</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , AND KATZ,
          <string-name>
            <surname>Y.</surname>
          </string-name>
          <article-title>Pellet: A practical OWL-DL reasoner</article-title>
          .
          <source>Journal of Web Semantics</source>
          <volume>5</volume>
          ,
          <issue>2</issue>
          (
          <year>June 2007</year>
          ),
          <fpage>51</fpage>
          -
          <lpage>53</lpage>
          .
          <article-title>This is a summary; full paper: online: http://pellet</article-title>
          .owldl.com/papers/sirin05pellet.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8. STOILOS,
          <string-name>
            <given-names>G.</given-names>
            ,
            <surname>CUENCA</surname>
          </string-name>
          <string-name>
            <surname>GRAU</surname>
          </string-name>
          ,
          <string-name>
            <surname>B.</surname>
          </string-name>
          ,
          <article-title>AND HORROCKS, I. How incomplete is your semantic web reasoner</article-title>
          ?
          <source>In Proceedings of the 24th National Conference on Artificial Intelligence (AAAI</source>
          <volume>10</volume>
          )
          <article-title>(Atlanta, Georgia</article-title>
          , USA, July
          <volume>11</volume>
          -
          <issue>15</issue>
          ,
          <year>2010</year>
          ),
          <string-name>
            <given-names>M.</given-names>
            <surname>Fox</surname>
          </string-name>
          and
          <string-name>
            <given-names>D.</given-names>
            <surname>Poole</surname>
          </string-name>
          , Eds., AAAI, AAAI Publications, pp.
          <fpage>1431</fpage>
          -
          <lpage>1436</lpage>
          . online: http://www.cs.ox.ac.uk/ian. horrocks/Publications/download/2010/StCH10a.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>THOMAS</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>PAN</surname>
            ,
            <given-names>J. Z.</given-names>
          </string-name>
          , AND REN,
          <string-name>
            <surname>Y.</surname>
          </string-name>
          <article-title>TrOWL: Tractable OWL 2 reasoning infrastructure</article-title>
          .
          <source>In Proceedings of the 7th Extended Semantic Web Conference (Heraklion</source>
          , Greece, May-June 2010), L. Aroyo,
          <string-name>
            <given-names>G.</given-names>
            <surname>Antoniou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Hyvnen</surname>
          </string-name>
          , A. ten Teije,
          <string-name>
            <given-names>H.</given-names>
            <surname>Stuckenschmidt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Cabral</surname>
          </string-name>
          , and T. Tudorache, Eds.,
          <source>ESWC'10</source>
          , Springer-Verlag, pp.
          <fpage>431</fpage>
          -
          <lpage>435</lpage>
          . online: http: //homepages.abdn.ac.uk/jeff.z.pan/pages/pub/TPR2010.pdf.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>