<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Polysemy in Controlled Natural Language Texts</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Normunds Grūzītis</string-name>
          <email>normundsg@ailab.lv</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Guntis Bārzdiņš</string-name>
          <email>guntis@latnet.lv</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Institute of Mathematics and Computer Science, University of</institution>
          <country country="LV">Latvia</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Controlled natural languages (CNL) and computational semantics in general do not address word sense disambiguation, i.e., they tend to interpret only some functional words that are crucial in the construction process of discourse representation structures. We present two alternative frameworks for supporting polysemy in controlled languages. The approaches result in more natural CNLs suitable for description and translation of multi-domain texts.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>
        There are several sophisticated controlled natural languages (CNL), which cover
relatively large subsets of English grammar providing seemingly informal means for
knowledge representation
        <xref ref-type="bibr" rid="ref8">(Schwitter et.al., 2008)</xref>
        . CNLs typically support
bidirectional mapping to some formal language like first-order logic (FOL) or its
decidable subset OWL DL
        <xref ref-type="bibr" rid="ref3">(Kaljurand and Fuchs, 2006)</xref>
        that allows to apply existing
tools for reasoning, consistency checking or even basic satisfiability model building.
      </p>
      <p>Two commonly accepted restrictions are used in CNLs to enable construction of
unambiguous discourse representation structures (DRS): a set of interpretation rules
for potentially ambiguous syntactic constructions, and a monosemous lexicon —
words are treated as predicate identifiers whose ‘meaning’ is defined only by FOL
formulas derived from the text being analyzed. While the first restriction limits only
syntactic sophistication of a language, the second one causes essential communication
limitations as the natural language lexicon is inherently polysemous (e.g. “The
library[collection] consists of million books”; “The city is constructing a new
library[building]”). The polysemy of natural language is not a deficiency, but rather a
gateway for referring to the rich background knowledge invoked by the same lexemes
in different contexts thus leading to multiple word-senses. In this paper we address
the latter limitation.</p>
      <p>
        The root cause of polysemy in a natural language is that at any given moment there
is only a ‘finite’ number of lexemes, which speakers of the given language know and
thus can use for communication. Meanwhile there is a potentially unlimited number
of new concepts (discourses) that might need to be named and communicated about.
Metaphoric reuse of existing lexemes therefore is unavoidable in the natural language,
which can be summed in a saying: language is a graveyard of ‘dead’ metaphors
        <xref ref-type="bibr" rid="ref4">(Leary, 1994)</xref>
        . Fortunately, various metaphoric senses of the same lexeme typically
fall in radically different domains, which is helpful in word sense disambiguation
(WSD).
A monosemous lexicon (terminology), of course, is appropriate for descriptions that
verbalize single-domain knowledge (i.e., consistent OWL DL ontologies). However,
even seemingly consistent descriptions might need to be artificially split into two or
more micro-domain descriptions to avoid lexical ambiguities and to maintain
compliance with existing naming conventions. A possible alternative in such cases is
to introduce artificial lexemes by explicitly pointing out the specific meanings (e.g.
“library-building” versus “library-collection”), but then the language becomes rather
un-natural and dependent on specific domain-ontology naming.
      </p>
      <p>Internally consistent domain ontologies that follow lexicon-driven naming
conventions we call micro-ontologies. Table 1 illustrates WSD problem when the text
references different domains that use overlapping and potentially inconsistent
terminology.</p>
      <p>Micro-ontologies (ontological text)</p>
      <p>Axioms in ACE
Every building is a construction and has a roof.</p>
      <p>Every library is a building.</p>
      <p>Every collection is an abstract-entity that contains some items.</p>
      <p>Every library is a collection that contains some publications.</p>
      <p>Every construction is a physical-entity.</p>
      <p>No physical-entity is an abstract-entity.</p>
      <p>Assertions (factual text)
x
o
-B There is a library[buildings] that has a green roof.</p>
      <p>A The library[collection] contains some valuable publications.</p>
      <p>The role of polysemy is most clearly apparent from the multilingual point of a
view: it is impossible to avoid interpretation of lexemes when translating a text. In our
case, interpretation means selection of the appropriate micro-ontology (see Table 2).</p>
      <p>Although grammar constructions (OWL DL mappings) and lexicons for the source
and target languages would differ, the interlingua — OWL DL micro-ontologies and
their consistent mergers — remain the same. Moreover, by attaching translation
equivalents to the ontological concepts, micro-ontologies simultaneously serve as
monosemous multi-lingual lexicons facilitating the translation process. The term
interlingua we mean in a wider sense: not only in the multi-lingual context, but also
for monolingual multi-domain communication.</p>
      <p>The problems of WSD and ontology merging are tightly intertwined and, in our
view, the lack of definitive success is largely due to addressing these issues
separately. Therefore we address both of these problems simultaneously — OWL DL
formal semantics can be used to dynamically handle micro-ontologies for WSD over
polysemous factual sentences. From all the available micro-ontologies for each
sentence (or clause) are selected those that can be invoked (directly or via some
merger) by a target lexeme (typically, a predicate) or other syntactically related
lexemes (syntactic links are mapped onto ontology properties). In general, more than
one micro-ontology can be invoked by an assertion due to the fact that different word
senses are not necessarily mutually inconsistent. Selecting the largest micro-ontology
merger likely unveils the most specific meaning (and facilitates further reasoning
tasks). However, it is not necessary to get rid of the consistent alternatives — in case
of later inconsistency they can be used during backtracking. As long as the current
discourse remains consistent, its merged ontology is gradually augmented; otherwise
additional discourse ontology is created separately.</p>
      <p>
        The proposed concept of micro-ontology is similar to the Cyc concept of
microtheories
        <xref ref-type="bibr" rid="ref5">(Lenat, 1995)</xref>
        , where all world-knowledge is split into narrow domain
microtheories (ontologies). In our approach micro-ontology is one of many internally
consistent domain-ontologies (or their dynamic mergers) described in OWL DL
(through a CNL or directly in an ontology editor), against which the polysemous
lexemes of the factual sentences can be mapped.
      </p>
    </sec>
    <sec id="sec-2">
      <title>3 Alternative approach to polysemy and discourse in CNL</title>
      <p>
        The above proposed rather ‘classic’ solution for adding polysemy to CNLs is
theoretically plausible, but it also raises a critical question: is this really how the
natural language works? It is well acknowledged in linguistic and cognitive sciences
that polysemes are etymologically and therefore semantically related, and typically
originate from metaphoric usage
        <xref ref-type="bibr" rid="ref7">(Ravin and Leacock, 2000)</xref>
        . The metaphoric view
erases the strict borders between polysemous word senses — these borders are
shifting with each creative use of a metaphor, and dictionaries or ontologies shall be
viewed only as short-lived snapshots of currently common word usages. To illustrate,
a metaphoric statement “She is a star” in natural language implies only that a person
possesses some aspect (e.g. being prominent) of a true star. Meanwhile a
monosemous CNL would likely interpret it literally as a light-emitting celestial star.
      </p>
      <p>
        Frame-semantic linguistic theory
        <xref ref-type="bibr" rid="ref1">(Fillmore et.al., 2003)</xref>
        has already come up with a
way to avoid such ‘tyranny’ of literal word meanings. Through extensive corpus
analysis FrameNet has identified approximately 700 frames which can be invoked by
actual words or sentences — regardless of being used literally or metaphorically.
Translation of the input text into FrameNet frames would resolve the problem of
polysemy. A CNL could help with translation disambiguation as explained below.
      </p>
      <p>The ultimate purpose of a CNL is to build a formal DRS capturing the full
semantics of the input text. Although one could try to merge the classic DRS
construction techniques rooted in FOL with FrameNet for a more natural polysemous
CNL, this would not aid the disambiguation problem. Therefore we propose an
alternative discourse representation approach based on PDDL (Planning Domain
Description Language) leading to a new kind of CNLs not rooted in FOL anymore.</p>
      <p>
        PDDL
        <xref ref-type="bibr" rid="ref6">(McDermott et.al., 1998)</xref>
        is designed to formalize dynamic models, where
actions guide the model through a series of states — in contrast to static models
specified by FOL. But most importantly — PDDL maps directly to FrameNet: a
PDDL action in most cases is the same FrameNet frame. Thus PDDL adds a formal
structure FrameNet was lacking — it introduces strict object and event identification
and therefore allows for syntactic subordination and global co-referential anaphora
encoding that is crucial for building large discourse structures. PDDL also comes with
a powerful constraint mechanism — actions in PDDL have formal precondition and
effect, which must be coordinated in consecutive actions to achieve a valid discourse
model. These PDDL action constraints along with global anaphora resolution enable
disambiguation of the FrameNet frame to be invoked by the particular lexeme.
      </p>
    </sec>
    <sec id="sec-3">
      <title>4 Discussion</title>
      <p>The idea to differentiate two kinds of sentences in natural language — the ontological
and the factual ones (T-Box and A-Box in Tables 1 and 2) — turns out to be a helpful
principle. Although natural language expressions occasionally might be a mixture of
both kinds of sentences, mostly such distinction on sentence level is possible.</p>
      <p>Polysemy is less relevant for the background knowledge (ontological sentences),
which essentially define language-independent abstract concepts in some, usually
monosemous, domain ontology. Meanwhile polysemy support is crucial for the
factual communication, which typically does not explicitly reference the background
knowledge (T-Box), which needs to be guess-mapped through the polysemous words
used in the text. In a CNL the corresponding T-Box has to be introduced explicitly
along with an A-Box, usually by manual sharing of ontological sentences among
factual texts. This forbids possibility for (inconsistent) polysemy in existing CNLs.</p>
      <p>While it is disputable whether a CNL is a more convenient approach for describing
ontologies (T-Boxes) than the formal languages and their graphic editors, a CNL is
definitely an advantage when describing concrete situations through factual sentences.
Vast amounts of such descriptions already exist in a written form: consider, for
example, information extraction from a newspaper archive, which is predominantly a
factual text.</p>
      <p>In contrast to the universal macro-ontologies, micro-ontologies offer several
significant advantages: (a) they do not impose a single consistent scheme, allowing
many distinct points of view to co-exist; (b) they can be seen as snapshots of some
aspects of ‘reality’, supporting non-stable and temporal entities — existing ontologies
don’t have to be updated each time the reality changes; alternative ontologies should
be introduced instead — it is a task of a reasoner to choose the appropriate ones; and
(c) they scale well — things don’t have to be compressed in a restricted number of
categories thus avoiding ‘signal losses’; the only restriction is the size of a lexicon.</p>
      <p>In Sections 2 and 3 we have proposed two very different approaches for adding
polysemy to CNLs — the micro-ontology approach in Section 2 is more traditional
and compatible with existing CNLs, while the PDDL approach in Section 3 is more
radical.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <given-names>Fillmore C. J.</given-names>
            ,
            <surname>Johnson</surname>
          </string-name>
          <string-name>
            <given-names>C. R.</given-names>
            and
            <surname>Petruck M. R. L</surname>
          </string-name>
          .
          <article-title>Background to FrameNet</article-title>
          . In
          <source>International Journal of Lexicography 16</source>
          ,
          <year>2003</year>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>Johansson R.</given-names>
            ,
            <surname>Berglund</surname>
          </string-name>
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Danielsson</surname>
          </string-name>
          <string-name>
            <given-names>M.</given-names>
            and
            <surname>Nugues P. Automatic</surname>
          </string-name>
          text
          <article-title>-to-scene conversion in the traffic accident domain</article-title>
          .
          <source>In Proceedings of the 19th International Joint Conference on Artificial Intelligence</source>
          ,
          <year>2005</year>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Kaljurand</surname>
            <given-names>K.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Fuchs</surname>
            <given-names>N. E.</given-names>
          </string-name>
          <article-title>Bidirectional mapping between OWL DL and Attempto Controlled English</article-title>
          .
          <source>In Proceedings of the 4th Workshop on Principles and Practice of Semantic Web Reasoning</source>
          ,
          <year>2006</year>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Leary D</surname>
          </string-name>
          . E. (Ed.)
          <article-title>Metaphors in the history of psychology</article-title>
          . Cambridge University Press,
          <year>1994</year>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Lenat D. Cyc</surname>
          </string-name>
          :
          <article-title>A Large-Scale Investment in Knowledge Infrastructure</article-title>
          .
          <source>In Communications of the ACM</source>
          ,
          <volume>38</volume>
          :
          <fpage>11</fpage>
          ,
          <year>1995</year>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>McDermott D.</surname>
          </string-name>
          et al.
          <article-title>PDDL - The Planning Domain Definition Language</article-title>
          .
          <source>Technical report</source>
          . Yale University,
          <year>1998</year>
          . Available at: http://www.cs.yale.edu/homes/dvm/ [13/03/2009]
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>Ravin Y.</given-names>
            and
            <surname>Leacock</surname>
          </string-name>
          <string-name>
            <given-names>C.</given-names>
            <surname>Polysemy</surname>
          </string-name>
          . Oxford University Press,
          <year>2000</year>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>Schwitter R.</given-names>
            ,
            <surname>Kaljurand</surname>
          </string-name>
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Cregan</surname>
          </string-name>
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Dolbear</surname>
          </string-name>
          <string-name>
            <given-names>C.</given-names>
            and
            <surname>Hart</surname>
          </string-name>
          <string-name>
            <surname>G.</surname>
          </string-name>
          <article-title>A Comparison of three Controlled Natural Languages for OWL 1.1</article-title>
          .
          <source>In Proceedings of the 4th International Workshop on OWL Experiences and Directions</source>
          , 2008
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>