<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Design of a CNL to Involve Domain Experts in Modeling</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Sivlie Spreeuwenberg</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jeroen van Grondelle</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ronald Heller</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Gartjan Grijzen</string-name>
          <email>g.grijzeng@beinformed.nl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Be Informed</institution>
          ,
          <addr-line>Apeldoorn</addr-line>
          ,
          <country country="NL">Netherlands</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>LibRT</institution>
          ,
          <addr-line>Amsterdam</addr-line>
          ,
          <country country="NL">Netherlands</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Need for Controlled Natural Languages in Modeling</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>Involving domain experts in modeling is important since knowledge needs to be captured in a model and only domain experts can establish whether the models are correct. We have experienced that a natural language based representation of a model helps them to understand the semantics of a model and has advantages over a visual representation. Therefore a controlled natural language (CNL) is designed for our existing semantic reasoning tool based on conceptual graphs (Be Informed). The resulting CNL has a formal logical basis but the goal of the CNL representation is to improve readability for human readers. We report on the challenge to develop a CNL that 1) is easy and intuitively readable for domain experts with no background in formal logics, 2) can be easily generated from the formal representation and 3) can be easily adjusted for other natural languages and cultural preferences. The solution uses patterns to represent the CNL that map to the conceptual graph. The patterns are based on SBVR's RuleSpeak and can be easily adjusted for local di erences.</p>
      </abstract>
      <kwd-group>
        <kwd>Controlled Natural Language</kwd>
        <kwd>Business Rules</kwd>
        <kwd>Speci cations</kwd>
        <kwd>Knowledge Representation</kwd>
        <kwd>CNL design and evaluation</kwd>
        <kwd>SBVR</kwd>
        <kwd>RuleSpeak</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>services based on these knowledge models. Knowledge representation in Be
Informed is based on concept graphs. To add semantics, the concepts, relations and
properties are typed, using types from a metamodel associated with the graph.
The tool represents the knowledge as a network diagram. A visual syntax maps
icons, line styles and colors to metamodel types.</p>
      <p>
        A rst version of the textual representation presented in this paper was used
to communicate a risk taxonomy to classify shipments of goods to insurance
underwriters. Although the sentences produced were very basic and consisted of
just the subject and object of a triple with a verb in between encoding for the
relation type, the underwriters immediately spotted constructs that appeared
odd to them. This resulted in an improved recall rate of modeling errors. This
early success has motivated further research at the Dutch Immigration O ce [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
A next version [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] of the text generator was used to validate candidate policy
decisions for consistency before they are accepted. In workshops with business
representatives and legal advisors, the policy is de ned in the tool that also will
be used to execute this policy. Both a visual graph oriented representation and
the textual representation discussed in this paper were used. It is important to
note that the parties involved here were unfamiliar with formal representation
techniques and would normally express any policy in unrestricted, natural
language. The expectation that the textual representation was preferred over the
diagrams was con rmed by the participants. An interesting new observation was
that the sentence should be a grammatically correct sentence.
      </p>
      <p>This paper reports on the design and implementation of a CNL that helps
Be Informed customers to actively participate in modeling knowledge.
2</p>
      <p>
        Design and Implementation of a CNL for Be Informed
Controlled languages are often classi ed in one of two categories [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]: those that
improve readability for human readers and those that enable reliable automatic
semantic analysis of the language. The language that we designed has a formal
logical basis. But all too often languages in the second category do not read very
naturally. The challenge for Be Informed was to design a language that can be
easily generated from a conceptual graph and is natural and understandable for
people to read.
      </p>
      <p>
        Using CNLs to represent ontology's has been done before, for instance in
Attempto Controlled Language [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and CLOnE [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. They both use natural
language generation (NLG) to create a textual representation and natural language
processing (NLP) to roundtrip the ontology based on the changed text.
      </p>
      <p>
        The textual syntax de nition proposed in this paper is quite similar to the
de nition used in CLOnE. Our approach towards editing a model based on a
natural language representation does not use NLP and has more in common
with Conceptual Authoring [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Editing is not performed by manipulating text
but by performing editing operations at the concept level, with the text being
updated to re ect concept-level changes.
2.1
      </p>
      <p>Pattern Based Generation Approach and SBVR's RuleSpeak
The mechanism we use is based on pattern sentences that map to a concept
graph. The formal model remains the single source at all times. The textual
representation is just used as a view on the formal model and editing operations
by the user in the view are translated into updates to the underlying formal
model.</p>
      <p>The structure of the textual representation of the formal graph is de ned by
pattern sentences. A pattern sentence consists of static text fragments and
subject, object and property placeholders. Fragments and placeholders are grouped
into sentence parts, in order to make certain parts of the sentence optional.
Cardinality in the (meta) model can be represented using multiple sentences or using
enumerated lists of relations of the same type. The sentences are hand-crafted
to communicate the semantics of the graph constructs they represent but are
re-used for di erent projects.</p>
      <p>
        The advantage of pattern sentences that map to the formal (meta) model
directly is that no NLP or parsing needs to be performed on the textual
representation. This provides freedom in choosing or updating the pattern sentences,
eventually based on audience-speci c preferences, without constraints from NLP
techniques. Methods such as RuleSpeak [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] and the OMG standard SBVR [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]
have rationalized the use of natural language for the business by introducing
syntactic guidelines and best practices. Our latest sentence patterns include
RuleSpeak keywords (must, always) and follow the guideline that a rule should
be expressed by a grammatically complete, correct and readable sentence. These
patterns make sentences easier to read and place a natural and intuitive
emphasis on the fact that the sentence introduces an obligation (and is not `just' a
potential statement) for business experts with no background in formal logics.
The sentence patterns also provide guidance to direct people into being more
formal.
2.2
      </p>
      <p>
        Implementation and Example
The following sentences illustrate the mapping of pattern sentences to a product
model about product purchases and applicable discounts and are taken from a
full example in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. The sentence parts are placed between quotation marks, the
mapping to the metamodel is placed between braces.
      </p>
      <p>This pattern sentence encodes for which discount applies: \The discount D
is always an applicable discount if a customer orders the product P" fDiscount,
requires, Productg \with option O." fDiscount, requires, Optiong.</p>
      <p>A rule sentence based on this pattern sentence is: \The discount early adopters
is always an applicable discount for a customer if the customer orders the product
basic telephony with option voip."</p>
      <p>Valid rule sentences are created in the editor by choosing relevant pattern
sentences and completing the variable parts from a drop-down list. The editor
uses the knowledge in the instantiated model and will only present concepts that
are de ned as a discount in the discount drop-down list.</p>
      <p>A consequence of the tight connection between the metamodel and the
sentence patterns is that patterns need to explicitly deal with the plural variation
of the rule sentence: \The discount triple play is always an applicable discount
for a customer if the customer orders all of the following products:
fast adsl
digital tv
basic telephony."</p>
      <p>Adding knowledge in the algorithm on plural, gender and verb may
eliminate this redundancy in the sentence patterns but introduces complexity in the
mapping of the sentence to an update on the formal model. It will make the
algorithm (natural) language-speci c, dependent on the availability of corpora
containing language information on large sets of (often specialized) terminology
and results in a more complex user interface for the end-user. These drawbacks
have withheld us until now from implementing this strategy.
3</p>
      <p>Conclusions and Research Directions
Because no NLP is used in this approach, Be Informed has a lot of freedom in
choosing sentence patterns, but has to explicitly deal with grammatical
variations. We are interested in hybrid solutions where NLG creates the variations of
patterns.</p>
      <p>
        Furthermore, contextualization of the syntax in projects widens the audience
(e.g. explanation dialogs, brochures and websites), but increases implementation
e orts. To facilitate this trade-o , we need measures that evaluate how well a
CNL grammar is suited for an audience. Research in this area and reports on
user evaluations (like [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]) are welcome.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Heller</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Teeseling</surname>
          </string-name>
          , F. van, Gulpers, M.:
          <article-title>A Knowledge Infrastructure for the Dutch Immigration O ce</article-title>
          .
          <source>7th Extended Semantic Web Conference, Heraklion. LNCS</source>
          , vol.
          <volume>6089</volume>
          , pp.
          <fpage>386</fpage>
          -
          <lpage>390</lpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Grondelle</surname>
            , J.C. van, Heller,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haandel</surname>
            , E. van, Verburg,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Involving Business Users in Formal Modelling using Natural Language Pattern Sentences</article-title>
          .
          <source>Proceedings of EKAW</source>
          , Lisbon, LNCS (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Clark</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Harrison</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Murray</surname>
            ,
            <given-names>W.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thompson</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , [ed] Fuchs,
          <string-name>
            <surname>N.E.</surname>
          </string-name>
          :
          <article-title>Naturalness vs</article-title>
          .
          <source>Predictability: A Key Debate in Controlled Languages.Pre-Proceedings of the Workshop on Controlled Natural Language, Marettimo Island. CEUR</source>
          , vol.
          <volume>448</volume>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Kaljurand</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fuchs</surname>
            ,
            <given-names>N.E.</given-names>
          </string-name>
          :
          <article-title>Verbalizing OWL in Attempto Controlled English</article-title>
          .
          <source>In Proceedings of Third International Workshop on OWL: Experiences and Directions</source>
          , Innsbruck, Austria (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Funk</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tablan</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bontcheva</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cunningham</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Davis</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Handschuh</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>CLOnE: Controlled Language for Ontology Editing</article-title>
          .
          <source>Proceedings of the Sixth International Semantic Web Conference (ISWC)</source>
          , Busan, Korea,
          <string-name>
            <surname>LNCS</surname>
          </string-name>
          , vol.
          <volume>4825</volume>
          , pp.
          <fpage>142</fpage>
          -
          <lpage>155</lpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Power</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scott</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Evans</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          :
          <article-title>What you see is what you meant: direct knowledge editing with natural language feedback</article-title>
          .
          <source>In Proceedings of the 13th Biennial European Conference on Arti cial Intelligence</source>
          , pp.
          <fpage>675</fpage>
          -
          <lpage>681</lpage>
          , Brighton, UK (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Ross</surname>
          </string-name>
          , R.G.: RuleSpeak [Online] http://www.rulespeak.com (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8. Object Management Group:
          <article-title>Semantics of Business Vocabulary</article-title>
          and Rules [Online] http://www.omg.org/spec/SBVR/1.0 (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Engelbrecht</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hart</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dolbear</surname>
          </string-name>
          , C. [ed.]
          <string-name>
            <surname>Fuchs</surname>
            ,
            <given-names>N.E.</given-names>
          </string-name>
          :
          <article-title>Talking Rabbit: a User Evaluation of Sentence Production</article-title>
          .
          <source>Pre-Proceedings of the Workshop on Controlled Natural Language, Marettimo Island. CEUR</source>
          , vol.
          <volume>448</volume>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>