<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Design of a CNL to Involve Domain Experts in Modeling</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Sivlie</forename><surname>Spreeuwenberg</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">LibRT</orgName>
								<address>
									<settlement>Amsterdam</settlement>
									<country key="NL">Netherlands</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Jeroen</forename><surname>Van Grondelle</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">Be Informed</orgName>
								<address>
									<settlement>Apeldoorn</settlement>
									<country key="NL">Netherlands</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Ronald</forename><surname>Heller</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">Be Informed</orgName>
								<address>
									<settlement>Apeldoorn</settlement>
									<country key="NL">Netherlands</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Gartjan</forename><surname>Grijzen</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">Be Informed</orgName>
								<address>
									<settlement>Apeldoorn</settlement>
									<country key="NL">Netherlands</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Design of a CNL to Involve Domain Experts in Modeling</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">98F5C82B3F206E6B0180490BC5EA5D31</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-25T09:17+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Controlled Natural Language</term>
					<term>Business Rules</term>
					<term>Specifications</term>
					<term>Knowledge Representation</term>
					<term>CNL design and evaluation</term>
					<term>SBVR</term>
					<term>RuleSpeak</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Involving domain experts in modeling is important since knowledge needs to be captured in a model and only domain experts can establish whether the models are correct. We have experienced that a natural language based representation of a model helps them to understand the semantics of a model and has advantages over a visual representation. Therefore a controlled natural language (CNL) is designed for our existing semantic reasoning tool based on conceptual graphs (Be Informed). The resulting CNL has a formal logical basis but the goal of the CNL representation is to improve readability for human readers. We report on the challenge to develop a CNL that 1) is easy and intuitively readable for domain experts with no background in formal logics, 2) can be easily generated from the formal representation and 3) can be easily adjusted for other natural languages and cultural preferences. The solution uses patterns to represent the CNL that map to the conceptual graph. The patterns are based on SBVR's RuleSpeak and can be easily adjusted for local differences.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Need for Controlled Natural Languages in Modeling</head><p>The adoption of model driven technologies such as Enterprise Decision Management and Business Process Management is growing. As a result, involving business users in modeling is more important than ever. Their ability to capture business knowledge in models correctly is a key factor in the adoption of these technologies. The main challenge in involving business users in knowledge modeling is the fact that most business users are not trained in formal knowledge representation techniques. A formal, concise, visual representation can be quite intimidating to the uninitiated. Consequently they will not be able to verify the accuracy of the model directly.</p><p>Be Informed develops a software suite that is used by complex, knowledge intensive organizations to capture their business knowledge and run model driven services based on these knowledge models. Knowledge representation in Be Informed is based on concept graphs. To add semantics, the concepts, relations and properties are typed, using types from a metamodel associated with the graph. The tool represents the knowledge as a network diagram. A visual syntax maps icons, line styles and colors to metamodel types.</p><p>A first version of the textual representation presented in this paper was used to communicate a risk taxonomy to classify shipments of goods to insurance underwriters. Although the sentences produced were very basic and consisted of just the subject and object of a triple with a verb in between encoding for the relation type, the underwriters immediately spotted constructs that appeared odd to them. This resulted in an improved recall rate of modeling errors. This early success has motivated further research at the Dutch Immigration Office <ref type="bibr" target="#b0">[1]</ref>. A next version <ref type="bibr" target="#b1">[2]</ref> of the text generator was used to validate candidate policy decisions for consistency before they are accepted. In workshops with business representatives and legal advisors, the policy is defined in the tool that also will be used to execute this policy. Both a visual graph oriented representation and the textual representation discussed in this paper were used. It is important to note that the parties involved here were unfamiliar with formal representation techniques and would normally express any policy in unrestricted, natural language. The expectation that the textual representation was preferred over the diagrams was confirmed by the participants. An interesting new observation was that the sentence should be a grammatically correct sentence.</p><p>This paper reports on the design and implementation of a CNL that helps Be Informed customers to actively participate in modeling knowledge.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Design and Implementation of a CNL for Be Informed</head><p>Controlled languages are often classified in one of two categories <ref type="bibr" target="#b2">[3]</ref>: those that improve readability for human readers and those that enable reliable automatic semantic analysis of the language. The language that we designed has a formal logical basis. But all too often languages in the second category do not read very naturally. The challenge for Be Informed was to design a language that can be easily generated from a conceptual graph and is natural and understandable for people to read.</p><p>Using CNLs to represent ontology's has been done before, for instance in Attempto Controlled Language <ref type="bibr" target="#b3">[4]</ref> and CLOnE <ref type="bibr" target="#b4">[5]</ref>. They both use natural language generation (NLG) to create a textual representation and natural language processing (NLP) to roundtrip the ontology based on the changed text.</p><p>The textual syntax definition proposed in this paper is quite similar to the definition used in CLOnE. Our approach towards editing a model based on a natural language representation does not use NLP and has more in common with Conceptual Authoring <ref type="bibr" target="#b5">[6]</ref>. Editing is not performed by manipulating text but by performing editing operations at the concept level, with the text being updated to reflect concept-level changes.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Pattern Based Generation Approach and SBVR's RuleSpeak</head><p>The mechanism we use is based on pattern sentences that map to a concept graph. The formal model remains the single source at all times. The textual representation is just used as a view on the formal model and editing operations by the user in the view are translated into updates to the underlying formal model. The structure of the textual representation of the formal graph is defined by pattern sentences. A pattern sentence consists of static text fragments and subject, object and property placeholders. Fragments and placeholders are grouped into sentence parts, in order to make certain parts of the sentence optional. Cardinality in the (meta) model can be represented using multiple sentences or using enumerated lists of relations of the same type. The sentences are hand-crafted to communicate the semantics of the graph constructs they represent but are re-used for different projects.</p><p>The advantage of pattern sentences that map to the formal (meta) model directly is that no NLP or parsing needs to be performed on the textual representation. This provides freedom in choosing or updating the pattern sentences, eventually based on audience-specific preferences, without constraints from NLP techniques. Methods such as RuleSpeak <ref type="bibr" target="#b6">[7]</ref> and the OMG standard SBVR <ref type="bibr" target="#b7">[8]</ref> have rationalized the use of natural language for the business by introducing syntactic guidelines and best practices. Our latest sentence patterns include RuleSpeak keywords (must, always) and follow the guideline that a rule should be expressed by a grammatically complete, correct and readable sentence. These patterns make sentences easier to read and place a natural and intuitive emphasis on the fact that the sentence introduces an obligation (and is not 'just' a potential statement) for business experts with no background in formal logics. The sentence patterns also provide guidance to direct people into being more formal.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Implementation and Example</head><p>The following sentences illustrate the mapping of pattern sentences to a product model about product purchases and applicable discounts and are taken from a full example in <ref type="bibr" target="#b1">[2]</ref>. The sentence parts are placed between quotation marks, the mapping to the metamodel is placed between braces. This pattern sentence encodes for which discount applies: "The discount D is always an applicable discount if a customer orders the product P" {Discount, requires, Product} "with option O." {Discount, requires, Option}.</p><p>A rule sentence based on this pattern sentence is: "The discount early adopters is always an applicable discount for a customer if the customer orders the product basic telephony with option voip."</p><p>Valid rule sentences are created in the editor by choosing relevant pattern sentences and completing the variable parts from a drop-down list. The editor uses the knowledge in the instantiated model and will only present concepts that are defined as a discount in the discount drop-down list. A consequence of the tight connection between the metamodel and the sentence patterns is that patterns need to explicitly deal with the plural variation of the rule sentence: "The discount triple play is always an applicable discount for a customer if the customer orders all of the following products: − fast adsl − digital tv − basic telephony."</p><p>Adding knowledge in the algorithm on plural, gender and verb may eliminate this redundancy in the sentence patterns but introduces complexity in the mapping of the sentence to an update on the formal model. It will make the algorithm (natural) language-specific, dependent on the availability of corpora containing language information on large sets of (often specialized) terminology and results in a more complex user interface for the end-user. These drawbacks have withheld us until now from implementing this strategy.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Conclusions and Research Directions</head><p>Because no NLP is used in this approach, Be Informed has a lot of freedom in choosing sentence patterns, but has to explicitly deal with grammatical variations. We are interested in hybrid solutions where NLG creates the variations of patterns.</p><p>Furthermore, contextualization of the syntax in projects widens the audience (e.g. explanation dialogs, brochures and websites), but increases implementation efforts. To facilitate this trade-off, we need measures that evaluate how well a CNL grammar is suited for an audience. Research in this area and reports on user evaluations (like <ref type="bibr" target="#b8">[9]</ref>) are welcome.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. Pattern based generation approach</figDesc><graphic coords="3,222.64,214.20,170.08,58.05" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Fig. 2 .</head><label>2</label><figDesc>Fig. 2. Task centric and word processor style editing</figDesc><graphic coords="4,158.86,257.31,297.64,217.81" type="bitmap" /></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">A Knowledge Infrastructure for the Dutch Immigration Office</title>
		<author>
			<persName><forename type="first">R</forename><surname>Heller</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Teeseling</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Van, Gülpers</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">7th Extended Semantic Web Conference</title>
		<title level="s">Heraklion. LNCS</title>
		<imprint>
			<date type="published" when="2010">2010</date>
			<biblScope unit="volume">6089</biblScope>
			<biblScope unit="page" from="386" to="390" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Involving Business Users in Formal Modelling using Natural Language Pattern Sentences</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">C</forename><surname>Grondelle</surname></persName>
		</author>
		<author>
			<persName><surname>Van</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Heller</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Haandel</surname></persName>
		</author>
		<author>
			<persName><surname>Van</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Verburg</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of EKAW</title>
				<meeting>EKAW<address><addrLine>Lisbon</addrLine></address></meeting>
		<imprint>
			<publisher>LNCS</publisher>
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Naturalness vs. Predictability: A Key Debate in Controlled Languages</title>
		<author>
			<persName><forename type="first">P</forename><surname>Clark</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Harrison</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">R</forename><surname>Murray</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Thompson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">E</forename><surname>Fuchs</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Pre-Proceedings of the Workshop on Controlled Natural Language</title>
		<title level="s">Marettimo Island. CEUR</title>
		<imprint>
			<date type="published" when="2009">2009</date>
			<biblScope unit="volume">448</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Verbalizing OWL in Attempto Controlled English</title>
		<author>
			<persName><forename type="first">K</forename><surname>Kaljurand</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">E</forename><surname>Fuchs</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of Third International Workshop on OWL: Experiences and Directions</title>
				<meeting>Third International Workshop on OWL: Experiences and Directions<address><addrLine>Innsbruck, Austria</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">CLOnE: Controlled Language for Ontology Editing</title>
		<author>
			<persName><forename type="first">A</forename><surname>Funk</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Tablan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Bontcheva</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Cunningham</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Davis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Handschuh</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Sixth International Semantic Web Conference (ISWC)</title>
				<meeting>the Sixth International Semantic Web Conference (ISWC)<address><addrLine>Busan, Korea</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2007">2007</date>
			<biblScope unit="volume">4825</biblScope>
			<biblScope unit="page" from="142" to="155" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">What you see is what you meant: direct knowledge editing with natural language feedback</title>
		<author>
			<persName><forename type="first">R</forename><surname>Power</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Scott</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Evans</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 13th Biennial European Conference on Artificial Intelligence</title>
				<meeting>the 13th Biennial European Conference on Artificial Intelligence<address><addrLine>Brighton, UK</addrLine></address></meeting>
		<imprint>
			<date type="published" when="1998">1998</date>
			<biblScope unit="page" from="675" to="681" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<title level="m" type="main">RuleSpeak</title>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">G</forename><surname>Ross</surname></persName>
		</author>
		<ptr target="http://www.rulespeak.com" />
		<imprint>
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<title level="m" type="main">Object Management Group: Semantics of Business Vocabulary and Rules</title>
		<ptr target="http://www.omg.org/spec/SBVR/1.0" />
		<imprint>
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Talking Rabbit: a User Evaluation of Sentence Production</title>
		<author>
			<persName><forename type="first">P</forename><surname>Engelbrecht</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Hart</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Dolbear</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">E</forename><surname>Fuchs</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Pre-Proceedings of the Workshop on Controlled Natural Language</title>
		<title level="s">Marettimo Island. CEUR</title>
		<imprint>
			<date type="published" when="2009">2009</date>
			<biblScope unit="volume">448</biblScope>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
