<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Risk Analysis and Prevention in Procedures: extraction and preliminary results</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Patrick</forename><surname>Saint-Dizier</surname></persName>
							<email>stdizier@irit.fr</email>
							<affiliation key="aff0">
								<orgName type="institution">IRIT-CNRS</orgName>
								<address>
									<addrLine>118 route de Narbonne</addrLine>
									<postCode>31062</postCode>
									<settlement>Toulouse cedex</settlement>
									<country key="FR">France</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Risk Analysis and Prevention in Procedures: extraction and preliminary results</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">859A47921F6634F6A08D6C3F1771DFD1</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-23T23:55+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Maintenance operations as well as production launches are essentially based on procedures which describe how to install and use a product and how to maintain it. Due to the complexity of to-day's equipements, and to the complexity of their interactions it is difficult to maintain up-to-date documentations. These procedural documents become more and more complex, even if simplified language constraints and revision scenarios are imposed. According to several analysis, out of 377 technicians working in different domains, 45% of them indicate that they have identified major errors in maintenance documents. About 75% indicate that there are major gaps (missing instructions) or obscure or imcomplete instructions, and 78% admit that often need help because they feel they are not operating the right way. We are all confronted to situations where we wish to follow instructions (DIY, software installation, etc.) with pictures, diagrams, etc. and that these are not understandable, have obvious gaps or do not correspond to the situation at stake. In some industrial areas, such difficulties are common and lead to accidents (aeronautics, nuclear energy, health, etc.). Risk analysis and prevention are therefore a major concern.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">A DOMAIN-INDEPENDENT ANALYSIS OF RISKY SITUATIONS FROM TEXT ANAL-YSIS</head><p>Procedural texts consist of a sequence of instructions, designed with some accuracy in order to reach a goal (e.g. assemble a computer) <ref type="bibr" target="#b2">[2,</ref><ref type="bibr" target="#b3">3,</ref><ref type="bibr" target="#b7">7,</ref><ref type="bibr" target="#b8">8]</ref>. Procedural texts are complex structures, they often exhibit a quite complex rational (the goal-instructions) and 'irrational' structure which is mainly composed of advice, conditions, preferences, evaluations, user stimulations, etc. They form what we call the explanation structure <ref type="bibr" target="#b6">[6]</ref>, which motivates and justifies <ref type="bibr" target="#b4">[4]</ref> the goal-instructions structure, viewed as the backbone of procedural texts. A number of these elements are forms of argumentation <ref type="bibr" target="#b1">[1,</ref><ref type="bibr">9]</ref>, they appear to be very useful, some-times as important as instructions: they provide a strong and essential internal cohesion and coherence to procedural texts. They also indicate, among other things, the difficulties, the risks to avoid, and the consequences on the target goal of an incorrect or incomplete execution of the associated instruction.</p><p>This is realized in the &lt;TextCoop&gt; <ref type="bibr" target="#b3">[3]</ref> project, where a number of structures are tagged. An example, in readable form, from didactics, is given hereafter.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Measuring the intrinsic difficulty rate d of an instruction</head><p>It is of much interest to be able to measure the inherent complexity or difficulty of an instruction. This notion obviously depends on the reader profile. Nevertheless, we think that some linguistic features introduce some inherent difficulties in any situation.</p><p>The most frequently encountered parameters are, informally:</p><p>-presence of 'complex' manners (e. g. very slowly), by complex we mean either a manner which is inherently difficult to realize or a manner reinforced by an adverb of intensity, -technical complexity of the verb or the verb compound used: if most instructions include a verb which is quite simple, some exhibit quite technical verbs, metaphorical uses, or verbs applied to unexpected situations, for which an elaboration is needed.</p><p>-duration of execution as specified in the instruction (the longer the more difficult), -synchronization between actions, in particular in instructional compounds, -uncommon tools, or uncommon uses of basic tools (open the box with a sharp knife) however this is quite difficult to chracaterize, besides statistical analysis (e.g. via bootstrapping on the net), -presence of evaluation statements or resulting states, for example to indicate the termination of the action (as soon as the sauce turns brown add flour).</p><p>For some of these criteria, some application-dependent knowledge linguistic resources are needed: some lexical data, basic ontological data, and a few business rules. These observations allow us to introduce a very preliminary measure of complexity. To be able to have an indicative evaluation, each of the points above counts for 1, independently of its importance or strength in the text. Complexity c therefore </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Measuring the expliciteness rate t of an instruction</head><p>Expliciteness characterizes the degree of accuracy of an instruction. Several marks, independently of the domain, contribute to making more explicit an instruction:</p><p>-when appropriate: existence of means or instruments, -pronominal references as minimal as possible, and predicate argument constructions as comprehensive as possible, -length of action explicit when appropriate (stir for 10 minutes), -list of items to consider as explicit and low level as possible (mix the flour with the sugar, eggs and oil), -presence of an argument, advice or warning, -presence of some help elements like images, diagrams, etc.</p><p>-presence of elaborations, illustrations or goal specification, -presence of a frame or a condition to limit the scope of the action.</p><p>Those criteria may be dependent on the domain, for example length of an action is very relevant in cooking, somewhat in do-it-yourself, and much less in the society domain. Similarly as for d, each item counts for 1 at the moment, expliciteness e therefore ranges from 0 to 8. The expliciteness rate is ti = e/8 to keep it in [0,1]. Note also that the higher ti is, the more chances the instruction has to succeed since it is very explicit and has a lot of details. Now, if we consider the product di × (1 − ti), the more it tends towards 1, the higher the risk is for the action to fail. Therefore, when di is high, it is also necessary that ti is high to compensate the difficulty. Given that di remains unchanged (if the instruction cannot be simplified), the strategy is then to increase ti as much as possible.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">A DOMAIN-DEPENDENT ANALYSIS OF RISKS</head><p>A number of factors of risk are clearly domain-dependent.</p><p>The difficulty is to be able to identify and evaluate risks without any access to a deep semantic analysis of the different actions of the domain at stake since this is seldom available.</p><p>In a first stage, as an exploration, our strategy is to extract from a large corpus of documents of the domain, for each action, the set of warnings associated with it. An action is characterized by a verb and its object argument(s), whatever their position in the instruction. Following argumentation theory, instructions with warnings have the following form: instruction because warning, as in Carefully plug-in the mother card vertically, otherwise you will damage the connectors, where the otherwise section is the support: it indicates the risks of not doing the action correctly. In this work, if the action is 'plug-in the mother card' the risks are the list of those warnings associated with it over the whole corpus.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">PERSPECTIVES</head><p>In this short paper, we presented the main lines of a preliminary approach to risk identification in procedures. This is a huge problem in the industry, to prevent accidents (humans and ecological). We proposed a simple solution to capture domain dependent knowledge acquired from procedure warnings. Obviously, this is just one useful facet of the problem, since a lot of knowledge is implicit and almost never expressed. Our users estimates is that we cover about 40% of the risks using this approach.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: The explanation structure annotated in a procedure</figDesc></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<title/>
		<author>
			<persName><surname>References</surname></persName>
		</author>
		<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Arguments, Dialogue, and Negotiation</title>
		<author>
			<persName><forename type="first">L</forename><surname>Amgoud</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Parsons</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Maudet</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">14th European Conference on Artificial Intelligence</title>
				<meeting><address><addrLine>Berlin</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2001">2001</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Pragmatic Overloading in Natural Language Instructions</title>
		<author>
			<persName><forename type="first">B</forename><surname>Di Eugenio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><forename type="middle">L</forename><surname>Webber</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">International Journal of Expert Systems</title>
		<imprint>
			<date type="published" when="1996">1996</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<title level="m" type="main">Analyzing the explanation structure of procedural texts: dealing with Advices and Warnings, STEP conference</title>
		<author>
			<persName><forename type="first">L</forename><surname>Fontan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Saint-Dizier</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2008-08">August 2008</date>
			<pubPlace>Venice</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Automatic Detection of Arguments in Legal Texts</title>
		<author>
			<persName><forename type="first">M-F</forename><surname>Moens</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Boiy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Mochales Palau</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Reed</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Eleventh International Conference on Artificial Intelligence and Law</title>
				<meeting>the Eleventh International Conference on Artificial Intelligence and Law<address><addrLine>NY</addrLine></address></meeting>
		<imprint>
			<publisher>ACM Press</publisher>
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<title level="m" type="main">Knowledge and Justification</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">L</forename><surname>Pollock</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1974">1974</date>
			<publisher>Princeton university Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<title level="m" type="main">Generating Arguments in Natural Language</title>
		<author>
			<persName><forename type="first">C</forename><surname>Reed</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1998">1998</date>
			<pubPlace>London</pubPlace>
		</imprint>
		<respStmt>
			<orgName>University College</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">PhD dissertation</note>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<title level="m" type="main">Feature Selection in Categorizing Procedural Expressions</title>
		<author>
			<persName><forename type="first">M</forename><surname>Takechi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Tokunaga</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Matsumoto</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Tanaka</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2003">IRAL2003. 2003</date>
			<biblScope unit="page" from="49" to="56" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<title level="m">Argumentation Schemes</title>
				<editor>
			<persName><forename type="first">D</forename><surname>Walton</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">C</forename><surname>Reed</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">F</forename><surname>Macagno</surname></persName>
		</editor>
		<imprint>
			<publisher>Cambridge University Press</publisher>
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
