<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Relaxed Precision and Recall for Ontology Matching</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Marc</forename><surname>Ehrig</surname></persName>
							<email>ehrig@aifb.uni-karlsruhe.de</email>
						</author>
						<author>
							<persName><forename type="first">J</forename><surname>Ér Ôme Euzenat</surname></persName>
							<email>jerome.euzenat@inrialpes.fr</email>
						</author>
						<author>
							<affiliation key="aff0">
								<orgName type="institution">Institute AIFB University of Karlsruhe Karlsruhe</orgName>
								<address>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff1">
								<orgName type="institution">INRIA Rh ône-Alpes 655 avenue de l&apos;Europe. Monbonnot</orgName>
								<address>
									<country key="FR">France</country>
								</address>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff2">
								<address>
									<settlement>Banff, Al-berta</settlement>
									<country key="CA">Canada</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Relaxed Precision and Recall for Ontology Matching</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">7E4DD608A70A17B8DAAE0774A1298FC6</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T09:02+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>D.2.12 [Software]: Interoperability</term>
					<term>I.2.4 [Artificial Intelligence]: Knowledge Representation Formalisms and Methods</term>
					<term>D.2.8 [Software Engineering]: Metrics Measurement, Performance, Experimentation Ontology alignment, evaluation measures, precision, recall</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>In order to evaluate the performance of ontology matching algorithms it is necessary to confront them with test ontologies and to compare the results. The most prominent criteria are precision and recall originating from information retrieval. However, it can happen that an alignment be very close to the expected result and another quite remote from it, and they both share the same precision and recall. This is due to the inability of precision and recall to measure the closeness of the results. To overcome this problem, we present a framework for generalizing precision and recall. This framework is instantiated by three different measures and we show in a motivating example that the proposed measures are prone to solve the problem of rigidity of classical precision and recall.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">INTRODUCTION</head><p>Ontology matching is an important problem for which many algorithms (e.g., PROMPT <ref type="bibr" target="#b11">[11]</ref>, GLUE <ref type="bibr" target="#b3">[3]</ref>, Ontrapro <ref type="bibr" target="#b1">[1]</ref>, OLA <ref type="bibr" target="#b7">[7]</ref>, FOAM <ref type="bibr" target="#b4">[4]</ref>) have been provided. In order to evaluate the performance of these algorithms it is necessary to confront them with test ontologies and to compare the results. The most prominent criteria are precision and recall originating from information retrieval and adapted to the ontology matching task. Precision and recall are based on the comparison of the resulting alignment with another standard alignment, effectively comparing which correspondences are found and which are not. These criteria are well understood and widely accepted.</p><p>However, as we have experienced in last year's Ontology Alignment Contest <ref type="bibr" target="#b13">[13]</ref>, they have the drawback to be of the all-or-nothing kind. An alignment may be very close to the expected result and another quite remote from it and both return the same precision and recall. The reason for this is that the criteria only compare two sets of correspondences without considering if these are close or remote to each other: if they are not the same exact correspondences, they score zero. They both score identically low, despite their different quality. It may be helpful for users to know whether the found alignments are close to the expected one and easily repairable or not. It is thus necessary to measure the proximity between alignments instead of their strict equality.</p><p>In this paper we investigate some measures that generalize precision and recall in order to overcome the problems presented above. We first provide the basic definitions of alignments, precision and recall as well as a motivating example ( §2). We then present a framework for generalizing precision and recall ( §3). This framework is instantiated by four different measures (including classical precision and recall) ( §4) and we show on the motivating example that the proposed measures do not exhibit the rigidity of classical precision and recall ( §5).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">FOUNDATIONS 2.1 Alignment</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>DEFINITION 1 (ALIGNMENT, CORRESPONDENCE).</head><p>Given two ontologies O and O , an alignment between O and O is a set of correspondences (i.e., 4-uples):</p><p>e, e , r, n with e ∈ O and e ∈ O being the two matched entities, r being a relationship holding between e and e , and n expressing the level of confidence [0..1] in this correspondence.</p><p>A matching algorithm returns an alignment A which is compared with a reference alignment R.</p><p>Let us illustrate this through a simple example. Figure <ref type="figure">1</ref> presents two ontologies together with two alignments A 1 and R. In this example, for the sake of simplification, the relation is always '=' and the confidence is always 1.0.</p><p>The alignment A 1 is defined as follows:</p><p>&lt;o1:Vehicle,o2:Thing,=,1.0&gt; &lt;o1:Car,o2:Porsche,=,1.0&gt; &lt;o1:hasSpeed,o2:hasProperty,=,1.0&gt; &lt;o1:MotorKA1,o2:MarcsPorsche,=,1.0&gt; &lt;o1:250kmh,o2:fast,=,1.0&gt;</p><p>We present another reasonable alignment A 2 : &lt;o1:Car,o2:Thing,=,1.0&gt; &lt;o1:hasSpeed,o2:hasProperty,=,1.0&gt; &lt;o1:MotorKA1,o2:MarcsPorsche,=,1.0&gt; &lt;o1:250kmh,o2:fast,=,1.0&gt; and an obviously wrong alignment A 3 : &lt;o1:Object,o2:Thing,=,1.0&gt; &lt;o1:Owner,o2:Volkswagen,=,1.0&gt; &lt;o1:Boat,o2:Porsche,=,1.0&gt; &lt;o1:hasOwner,o2:hasMotor,=,1.0&gt; &lt;o1:Marc,o2:fast,=,1.0&gt; Further, we have the following reference alignment (R): &lt;o1:Object,o2:Thing,=,1.0&gt; &lt;o1:Car,o2:Automobile,=,1.0&gt; &lt;o1:Speed,o2:Characteristic,=,1.0&gt; &lt;o1:250kmh,o2:fast,=,1.0&gt; &lt;o1:PorscheKA123,o2:MarcsPorsche,=,1.0&gt;</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Precision and Recall</head><p>The usual approach for evaluating the returned alignments is to consider them as sets of correspondences and check for the overlap of the two sets. This is naturally obtained by applying the classical measure of precision and recall <ref type="bibr" target="#b14">[14]</ref>, which are the ratio of the number of true positive (|R ∩ A|) and retrieved correspondences (|A|) or those to be retrieved (|R|), respectively. DEFINITION 2 (PRECISION, RECALL). Given a reference alignment R, the precision of some alignment A is given by</p><formula xml:id="formula_0">P (A, R) = |R ∩ A| |A|</formula><p>and recall is given by</p><formula xml:id="formula_1">R(A, R) = |R ∩ A| |R| .</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3">Problems with Current Measures</head><p>However, even if the above measurements are easily understandable and widespread, they are often criticized for two reasons: Neither do they discriminate between a totally wrong and an almost correct alignment, nor do they measure user effort to adapt the alignment.</p><p>Indeed, it often makes sense to not only have a decision whether a particular correspondence has been found or not, but measure the proximity of the found alignments. This implies that also "near misses" are taken into consideration instead of only the exact matches. As a matter of example, it will be clear to anybody that among the alignments presented above, A 3 is not a very good alignment and A 1 and A 2 are better alignments. However, they score almost exactly the same in terms of precision (.2) and recall (.2).</p><p>Moreover, the alignments will have to go through user scrutiny and correction before being used. It is worth measuring the effort required by the user for correcting the provided alignment instead of only if some correction is needing. This also calls for a relaxation of precision and recall.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">GENERALIZING PRECISION AND RE-CALL</head><p>Because precision and recall are well-known and easily explained measures, it is good to adhere to them and extend them. It also brings the benefit that measures derived from precision and recall, such as f-measure, can still be computed. For these reasons, we propose to generalize these measures.</p><p>If we want to generalize precision and recall, we should be able to measure the proximity of correspondence sets rather than their strict overlap. Instead of the taking the cardinal of the intersection of the two sets (|R ∩ A|), we measure their proximity (ω).</p><p>DEFINITION 3 (GENERALIZED PRECISION AND RECALL). Given a reference alignment R and an overlap function ω between alignments, the precision of an alignment A is given by</p><formula xml:id="formula_2">P ω (A, R) = ω(A, R) |A|</formula><p>and recall is given by</p><formula xml:id="formula_3">R ω (A, R) = ω(A, R) |R| .</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Basic properties</head><p>In order, for these new measures to be true generalizations, we would like ω to share some properties with |R ∩ A|. In particular, the measure should be positive:</p><formula xml:id="formula_4">∀A, B, ω(A, B) ≥ 0 (positiveness)</formula><p>and not exceeding the minimal size of both sets:</p><formula xml:id="formula_5">∀A, B, ω(A, B) ≤ min(|A|, |B|) (maximality) Concept Relation Instance Correct Alignment R Found Alignment A 1</formula><p>Ontology 1 Ontology 2</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Figure 1: Two Aligned Ontologies</head><p>If we want to preserve precision and recall results, ω should only add more flexibility to the usual precision and recall.</p><p>So their values cannot be worse than the initial evaluation:</p><formula xml:id="formula_6">∀A, B, ω(A, B) ≥ |A ∩ B| (boundedness)</formula><p>Hence, the main constraint faced by the proximity is the following:</p><formula xml:id="formula_7">|A ∩ R| ≤ ω(A, R) ≤ min(|A|, |R|)</formula><p>This is indeed a true generalization because, |A∩R| satisfies all these properties. One more property satisfied by precision and recall that we will not enforce here is symmetry. This guarantees that the precision and recall measures are true normalized similarities.</p><formula xml:id="formula_8">∀A, B, ω(A, B) = ω(B, A)<label>(symmetry)</label></formula><p>We will not require symmetry, especially since A and R are not in symmetrical positions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">Designing Overlap Proximity</head><p>There are many different ways to design such a proximity given two sets. We retain here the most obvious one which consists of finding correspondences matching each other and computing the sum of their proximity. This can be defined as an overlap proximity:</p><p>DEFINITION 4 (OVERLAP PROXIMITY). A measure that would generalize precision and recall is:</p><formula xml:id="formula_9">ω(A, R) = a,r ∈M (A,R) σ(a, r)</formula><p>in which M (A, R) is a matching between the correspondences of A and R and σ(a, r) a proximity function between two correspondences.</p><p>Again, the standard overlap |A ∩ R| used in precision and recall is such an overlap proximity.</p><p>There are two tasks to fulfill when designing such an overlap proximity function:</p><p>-the first one consists of finding the correspondences to be compared M . -the second one is to define a proximity measure on correspondences σ;</p><p>We consider these two issues below.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3">Matching Correspondences</head><p>A matching between alignments is a set of correspondence pairs, i.e., M (A, R) ⊆ A × R. However, if we want to keep the analogy with precision and recall, it will be necessary to restrict ourselves to the matchings in which an entity from the ontology does not appear twice. This is compatible with precision and recall for two reasons: (i) in these measures, any correspondence is identified only with itself, and (ii) appearing more than once in the matching would not guarantee an overlap proximity below min(|A|, |R|) .</p><p>There are</p><formula xml:id="formula_10">|A|! (|A|−|R|)! candidate matches (if |A| ≥ |R|)</formula><p>. The natural choice is to select the best match because this guarantees that the function generalizes precision and recall.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>DEFINITION 5 (BEST MATCH).</head><p>The best match M (A, R) between two sets of correspondences A and R, is the subset of A × R which maximizes the overall proximity and in which each element of A (resp. R) belongs to only one pair:</p><formula xml:id="formula_11">M (A, R) ∈ M ax ω(A,R) {M ⊆ A × R}</formula><p>As defined here, this best match may not be unique. This is not a problem, because we only want to find the highest value for ω and any of the best matches will yield the same value.</p><p>Of course, the definitions M and ω are dependent of each other, but this does not prevent us from computing them. They are usually computed together but it is better to present them separately.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.4">Correspondence Proximity</head><p>In order to compute ω(A, R), we need to measure the proximity between two matched correspondences (i.e., a, r ∈ M (A, R)) on the basis of how close the result is from the ideal one. Each element in the tuple a = e a , e a , r a , n a will be compared with its counterpart in r = e r , e r , r r , n r . For any two correspondences (the found a and the reference r) we compute three similarities σ pair , σ rel , and σ conf . If elements are identical, proximity has to be one (maximality). If they differ, proximity is lower, always according to the chosen strategy. In contrast to the standard definition of similarity, the mentioned proximity measures do not necessarily have to be symmetric. We will only consider normalized proximities, i.e., measures whose values are within the unit interval [0..1], because this guarantees that</p><formula xml:id="formula_12">ω(A, R) ≤ min(|A|, |R|)</formula><p>The component proximity measure is defined in the following way: σ pair ( e a , e r , e a , e r ): How is one entity pair similar to another entity pair? In ontologies we can in principal follow any relation which exists (e.g., subsumption, instantiation), or which can be derived in a meaningful way. The most important parameters are the relations to follow and their effect on the proximity.</p><p>σ rel (r a , r r ): Often the alignment relations are more complex, e.g., represent subsumption, instantiation, or compositions. Again, one has to assess the similarity between these relations. The two relations of the alignment cell can be compared based on their distance in a conceptual neighborhood structure <ref type="bibr" target="#b6">[6,</ref><ref type="bibr">8]</ref>.</p><p>σ conf (n a , n r ): Finally, one has to decide, what to do with different levels of confidence. The similarity could simply be the difference. Unfortunately, none of the current alignment approaches have an explicit meaning attached to confidence values, which makes it rather difficult in defining an adequate proximity.</p><p>Once these proximities are established, they have to be aggregated. The constraints on the aggregation function (Aggr) are:</p><formula xml:id="formula_13">-normalization preservation (if ∀i, 0 ≤ c i ≤ 1 then 0 ≤ Aggr i c i ≤ 1); -maximality (if ∀i, c i = 1 then Aggr i c i = 1); -local monotonicity (if ∀i = j, c i = c i = c j and c j ≤ c j ≤ c j then Aggr i c i ≤ Aggr i c i ≤ Aggr i c i ).</formula><p>Here, we consider aggregating them through multiplication without further justification. Other aggregations (e.g., weighted sum) are also possible.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>DEFINITION 6 (CORRESPONDENCE PROXIMITY).</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Given two correspondences</head><p>e a , e a , r a , n a and e r , e r , r r , n r , their proximity is:</p><p>σ( e a , e a , r a , n a , e r , e r , r r , n r ) = σ pair ( e a , e r , e a , e r ) × σ rel (r a , r r ) × σ conf (n a , n r )</p><p>We have provided constraints and definitions for M , ω, and σ. We now turn to concrete measures.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">CONCRETE MEASURES</head><p>We consider four cases of relaxed precision and recall measures based on the above definitions. We first give the definition of usual precision and recall within this framework.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1">Standard Precision and Recall</head><p>For standard precision and recall, the value of ω is |A ∩ R|. This is indeed an instance of this framework, if the proximity used is based on the strict equality of the components of correspondences.</p><p>DEFINITION 7 (EQUALITY PROXIMITY). The equality proximity is charaterized by: σ pair ( e a , e a , e r , e r ) =</p><p>1 if e a , e a = e r , e r 0 otherwise</p><formula xml:id="formula_14">σ rel (r a , r r ) = 1 if r a = r r 0 otherwise σ conf (n a , n r ) = 1 if n a = n r 0 otherwise</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2">Symmetric Proximity</head><p>The easiest way to relax precision and recall is to have some distance δ on the elements in ontologies and to weight the proximity with the help of this distance: the higher the distance between two entities in the matched correspondences, the lower their proximity. This can be defined as:</p><p>δ(e a , e r ) ≤ δ(e b , e r ) and δ(e a , e r ) ≤ δ(e b , e r ) =⇒ σ( e a , e a , e r , e r ) ≥ σ( e b , e b , e r , e r )</p><p>As a simple example of such a symmetric similarity, we use a distance in which a class is at distance 0 of itself, at distance 0.5 of its direct sub-and superclasses, and at a distance 1 of any other class. This could be further refined by having a similarity inversely proportional to the distance in the subsumption tree. Likewise, this similarity may also be applied to properties and instances (through part-of relationships in the latter case). The similarity between pairs is the complement of these similarities The result is displayed in Table <ref type="table" target="#tab_0">1</ref>. We always mention the assumed alignment and the actual correct alignment. For the confidence distance we simply take the complement of the difference. The final precision is calculated according to the formula presented in the previous section: </p><formula xml:id="formula_15">i = i i partOf i 0.5 i = i i consistsOf i 0.5</formula><p>Table <ref type="table">2</ref>: Similarities based on Relations DEFINITION 8 (SYMMETRIC PROXIMITY). The symmetric proximity is characterized by: σ pair ( e a , e a , e r , e r ) as defined in Table <ref type="table" target="#tab_0">1</ref> σ rel (r a , r r ) as defined in Table <ref type="table">2</ref> </p><formula xml:id="formula_16">σ conf (n a , n r ) = 1 − |n a − n r |.</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.3">Measuring Correction Effort</head><p>If users have to check and correct alignments, the quality of alignment algorithms can be measured through the effort required for transforming the obtained alignment into the (correct) reference one <ref type="bibr" target="#b2">[2]</ref>. This measure can be implemented as an edit distance <ref type="bibr" target="#b10">[10]</ref>: an edit distance defines a number of operations by which an object can be corrected (here the the operations on correspondences authorized) and assigns a cost to each of these operations (here the effort required to identify and repair some mistake). The cost of a sequence of operations is the sum of their cost and the distance between two objects is the cost of the less costly sequence of operations that transform one object into the other one. The result can always be normalized in function of the size of the largest object. Such a distance can be turned into a proximity by taking its complement with regard to 1.</p><p>Table <ref type="table" target="#tab_2">3</ref> provides such plausible weights. Usually classes are organized in a taxonomy in which they have less direct super-than subclasses. It is thus easier to correct a class to (one of) its superclass than to one of its subclasses. As a consequence, the proximity is dissymmetric. Such a measure should also add some effort when classes are not directly related, but this has not been considered here.</p><p>The edit distance between relations is relatively easy to design since, generally, changing from one relation to another can be done with just one click. Thus, the relational similarity equals 1 if the relations are the same and 0.5 otherwise.</p><p>In this correction effort measure, the confidence factor does not play an important role: ordering the correspondences can only help the user to know that after some point she will have to discard many correspondences. We thus decided to not take confidence into account and thus, their proximity will always be 1.  The effort-based proximity is charaterized by: σ pair ( e a , e a , e r , e r ) as defined in Table <ref type="table" target="#tab_2">3</ref> σ rel (r a , r r ) = 1 if r a = r r 0.5 otherwise</p><formula xml:id="formula_17">σ conf (n a , n r ) = 1 if n a = 0 and n r = 0 0 otherwise</formula><p>To be accurate, such an effort proximity would have been better aggregated with an additive and normalized aggregation function rather than multiplication.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.4">Precision-and Recall-oriented Measures</head><p>One can also decide to use two different similarities depending on their application for evaluating either precision or recall. We here provide two such measures and justify the given weights. Precision is normally a measure of accuracy i.e., the returned results need to be correct. Every wrong result will therefore entail a penalty. We assume the user poses a query to the system as follows: "return me all instances of e". The system then returns any instance corresponding to the alignment i.e. e . Vice versa, for the relaxed recall we want to avoid missing any correct result. This affects the similarity relations and weights.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.4.1">Relaxed Precision</head><p>In Table <ref type="table" target="#tab_4">4</ref> and 5 we present the precision similarity for pairs and relations. The comments in each line explain the decision for the weights.</p><p>For the distance within the confidence we again use the complement of the difference.</p><p>DEFINITION 10 (PRECISION-ORIENTED PROXIMITY). The precision-recall oriented proximity is characterized by: σ pair ( e a , e a , e r , e r ) as defined in Table <ref type="table" target="#tab_4">4</ref> σ rel (r a , r r ) as defined in  </p><formula xml:id="formula_18">= r r ⊂ r 0.5 r = r r ⊃ r 1 i = i i partOf i 0.5 i = i i consistsOf i 1</formula><p>Table <ref type="table" target="#tab_3">5</ref>: Similarities for Relaxed Precision based on Relations</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.4.2">Relaxed Recall</head><p>In Table <ref type="table" target="#tab_5">6</ref> and 7 we present the recall similarity for pairs and relations. Basically many distances are just mirrored compared to the precision case.  <ref type="figure">c,c</ref> sub(c),c 0.5 returns more specialized instances, misses some r,r r,sup(r ) 0.5 r,r sup(r),r 1 r,r r,sub(r ) 1 r,r sub(r),r 0.5 i,i i,super(i ) 0 returns a more restricted instance, misses correct i,i super(i),i 0.5 returns a broader instance i,i i,sub(i ) 0.5 returns a broader instance i,i sub(i),i 0 returns a more restricted instance, misses correct </p><formula xml:id="formula_19">= r r ⊂ r 0 r = r r ⊃ r 0.5 i = i i partOf i 0 i = i i consistsOf i 0.5</formula><p>Table <ref type="table">7</ref>: Similarities for Relaxed Recall based on Relations</p><p>The final recall is computed as usual:</p><p>DEFINITION 11 (RECALL-ORIENTED PROXIMITY). The recall-oriented proximity is characterized by: σ pair ( e a , e a , e r , e r ) as defined in Table <ref type="table" target="#tab_5">6</ref> σ rel (r a , r r ) as defined in Table <ref type="table">7</ref> σ conf (n a , n r ) = 1 − |n a − n r |.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">EXAMPLE</head><p>In the introduction of this paper we have presented a pair of ontologies, the reference alignment, and three different identified alignments. We will now apply the different proposed precision and recall measures to these example alignments. Please note that they mainly illustrate entity pair similarities, as relations and confidences are always identical. Table <ref type="table" target="#tab_7">8</ref> provides the results. For the oriented measure we assume that the query is given in ontology 1 and the answer has to be retrieved in ontology 2. As the oriented measure is dissymmetric, one has to define this direction beforehand.  The measures which have been introduced address the problems raised in the introduction and fulfill the requirements:</p><formula xml:id="formula_20">ω (R, R) (R, A 1 ) (R, A 2 ) (R</formula><p>-They keep precision and recall untouched for the best alignment (R); -They help discriminating between irrelevant alignments (A 3 ) and not far from target ones (A 1 and A 2 ); -Specialized measures are able to emphasize some characteristics of alignments: ease of modification, correctness or completeness. For instance, let's consider the oriented measures. In our example A 1 has two very near misses, which leads to a relatively high precision. In A 2 however the miss is bigger, but by aligning one concept to its superconcept recall rises relatively to precision.</p><p>These results are based on only one example. They have to be systematized in order to be extensively validated. Our goal is to implement these measures within the Alignment API and to use them on the forthcoming results of the Ontology Alignment Evaluation 2005<ref type="foot" target="#foot_0">1</ref> in order to have real data on which the relevance of the proposed measures can be more openly debated.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">RELATED WORK</head><p>The naturally relevant work is <ref type="bibr" target="#b2">[2]</ref> which has considered precisely the evaluation of schema matching. However, the authors only note the other mentioned problem (having two measures instead of one) and use classical aggregation (overall and F-measure) of precision and recall.</p><p>In computational linguistics, and more precisely multilingual text alignment, <ref type="bibr" target="#b9">[9]</ref> has considered extending precision and recall. Their goal is the same as ours: increasing the discriminating power of the measures. In this work, the mathematical formulation is not changed but the granularity of compared sets changes: instead of comparing sentences in a text, they compare words in sentences in a text. This helps having some contribution to the measures when most of the words are correctly aligned while the sentences are not strictly aligned.</p><p>In the Alignment API <ref type="bibr" target="#b5">[5]</ref>, there is another evaluation measure which directly computes a distance based on a weighted symmetric difference (weights are the confidences of each correspondence in the alignment). This measure could be used in the generalization proposed here (the distance would then be based on confidence difference and would generally satisfy P (A, R) ≤ P (A, R) and R (A, R) ≤ R(A, R).</p><p>The deeper proposal for extending precision and recall comes from hierarchical text categorization in which texts are attached to some category in a taxonomy <ref type="bibr" target="#b12">[12]</ref>. Usually, texts are attached to the leaves, but when algorithms attach them to the intermediate categories, it is useful to discriminate between a category which is irrelevant and a category which is an immediate super category of the expected one.</p><p>For that purpose, they introduce an extension of precision (recall is redefined similarly) such that:</p><formula xml:id="formula_21">P CS = max(0, |A ∩ R| + F pCon + F nCon) |A| + F nCon</formula><p>in which F pCon (resp. F nCon) is the contribution to false positive (resp. false negative), i.e., the way incorrectly classified documents could contribute to its incorrect category anyway. The maximization is necessary to prevent the result from being negative (because the contribution is defined with respect to the average such contribution). The contribution is measured in two ways. The first one is a category similarity that is computed on the features of categories (categories and documents are represented by a vector of features and the membership to some category is based on a distance be-tween these vectors). The second one is based on the distance between categories in the taxonomy.</p><p>This measure does not seem to be a generalization of standard precision and recall as the one presented here. In particular, because the contributions can be negative, this measure can be lower than standard precision and recall. The idea of retracting the contribution from wrongly classified documents is not far from the idea developed here. However, the computation of this contribution with regard to some average and the addition of some contribution to the divisor do not seem justified.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7.">DISCUSSION</head><p>Evaluation of matching results is often made on the basis of the well-known and well-understood precision and recall measures. However, these measures do not discriminate accurately between methods which do not provide the exact results. In the context where the result of alignments have to be screened by humans, this is an important need.</p><p>We have proposed a framework for generalizing precision and recall when comparing ontology alignments. It keeps the advantages of usual precision and recall but helps discriminating between alignments by identifying for near misses instead of completely wrong correspondences.</p><p>The has been instantiated in three different measures, each one aiming at favoring some particular aspects of alignment utility. We show that these measures indeed avoid the shortcomings of standard evaluation criteria. They should however, be further investigated in order to find better formulations: more discrepancy needs to be considered, more progressive distance (e.g., not direct subclasses) and rationalized design of weights.</p><p>This generalization framework is not the only possible one since we have made a number of choices:</p><p>-on the form of the alignment similarity (Definition 4); -on the kind of alignment matching (Definition 5); -on the form of the correspondence similarity (Definition 6).</p><p>More work has to be done in order to assess the potential of other choices in these functions.</p><p>The most important work is to consider these proposed measures in real evaluation of alignment systems and to identify good measures for further evaluations. We plan to implement these measures within the Alignment API <ref type="bibr" target="#b5">[5]</ref> and process the results of the Ontology Alignment Evaluation 2005.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head></head><label></label><figDesc>instances, includes the correct results c,c c,sub(c ) 1 returns more general instances, includes the correct results</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 :</head><label>1</label><figDesc>Similarities based on Entity Pairs</figDesc><table><row><cell cols="4">found closest correct similarity comment</cell></row><row><cell>e,e</cell><cell>e,e</cell><cell>σ pair</cell><cell></cell></row><row><cell>e,e</cell><cell>e,e</cell><cell>1</cell><cell>correct correspondence</cell></row><row><cell>c,c</cell><cell>c,sup(c )</cell><cell>0.5</cell><cell>returns more specialized instances</cell></row><row><cell>c,c</cell><cell>sup(c),c</cell><cell>0.5</cell><cell>returns more general instances</cell></row><row><cell>c,c</cell><cell>c,sub(c )</cell><cell>0.5</cell><cell>returns more general instances</cell></row><row><cell>c,c</cell><cell>sub(c),c</cell><cell>0.5</cell><cell>returns more specialized instances</cell></row><row><cell>r,r</cell><cell>r,sup(r )</cell><cell>0.5</cell><cell>returns more spec. relation instances</cell></row><row><cell>r,r</cell><cell>sup(r),r</cell><cell>0.5</cell><cell>returns more gen. relation instances</cell></row><row><cell>r,r</cell><cell>r,sub(r )</cell><cell>0.5</cell><cell>returns more gen. relation instances</cell></row><row><cell>r,r</cell><cell>sub(r),r</cell><cell>0.5</cell><cell>returns more spec. relation instances</cell></row><row><cell>i,i</cell><cell>i,super(i )</cell><cell>0.5</cell><cell>returns a more restricted instance</cell></row><row><cell>i,i</cell><cell>super(i),i</cell><cell>0.5</cell><cell>returns a too broad instance</cell></row><row><cell>i,i</cell><cell>i,sub(i )</cell><cell>0.5</cell><cell>returns a too broad instance</cell></row><row><cell>i,i</cell><cell>sub(i),i</cell><cell>0.5</cell><cell>returns a more restricted instance</cell></row><row><cell cols="4">Table 2 consider the proximity between relations. It only</cell></row><row><cell cols="4">presents the similarity between equality (=) and other rela-</cell></row><row><cell>tions.</cell><cell></cell><cell></cell><cell></cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 3 :</head><label>3</label><figDesc>Effort-based proximity between Entity Pairs DEFINITION 9 (EFFORT-BASED PROXIMITY).</figDesc><table /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 5 σ</head><label>5</label><figDesc>conf (n a , n r ) = 1 − |n a − n r |.</figDesc><table><row><cell cols="4">found closest correct similarity comment</cell></row><row><cell>e,e</cell><cell>e,e</cell><cell>σ pair</cell><cell></cell></row><row><cell>e,e</cell><cell>e,e</cell><cell>1</cell><cell>correct correspondence</cell></row><row><cell>c,c</cell><cell>c,sup(c )</cell><cell>1</cell><cell>returns more specialized instances,</cell></row><row><cell></cell><cell></cell><cell></cell><cell>these are correct</cell></row><row><cell>c,c</cell><cell>sup(c),c</cell><cell>0.5</cell><cell>returns more general instances,</cell></row><row><cell></cell><cell></cell><cell></cell><cell>includes some correct results</cell></row><row><cell>c,c</cell><cell>c,sub(c )</cell><cell>0.5</cell><cell>returns more general instances,</cell></row><row><cell></cell><cell></cell><cell></cell><cell>includes some correct results</cell></row><row><cell>c,c</cell><cell>sub(c),c</cell><cell>1</cell><cell>returns more specialized instances,</cell></row><row><cell></cell><cell></cell><cell></cell><cell>these are correct</cell></row><row><cell>r,r</cell><cell>r,sup(r )</cell><cell>1</cell><cell></cell></row><row><cell>r,r</cell><cell>sup(r),r</cell><cell>0.5</cell><cell></cell></row><row><cell>r,r</cell><cell>r,sub(r )</cell><cell>0.5</cell><cell></cell></row><row><cell>r,r</cell><cell>sub(r),r</cell><cell>1</cell><cell></cell></row><row><cell>i,i</cell><cell>i,super(i )</cell><cell>0.5</cell><cell>returns a more restricted instance</cell></row><row><cell>i,i</cell><cell>super(i),i</cell><cell>0</cell><cell>returns a too broad instance</cell></row><row><cell>i,i</cell><cell>i,sub(i )</cell><cell>0</cell><cell>returns a too broad instance</cell></row><row><cell>i,i</cell><cell>sub(i),i</cell><cell>0.5</cell><cell>returns a more restricted instance</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_4"><head>Table 4 :</head><label>4</label><figDesc>Similarities for Relaxed Precision based on Entity Pairs</figDesc><table><row><cell>found</cell><cell>correct</cell><cell cols="2">similarity comment</cell></row><row><cell>relation</cell><cell>relation</cell><cell>σ rel</cell><cell></cell></row><row><cell>e = e</cell><cell>e = e</cell><cell>1</cell><cell>correct relation</cell></row><row><cell>c = c</cell><cell>c ⊂ c</cell><cell>0.5</cell><cell>returns more instances than correct</cell></row><row><cell>c = c</cell><cell>c ⊃ c</cell><cell>1</cell><cell>returns less instances than possible,</cell></row><row><cell></cell><cell></cell><cell></cell><cell>but these are correct</cell></row><row><cell>r</cell><cell></cell><cell></cell><cell></cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_5"><head>Table 6 :</head><label>6</label><figDesc>Similarities for Relaxed Recall based on Entity Pairs</figDesc><table><row><cell>found</cell><cell>correct</cell><cell cols="2">similarity comment</cell></row><row><cell>relation</cell><cell>relation</cell><cell>σ rel</cell><cell></cell></row><row><cell>e = e</cell><cell>e = e</cell><cell>0</cell><cell>correct relation</cell></row><row><cell>c = c</cell><cell>c ⊂ c</cell><cell>0</cell><cell>returns more instances than correct</cell></row><row><cell>c = c</cell><cell>c ⊃ c</cell><cell>0.5</cell><cell>returns less instances than possible,</cell></row><row><cell></cell><cell></cell><cell></cell><cell>misses some</cell></row><row><cell>r</cell><cell></cell><cell></cell><cell></cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_7"><head>Table 8 :</head><label>8</label><figDesc>Precision recall result on the alignments of Figure 1</figDesc><table /></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">http://oaei.inrialpes.fr/2005/</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgements</head><p>This work has been partially supported by the Knowledge Web European network of excellence (IST-2004-507482). The authors would like to thank Diana Maynard who pointed out the problem addressed here.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<title/>
		<author>
			<persName><surname>References</surname></persName>
		</author>
		<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Ontology translation protocol (ontrapro)</title>
		<author>
			<persName><forename type="first">B</forename><surname>Ashpole</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of Performance Metrics for Intelligent Systems (PerMIS &apos;04)</title>
				<editor>
			<persName><forename type="first">E</forename><surname>Messina</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Meystel</surname></persName>
		</editor>
		<meeting>Performance Metrics for Intelligent Systems (PerMIS &apos;04)<address><addrLine>Gaithersburg, MD, USA</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2004-08">August 2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Comparison of schema matching evaluations</title>
		<author>
			<persName><forename type="first">H.-H</forename><surname>Do</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Melnik</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Rahm</surname></persName>
		</author>
		<ptr target="http://dol.uni-leipzig.de/pub/2002-28" />
	</analytic>
	<monogr>
		<title level="m">Proc. GI-Workshop &quot;Web and Databases</title>
				<meeting>GI-Workshop &quot;Web and Databases<address><addrLine>Erfurt (DE)</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Learning to map ontologies on the semantic web</title>
		<author>
			<persName><forename type="first">A.-H</forename><surname>Doan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Madhavan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Dhamankar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Domingos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Halevy</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">VLDB journal</title>
		<imprint>
			<date type="published" when="2003">2003</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">QOM -quick ontology mapping</title>
		<author>
			<persName><forename type="first">M</forename><surname>Ehrig</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Staab</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. 3rd ISWC</title>
				<meeting>3rd ISWC<address><addrLine>Hiroshima (JP)</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2004-11">November 2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">An API for ontology alignment</title>
		<author>
			<persName><forename type="first">J</forename><surname>Euzenat</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. 3rd international semantic web conference</title>
				<meeting>3rd international semantic web conference<address><addrLine>Hiroshima (JP</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2004">2004</date>
			<biblScope unit="page" from="698" to="712" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">A semantic framework for multimedia document adaptation</title>
		<author>
			<persName><forename type="first">J</forename><surname>Euzenat</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Layaïda</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Dias</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. 18th International Joint Conference on Artificial Intelligence (IJCAI)</title>
				<meeting>18th International Joint Conference on Artificial Intelligence (IJCAI)<address><addrLine>Acapulco (MX)</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2003">2003</date>
			<biblScope unit="page" from="31" to="36" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Similarity-based ontology alignment in OWL-lite</title>
		<author>
			<persName><forename type="first">J</forename><surname>Euzenat</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Valtchev</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. 15th ECAI</title>
				<meeting>15th ECAI<address><addrLine>Valencia (ES)</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2004">2004</date>
			<biblScope unit="page" from="333" to="337" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Temporal reasoning based on semi-intervals</title>
		<author>
			<persName><forename type="first">C</forename><surname>Freksa</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Artificial Intelligence</title>
		<imprint>
			<biblScope unit="volume">54</biblScope>
			<biblScope unit="issue">1-2</biblScope>
			<biblScope unit="page" from="199" to="227" />
			<date type="published" when="1992">1992</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Methods and practical issues in evaluating alignment techniques</title>
		<author>
			<persName><forename type="first">P</forename><surname>Langlais</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Véronis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Simard</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. 17th international conference on Computational linguistics</title>
				<meeting>17th international conference on Computational linguistics<address><addrLine>Montréal (CA</addrLine></address></meeting>
		<imprint>
			<date type="published" when="1998">1998</date>
			<biblScope unit="page" from="711" to="717" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Binary codes capable of correcting deletions, insertions, and reversals</title>
		<author>
			<persName><forename type="first">I</forename><forename type="middle">V</forename><surname>Levenshtein</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Cybernetics and Control Theory</title>
				<imprint>
			<date type="published" when="1966">1966</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<title level="m" type="main">Smart: Automated support for ontology merging and alignment</title>
		<author>
			<persName><forename type="first">N</forename><surname>Noy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Musen</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1999">1999</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Hierarchical text classification and evaluation</title>
		<author>
			<persName><forename type="first">A</forename><surname>Sun</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E.-P</forename><surname>Lin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. IEEE international conference on data mining</title>
				<meeting>IEEE international conference on data mining</meeting>
		<imprint>
			<date type="published" when="2001">2001</date>
			<biblScope unit="page" from="521" to="528" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<monogr>
		<title level="m">Proceedings of the 3rd Evaluation of Ontology-based tools</title>
				<editor>
			<persName><forename type="first">Y</forename><surname>Sure</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">O</forename><surname>Corcho</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><surname>Euzenat</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">T</forename><surname>Hughes</surname></persName>
		</editor>
		<meeting>the 3rd Evaluation of Ontology-based tools</meeting>
		<imprint>
			<publisher>EON</publisher>
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<monogr>
		<title level="m" type="main">Information retrieval</title>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">J</forename><surname>Van Rijsbergen</surname></persName>
		</author>
		<ptr target="http://www.dcs.gla.ac.uk/Keith/Preface.html" />
		<imprint>
			<date type="published" when="1975">1975</date>
			<publisher>Butterworths</publisher>
			<pubPlace>London (UK</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
