<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Improving Automatically Created Mappings using Logical Reasoning</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Christian</forename><surname>Meilicke</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">University of Mannheim</orgName>
								<address>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Heiner</forename><surname>Stuckenschmidt</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">University of Mannheim</orgName>
								<address>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Andrei</forename><surname>Tamilin</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">ITC-irst</orgName>
								<orgName type="institution">University of Trento</orgName>
								<address>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Improving Automatically Created Mappings using Logical Reasoning</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">8023CE91CE2E7EF61A86C1D6CEFA1DBF</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T06:49+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>A lot of attention has been devoted to heuristic methods for discovering semantic mappings between ontologies. Despite impressive improvements, the mappings created by these automatic matching tools are still far from being perfect. In particular, they often contain wrong and redundant mapping rules. In this paper we present an approach for improving such mappings using logical reasoning in the context of Distributed Description Logics (DDL). Our method is orthogonal to the matching algorithm used and can therefore be used in combination with any matching tool. We explain the general idea of our approach informally using a small example and present the results of experiments conducted on the OntoFarm Benchmark which is part of the Ontology Alignment Evaluation challenge.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Motivation</head><p>The problem of semantic heterogeneity is becoming more and more pressing in many areas of information technologies. The Semantic Web is only one area where the problem of semantic heterogeneity has lead to intensive research on methods for semantic integration. The specific problem of semantic integration on the Semantic Web is the need to not only integrate data and schema information, but to also provide means to integrate ontologies, rich semantic models of a particular domain. There are two lines of work connected to the problem of a semantic integration of ontologies:</p><p>-The (semi-) automatic detection of semantic relations between ontologies (e.g., <ref type="bibr" target="#b8">[9,</ref><ref type="bibr" target="#b5">6,</ref><ref type="bibr" target="#b10">11,</ref><ref type="bibr" target="#b11">12,</ref><ref type="bibr" target="#b6">7]</ref>). -The representation and use of semantic relations for reasoning and query answering (e.g., <ref type="bibr" target="#b13">[14,</ref><ref type="bibr" target="#b9">10,</ref><ref type="bibr" target="#b4">5,</ref><ref type="bibr" target="#b2">3,</ref><ref type="bibr" target="#b1">2]</ref>).</p><p>So far, work on representation of and reasoning with mappings has focussed on mechanisms for answering queries and using mappings to compute subsumption relationships between concepts in the mapped ontologies. These methods always assumed that the mappings used are manually created and of high quality (in particular consistent). In this paper we investigate logical reasoning about mappings that are not assumed to be perfect. In particular, our methods can be used to check (automatically created) mappings for formal and conceptual consistency and determine implied mappings that have not explicitly been represented. We investigate such mappings in the context of Distributed Description Logics <ref type="bibr" target="#b0">[1,</ref><ref type="bibr" target="#b12">13]</ref>, an extension of traditional description logics with mappings between concepts in different T-boxes. The functionality described in this paper will become more important in the future because more and more ontologies are created and need to be linked. For larger ontologies the process of mapping will not be done completely by hand, but will rely on or will at least be supported by automatic mapping approaches. We see our work as a contribution to semi-automatic approaches for creating mappings between ontologies where possible mappings are computed automatically and then corrected manually making use of methods for checking the formal and conceptual properties of the mappings.</p><p>In previous work we have proposed a number of formal properties of mappings in Distributed Description Logics that we consider useful for judging the quality of a set of mappings <ref type="bibr" target="#b15">[16]</ref>. In this paper, we refine and extend this work in several directions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Debugging of mappings</head><p>We propose a process for (semi-)automatically debugging automatically created mappings making use of some of the properties mentioned above. In particular we use the notion of mapping consistency to detect problems caused by the mappings. For each potential problem, we determine the minimal set of mapping rules responsible for the problem (minimal conflict set). For each conflict set, we try to identify which mapping rule is incorrect and remove it form the mapping.</p><p>Implementation On top of the DRAGO reasoning system <ref type="bibr" target="#b14">[15]</ref> we built a prototype of mapping debugger for computing minimal conflict sets with respect to an inconsistency caused by a mapping as well as some heuristics for automatic repairing of an inconsistent mapping. We further added a minimization functionality for computing minimal mapping sets from redundant ones.</p><p>Experiments We tested the approach using the OntoFarm data set, a set of several rich OWL ontologies describing the domain of conference management systems <ref type="bibr" target="#b16">[17]</ref>. We used the CtxMatch matching tool to automatically create mappings between each of the ontologies. We further automatically determined problems (in particular unsatisfiable concepts) created by the mapping and tried to fix them automatically using the debugging process proposed in this paper. In the concluding step of the experimental study, we tried to compute for each mapping its logically-equivalent minimal version.</p><p>The structure of the paper is as follows. We start with a brief recall of basic definitions of Distributed Description Logics and explanations of the reasoning mechanisms. Then we describe the intuitions of our debugging/minimization approaches using a small example. Finally, we report on some preliminary experimental evaluation of the techniques proposed in this paper and summarize the results.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Distributed Description Logic</head><p>Distributed Description Logic framework (DDL) is a formal tool for representing and reasoning with multiple ontologies pairwise linked by semantic mappings. In this section, we briefly recall some key definitions and properties of DDL relying on the original studies in <ref type="bibr" target="#b0">[1,</ref><ref type="bibr" target="#b12">13]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Syntax and Semantics</head><p>Given a set I of indexes, used to enumerate a set of ontologies, a Distributed Description Logics is then a collection {DL i } i∈I of Description Logics. Each ontology i is formalized by a T-box T i of DL i , so that the initial set of ontologies in DDL corresponds to a family of T-boxes T = {T i } i∈I . To distinguish the descriptions from various T i in the family, DDL utilizes a prefix notation to pin descriptions to ontologies where they are considered in, e.g., i : X, i : X ⊑ Y . Semantic relations between pairs of ontologies a represented in DDL by bridge rules. A bridge rule from i to j is an expression of the following two forms:</p><p>i :</p><formula xml:id="formula_0">X ⊑ −→ j : Y -an into-bridge rule i : X ⊒ −→ j :</formula><p>Y -an onto-bridge rule where X and Y are concepts of ontologies T i and T j respectively. The derived bridge rule i : X ≡ −→ j : Y can be defined as the conjunction of corresponding into-and onto-bridge rule.</p><p>Intuitively, the into-bridge rule i : Bachelor ⊑ −→ j : Student states that, from the j-th point of view the concept Bachelor in i is more specific than its local concept Student. Similarly, the onto-bridge rule i : Scientif icEvent The semantics of DDL is based on the key assumption that each ontology T i in the family is locally interpreted by interpretation I i on its local interpretation domain ∆ Ii . The semantic correspondences between heterogeneous local domains, e.g., the representations of a registration fee in US Dollars and in Euro, are modeled in DDL by a domain relation.</p><p>A domain relation r ij represents a possible way of mapping the elements of ∆ Ii into the domain</p><formula xml:id="formula_1">∆ Ij : r ij ⊆ ∆ Ii × ∆ Ij such that r ij denotes {d ′ ∈ ∆ Ij | d, d ′ ∈ r ij }; for any subset D of ∆ Ii , r ij (D) denotes d∈D r ij (d); and for any R ⊆ ∆ Ii × ∆ Ij r ij (R) denotes d,d ′ ∈R r ij (d) × r ij (d ′ ).</formula><p>For instance, if ∆ I1 and ∆ I2 are the representations of a registration fee in US Dollars and in Euro, then r 12 could be a rate of exchange function, or some other approximation relation.</p><p>A distributed interpretation I = {I i } i∈I , {r ij } i =j∈I of a distributed T-box T = T , B consists of a family of local interpretations I i on local interpretation domains ∆ Ii , one for each T i , and a family of domain relations r ij between these local domains. A distributed interpretation I is said to satisfy a distributed T-box T = T , B , written</p><formula xml:id="formula_2">I |= T, if all T-boxes in T are satisfied I |= T i , if I i |= A ⊑ B for all A ⊑ B ∈ T i</formula><p>and all bridge rules in B are satisfied:</p><formula xml:id="formula_3">I |= i : X ⊑ −→ j : Y, if r ij (X Ii ) ⊆ Y Ij I |= i : X ⊒ −→ j : Y, if r ij (X Ii ) ⊇ Y Ij Given a distributed T-box T = T , B , one can perform some basic Distributed DL inferences. A concept i : C is satisfiable with respect to T if there exist a distributed interpretation I of T such that C Ii = ∅. A concept i : C is subsumed by a concept i : D with respect to T (T |= i : C ⊑ D) if for every distributed interpretation I of T we have that C Ii ⊆ D Ii .</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">DDL Inference Mechanisms</head><p>Although both in DL and Distributed DL the fundamental reasoning services lay in verification of concepts satisfiability/subsumption within a certain ontology, in DDL, besides the ontology itself, the reasoning also depends on other ontologies that affect it through semantic mappings. This affection consist in the ability of bridge rules to propagate the knowledge across ontologies in form of subsumption axioms.</p><p>The simplest case illustrating the knowledge propagation in DDL is the following:</p><formula xml:id="formula_4">i : A ⊑ B, i : A ⊒ −→ j : G, i : B ⊑ −→ j : H j : G ⊑ H<label>(1)</label></formula><p>In languages that support disjunction, the simplest propagation rule can be generalized to the propagation of subsumption between a concept and a disjunction of other concepts in the following way:</p><formula xml:id="formula_5">i : A ⊑ B 1 ⊔ . . . ⊔ B n , i : A ⊒ −→ j : G, i : B k ⊑ −→ j : H k (1 ≤ k ≤ n) j : G ⊑ H 1 ⊔ . . . ⊔ H n<label>(2)</label></formula><p>The important property of the described knowledge propagation is that it is directional, i.e., bridge rules from i to j support knowledge propagation only from i towards j. It has been shown in <ref type="bibr" target="#b12">[13]</ref> that adding the inference pattern (2) to existing DL tableaux reasoning methods lead to a correct and complete method for reasoning in DDL. This method has been implemented in the DRAGO DDL reasoner.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">The Debugging Process</head><p>In this section we will explain the general idea of our approach for improving automatically created mappings based on reasoning about mappings in Distributed Description Logics using a simple example. In particular, we consider two ontologies in the domain of conference management systems, the same domain we did our experiments in. For each ontology, i and j, we only consider a single axiom, namely: i : Author ⊑ P erson and j : P erson ⊑ ¬Authorization</p><p>These simple axioms that describe the concept of a person in two different ontologies -one stating that an author is a special kind of person and the other one stating that the concepts Person and Authorization (to access submitted papers) are disjoint concept -are enough to explain the important features of our approach. The approach consists of the following steps.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Mapping Creation</head><p>In the first step, we use any existing system for matching ontologies to create an initial set of mapping hypotheses. In particular, we are interested in mappings between class names, because these are the kinds of mappings that we can reason about using DDL framework. In order to support automatical repair of inconsistent mappings later on, the matching algorithm chosen should ideally not only return a set of mappings, but also a level of confidence in the correctness of a mapping. For the sake of simplicity, we assume that we use a simple string matching method that compares the overlap in concept names and computes a similarity value that denotes the relative size of the common substring <ref type="foot" target="#foot_0">1</ref> . Mappings are created based on a threshold for this value that we assume to be 1/3. Applying this method to the example will result in the following two mappings with corresponding levels of confidence:</p><formula xml:id="formula_6">i : P erson ≡ −→ j : P erson, 1.00 i : Author ≡ −→ j : Authorization, 0.46</formula><p>We further assume that the mapping method also applies some structural heuristics to derive additional mappings and propagates the levels of confidence accordingly. For instance, the fact that i : P erson is a superconcept of i : Author which is assumed to be equivalent to j : Authorization may be used to derive the following mapping:</p><formula xml:id="formula_7">i : P erson ⊒ −→ j : Authorization, 0.46</formula><p>In the same way, the fact that i : Author is a subconcept of i : P erson and the fact that i : P erson is assumed to be equivalent to j : P erson may be used to the following addition mapping:</p><formula xml:id="formula_8">i : Author ⊑ −→ j : P erson, 1.00</formula><p>We can easily see that the process has produced two incorrect mappings, namely the ones with a confidence of 0.46. It could be argued that it is easy to get rid of these incorrect mappings by raising the threshold to 0.5 for instance. This however is no sustainable solution to the problem, because there might be mappings with a level of confidence below 0.5 that are correct, on the other hand, there might still be incorrect mappings with a confidence of more than 0.5. Instead of relying on artificially set thresholds, we propose to analyze the impact of created mappings on the connected ontologies and to eliminate mappings that have a malicious influence.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">Diagnosis</head><p>The mapping set described in the last step now serves as a basis for analyzing the effect of mappings and detecting malicious mappings. This process is similar to the well known concept of model-based diagnosis which has already successfully been applied to the task of detecting wrong axioms in single ontologies. Similar to existing approaches for diagnosing ontologies, our starting point are unsatisfiable concepts which are interpreted as symptoms for which a diagnosis has to be computed. Compared to the general task of diagnosing ontologies, we are in a lucky position, because we have to deal with a much smaller set of potential diagnosis. In particular, we claim that the ontologies connected in the first step do not contain unsatisfiable concepts. If we now observe unsatisfiable concepts in the target ontology <ref type="foot" target="#foot_1">2</ref> and assuming that the ontologies themselves are correct, we know that they have to be caused by some mappings in the mapping set.</p><p>To illustrate this situation, we can have a look at our example again. Using existing techniques for reasoning in DDL, we can derive that the concept Authorization is globally unsatisfiable, i.e., j : Authorization I = ∅, because we have Authorzation ⊑ ¬P erson and at the same time, we can infer Authorization ⊑ P erson. There are two reasons for this, namely: Interpreting the inconsistency of the concept j : Authorization as a symptom, we can now try to identify and repair the cause of this inconsistency. For this purpose, we compute irreducible conflict set for this symptom. Here an irreducible conflict set is a set of mappings that makes the concept unsatisfiable and has the additional property that removing a mapping from the set makes the concept satisfiable again. the arguments above it is easy to see that he have the following irreducible conflict sets: In classical diagnosis, all conflict sets<ref type="foot" target="#foot_2">3</ref> are computed and the diagnosis is computed from these conflict sets using the hitting set algorithm. For the case of diagnosing mappings this is neither computationally feasible nor does it provide the expected result. In our example, the hitting set would consist of the mapping i : P erson ≡ −→ j : P erson which, as we sill see later, is the only mapping that actually carries some correct information.</p><formula xml:id="formula_9">j : Authorization Ij = r ij (i : Author Ii ) ⊆ r ij (i : P erson Ii ) = j : P erson Ij</formula><p>Our solution to the problem is to use an iterative approach that computes an often not minimal hitting set by determining one conflict set at a time and immediately fixing it in the way described in the next section. In our example, the algorithm will first detect the second conflict and fix it, afterwards, the method checks whether the concept j : Authorization is still inconsistent. As this is the case, the second conflict set will be detected and fixed as well removing the problem.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3">Heuristic Debugging</head><p>As mentioned above, the result of the diagnosis step is an irreducible conflict sets, in particular a set of mappings that make a concept unsatisfiable and with the additional property that removing one mapping from this set solves the problem in the sense that the concept becomes satisfiable. The underlying idea of our approach is now that unsatisfiable concepts are the result of wrong mappings. This means that each irreducible conflict set contains at least one mapping rule that does state a correct semantic relation between concepts and therefore should not be in the set of mappings. The goal of the debugging step is now to identify this malicious mapping and remove it from the overall mapping set. If we chose the right mapping for removal the quality of the overall mapping set should be improved, because a wrong mapping has been removed. In the case of our example, the first irreducible conflict set that will be considered consists of the following two mappings one of which we have to remove: There are different ways now, in which a decision about the mapping to remove could be made. The easiest way is to use an interactive approach where the conflict sets are presented to a human user who decides which mapping should be removed. In our case, the user will easily be able to decide that the mapping i : Author ≡ −→ j : Authorization is not correct and should be removed. In the second iteration, the following two mappings will be in the irreducible conflict set: For this set the user will be able to see immediately that the second mapping should be removed, because it is not correct. This approach sound trivial, but in the presence of large mapping sets, providing the user with feedback about potential problems in terms of small conflict sets is of great help and often reveals problems that are hard to see when looking at the complete mapping set.</p><p>We can also try to further automate the debugging process by letting the system decide, which mapping rule to eliminate. In cases where the matching system already provides a measure of confidence, this is again quite simple, as we can simply remove the mapping rule with the lowest degree of confidence. In our case this is again the rule i : Author ≡ −→ j : Authorization and removing it will lead to a better mapping set. It is not always possible, however, to rely on the confidence provided by the matching system, either because the system simply does not provide any or because the levels of confidence provided are not informative. In our experiments, we often had the situation where all mapping even though they were conflicting had a confidence of 100% attached. In this case, we have to think of a new way of ranking mappings. An approach that we used in our experiments that turned out to work quite well is to compute the semantic distance of the concept names involved using WordNet synsets. For the example above it is clear that this heuristic will also lead to an exclusion of the second rule, because the class names in the first rule are equivalent and therefore have the least semantic distance possible. In cases where no distinction can be made using this heuristic, we have to switch back to the interactive mode and ask the user which mapping to remove. In any cases, the debugging step leaves us with a single mapping that does not create any inconsistencies. In order to get a complete set of correct mappings, we can now infer all additional mappings that follow from this one which leads us to the corrected final set of mappings in our case this final set if the following. In summary, the process above is a way to improve the quality of automatically generated mapping sets by means of intelligent post-processing. Using formal properties of mappings and logical reasoning we are able to detect wrong mappings by analyzing their impact and tracking unwanted effects back to the mapping rules that caused them. In this our method is not yet another ontology matching method, but it is actually orthogonal to existing developments in the area of ontology matching as it can be applied to any set of mappings. The approach can be extended in several directions. First of all we can use symptoms other than concept satisfiability as a starting point for diagnosis. Further, we can use the method on joint sets of competing mappings created by different matching algorithms. This will help us to get a better coverage of the actual semantic relations and the trust in the quality of the different matching algorithms provides us with an additional criterion for selecting mappings to be discarded.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.4">Minimization</head><p>A further improvement of the debugged mapping can be achieved by removing redundant mappings -mappings that logically follow from other mappings. In <ref type="bibr" target="#b15">[16]</ref> we defined the notion of minimality of a mapping that we use in this context to remove redundant mappings. In the example for instance, the two mappings derived using structural heuristics do not really add new information to the system, because they can be derived from the two equivalence mappings that have been created first. In particular i : P erson ⊒ −→ j : Authorization, is redundant information, because:</p><formula xml:id="formula_10">i : Author Ii ⊆ i : P erson Ii (3) =⇒ r ij (Author Ii ) ⊆ r ij (P erson Ii )<label>(4)</label></formula><formula xml:id="formula_11">r ij (P erson Ii ) = j : P erson I j (5) =⇒ r ij (Author) ⊆ j : P erson Ij<label>(6)</label></formula><p>This means that for reasoning with automatically created mappings, we only have to take into account the equivalence mapping between the person concept in the two ontologies, because it is the basis for inferring the other one. For this reason, we remove all mappings that can be shown to be redundant in the sense that they can be derived from using other mappings from the set of mappings and only continue with the resulting minimal mapping set that still carries all the semantics of the complete set.</p><p>In this section we report on some preliminary experimental evaluation of the mapping debugging/minimization techniques presented in the preceding sections. All the experiments have been conducted on the prototype of the debugger/minimizer implemented on top of the DRAGO DDL reasoner <ref type="bibr" target="#b14">[15]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1">Experimental Setting</head><p>To perform experiments, we used a set of ontologies developed in the OntoFarm project <ref type="bibr" target="#b16">[17]</ref> which are used as a part of Benchmark in Ontology Alignment Evaluation challenge. 4 In particular, we selected several ontologies modeling the domain of conference organization: Given this ontology test set, we apply the following experimental scenario. Using the CtxMatch matching tool <ref type="bibr" target="#b3">[4]</ref>, we automatically compute mappings between pairs of ontologies in the test set. Among the created mappings, we further identify those ones which are capable of producing unsatisfiable classes and therefore need to be debugged first. In the process of debugging, malicious bridge rules in mappings are automatically diagnosed and removed in accordance with the heuristic debugging discussed in Section 3. In the concluding step of the experimental study, we apply the minimization algorithm to compute for each mapping a logically-equivalent minimal set of bridge rules. Note that for those mappings which demand the debugging first the minimization is applied to their repaired descendants.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2">Results</head><p>The results of applying the heuristic debugging and minimization techniques to the automatically generated mappings are summarized in Table <ref type="table" target="#tab_2">1 and Table 2</ref>. More information about the test data and results can be obtained visiting the applications section of the DRAGO reasoner web page. 5  During the debugging process we performed the following measurements: the initial amount of bridge rules in the mapping to be debugged, number of classes which become unsatisfiable due to the mapping, and finally the sets of bridge rules which are diagnosed as malicious and are automatically removed by the debugging algorithm. After the removal of malicious bridge rules, a mapping becomes repaired in a sense that it is not capable of producing unsatisfiability anymore. As shown in Table <ref type="table" target="#tab_1">1</ref>, the results of   applying the heuristic debugging approach proposed in Section 3 are quite reassuringall of the mappings automatically removed by our method are actually incorrect ones.</p><p>To estimate minimization rate we measured the initial number of bridge rules and the amount of logically entailed bridge rules discovered by applying the minimization technique. As summarized in Table <ref type="table" target="#tab_2">2</ref>, the amount of the entailed bridge rules in a certain automatically generated mapping varies from 50 to 80% to the initial number of bridge rules in this mapping.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Discussion</head><p>We have presented a method for automatically improving the result of heuristic matching systems using logical reasoning. The basic idea is similar to existing work on debugging ontologies and uses some non-standard inference methods for reasoning about mappings introduced in previous work. The method feeds on the fact that most existing matching algorithms ignore the logical implications of new mappings. This gap is filled by our method that detects malicious impacts of generated mappings and traces them back to their source. As we have shown in the experiments, in almost all cases (in fact in all cases observed in the experiment) the unwanted effects were caused by wrong mappings and we were able to remove them automatically thus improving the correctness of the generated mapping. Actually, the idea of using logical reasoning in the matching process is not new and has been proposed by others (e.g., <ref type="bibr" target="#b6">[7,</ref><ref type="bibr" target="#b7">8]</ref>), the way it is used in our work, however, is unique, as it is the only approach that takes the effects of mappings into account. We believe that this additional step can significantly improve the quality of matching methods and should be integrated in existing matching algorithms as far as they are concerned with expressive ontologies that support consistency checking. In fact, the expressiveness of the language used to encode the ontologies to be matched seems to be the only limitation of our approach which can only be applied if the language supports consistency checking. In our experiments, we have seen that we can improve the correctness of matching results by removing wrong mappings. So far, we did not quantify this improvement, this has to be done in future work.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>⊒</head><label></label><figDesc>−→ j : Conf erence expresses the more generality relation. A distributed T-box T = T , B consists of a collection of T-boxes T = {T i } i∈I and a collection of bridge rules B = {B ij } i =j∈I between them.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head></head><label></label><figDesc>and j : Authorization Ij ⊆ r ij (i : P erson Ii ) = j : P erson Ij</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head></head><label></label><figDesc>{i : P erson ≡ −→ j : P erson, i : Author ≡ −→ j : Authorization} and {i : P erson ≡ −→ j : P erson, i : P erson ⊒ −→ j : Authorization}</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>i</head><label></label><figDesc>: P erson ≡ −→ j : P erson, 1.00 i : Author ≡ −→ j : Authorization, 0.46</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>i</head><label></label><figDesc>: P erson ≡ −→ j : P erson, 1.00 i : P erson ⊒ −→ j : Authorization, 0.46</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_5"><head>i</head><label></label><figDesc>: P erson ≡ −→ j : P erson, 1.00 i : Author ⊒ −→ j : P erson, 1.00</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_6"><head>⊒−→</head><label></label><figDesc>CM T : P rogramCommitteeChair CRS-CONFTOOL 80 27 30 CRS : conf erence ⊑ −→ CON F T OOL : Organization CRS : person ⊑ −→ CON F T OOL : Event CRS : person ⊒ −→ CON F T OOL : P oster CRS : document ⊑ −→ CON F T OOL : Event CRS : author ⊑ −→ CON F T OOL : Event CRS : participant ⊑ −→ CON F T OOL : Event CRS : event ⊒ −→ CON F T OOL : P erson . . . PCS-CONFTOOL 45 5 5 P CS : Conf erence ⊑ −→ CON F T OOL : Organization P CS : Report ⊑ −→ CON F T OOL : Event P CS : Report ⊑ −→ CON F T OOL : Organization P CS : P ERSON ⊒ −→ CON F T OOL : P oster P CS : Accepted paper ⊑ −→ CON F T OOL : Event PCS-EKAW 120 4 5 P CS : P ERSON ⊒ −→ EKAW : F lyer P CS : P ERSON ⊒ −→ EKAW : M ulti − author V olume P CS : P ERSON ⊒ −→ EKAW : P roceedings P CS : W eb site ⊑ −→ EKAW : Event P CS : P ERSON ⊒ −→ EKAW : Conf erence P roceedings SIGKDD-CMT 60 1 1 SIGKDD : P rogram Committee ⊒ −→ CM T : P rogramCommitteeChair SIGKDD-CONFTOOL 72 3 3 SIGKDD : Conf erence ⊑ −→ CON F T OOL : Organization SIGKDD : P erson ⊒ −→ CON F T OOL : P oster SIGKDD : Deadline Author notif ication ⊑ −→ CON F T OOL : P erson SIGKDD-CRS 57 1 1 SIGKDD : Document ⊒ −→ CRS : program SIGKDD-EKAW 127 4 5 SIGKDD : P erson ⊒ −→ EKAW : F lyer SIGKDD : P erson ⊒ −→ EKAW : M ulti − author V olume SIGKDD : P erson ⊒ −→ EKAW : P roceedings SIGKDD : Deadline Author notif ication ⊑ −→ EKAW : P erson SIGKDD : P erson ⊒ −→ EKAW : Conf erence P roceedings</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 1 .</head><label>1</label><figDesc>Debugging results.</figDesc><table><row><cell>Mapping</cell><cell>Bridge rules count</cell><cell>Entailed bridge rules count</cell><cell>Bridge rules reduction rate</cell><cell>Mapping</cell><cell>Bridge rules count</cell><cell>Entailed bridge rules count</cell><cell>Bridge rules reduction rate</cell></row><row><cell>CMT-CONFTOOL  *</cell><cell>45</cell><cell>34</cell><cell>76%</cell><cell>EKAW-CMT</cell><cell>115</cell><cell>96</cell><cell>83%</cell></row><row><cell>CMT-CRS  *</cell><cell>52</cell><cell>38</cell><cell>73%</cell><cell>EKAW-SIGKDD</cell><cell>127</cell><cell>95</cell><cell>75%</cell></row><row><cell>CMT-SIGKDD</cell><cell>59</cell><cell>45</cell><cell>76%</cell><cell>PCS-CONFTOOL  *</cell><cell>40</cell><cell>25</cell><cell>63%</cell></row><row><cell>CMT-EKAW  *</cell><cell>111</cell><cell>94</cell><cell>85%</cell><cell>PCS-CRS</cell><cell>38</cell><cell>21</cell><cell>55%</cell></row><row><cell>CONFTOOL-CMT</cell><cell>48</cell><cell>34</cell><cell>71%</cell><cell>PCS-SIGKDD</cell><cell>56</cell><cell>36</cell><cell>64%</cell></row><row><cell>CONFTOOL-CRS  *</cell><cell>65</cell><cell>40</cell><cell>62%</cell><cell>PCS-CMT</cell><cell>73</cell><cell>58</cell><cell>79%</cell></row><row><cell>CONFTOOL-SIGKDD</cell><cell>75</cell><cell>43</cell><cell>57%</cell><cell>PCS-EKAW  *</cell><cell>115</cell><cell>96</cell><cell>83%</cell></row><row><cell>CONFTOOL-PCS</cell><cell>45</cell><cell>27</cell><cell>60%</cell><cell>SIGKDD-CMT  *</cell><cell>59</cell><cell>45</cell><cell>76%</cell></row><row><cell>CRS-CMT  *</cell><cell>50</cell><cell>34</cell><cell>68%</cell><cell>SIGKDD-CONFTOOL  *</cell><cell>69</cell><cell>41</cell><cell>59%</cell></row><row><cell>CRS-CONFTOOL  *</cell><cell>50</cell><cell>37</cell><cell>74%</cell><cell>SIGKDD-CRS  *</cell><cell>56</cell><cell>34</cell><cell>61%</cell></row><row><cell>CRS-SIGKDD</cell><cell>57</cell><cell>34</cell><cell>60%</cell><cell>SIGKDD-PCS</cell><cell>56</cell><cell>36</cell><cell>64%</cell></row><row><cell>CRS-PCS</cell><cell>38</cell><cell>21</cell><cell>55%</cell><cell>SIGKDD-EKAW  *</cell><cell>122</cell><cell>94</cell><cell>77%</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 2 .</head><label>2</label><figDesc>Minimization results (starred mappings were first repaired applying the debugging).</figDesc><table /></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">of course we use more sophisticated methods in the real experiments</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">the formal semantics of DDL guarantees that the addition of mappings cannot lead to unsatisfiable concepts in the source ontology</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2"><ref type="bibr" target="#b2">3</ref> in classical diagnosis often only minimal conflict sets are considered</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgements</head><p>This work was partially supported by the German Science Foundation in the Emmy-Noether Program and by the European Union under grant FP6-507482 (KnowledgeWeb) as part of the T-Rex exchange program.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Distributed description logics: Assimilating information from peer sources</title>
		<author>
			<persName><forename type="first">A</forename><surname>Borgida</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Serafini</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Data Semantics</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="153" to="184" />
			<date type="published" when="2003">2003</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Specification of a common framework for characterizing alignment</title>
		<author>
			<persName><forename type="first">P</forename><surname>Bouquet</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Euzenat</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Franconi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Serafini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Stamou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Tessaris</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Deliver</title>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="issue">4</biblScope>
			<date type="published" when="2004">2004</date>
			<publisher>KnowledgeWeb</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">C-OWL: Contextualizing ontologies</title>
		<author>
			<persName><forename type="first">P</forename><surname>Bouquet</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Giunchiglia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Van Harmelen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Serafini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Stuckenschmidt</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2nd International Semantic Web Conference (ISWC-03)</title>
				<meeting>the 2nd International Semantic Web Conference (ISWC-03)</meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2003">2003</date>
			<biblScope unit="volume">2870</biblScope>
			<biblScope unit="page" from="164" to="179" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Semantic coordination: a new approach and an application</title>
		<author>
			<persName><forename type="first">P</forename><surname>Bouquet</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Serafini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Zanobini</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Second Internatinal Semantic Web Conference</title>
		<title level="s">Lecture Notes in Computer Science</title>
		<meeting>the Second Internatinal Semantic Web Conference</meeting>
		<imprint>
			<publisher>Springer Verlag</publisher>
			<date type="published" when="2003">2003</date>
			<biblScope unit="volume">2870</biblScope>
			<biblScope unit="page" from="130" to="145" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">A framework for ontology integration</title>
		<author>
			<persName><forename type="first">D</forename><surname>Calvanese</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">De</forename><surname>Giacomo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Lenzerini</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Semantic Web Working Symposium</title>
				<meeting>the Semantic Web Working Symposium<address><addrLine>Stanford, CA</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2001">2001</date>
			<biblScope unit="page" from="303" to="316" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">QOM -Quick Ontology Mapping</title>
		<author>
			<persName><forename type="first">M</forename><surname>Ehrig</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Staab</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 3rd International Semantic Web Conference (ISWC-04)</title>
				<meeting>the 3rd International Semantic Web Conference (ISWC-04)</meeting>
		<imprint>
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">S-match: an algorithm and an implementation of semantic matching</title>
		<author>
			<persName><forename type="first">F</forename><surname>Giunchiglia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Shvaiko</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Yatskevich</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the European Semantic Web Conference (ESWS-04)</title>
				<meeting>the European Semantic Web Conference (ESWS-04)</meeting>
		<imprint>
			<date type="published" when="2004">2004</date>
			<biblScope unit="page" from="61" to="75" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Efficient semantic matching</title>
		<author>
			<persName><forename type="first">F</forename><surname>Giunchiglia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Yatskevich</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Giunchiglia</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the European Semantic Web Conference (ESWS-05)</title>
				<meeting>the European Semantic Web Conference (ESWS-05)</meeting>
		<imprint>
			<date type="published" when="2005">2005</date>
			<biblScope unit="page" from="272" to="289" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Combining and standardizing largescale, practical ontologies for machine translation and other uses</title>
		<author>
			<persName><forename type="first">E</forename><surname>Hovy</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 1st International Conference on Language Resources and Evaluation (LREC)</title>
				<meeting>the 1st International Conference on Language Resources and Evaluation (LREC)</meeting>
		<imprint>
			<date type="published" when="1998">1998</date>
			<biblScope unit="page" from="535" to="542" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Representing and reasoning about mappings between domain models</title>
		<author>
			<persName><forename type="first">J</forename><surname>Madhavan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">A</forename><surname>Bernstein</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Domingos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Halevy</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 18th National Conference on Artificial Intelligence (AAAI-02)</title>
				<meeting>the 18th National Conference on Artificial Intelligence (AAAI-02)</meeting>
		<imprint>
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Similarity flooding: A versatile graph matching algorithm and its application to schema matching</title>
		<author>
			<persName><forename type="first">S</forename><surname>Melnik</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Garcia-Molina</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Rahm</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 18th International Conference on Data Engineering (ICDE-02)</title>
				<meeting>the 18th International Conference on Data Engineering (ICDE-02)</meeting>
		<imprint>
			<publisher>IEEE Computing Society</publisher>
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">The PROMPT suite: Interactive tools for ontology merging and mapping</title>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">F</forename><surname>Noy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Musen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">International Journal of Human-Computer Studies</title>
		<imprint>
			<biblScope unit="volume">59</biblScope>
			<biblScope unit="issue">6</biblScope>
			<biblScope unit="page" from="983" to="1024" />
			<date type="published" when="2003">2003</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Aspects of distributed and modular ontology reasoning</title>
		<author>
			<persName><forename type="first">L</forename><surname>Serafini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Borgida</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Tamilin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 19th International Joint Conference on Artificial Intelligence (IJCAI-05)</title>
				<meeting>the 19th International Joint Conference on Artificial Intelligence (IJCAI-05)</meeting>
		<imprint>
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">A formal investigation of mapping languages for terminological knowledge</title>
		<author>
			<persName><forename type="first">L</forename><surname>Serafini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Stuckenschmidt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Wache</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 19th International Joint Conference on Artificial Intelligence (IJCAI-05)</title>
				<meeting>the 19th International Joint Conference on Artificial Intelligence (IJCAI-05)</meeting>
		<imprint>
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">DRAGO: Distributed reasoning architecture for the semantic web</title>
		<author>
			<persName><forename type="first">L</forename><surname>Serafini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Tamilin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2nd European Semantic Web Conference (ESWC-05)</title>
				<meeting>the 2nd European Semantic Web Conference (ESWC-05)</meeting>
		<imprint>
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Reasoning about ontology mappings</title>
		<author>
			<persName><forename type="first">H</forename><surname>Stuckenschmidt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Wache</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Serafini</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the ECAI-06 Workshop on Contextual Representation and Reasoning</title>
				<meeting>the ECAI-06 Workshop on Contextual Representation and Reasoning</meeting>
		<imprint>
			<date type="published" when="2006">2006</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Ontofarm: Towards an experimental collection of parallel ontologies</title>
		<author>
			<persName><forename type="first">O</forename><surname>Svab</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Svatek</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Berka</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Rak</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Tomasek</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Poster Proceedings of the International Semantic Web Conference 2005 (ISWC-05</title>
				<imprint>
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
