COMA++: Results for the Ontology Alignment Contest OAEI 2006

Presentation of the system

COMA++ is an extension of our previous COMA prototype [1]. It is a customizable and generic tool for matching both schemas and ontologies specified in languages such as SQL, XML Schema or OWL [2]. COMA++ offers a GUI and supports the combined use of several match algorithms as well as the reuse of previously confirmed match results [6]. The COMA++ architecture is shown in figure 1. The Repository persistently stores all match-related data, the Model and Mapping Pools manage all schemas, ontologies, and mappings in memory, and the Matching Engine performs the match operations. The GUI provides access to these components and is used to visualize models, manage the match process and mappings. The Matching Engine contains different libraries that supports many match algorithms and match strategies. The similarity results of individual matchers are maintained and aggregated within a similarity matrix per match task [1]. Match strategies implement workflows to deal with complex match tasks and enable a reuse of previous results and the decomposition of larger match tasks into smaller ones [3].

State, purpose, general statement

COMA and COMA++ have proven to be very effective for matching database and XML schemas [1,4,6]. The main reason for this test was to see the effectiveness of a generic matching tool for dealing with ontologies.

Specific techniques used

An automatic match process in COMA++ consists of several steps. In the first step the imported schemas and ontologies are transformed into a generic graph representation. The graph nodes represent schema/ontology components such as classes or properties and have attributes like name and data type. All relationships, e.g. aggregations and specializations, are uniformly represented by edges between nodes. In the next step graph nodes are matched with each other using a match strategy and matchers. There is no differentiation made between node types, so that for example classes and properties can be matched. The similarity values obtained by the individual matchers are aggregated according to a combination strategy (average, etc.). The match candidates are selected from the aggregated correspondences, e.g. based on a threshold criterion. Finally, the result mapping (RDF alignment) is generated.

In addition to the schema-based matchers we used an instance-level matcher which has recently been added to the COMA++ match library.

Adaptations made for the evaluation

In addition to the integration of an instance matcher only few changes to COMA++ were necessary to deal with specifics of the contest. As mentioned, the output mapping was translated into the predefined RDF alignment format. Furthermore the result of a matcher was ignored if it contained the same similarity value for all entities. This was a minor adaptation made because the same strategy had to be used for all tests.

Another change was the splitting of huge ontologies into several smaller ones. The results of the smaller match tasks were then merged. Another selection step was applied on the merged results to obtain the final result mapping.

To fit the rules of the contest the prototype is not using synonyms and abbreviations which can be given to the system. The specific creation of them was not allowed but would have been necessary because of the different domains.

Link to the system, parameters file and to the set of provided alignments

At the following URL .zip archives of all the contest results are available.

Furthermore the system with a parameters file can be downloaded.

http://dbs.uni-leipzig.de/Research/coma_oaei.html

Results

The results discussed here have been calculated with five matchers: NameType, Comment, Parents, Children and Instance. For the combination of the match results the average value has been computed and a selection has been made using, e.g. a threshold. The best setting has been determined by running different configurations on the benchmark and choosing the one with the highest f-measure. The exact parameters can be found in the appendix.

Benchmark

This test is a systematic benchmark test containing 50 tests which can be used for identifying the strengths and weaknesses of an algorithm.

The overall score of COMA++ for this task (except 102) is quite good: Precision Recall F-Measure Time Average 0.96 0.82 0.88 7.0 sec

Tests 101-104

The results for tests 101, 103 and 104 are perfect because the classes and properties have the same names, comments and instances. The language restriction and generalization have no influence. The alignment for the irrelevant ontology 102 contains a few false matches that have similar names, e.g. "year -yearValue". There are no matches expected for this test, thus precision and recall automatically are 0.0, so we left this value out at the average calculation.

2.1.2

Tests 201-247 The results of these tests differ depending on the given information because the chosen strategy uses names, data types, comments, structure and instance. If one or more of these information is missing only the remaining information can be used.

For the tasks 202, 209 and 210 the names and the comments differ so these information can't be used and the results have a lower recall.

For all other tests of this group the names, the comments or both contain useful information so the results are quite good.

The tests 221-247 even have the same names and comments, whereas the structure is different. Instances are similar but some ontologies don't contain them. The given information is enough to reach very good results.

Tests 248-266

In these tests the names have been substituted with random strings and there are no comments. The algorithm can thus only use the hierarchy and the instances, if given. Not for every class and property instances exist, so that information just helps to find corresponding entities. The results for these tests are therefore satisfactory.

Tests 301-304 (Real Ontologies)

The real-world ontologies have been a more difficult task for COMA++ because the ontologies are quite different compared with the 101 ontology. Three out of the four ontologies don't contain instances -only 304 does. 302 and 303 don't use comments, the structure is quite different and the names are often dissimilar, which the prototype could not find because the contest disallowed us to use auxiliary information.

Anatomy

For the anatomy task two large ontologies had to be aligned. Because of the huge size the matching task had to be splitted by our system into smaller ones. The part results were merged and then a variety has been selected. The selection was necessary because with the splitted matching more false matches have been found.

Another difficulty has been the fact that in the FMA ontology the id of classes look like "frame_92794" and "frame_51746" and the real information is in the label. Whereas the OpenGALEN ontology has meaningful ids and uses rarely labels. These labels or ids are made up of a lot of tokens and sometimes they differ only in a few letters, e.g. "fifth" instead of "first". Therefore we expect that more false positives will be found than in the benchmark test.