=Paper=
{{Paper
|id=Vol-2536/oaei19_paper2
|storemode=property
|title=ALIN Results for OAEI 2019
|pdfUrl=https://ceur-ws.org/Vol-2536/oaei19_paper2.pdf
|volume=Vol-2536
|authors=Jomar da Silva,Carla Delgado,Kate Revoredo,Fernanda Baião
|dblpUrl=https://dblp.org/rec/conf/semweb/SilvaDRB19
}}
==ALIN Results for OAEI 2019==
ALIN Results for OAEI 2019 Jomar da Silva1 , Carla Delgado1 , Kate Revoredo1 , and Fernanda Araujo Baião2 1 Graduate Program in Informatics, Federal University of Rio de Janeiro (UFRJ), Brazil 2 Department of Industrial Engineering, Pontifical Catholic University of Rio de Janeiro (PUC-Rio), Brazil jomar.silva@uniriotec.br,carla@ppgi.ufrj.br,katerevoredo@ppgi.ufrj.br, fbaiao@puc-rio.br Abstract. 1 ALIN is an ontology matching system specialized in the interactive ontology matching, and its main characteristic is the use of expert feedback to improve the set of selected mappings, using semantic and structural techniques to make this improvement. This paper de- scribes its configuration for the OAEI 2019 competition and discusses its results. Keywords: ontology matching, Wordnet, interactive ontology match- ing, ontology alignment, interactive ontology alignment 1 Presentation of the System Due to the advances in information and communication technologies, a large amount of data repositories became available. Those repositories, however, are highly semantically heterogeneous, which hinders their integration. Ontology matching has been successfully applied to solve this problem, by discovering mappings between two distinct ontologies which, in turn, conceptually define the data stored in each repository. Among the various ontology matching ap- proaches that exist in the literature, interactive ontology matching includes the participation of domain experts to improve the quality of the final alignment [1]. ALIN is an interactive ontology matching system and has been participating in all OAEI editions since 2016, with improving results. 1.1 State, Purpose and General Statement ALIN is a system for interactive ontology matching that consists of two steps: one non-interactive and one interactive. In the first step, ALIN chooses the first mappings, among which some are directly placed in the alignment and others are presented to the expert. In the 2019 version, ALIN uses new techniques to improve the first step, thus placing more mappings directly in the alignment without having to present them to the expert. 1 Copyright c 2019 for this paper by its authors. Use permitted under Creative Com- mons License Attribution 4.0 International (CC BY 4.0). 1.2 Specific Techniques Used Alin handles three sets of mappings: (i) Accepted, which is a set of mappings definitely to be retained in the alignment; (ii) Selected, which is a set of mappings where each is yet to be decided if it will be included in the alignment; and (iii) Suspended, which is a set of mappings that have been previously selected, but (temporarily or permanently) filtered out of the alignment. Given the previous definitions, Alin procedure follows 5 Steps, described as follows: 1. Select mappings: select the first mappings and automatically accepts some of them. We explain the selection and acceptance process below; 2. Filter mappings: suspend some selected mappings, using lexical criteria for that; 3. Ask expert: accepts or rejects selected mappings, according to expert feed- back 4. Propagate: select new mappings, reject some selected mappings or unsuspend some suspended mappings (depending on newly accepted mappings) 5. Go back to 3 as long as there are undecided selected mappings All versions of ALIN (since its very first OAEI participation) follow this general procedure. In this 2019 version, however, we introduced modifications in Step 1. In previous versions, ALIN automatically accepted only the entities with the same name. In this version, ALIN also automatically accepts the entities whose names are synonyms or with variations in name words. ALIN searches synonyms in the Wordnet. In the Anatomy track, ALIN uses the FMA ontology too. ALIN applies the following techniques: – Line 1. ALIN selects mappings using linguistic similarities between entity names. ALIN uses synonyms and variations in entity name words to auto- matically accept mappings. At this time, ALIN automatically selects and accepts only concept mappings. To do that, ALIN uses linguistic metrics. ALIN uses the Wordnet and domain-specific ontologies (the FMA Ontology in the Anatomy track) to find synonyms between entities. – Line 2. ALIN suspends the selected mappings whose entities have low lexical similarity. We use the Jaccard, Jaro-Wrinkler, and n-gram lexical metrics to calculate the lexical similarity of the selected mappings. We based the process of choosing the similarity metrics used by ALIN on the result of these metrics in assessments [2]. It is important to know that these suspended mappings can be unsuspended later, by structural analysis, as proposed in [3]. – Line 3. At this point, the expert interaction begins. ALIN sorts the selected mappings in a descending order according to the sum of similarity metric values. The sorted selected mappings are submitted to the expert. – Line 4. Initially, the set of selected mappings contains only concept map- pings. At each interaction with the expert, if the expert accepts the mapping, ALIN (i) removes from the set of selected mappings all the mappings that compose the mapping anti-pattern [4][5] (we explain mapping anti-pattern below) with the accepted mappings; (ii) selects data property (like [6]) and object property mappings related to the accepted concept mappings; (iii) unsuspends all concept mappings whose both entities are subconcepts of the concept of an accepted mapping, following a similar technique proposed in our previous work [3]. – Line 5. The interaction phase continues until there are no selected mappings. An ontology may have construction constraints, such as a concept cannot be equivalent to its superconcept. An alignment may have other constraints like, for example, an entity of ontology O cannot be equivalent to two entities of the ontology O0 . A mapping anti-pattern is a combination of mappings that generates a problematic alignment, i.e., a logical inconsistency or a violated constraint. 1.3 Link to the System and Parameters File ALIN is available 2 as a package to be run through the SEALS client. 2 Results Interactive ontology matching is the focus of the ALIN system. Comparing its results in the 2019 campaign to its previous participations (Table 5), ALIN improvements include an expressive reduction on the number of interactions with the expert and the increase of the quality of the generated alignment. 2.1 Comments on the Participation of the ALIN in Non-Interactive Tracks ALIN used new techniques to automatically accept mappings. These techniques led to an increase in the F-Measure of non-interactively generated alignment, which shows the effectiveness of the techniques. (Table 1 and Table 2). Confer- ence track, unlike the Anatomy track, has relationship mappings and attribute mappings that ALIN does not automatically accept, thus making the F-Measure on the Conference track, although higher than last year, still low. Table 1. Participation of ALIN in Anatomy Non-Interactive Track - OAEI 2018[7]/2019[8] Year Precision Recall F-measure 2018 0.998 0.611 0.758 2019 0.974 0.698 0.813 2 https : //drive.google.com/f ile/d/1SxJL6f LRV qI84epm8DbAM lcscEoGbgZ/view?usp = sharing Table 2. Participation of ALIN in Conference Non-Interactive Track - OAEI 2018/2019[9] Year Precision Recall F-measure 2018 0.81 0.42 0.55 2019 0.82 0.43 0.56 2.2 Comments on the Participation of the ALIN in Interactive Tracks In the Anatomy track, ALIN was tied for second in quality (F-Measure) with slightly lower total requests (Table 3). In the Conference track, ALIN was tied for first in quality with a slightly higher total request (Table 4). Table 3. Participation of ALIN in Anatomy Interactive Track - Error Rate 0.0[10] Tool Precision Recall F-measure Total Requests ALIN 0.979 0.85 0.91 365 AML 0.968 0.948 0.958 236 LogMap 0.982 0.846 0.909 388 Table 4. Participation of ALIN in Conference Interactive Track - Error Rate 0.0[10] Tool Precision Recall F-measure Total Requests ALIN 0.914 0.695 0.79 228 AML 0.91 0.698 0.79 221 LogMap 0.886 0.61 0.723 82 Interactive Anatomy Track In this track, ALIN has had a decrease in the number of expert interactions and an increase in the quality of the generated alignment, showing that the new techniques used to automatically accept correct mappings are effective (Table 5). ALIN used the FMA ontology to help find synonyms between the two on- tologies of the Anatomy track. The Foundational Model of Anatomy Ontology (FMA) is a reference ontology for the domain of Human anatomy 3 . 3 “Foundational Model of Anatomy Ontology”. Available at http://sig.biostr.washington.edu/projects/fm/AboutFM.html Last accessed on Oct, 11, 2019. Interactive Conference Track In this track, ALIN has had a decrease in the number of expert interactions keeping a good quality of the generated alignment (Table 7). 2.3 Comparison of the Participation of ALIN in OAEI 2019 with his Participation in OAEI 2018 In this version, ALIN uses new techniques to automatically accept mappings. These techniques use synonyms and word variations to find equal entities be- tween the two ontologies. ALIN also started to use FMA ontology as an external resource. The use of the new techniques proved to be effective as it reduced the number of interactions while keeping a good level of quality. The new techniques also increased the quality of the alignment generated in Anatomy interactive tracking, where ALIN used the FMA ontology. It is not always possible to use an external resource to find synonyms between entities of two ontologies, but when this is possible, the results showed that it is worth it. The quality of the alignment generated by ALIN is dependent on the correct expert feedback, as expert responses are used to select new mappings. When ALIN selects wrong mappings, the quality of the generated alignment tends to decrease. But if we compare this year’s quality decline with last year’s, we see that this fall is less sharp (Table 6 and Table 8). The less sharp decline in quality is because we need less user interaction as we are automatically accepting more mappings. The organization of FMA ontology in memory and the search for synonyms and word variations led to longer run time (Table 9 and Table 10) Table 5. Participation of ALIN in Anatomy Interactive Track - OAEI 2016[11]/2017[12]/2018[7]/2019[10] - Error Rate 0.0 Year Precision Recall F-measure Total Requests 2016 0.993 0.749 0.854 803 2017 0.993 0.794 0.882 939 2018 0.994 0.826 0.902 602 2019 0.979 0.85 0.91 365 3 General Comments Evaluating the results, we can see that the system has improved, although it can improve even further, towards: – handling user error rate; Table 6. F-Measure of ALIN in Anatomy Interactive Track - OAEI /2018[7]/2019[10] - with Different Error Rates Year Error rate 0.0 Error rate 0.1 2018 0.902 0.854 2019 0.91 0.889 Table 7. Participation of ALIN in Conference Interactive Track - OAEI 2016[11]/2017[12]/2018[7]/2019[10] - Error Rate 0.0 Year Precision Recall F-measure Total Requests 2016 0.957 0.735 0.831 326 2017 0.957 0.731 0.829 329 2018 0.921 0.721 0.809 276 2019 0.914 0.695 0.79 228 Table 8. F-Measure of ALIN in Conference Interactive Track - OAEI /2018[7]/2019[10] - with Different Error Rates Year Error rate 0.0 Error rate 0.1 2018 0.809 0.705 2019 0.79 0.725 Table 9. Run Time (sec) in Anatomy Interactive Track - OAEI /2018[13]//2019[10] Tool 2018 2019 ALIN 317 2132 AML 48 82 LogMap 23 29 Table 10. Run Time (sec) in Conference interactive track - OAEI /2018[13]/2019[10] Tool 2018 2019 ALIN 106 397 AML 35 34 LogMap 37 37 – generating a higher quality initial alignment in its non-interactive phase; – reducing the number of interactions with the expert; And there was a worsening run time, where we could improve too. 3.1 Conclusions ALIN used new techniques to automatically accept new mappings. They have been effective in reducing the number of interactions, while also keeping good quality in the generated alignment. In the case of the Anatomy track, these new techniques both decreased the number of interactions and increased the quality of the generated alignment. We can explain this quality improvement in this track by the use of the FMA ontology as a new external resource. With the use of the new techniques in both Anatomy and Conference tracks, there has been a less sharp drop in quality as the expert makes mistakes. Nevertheless, ALIN had an increase in run time due to the use of the new techniques, which may be addressed in future work. References 1. Paulheim, H., Hertling, S., Ritze, D.: Towards Evaluating Interactive Ontology Matching Tools. Lecture Notes in Computer Science 7882 (2013) 31–45 2. Cheatham, M., Hitzler, P.: String similarity metrics for ontology alignment. In: Proceedings of the 12th International Semantic Web Conference - Part II. ISWC ’13, New York, NY, USA, Springer-Verlag New York, Inc. (2013) 294–309 3. Silva, J., Baião, F., Revoredo, K., Euzenat, J.: Semantic interactive ontology matching: Synergistic combination of techniques to improve the set of candidate correspondences. In: OM-2017: Proceedings of the Twelfth International Workshop on Ontology Matching. Volume 2032. (2017) 13–24 4. Guedes, A., Baião, F., Shivaprabhu, Revoredo, R.: On the Identification and Rep- resentation of Ontology Correspondence Antipatterns. In: Proc. 5th Int. Conf. Ontol. Semant. Web Patterns (WOP’14), CEUR Work. Proc. (2014) 5. Guedes, A., Baião, F., Revoredo, K.: Digging Ontology Correspondence Antipat- terns. In: Proceeding WOP’14 Proc. 5th Int. Conf. Ontol. Semant. Web Patterns. Volume 1032. (2014) 38—-48 6. Silva, J., Revoredo, K., Baião, F.A., Euzenat, J.: Interactive Ontology Matching: Using Expert Feedback to Select Attribute Mappings. (2018) 7. Silva, J., Baião, F., Revoredo, K.: Alin results for oaei 2018. In: Ontology Matching: OM-2018: Proceedings of the ISWC Workshop. OM’18 (2018) 117–124 8. : Results for oaei 2019 - anatomy track. http://oaei.ontologymatching.org/ 2019/results/anatomy/ Accessed: 2019-10-11. 9. : Results of evaluation for the conference track within oaei 2019. http://oaei. ontologymatching.org/2019/results/conference/index.html Accessed: 2019- 10-11. 10. : Results for oaei 2019 - interactive track. http://oaei.ontologymatching.org/ 2019/results/interactive/ Accessed: 2019-10-11. 11. Silva, J., Baião, F., Revoredo, K.: Alin results for oaei 2016. In: OM-2016: Pro- ceedings of the Eleventh International Workshop on Ontology Matching. OM’16 (2016) 130–137 12. Silva, J., Baião, F., Revoredo, K.: Alin results for oaei 2017. In: OM-2017: Pro- ceedings of the Twelfth International Workshop on Ontology Matching. OM’17 (2017) 114–121 13. : Results for oaei 2018 - interactive track. http://oaei.ontologymatching.org/ 2018/results/interactive/index.html Accessed: 2019-10-11.