<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>ALIN Results for OAEI 2020</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jomar da Silva</string-name>
          <email>jomar.silva@uniriotec.br</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Carla Delgado</string-name>
          <email>carla@ppgi.ufrj.br</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kate Revoredo</string-name>
          <email>kate.revoredo@wu.ac.at</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fernanda Araujo Bai~ao</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Industrial Engineering Ponti cal Catholic University of Rio de Janeiro (PUC-Rio)</institution>
          ,
          <country country="BR">Brazil</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Graduate Program in Informatics Federal University of Rio de Janeiro (UFRJ)</institution>
          ,
          <country country="BR">Brazil</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Vienna University of Economics and Business</institution>
          ,
          <addr-line>Vienna</addr-line>
          ,
          <country country="AT">Austria</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>4 Alin is a system for interactive ontology matching. The ALIN version participating in OAEI 2020 applies natural language processing techniques (NLP) to standardize the concept names of the ontologies that participate in the matching process. As Alin selects through semantic and lexical metrics many of the mappings that the domain expert evaluates, we hope that the standardization of the concept names will improve the selection of the mappings and thus the generated alignment. This article describes the participation of Alin at OAEI 2020 and discusses its results.</p>
      </abstract>
      <kwd-group>
        <kwd>ontology matching</kwd>
        <kwd>Wordnet</kwd>
        <kwd>interactive ontology matching</kwd>
        <kwd>ontology alignment</kwd>
        <kwd>interactive ontology alignment</kwd>
        <kwd>natural language processing</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        however, this strategy has achieved results that are superior to automatic
(noninteractive) strategies. Nevertheless, there is still room for improvements [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], as
evidenced by the most recent results from the evaluation of interactive tools in
the OAEI5 (Ontology Alignment Evaluation Initiative). Alin [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] is a system for
interactive ontology matching which has been participating in all OAEI editions
since 2016, with increasingly improved results.
1.1
      </p>
    </sec>
    <sec id="sec-2">
      <title>State, Purpose and General statement</title>
      <p>Interactive ontology matching systems select mappings for domain expert
evaluates. Alin selects many of these mappings through semantic and lexical
metrics. As the concept names of the ontologies are not standardized, these metrics
may return lower values than would be the case if they were standardized. This
smaller metric may cause Alin not to select these mappings for evaluation by the
domain expert. In its 2020 version, Alin proposes Natural Language
Processing (NLP) techniques such as the development of regular grammars (in reality
its equivalent regular expressions) and context free grammars along with their
respective lexical analyzers (scanners) and syntax analyzers (parsers), for the
concept names of the ontologies to be matched. The use of these NLP resources
(scanners and parsers) makes it possible to translate di erent patterns used in
the two ontologies into a unique one. This standardization allows Alin to select
better mappings for the domain expert to evaluate.</p>
      <p>To do the standardization, Alin will have a new phase before the execution
of the program. In this phase, an NLP expert develops, manually, grammars to
the concept names of the ontologies and their respective scanners and parsers.
Alin uses these scanners and parsers during the execution of the program. This
new phase is possible in an interactive ontology matching system because:
1. We know before the program runs which ontologies it will match, as we need
to look for experts in the domain of ontologies to interact with the program;
2. The process of searching, meeting, and scheduling a day available for the
expert to participate in the process can take a long time, probably a few
days.</p>
      <p>We can use this time of a few days until the execution of the program to
develop the necessary grammars, scanners, and parsers for the ontologies. In this
version of Alin, the authors of this paper played the role of the NLP expert.
1.2</p>
    </sec>
    <sec id="sec-3">
      <title>Speci c techniques used</title>
      <p>During its matching process, Alin handles three sets of mappings: (i) Accepted,
which is a set of mappings de nitely to be retained in the alignment; (ii) Selected,
which is a set of mappings where each is yet to be decided if it will be included
in the alignment; and (iii) Suspended, which is a set of mappings that have
5 Available at http://oaei.ontologymatching.org/2020/results/interactive/index.html,
last accessed on Oct, 23, 2020.
been previously selected, but (temporarily or permanently) ltered out of the
alignment.</p>
      <p>Given the previous de nitions, Alin procedure follows 5 Steps, described as
follows:
1. Select mappings: select the rst mappings and automatically accepts some
of them. We explain the selection and acceptance process below;
2. Filter mappings: suspend some selected mappings, using lexical criteria for
that;
3. Ask domain expert: accepts or rejects selected mappings, according to
domain expert feedback
4. Propagate: select new mappings, reject some selected mappings or unsuspend
some suspended mappings (depending on newly accepted mappings)
5. Go back to 3 as long as there are undecided selected mappings</p>
      <p>
        All versions of Alin (since its very rst OAEI participation) follow this
general procedure. In this 2020 version, Alin includes a new step where an
NLP expert develops grammars, and their respective scanners, and parsers to
the concept names of the ontologies. Alin uses these scanners and parsers to
standardize the concept names of the ontologies and thus improve the generated
alignment. The new step can lead to, for example, correcting spelling errors and
unifying di erent spellings for the same concept name. More detailed examples
of possible standardization of concept names are presented in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Alin uses the
developed scanners and parsers in step 1 of the program.
      </p>
      <p>
        Alin applies the following techniques:
{ Step 1. Alin runs the scanners and the parsers for each concept name of the
ontologies, modifying it and standardizing it. Alin uses a blocking strategy
where it discards all data properties and object properties of the
ontologies. So, in this step, Alin selects only concept mappings, using linguistic
similarities between the concept names. Alin automatically accepts concept
mappings whose names are synonyms. Alin uses the Wordnet and
domainspeci c ontologies (the FMA Ontology in the Anatomy track) to nd
synonyms between entities.
{ Step 2. Alin suspends the selected mappings whose entities have low lexical
similarity. We use the Jaccard, Jaro-Wrinkler, and n-gram lexical metrics
to calculate the lexical similarity of the selected mappings. We based the
process of choosing the similarity metrics used by ALIN on the result of
these metrics in assessments [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. It is relevant to know that these suspended
mappings can be further unsuspended later, as proposed in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
{ Step 3. At this point, the domain expert interaction begins. Alin sorts the
selected mappings in a descending order according to the sum of similarity
metric values. The sorted selected mappings are submitted to the domain
expert.
{ Step 4. Initially, the set of selected mappings contains only concept
mappings. At each interaction with the domain expert, if s/he accepts the
mapping, Alin (i) removes from the set of selected mappings all the mappings
that compose an instantiation of a mapping anti-pattern [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ][
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] (we explain
mapping anti-patterns below) with the accepted mappings; (ii) selects data
property (like [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]) and object property mappings related to the accepted
concept mappings; (iii) unsuspends all concept mappings whose both
entities are subconcepts of the concept of an accepted mapping, following a
similar technique proposed in our previous work [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
{ Step 5. The interaction phase continues until there are no selected mappings.
      </p>
      <p>There are logical constraints which should apply to several ontologies. For
example, an ontology may have construction constraints, such as a concept
cannot be equivalent to its superconcept. An alignment may have other constraints
like, for example, an entity of ontology O cannot be equivalent to two
entities of the ontology O0. A mapping anti-pattern is a combination of mappings
that generates a problematic alignment, i.e., a logical inconsistency or a violated
constraint.
1.3</p>
    </sec>
    <sec id="sec-4">
      <title>Link to the system and parameters le</title>
      <p>To this version, Alin used the scanners and the parsers we developed for the
ontologies of the conference and anatomy tracks.</p>
      <p>Alin is available 6 as a package to be run through the SEALS client.
2</p>
      <p>Results
Interactive ontology matching is the focus of the Alin system. If you compare
the participation of Alin in 2020 and 2019 (Table 4), you will see an
improvement in the quality of the generated alignment, showing the e ectiveness of the
techniques used.
2.1</p>
    </sec>
    <sec id="sec-5">
      <title>Comments on the participation of ALIN in non-interactive tracks</title>
      <p>The use of NLP techniques led to an increase in the F-Measure of non-interactively
generated alignments in the Anatomy track but stability on the Conference track
(Table 1).
2.2</p>
    </sec>
    <sec id="sec-6">
      <title>Comments on the participation of ALIN in interactive tracks</title>
      <p>In the Anatomy track, Alin was better than LogMap in both quality (F-Measure)
and total requests, but worse in both aspects than AML (Table 2). In the
Conference track, Alin was rst in quality and third in total requests (Table 3).
6 https : ==drive:google:com=f ile=d=1ZM 3g0aOgU ha
9V ptU bqk9nmnkF Cl7L=view?usp = sharing
Interactive Anatomy Track In this track, Alin had a decrease in the number
of interactions with the domain expert and an increase in the quality of the
generated alignment, showing that the use of the NLP techniques are e ective
for this track (Table 4).</p>
      <p>Interactive Conference Track In this track, Alin had an increase in the
quality of the generated alignment but an increase in the number of domain
expert interactions (Table 5).
2.3</p>
    </sec>
    <sec id="sec-7">
      <title>Comparison of the participation of ALIN in OAEI 2020 with its participation in OAEI 2019</title>
      <p>The quality of the alignment generated by Alin depends on the correct feedback
from the domain expert, as Alin uses this feedback to select new mappings.
When Alin selects wrong mappings, the quality of the generated alignment
tends to decrease. If we compare this year's quality decline with last year's, we
see that this fall is more sharp (Table 6).</p>
      <p>The run time of Alin this year was shorter than last year (Table 7). In an
Intel I5 with 10Gb reserved to Alin, Alin has run 20% faster this year than last
year. The execution in OAEI had a reduction in the run time, but other systems
also had this reduction. So this di erence may be due both to modi cations
made in Alin and to changes in the computational environment.</p>
      <p>Year Precision Recall F-measure Total Requests</p>
      <p>Year Precision Recall F-measure Total Requests
2016
2017
2018
2019
2020
Evaluating the OAEI 2020 results, Alin has improved the quality of the
generated alignment in the interactive track. However, an increase in the user error
rate led to a slight worse alignment. Finally, the number of interactions with
the expert was relatively stable since last year, with a slight increase (from 228
to 233 requests) in the Conference track and a slight decrease (from 365 to 360
requests) in the Anatomy track.</p>
      <p>Another consideration is that this version of Alin generates the need for a
new expert involved in the process, to develop artifacts (scanner, parser) required
for scanning and parsing the name of the concepts. This NLP expert may not
always be available, but if he is, the results have shown that his work can improve
the quality of the generated alignment.
Alin 2020 used NLP techniques to improve the standardization of the concept
names of the ontologies to be matched. They have been e ective in increasing the
quality of the generated alignment while being relatively stable with regard to
the number of requests to the user. Alin had a decrease in run time but a more
sharp fall in the alignment quality when the domain expert makes mistakes. An
assumption that Alin now assumes with the inclusion of NLP techniques is the
need of a scanner and a parser for the ontologies involved in the matching.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Euzenat</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shvaiko</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <string-name>
            <surname>Ontology Matching - Second Edition</surname>
          </string-name>
          . Springer-Verlag (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dragisic</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Faria</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ivanova</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jimenez-Ruiz</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lambrix</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pesquita</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>User validation in ontology alignment: functional assessment and impact</article-title>
          .
          <source>The Knowledge Engineering Review</source>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>Da</given-names>
            <surname>Silva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Revoredo</surname>
          </string-name>
          ,
          <string-name>
            <surname>K.</surname>
          </string-name>
          , Baia~o,
          <string-name>
            <given-names>F.</given-names>
            ,
            <surname>Euzenat</surname>
          </string-name>
          , J.:
          <article-title>Alin: improving interactive ontology matching by interactively revising mapping suggestions</article-title>
          .
          <source>The Knowledge Engineering Review</source>
          <volume>35</volume>
          (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Real</surname>
            ,
            <given-names>F.J.Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bella</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McNeill</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bundy</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Using domain lexicon and grammar for ontology matching</article-title>
          . (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Cheatham</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hitzler</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>String similarity metrics for ontology alignment</article-title>
          .
          <source>In: Proceedings of the 12th International Semantic Web Conference - Part II. ISWC '13</source>
          , New York, NY, USA, Springer-Verlag New York, Inc. (
          <year>2013</year>
          )
          <volume>294</volume>
          {
          <fpage>309</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Silva</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , Baia~o,
          <string-name>
            <given-names>F.</given-names>
            ,
            <surname>Revoredo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Euzenat</surname>
          </string-name>
          , J.:
          <article-title>Semantic interactive ontology matching: Synergistic combination of techniques to improve the set of candidate correspondences</article-title>
          . In: OM-2017
          <source>: Proceedings of the Twelfth International Workshop on Ontology Matching. Volume</source>
          <year>2032</year>
          . (
          <year>2017</year>
          )
          <volume>13</volume>
          {
          <fpage>24</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Guedes</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , Baia~o,
          <string-name>
            <given-names>F.</given-names>
            ,
            <surname>Shivaprabhu</surname>
          </string-name>
          , Revoredo, R.:
          <article-title>On the Identi cation and Representation of Ontology Correspondence Antipatterns</article-title>
          .
          <source>In: Proc. 5th Int. Conf. Ontol. Semant. Web Patterns (WOP'14)</source>
          ,
          <source>CEUR Work. Proc. (</source>
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Guedes</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , Baia~o,
          <string-name>
            <given-names>F.</given-names>
            ,
            <surname>Revoredo</surname>
          </string-name>
          ,
          <string-name>
            <surname>K.</surname>
          </string-name>
          :
          <article-title>Digging Ontology Correspondence Antipatterns</article-title>
          .
          <source>In: Proceeding WOP'14 Proc. 5th Int. Conf. Ontol. Semant. Web Patterns</source>
          . Volume
          <volume>1032</volume>
          . (
          <year>2014</year>
          )
          <volume>38</volume>
          |-
          <fpage>48</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Silva</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Revoredo</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          , Baia~o,
          <string-name>
            <given-names>F.A.</given-names>
            ,
            <surname>Euzenat</surname>
          </string-name>
          , J.:
          <article-title>Interactive Ontology Matching: Using Expert Feedback to Select Attribute Mappings</article-title>
          .
          <source>In: CEUR Workshop Proceedings</source>
          . Volume
          <volume>2288</volume>
          . (
          <year>2018</year>
          )
          <volume>25</volume>
          {
          <fpage>36</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Silva</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Delgado</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Revoredo</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          , Baia~o,
          <string-name>
            <surname>F.</surname>
          </string-name>
          :
          <article-title>Alin results for oaei 2019</article-title>
          .
          <source>In: Proceedings of the 14th International Workshop on Ontology Matching. OM'19</source>
          (
          <year>2019</year>
          )
          <volume>94</volume>
          {
          <fpage>100</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11. :
          <article-title>Results for oaei 2020 - anatomy track</article-title>
          . http://oaei.ontologymatching.org/ 2020/results/anatomy/ Accessed:
          <fpage>2020</fpage>
          -10-23.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12. :
          <article-title>Results of evaluation for the conference track within oaei 2020</article-title>
          . http://oaei. ontologymatching.org/2020/results/conference/index.html Accessed:
          <fpage>2020</fpage>
          - 10-23.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13. :
          <article-title>Results for oaei 2020 - interactive track</article-title>
          . http://oaei.ontologymatching.org/ 2020/results/interactive/index.html Accessed:
          <fpage>2020</fpage>
          -10-23.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Silva</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , Baia~o,
          <string-name>
            <given-names>F.</given-names>
            ,
            <surname>Revoredo</surname>
          </string-name>
          ,
          <string-name>
            <surname>K.</surname>
          </string-name>
          :
          <article-title>Alin results for oaei 2016</article-title>
          . In: OM-2016
          <source>: Proceedings of the Eleventh International Workshop on Ontology Matching. OM'16</source>
          (
          <year>2016</year>
          )
          <volume>130</volume>
          {
          <fpage>137</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Silva</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , Baia~o,
          <string-name>
            <given-names>F.</given-names>
            ,
            <surname>Revoredo</surname>
          </string-name>
          ,
          <string-name>
            <surname>K.</surname>
          </string-name>
          :
          <article-title>Alin results for oaei 2017</article-title>
          . In: OM-2017
          <source>: Proceedings of the Twelfth International Workshop on Ontology Matching. OM'17</source>
          (
          <year>2017</year>
          )
          <volume>114</volume>
          {
          <fpage>121</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Silva</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , Baia~o,
          <string-name>
            <given-names>F.</given-names>
            ,
            <surname>Revoredo</surname>
          </string-name>
          ,
          <string-name>
            <surname>K.</surname>
          </string-name>
          :
          <article-title>Alin results for oaei 2018</article-title>
          . In: Ontology Matching: OM-2018
          <source>: Proceedings of the ISWC Workshop</source>
          . OM'
          <volume>18</volume>
          (
          <year>2018</year>
          )
          <volume>117</volume>
          {
          <fpage>124</fpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>