<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Towards Building a Link Set Backed by Domain Experts using the Alignment Tool</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ondrˇej Zamazal</string-name>
          <email>ondrej.zamazal@vse.cz</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sotirios Karampatakis</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Charalampos Bratsas</string-name>
          <email>cbratsas@math.auth.grg</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Aristotle University of Thessaloniki, School of Mathematics</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Open Knowledge</institution>
          <country country="GR">Greece</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Economics, Prague, Dept. Information and Knowledge Engineering</institution>
        </aff>
      </contrib-group>
      <fpage>2</fpage>
      <lpage>3</lpage>
      <abstract>
        <p>Discovering semantic relations between entities (entity linking) is one of the most important activity for both semantic web and linked data areas. Either we need link sets of instances or concepts we can rely on automatic systems only to a certain extent. As a result, an automatic linking is accompanied with a user interaction which enables to increase the quality of resulted link sets. Often, in order to reach as much quality of link set as possible the user should be a domain expert for an area of linking task [1]. This user specifics should be considered by designers of interactive entity linking tools. This work presents an experience from an experiment of building a link set for two fiscal code lists where domain experts have been involved. The experiment has been done using the Alignment tool.45</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        While the Alignment tool is now a general linking tool, it has originally been developed
in order to facilitate linking heterogeneous fiscal code lists in the OpenBudgets project.
It is a web application for online, collaborative, system aided manual entity linking.
The tool can be used to manually create link sets between two knowledge graphs or to
validate already existing link sets. It further offers a number of utilities to aid the linking
such as a graph visualization as a tree, a search bar, an entity description and finally
suggestions based on linking algorithms provided by Silk [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] or by other automated
linking tools. Multiple users can work on the same linking project simultaneously, thus
enabling crowdsourcing of a link set creation and reducing required time and effort.
      </p>
      <p>The user can select a semantic meaning of the link by selecting from a number of
predefined link types (e.g skos:related,6 skos:broadMatch, owl:sameAs etc.) or provide
a custom one. The tool can also be used to crowdsource a link validation using a voting
system. You can upload links produced by an automated procedure or the tool itself
and create polls to check eligibility. Finally, link sets can be exported in various RDF
formats and CSV.
4 http://alignment.okfn.gr/
5 https://github.com/okgreece/Alignment/
6 Skos prefix refers to http://www.w3.org/2004/02/skos/core# namespace.</p>
    </sec>
    <sec id="sec-2">
      <title>Building a Link Set by Involving Domain Experts</title>
      <p>European union countries often apply their own different categorization systems for
funded projects. As a consequence, this hinders straightforward fiscal analyses. Since
there is already integrated European categorization system for funded projects, one of
possible solutions to enhance fiscal analyses is to interlink categorization systems of
individual EU countries to the European one. For improving this situation we started
with building the one link set, the Czech code list (44 items) to the European one (142
items).7 In order to ensure the quality of the link set we involved two domain experts
and we used the Alignment tool. Thus this work enabled us testing the Alignment tool
in action and examining the task of interlinking code lists with domain experts.</p>
      <p>Our two domain experts worked separately. They followed detail guidelines8 where
they were informed about the goal of correctly interlinking as many source items to
target items as possible. The guidelines also includes a brief manual how to use the
Alignment tool and the instruction that experts should prefer certain types of links more,
i.e. there was the following preference skos:exactMatch, then skos:narrowMatch and
skos:broadMatch and then the others.</p>
      <p>Both experts interlinked 32 same items where expert 1 linked 84% (37) items from
the source code list and expert 2 linked 82% (36) items from the source code list. While
the expert 1 employed all skos link types (out of all 53 links) more or less uniformly
(21 skos:narrowMatch, 11 skos:closeMatch, 9 skos:exactMatch, 8 skos:relatedMatch,
4 skos:broadMatch), the expert 2 created mainly skos:narrowMatch links (116),
additionally 8 skos:exactMatch and 1 skos:broadMatch, out of all 125 links. Both experts
managed 32 times to linked the same two entities in one link and, more importantly, they
managed to create the very same link 23 times where there were 7 skos:exactMatch, 1
skos:broadMatch and 15 skos:narrowMatch.9 The resulted link set of 23 links
represents the nucleus of the reference link set. Since there are many links created by only
one expert (57% in the case of expert 1 and 82% in the case of expert 2) we further plan
to let experts discuss those not agreed links to extend the current reference link set.</p>
      <p>During the interlinking by experts we continually received a feedback in terms of
bugs and improvement suggestions for the Alignment tool as also reflected via GitHub.
Acknowledgments
We thank to experts. The work has been supported by the H2020 project no. 645833.
7 Both code lists are extracted from existing data sets.
8 The English translation is available at https://goo.gl/vRYc5r
9 The further information is available at http://owl.vse.cz:8080/OM2017/</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Dragisic</surname>
            <given-names>Z</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ivanova</surname>
            <given-names>V</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lambrix</surname>
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Faria</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jimnez-Ruiz</surname>
            <given-names>E</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pesquita</surname>
            <given-names>C.</given-names>
          </string-name>
          <article-title>User validation in ontology alignment</article-title>
          .
          <source>In: International Semantic Web Conference 2016</source>
          . Springer.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Volz</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bizer</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gaedke</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kobilarov</surname>
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Silk-A Link Discovery</surname>
          </string-name>
          <article-title>Framework for the Web of Data</article-title>
          .
          <source>LDOW</source>
          .
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>