<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>KG-Agnostic Entity Linking Orchestration</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Noullet K.</string-name>
          <email>kristian.noullet@kit.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Karlsruhe Institute of Technology</institution>
          ,
          <addr-line>Kaiserstr. 89, 76133 Karlsruhe</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The domain of Entity Linking (EL) has been researched thoroughly, resulting in numerous approaches and a level of maturity in the field. In this dissertation, we aim at (1) creating an orchestrated, continuously-improving system through addition of approaches both existing and ones that are yet to come. On the same note, our framework shall (2) allow for Knowledge Graph (KG)- and potentially languageagnostic EL processes through cross-KG techniques, enabling further exploration of KG-dependence in the domain of EL. Finally, we will boost the ease of (3) reproducibility and (4) comparison between EL systems, their underlying techniques, as well as orchestration frameworks as a whole.</p>
      </abstract>
      <kwd-group>
        <kwd>NLP</kwd>
        <kwd>Entity Linking</kwd>
        <kwd>Orchestration</kwd>
        <kwd>KG-Agnosticity</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>In the world of EL, there exist a plethora of approaches attempting to solve
the issue of detecting and correctly disambiguating mentions from plain text
documents. Many of these approaches are - at least primarily - bound to specific
KGs, making system comparison problematic and portability of results to other
knowledge bases questionable. Additionally, - with a few exceptions - utilizing
only specific parts of a given system’s pipeline to further be processed by other
systems is relatively hard, consequently increasing a researcher’s load even when
intending to only research specific steps. Thus, proper system comparability and
step1-related result explanation become potentially tedious tasks, for instance
when attempting to solely improve Entity Disambiguation (ED), whilst
intending to keep existing Mention Detection (MD) and Candidate Generation (CG)
techniques constant.</p>
      <p>
        Inter-System vs. Intra-System We broadly di↵erentiate between two types of
systems utilised for our orchestration: inter- and intra-system. The former refers
to orchestration across multiple full-fledged EL systems, applying End-to-End
EL systems’ results and combining them in specific ways. Among these are
approaches, such as Babelfy [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], DBpedia Spotlight [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], DoSeR [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ] and MAG [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]
- to name a few. These frameworks loosely follow the steps of MD, CG and ED,
meaning they may be split into modules for use in intra-system orchestration.
In contrast to the inter-system approach, intra-system orchestration refers to
the application of sub-system components interacting in order to improve
intermediary results, potentially yielding an overall qualitatively further developed
final output. The advantage of accessing specific processing steps prior to final
result output lies in allowing for further dynamic access and potentially utilizing
highly-developed techniques without having to re-compute nor re-develop
certain approaches. As intermediary steps are improved upon, result performance
shall ultimately increase as well. Among others, with the rise of neural EL
approaches, not every EL system may be regarded as a succession of steps of MD,
CG and ED. For instance, Broscheit [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] and Kolitsas et al. [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] have developed
joint-EL techniques, making splitting these into encapsulated stand-alone
subsystems dicult.
      </p>
      <p>As such, we define the following problems our dissertation intends to alleviate
and potentially resolve:
P1 Multitude of ’Closed’ Heterogeneous Systems.</p>
      <p>P2 Lack of Cross-KG and Robust KG-Agnostic EL Orchestration.
P3 EL System Evaluation (Comparability, Result Reproducibility and
Explainability).
1.1</p>
    </sec>
    <sec id="sec-2">
      <title>Multitude of ’Closed’ Heterogeneous Systems.</title>
      <p>
        While there exist a multitude of systems to choose from - each with their own
set of peculiarities -, to the best of our knowledge, a centralised way of utilising
existing approaches’ underlying steps without considerable development e↵ort
has yet to be released. In the case of open source projects, development and
adjustments may yield wanted results, but the accompanying e↵ort and required
underlying data structures render the task tedious, if not infeasible. Further,
taking into account licensing as well as privacy concerns - prohibiting
distribution - regarding potentially-utilised knowledge sources, makes it apparent where
limits of system, as well as result reproducibility may be reached. Therefore, our
goal is to create an orchestration framework, allowing for ease of research and
development in the domain of EL without having to worry about every step.
Alike GERBIL [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ], we intend to centralise and simplify the use of existing
approaches, while our goal is majorly contrasted by the aim of using concurrent
systems to a higher level of granularity, not for evaluation, but for combined
inter- as well as intra-system orchestration. Consequently, as a major byproduct
of this PhD. proposal, we will provide interfaces for ease of implementation, a
REST API-based endpoint to execute existing (and register novel) approaches,
additionally to supplying a training &amp; testing platform in order to boost existing
EL research, as well as to apply various techniques for a holistic orchestration
approach.
      </p>
      <p>
        Lack of Cross-KG and Robust KG-Agnostic EL Orchestration.
Although there exist KG-Agnostic EL frameworks, to the best of our knowledge
existing orchestration approaches do not approach the domain of working with
multiple KGs simultaneously (Cross-KG) nor work for potentially ”any” wanted
knowledge base (KG-Agnostic). The lack of system portability between KGs can
be problematic for evaluation, maintenance as also ensuring the use of a system
in the long run - either due to lack of community interest for a given KG or
changes in regards to licensing. In the past, EL approaches have strongly relied on
Wikipedia/DBpedia [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ], but more recently, research has shifted slightly in favor
of developing systems for Wikidata [
        <xref ref-type="bibr" rid="ref17 ref21 ref6 ref8">8, 21, 6, 17</xref>
        ], among others. Initially, we will
be working with the baseline of translating systems’ annotations to a common
basis through owl:sameAs predicate links, allowing for orchestration through
supervised learning approaches with help of gold standards. Further down the
line, our intention is to have similar entity detection across KGs through entity
alignment techniques [
        <xref ref-type="bibr" rid="ref23 ref25">25, 23</xref>
        ].
1.3
      </p>
    </sec>
    <sec id="sec-3">
      <title>EL System Evaluation.</title>
      <p>
        GERBIL [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] has greatly contributed to facilitating evaluation of EL systems,
as well as comparing their result metrics to one another. Unfortunately, it does
not allow for checking results directly - unless an endpoint is set up as a relay in
a man-in-the-middle type of approach -, but it serves well its purpose of
properly allowing overall performance comparison between systems. Spiritually, we
follow a similar route for system evaluation, but rather go on a more
granular level of the common EL pipeline for the purpose of combining a multitude
of (sub-)approaches and learning optimal orchestration settings and potential
configurations for specific domains. Unfortunately, to the best of our knowledge,
existing approaches do not allow for the ease of result reproducibility to
take place for MD, CG and ED separately. For the purpose of boosting ease of
research focusing solely on specific parts of existing pipelines, we wish to
remediate this lack with our system by defining common protocols for each of these
steps. Additionally, we shall do so for their combinations as well, in order to
further allow highly configurable variations in future systems to arise. Allowing for
common entity linking pipelines’ modularization further allows for easier result
explainability by exploring output variations due to sub-system mechanism
swaps. Further, thanks to our approach aiming at being KG-Agnostic, we shall
facilitate domain-dependent explainability greatly.
2
      </p>
      <sec id="sec-3-1">
        <title>Research Questions &amp; Hypotheses</title>
        <p>RQ1 How can we leverage multiple KG systems’ advantages for various settings?
RQ2 When does which system(s) yield optimal results?
RQ3 To what extent do domains impact EL results?
RQ4 How to utilise KG-bound EL systems for agnostic tasks?</p>
        <p>RQ5 How can we achieve and evaluate the level of KG-Agnosticity of various
systems?
Consequent to our research questions, we make the following hypotheses:
H1 Di↵erent linking systems work to di↵erent extents based on target and
underlying KGs.</p>
        <p>H2 Domains of plain texts a↵ect the quality of results strongly (e.g. political vs.</p>
        <p>geographical, see Obama2 and Obama3).</p>
        <p>H3 Constraining a given system to a specific KG potentially lowers quality of
results.</p>
        <p>H4 EL research is dampened by a development overhead, rendering system
comparability and reproducibility dicult, especially when varying between
underlying knowledge bases.
3</p>
      </sec>
      <sec id="sec-3-2">
        <title>Related Work</title>
        <p>
          As can be seen in GERBIL[
          <xref ref-type="bibr" rid="ref19 ref2">19, 2</xref>
          ] and [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ], EL is a mature area of research,
including a multitude of approaches. These are based on various techniques.
We broadly di↵erentiate between graph-based and machine learning-based EL
frameworks. On one hand, you have graph-based approaches, such as Babelfy [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]
and MAG [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. These make use of graph algorithms to find contextually
similar entities with the assumption that (candidate) entities which are located in
close proximity to each other within a KG, increases the likelihood of appearing
together within texts. As such, Babelfy applies a densest subgraph algorithm,
whereas MAG employs a breadth-first search style of search for entities within a
defined KG. Other types of EL approaches include machine learning-based and
neural techniques. Among their ranks may be listed [
          <xref ref-type="bibr" rid="ref10 ref13 ref24 ref26 ref3 ref5">3, 5, 10, 13, 24, 26</xref>
          ]. In [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ],
Broscheit utilizes some of the explosively-popular BERT [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] models to achieve
neural end-to-end EL, ideologically similarly to Kolitsas et al.’s [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] joint neural
EL approach. Main contributions relating to our area of EL system orchestration
were brought by [
          <xref ref-type="bibr" rid="ref12 ref20 ref4 ref7">4, 7, 20, 12</xref>
          ]. In [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ], Jo˜ao et al. present an end-to-end supervised
learning approach to orchestration on 3 systems (Babelfy [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ], TagMe [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] and
Ambiverse [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ]), analyzing performance for di↵erent settings, such as binary and
multi-label classification. In contrast, Canale et al’s. [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] developed system applies
a corpus-learnt voting scheme to annotators, while Corcoglioniti et al. employ
a more constant majority-based voting approach with MicroNeel [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]. The
former learn weights for their voting application, whereas the latter predominantly
focuses on short documents, resolving potential conflicts with predetermined
priorities. Finally, we consider GERBIL [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ] to be a spiritual predecessor to a
certain extent due to (1) its goal of greatly simplifying processes relating to EL;
(2) providing interfaces and code templates for ease of implementation; (3)
aggregating a multitude of approaches; and (4) unifying approaches (for evaluation),
therewith improving system comparison.
        </p>
        <sec id="sec-3-2-1">
          <title>2 https://en.wikipedia.org/wiki/Obama, Fukui 3 https://en.wikipedia.org/wiki/Barack Obama</title>
        </sec>
      </sec>
      <sec id="sec-3-3">
        <title>Approach</title>
        <p>In order to achieve a sensible and highly extensible orchestration system, we have
started by implementing our own KG-Agnostic embeddings-based EL approach,
codenamed Agnos. Through its use of multiple KGs and various mechanisms for
MD, CG and ED, we identified the degree of generalization, data structures and
processes must provide in order to allow for maximal inter-compatibility with
existing systems, whilst simultaneously minimizing the loss of linker-specific
information. Both inter-system and intra-system compatibility is warranted and an
initial REST API is provided. The code base for our EL approach is accessible at
https://git.scc.kit.edu/wf7467/agnos mini, additionally to our project
including initial orchestration capabilities at https://github.com/kmdn/Agnos mini.
Each step from mention detection to pruning is handled by individual interfaces,
allowing for definition of REST API-based connectivity for modularity of each
desired step.</p>
        <p>Once we have a relatively mature system set up, we will initially apply
supervised learning techniques for our cross-KG approach. These will be supported
by owl:sameAs predicates, allowing for a transparent translation layer of
interconnections between KGs. Further down the line, we will additionally shift to
exploiting KG alignment techniques to further facilitate the use of a plethora
of KGs - both commonly available, as well as custom and potentially private
ones. Due to our learning approach, we will further subdivide our experiments
into various domains in order to properly assess their relevance and influence on
result quality.
5</p>
      </sec>
      <sec id="sec-3-4">
        <title>Evaluation</title>
        <p>
          We shall assess our work from an EL point of view by using GERBIL along with
its performance metrics for evaluation - if data sets fulfill required parameters,
such as that of size and domain relevance for our experiments. Unfortunately,
GERBIL requires a reference knowledge base for data sets which may increase
diculty for a KG-Agnostic evaluation. Therefore, we will either develop other
data sets or use existing ones (e.g. [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ]) for cross-KG evaluation, depending on
whether appropriate ones may be located. Further, we will proceed with testing
our orchestration system while taking into consideration its own orchestration
with other end-to-end frameworks to explore whether major improvements may
be achieved on an even more meta-level. Whilst our primary objective is to
further boost EL result quality, it is important to not only evaluate our system as
an EL framework, but also as a platform. Therefore, we will conduct user studies
to evaluate useability as well as potential time gain achieved through our system,
along the lines of https://github.com/dice-group/gerbil/blob/master/documentation/survey.csv.
Finally, we will attempt to reach more detailed conclusions in terms of di↵erent
frameworks’ domain dependence as well as analyse areas of both opportunity and
potential system limitations due to our enabled enhanced system granularity.
        </p>
      </sec>
      <sec id="sec-3-5">
        <title>Future Work</title>
        <p>In terms of further research to come, we will develop our framework for
purposes of maximising intercompatibility of EL systems through usage of the NLP
Interchange Format (NIF4). Additionally to applying supervised classification
techniques, we will attempt to additionally remodel the problem in order to
tackle it from di↵erent angles, perchance enabling the use of large corpora
without the necessity of tedious creation of silver and gold standards. We will further
refine our framework, publishing the finalised interfaces and code templates, once
we consider them mature enough. Alongside framework developments, we will
set up an openly-available REST API, along with a variety of configurations to
choose from for execution of specific EL systems, as well as orchestration
baselines.</p>
        <p>For evaluation purposes, we would like to increase the range of data sets applied
to our task. We will be targetting various gold standards of sucient size (e.g.
CoNLL2011), but would like to contribute to the community by creating our
own, as well as separating these into various domains to enable the impact of
domain to the field of research.</p>
        <p>On a similar note, we will be training our system - among others - on specific
domains and evaluating them on the source domain, as well as various target
domains in an attempt to evaluate domain dependency.</p>
        <p>Finally, as our project progresses, we will switch from relatively simple owl:sameAs
link usage to additionally making use of alignment techniques, increasing
system agnosticity and likely yielding advantages from a multitude of approaches,
aggregated into a single framework.</p>
        <sec id="sec-3-5-1">
          <title>4 http://aksw.org/Projects/NIF.html</title>
        </sec>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>AmbiverseNLU: A Natural Language</surname>
          </string-name>
          <article-title>Understanding suite by Max Planck Institute for Informatics</article-title>
          . https://www.mpi-inf.mpg.de/ambiverse-nlu/, accessed:
          <fpage>2020</fpage>
          -05- 30
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>GERBIL</given-names>
            <surname>Experiment</surname>
          </string-name>
          <article-title>Configuration</article-title>
          . http://gerbil.aksw.org/gerbil/config, accessed:
          <fpage>2020</fpage>
          -05-30
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Broscheit</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Investigating entity knowledge in BERT with simple neural end-to-end entity linking</article-title>
          . CoRR abs/
          <year>2003</year>
          .05473 (
          <year>2020</year>
          ), https://arxiv.org/abs/
          <year>2003</year>
          .05473
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Canale</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lisena</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Troncy</surname>
            ,
            <given-names>R.:</given-names>
          </string-name>
          <article-title>A novel ensemble method for named entity recognition and disambiguation based on neural network</article-title>
          .
          <source>In: International Semantic Web Conference</source>
          . pp.
          <fpage>91</fpage>
          -
          <lpage>107</lpage>
          . Springer (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Cao</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hou</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          :
          <article-title>Neural collective entity linking</article-title>
          . In: Bender,
          <string-name>
            <given-names>E.M.</given-names>
            ,
            <surname>Derczynski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Isabelle</surname>
          </string-name>
          , P. (eds.)
          <source>Proceedings of the 27th International Conference on Computational Linguistics</source>
          ,
          <string-name>
            <surname>COLING</surname>
          </string-name>
          <year>2018</year>
          ,
          <string-name>
            <given-names>Santa</given-names>
            <surname>Fe</surname>
          </string-name>
          , New Mexico, USA,
          <year>August</year>
          20-
          <issue>26</issue>
          ,
          <year>2018</year>
          . pp.
          <fpage>675</fpage>
          -
          <lpage>686</lpage>
          . Association for Computational Linguistics (
          <year>2018</year>
          ), https://www.aclweb.org/anthology/C18-1057/
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Cetoli</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bragaglia</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>O</given-names>
            <surname>'Harney</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.D.</given-names>
            ,
            <surname>Sloan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Akbari</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.:</surname>
          </string-name>
          <article-title>A neural approach to entity linking on wikidata</article-title>
          . In: Azzopardi,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Stein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Fuhr</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            ,
            <surname>Mayr</surname>
          </string-name>
          ,
          <string-name>
            <surname>P.</surname>
          </string-name>
          , Hau↵,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Hiemstra</surname>
          </string-name>
          ,
          <string-name>
            <surname>D</surname>
          </string-name>
          . (eds.)
          <source>Advances in Information Retrieval - 41st European Conference on IR Research</source>
          , ECIR
          <year>2019</year>
          , Cologne, Germany, April 14-
          <issue>18</issue>
          ,
          <year>2019</year>
          , Proceedings,
          <source>Part II. Lecture Notes in Computer Science</source>
          , vol.
          <volume>11438</volume>
          , pp.
          <fpage>78</fpage>
          -
          <lpage>86</lpage>
          . Springer (
          <year>2019</year>
          ). https://doi.org/10.1007/978-3-
          <fpage>030</fpage>
          -15719-7 10, https://doi.org/10.1007/978-3-
          <fpage>030</fpage>
          -15719-7 10
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Corcoglioniti</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aprosio</surname>
            ,
            <given-names>A.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nechaev</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Giuliano</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>MicroNeel: Combining NLP Tools to Perform Named Entity Detection and Linking on Microposts</article-title>
          . In: Basile,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Corazza</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Cutugno</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            ,
            <surname>Montemagni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Nissim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Patti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            ,
            <surname>Semeraro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            ,
            <surname>Sprugnoli</surname>
          </string-name>
          ,
          <string-name>
            <surname>R</surname>
          </string-name>
          . (eds.)
          <source>Proceedings of Third Italian Conference on Computational Linguistics</source>
          (CLiC-it
          <year>2016</year>
          ) &amp;
          <article-title>Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian</article-title>
          .
          <source>Final Workshop (EVALITA</source>
          <year>2016</year>
          ), Napoli, Italy, December 5-
          <issue>7</issue>
          ,
          <year>2016</year>
          .
          <source>CEUR Workshop Proceedings</source>
          , vol.
          <volume>1749</volume>
          .
          <string-name>
            <surname>CEUR-WS.org</surname>
          </string-name>
          (
          <year>2016</year>
          ), http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>1749</volume>
          /paper 010.pdf
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Delpeuch</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Opentapioca: Lightweight entity linking for wikidata</article-title>
          . CoRR abs/
          <year>1904</year>
          .09131 (
          <year>2019</year>
          ), http://arxiv.org/abs/
          <year>1904</year>
          .09131
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Devlin</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chang</surname>
            ,
            <given-names>M.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Toutanova</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Bert: Pre-training of deep bidirectional transformers for language understanding</article-title>
          . arXiv preprint arXiv:
          <year>1810</year>
          .
          <volume>04805</volume>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Fang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cao</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Joint entity linking with deep reinforcement learning</article-title>
          .
          <source>CoRR abs/1902</source>
          .00330 (
          <year>2019</year>
          ), http://arxiv.org/abs/
          <year>1902</year>
          .00330
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Ferragina</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scaiella</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          : TAGME:
          <article-title>on-the-fly annotation of short text fragments (by wikipedia entities)</article-title>
          . In: Huang,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Koudas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            ,
            <surname>Jones</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.J.F.</given-names>
            ,
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            ,
            <surname>Collins-Thompson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>An</surname>
          </string-name>
          ,
          <string-name>
            <surname>A</surname>
          </string-name>
          . (eds.)
          <source>Proceedings of the 19th ACM Conference on Information and Knowledge Management</source>
          ,
          <string-name>
            <surname>CIKM</surname>
          </string-name>
          <year>2010</year>
          , Toronto, Ontario, Canada,
          <source>October 26-30</source>
          ,
          <year>2010</year>
          . pp.
          <fpage>1625</fpage>
          -
          <lpage>1628</lpage>
          . ACM (
          <year>2010</year>
          ). https://doi.org/10.1145/1871437.1871689
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12. Jo˜ao, R.S.,
          <string-name>
            <surname>Fafalios</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dietze</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Better together: an ensemble learner for combining the results of ready-made entity linking systems</article-title>
          . In: Hung,
          <string-name>
            <surname>C.</surname>
          </string-name>
          , Cerny´,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Shin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            ,
            <surname>Bechini</surname>
          </string-name>
          ,
          <string-name>
            <surname>A</surname>
          </string-name>
          . (eds.)
          <source>SAC '20: The 35th ACM/SIGAPP Symposium on Applied Computing</source>
          , online event, [Brno, Czech Republic],
          <source>March 30 - April 3</source>
          ,
          <year>2020</year>
          . pp.
          <fpage>851</fpage>
          -
          <lpage>858</lpage>
          . ACM (
          <year>2020</year>
          ). https://doi.org/10.1145/3341105.3373883
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Kolitsas</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ganea</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hofmann</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>End-to-end neural entity linking</article-title>
          . In: Korhonen,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Titov</surname>
          </string-name>
          , I. (eds.)
          <source>Proceedings of the 22nd Conference on Computational Natural Language Learning</source>
          ,
          <source>CoNLL 2018</source>
          , Brussels, Belgium,
          <source>October 31 - November 1</source>
          ,
          <year>2018</year>
          . pp.
          <fpage>519</fpage>
          -
          <lpage>529</lpage>
          . Association for Computational Linguistics (
          <year>2018</year>
          ). https://doi.org/10.18653/v1/k18-
          <fpage>1050</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Mendes</surname>
            ,
            <given-names>P.N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jakob</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <article-title>Garc´ıa-</article-title>
          <string-name>
            <surname>Silva</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bizer</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Dbpedia spotlight: shedding light on the web of documents</article-title>
          . In: Ghidini,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Ngomo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.N.</given-names>
            ,
            <surname>Lindstaedt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.N.</given-names>
            ,
            <surname>Pellegrini</surname>
          </string-name>
          , T. (eds.)
          <source>Proceedings the 7th International Conference on Semantic Systems, I-SEMANTICS</source>
          <year>2011</year>
          , Graz, Austria, September 7-
          <issue>9</issue>
          ,
          <year>2011</year>
          . pp.
          <fpage>1</fpage>
          -
          <lpage>8</lpage>
          . ACM International Conference Proceeding Series, ACM (
          <year>2011</year>
          ). https://doi.org/10.1145/2063518.2063519
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Moro</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Raganato</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Navigli</surname>
          </string-name>
          , R.:
          <article-title>Entity linking meets word sense disambiguation: a unified approach</article-title>
          .
          <source>Transactions of the Association for Computational Linguistics</source>
          <volume>2</volume>
          ,
          <fpage>231</fpage>
          -
          <lpage>244</lpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Moussallem</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Usbeck</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , Ro¨der,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Ngomo</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.N.:</surname>
          </string-name>
          <article-title>MAG: A multilingual, knowledge-base agnostic and deterministic entity linking approach</article-title>
          . In: Corcho,
          <string-name>
            <given-names>O´.</given-names>
            ,
            <surname>Janowicz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Rizzo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            ,
            <surname>Tiddi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            ,
            <surname>Garijo</surname>
          </string-name>
          ,
          <string-name>
            <surname>D</surname>
          </string-name>
          . (eds.)
          <source>Proceedings of the Knowledge Capture Conference, K-CAP</source>
          <year>2017</year>
          , Austin, TX, USA, December 4-
          <issue>6</issue>
          ,
          <year>2017</year>
          . pp.
          <volume>9</volume>
          :
          <fpage>1</fpage>
          -
          <issue>9</issue>
          :
          <fpage>8</fpage>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2017</year>
          ). https://doi.org/10.1145/3148011.3148024
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Mulang</surname>
            ,
            <given-names>I.O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Singh</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vyas</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shekarpour</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sakor</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vidal</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Auer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lehmann</surname>
          </string-name>
          , J.:
          <article-title>Context-aware entity linking with attentive neural networks on wikidata knowledge graph</article-title>
          . CoRR abs/
          <year>1912</year>
          .06214 (
          <year>2019</year>
          ), http://arxiv.org/abs/
          <year>1912</year>
          .06214
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Noullet</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mix</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , Fa¨rber, M.: KORE 50dywc:
          <article-title>An evaluation data set for entity linking based on dbpedia, yago, wikidata, and crunchbase</article-title>
          . In: Calzolari,
          <string-name>
            <surname>N.</surname>
          </string-name>
          , B´echet,
          <string-name>
            <given-names>F.</given-names>
            ,
            <surname>Blache</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Choukri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Cieri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Declerck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Goggi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Isahara</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            ,
            <surname>Maegaard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Mariani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Mazo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            ,
            <surname>Moreno</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Odijk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Piperidis</surname>
          </string-name>
          , S. (eds.)
          <source>Proceedings of The 12th Language Resources and Evaluation Conference</source>
          ,
          <string-name>
            <surname>LREC</surname>
          </string-name>
          <year>2020</year>
          , Marseille, France, May
          <volume>11</volume>
          -16,
          <year>2020</year>
          . pp.
          <fpage>2389</fpage>
          -
          <lpage>2395</lpage>
          . European Language Resources Association (
          <year>2020</year>
          ), https://www.aclweb.org/anthology/2020.lrec-
          <volume>1</volume>
          .291/
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19. R¨oder,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Usbeck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Ngomo</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.N.</surname>
          </string-name>
          :
          <article-title>GERBIL - benchmarking named entity recognition and linking consistently</article-title>
          .
          <source>Semantic Web</source>
          <volume>9</volume>
          (
          <issue>5</issue>
          ),
          <fpage>605</fpage>
          -
          <lpage>625</lpage>
          (
          <year>2018</year>
          ). https://doi.org/10.3233/SW-170286
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Ruiz</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Poibeau</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Combining open source annotators for entity linking through weighted voting</article-title>
          .
          <source>In: Proceedings of the Fourth Joint Conference on Lexical and Computational Semantics</source>
          . pp.
          <fpage>211</fpage>
          -
          <lpage>215</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Sakor</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Singh</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Patel</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vidal</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>FALCON 2.0: An entity and relation linking tool over wikidata</article-title>
          . CoRR abs/
          <year>1912</year>
          .11270 (
          <year>2019</year>
          ), http://arxiv.org/abs/
          <year>1912</year>
          .11270
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Shen</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>J</given-names>
            ., Han, J
          </string-name>
          .:
          <article-title>Entity linking with a knowledge base: Issues, techniques, and solutions</article-title>
          .
          <source>IEEE Trans. Knowl. Data Eng</source>
          .
          <volume>27</volume>
          (
          <issue>2</issue>
          ),
          <fpage>443</fpage>
          -
          <lpage>460</lpage>
          (
          <year>2015</year>
          ). https://doi.org/10.1109/TKDE.
          <year>2014</year>
          .2327028
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Sun</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hu</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          , Zhang,
          <string-name>
            <given-names>Q.</given-names>
            ,
            <surname>Qu</surname>
          </string-name>
          ,
          <string-name>
            <surname>Y.</surname>
          </string-name>
          :
          <article-title>Bootstrapping entity alignment with knowledge graph embedding</article-title>
          .
          <source>In: IJCAI</source>
          . pp.
          <fpage>4396</fpage>
          -
          <lpage>4402</lpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Xue</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cai</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Su</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Song</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ge</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Neural collective entity linking based on recurrent random walk network learning</article-title>
          . In: Kraus,
          <string-name>
            <surname>S</surname>
          </string-name>
          . (ed.)
          <source>Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI</source>
          <year>2019</year>
          , Macao, China,
          <source>August 10-16</source>
          ,
          <year>2019</year>
          . pp.
          <fpage>5327</fpage>
          -
          <lpage>5333</lpage>
          . ijcai.
          <source>org</source>
          (
          <year>2019</year>
          ). https://doi.org/10.24963/ijcai.
          <year>2019</year>
          /740
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Zhu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xie</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sun</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Iterative entity alignment via joint knowledge embeddings</article-title>
          .
          <source>In: IJCAI</source>
          . pp.
          <fpage>4258</fpage>
          -
          <lpage>4264</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Zwicklbauer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Seifert</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Granitzer</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Doser - A knowledge-base-agnostic framework for entity disambiguation using semantic embeddings</article-title>
          . In: Sack,
          <string-name>
            <given-names>H.</given-names>
            ,
            <surname>Blomqvist</surname>
          </string-name>
          , E.,
          <string-name>
            <surname>d'Aquin</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ghidini</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ponzetto</surname>
            ,
            <given-names>S.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lange</surname>
          </string-name>
          , C. (eds.)
          <article-title>The Semantic Web</article-title>
          .
          <source>Latest Advances and New Domains - 13th International Conference, ESWC</source>
          <year>2016</year>
          , Heraklion, Crete, Greece, May 29 - June 2,
          <year>2016</year>
          ,
          <source>Proceedings. Lecture Notes in Computer Science</source>
          , vol.
          <volume>9678</volume>
          , pp.
          <fpage>182</fpage>
          -
          <lpage>198</lpage>
          . Springer (
          <year>2016</year>
          ). https://doi.org/10.1007/978-3-
          <fpage>319</fpage>
          -34129-3 12, https://doi.org/10.1007/978-3-
          <fpage>319</fpage>
          -34129-3 12
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>