<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Enhancing Entity Alignment Between Wikidata and ArtGraph Using LLMs</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Anna Sofia Lippolis</string-name>
          <xref ref-type="aff" rid="aff6">6</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Antonis Klironomos</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Daniela F. Milon-Flores</string-name>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Heng Zheng</string-name>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alexane Jouglar</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ebrahim Norouzi</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Aidan Hogan</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Bosch Center for Artificial Intelligence</institution>
          ,
          <addr-line>BCAI</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Computer science faculty, Université de Namur</institution>
          ,
          <country country="BE">Belgium</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Department of Computer Science, University of Chile</institution>
          ,
          <country country="CL">Chile</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>FIZ Karlsruhe - Leibniz Institute for Information Infrastructure</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>School of Information Sciences, University of Illinois Urbana-Champaign</institution>
          ,
          <country country="US">United States</country>
        </aff>
        <aff id="aff5">
          <label>5</label>
          <institution>Univ. Grenoble Alpes</institution>
          ,
          <addr-line>CNRS, Grenoble INP, LIG, Grenoble</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff6">
          <label>6</label>
          <institution>University of Bologna and ISTC-CNR</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff7">
          <label>7</label>
          <institution>University of Mannheim</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Knowledge graphs (KGs) are used in a wide variety of applications, including within the cultural heritage domain. An important prerequisite of such applications is the quality and completeness of the data. Using a single KG might not be enough to fulfill this requirement. The absence of connections between KGs complicates taking advantage of the complementary data they can provide. This paper focuses on the Wikidata and A G ℎ KGs, which exhibit gaps in content that can be filled by enriching one with data from the other. Entity alignment can help to combine data from KGs by connecting entities that refer to the same real-world entities. However, entity alignment in art-domain knowledge graphs remains under-explored. In the pursuit of entity alignment between A G ℎ and Wikidata, a hybrid approach is proposed. The first part, which we call WES (Wikidata Entity Search), utilizes traditional Wikidata SPARQL queries and is followed by a supplementary sequence-to-sequence large language model (LLM) pipeline that we denote as pArtLink. The combined approach successfully aligned artworks and artists, with WES identifying entities for 14,982 artworks and 2,029 artists, and pArtLink further aligning 76 additional artists, thus enhancing the alignment process beyond WES' capabilities.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Entity alignment</kwd>
        <kwd>Wikidata</kwd>
        <kwd>ArtGraph</kwd>
        <kwd>Knowledge-graphs</kwd>
        <kwd>Large Language Models</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Knowledge graphs (KGs) help organize and analyze information. In particular, cultural heritage
KGs play a vital role in depicting and comprehending the significant relationships between
individuals, objects, customs, events, and legacies, illuminating the tapestry of our collective
heritage. Recently, such KGs have been used in several applications that aim to automatically
analyze art [1, 2], investigate the connections between visual elements [3], represent traditional
crafts [4], and improve the explainability of neuro-symbolic methods [5].</p>
      <p>Rich information plays a vital role in applications related to cultural heritage. For example,
discerning the emotional response elicited by an image in viewers is challenging due to the
intricate and subjective nature of human emotions and the abstract nature of visual arts [1]. To
achieve this recognition of emotions, comprehensive background knowledge is necessary. A
single KG that is either general-purpose or domain-specific might not be suficient. Multiple
KGs are often relevant to an application, but entity links identifying which nodes in these KGs
refer to the same real-world entity are essential to integrate and leverage their data fully. Entity
alignment is fundamental for integrating knowledge between diferent KGs by identifying
those missing entity links. It ensures that the same real-world entities from the input graphs
are aligned, even when using diferent identifiers in each graph. While entity alignment is
widespread, its application in art-related KGs has yet to be explored.</p>
      <p>
        In this work, we focus on computing entity alignments for two KGs: (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) Wikidata [6], which
is a well-known community-driven KG representing general knowledge, and (
        <xref ref-type="bibr" rid="ref2">2</xref>
        ) A G ℎ [2],
which is a recently proposed KG in the art domain. Multiple information gaps in these two
KGs could be filled by identifying the KG nodes that refer to the same real-world entities. For
instance, among the artworks mapped from A G ℎ, all of which possess an image, 5,356
on Wikidata lack the P18 property image, crucial for describing artworks. Another example is
the P136 genre property, which is complete for most A G ℎ entities, but 8,830 artwork
entities are without such a value on Wikidata. Other information that can enhance art-related
data on Wikidata can be information about an artist or an artwork belonging to a movement
or identifying art patrons as related to artists. Another notable observation is the disparity in
information between WikiArt, from which A G ℎ is derived, and Wikidata. For instance,
while WikiArt identifies Mario Cesariny as an artist, Wikidata highlights his role as a writer (ID:
Q1360242) . Linking both KGs would ofer a more complete picture of the artists and artworks
they describe.
      </p>
      <p>Thus, there is an opportunity to fill information gaps and solve inconsistencies in both of the
mentioned KGs to enable more robust applications. In this paper, we study entity alignment
between Wikidata and A G ℎ. In particular, we aim to identify missing entity links between
Wikidata and A G ℎ, focusing on the artists and artworks (Figure 1).</p>
      <p>We propose a hybrid approach to perform entity alignment between A G ℎ and Wikidata.
The first part of the approach, named WES (Wikidata Entity Search), uses traditional techniques
such as Wikidata SPARQL queries (similar to [7]). This method is followed by pArtLink: a
supplementary pipeline that uses sequence-to-sequence large language models (LLMs), such as
Llama 2 [8], to enhance the results of the previous method. For aligning the artworks, we used
WES, and for the artists, we first executed WES and then pArtLink to process artists that were
not aligned using WES. We manually evaluate the correctness of the results. Using WES, we
retrieved Wikidata entities for 14,982 artworks and 2,029 artists in A G ℎ. With pArtLink,
we found Wikidata entities for 76 additional artists that WES left unaligned. We provide the
code, the data, and the results in a public GitHub repository 1.</p>
      <p>The outline of the paper is the following: Section 2 introduces the background; Section 3
presents our proposed approach; Section 4 includes the results and evaluation of the approach;
Section 5 discusses the approach; Section 6 concludes the paper2.</p>
      <sec id="sec-1-1">
        <title>1https://github.com/AntonisKl/Entity-Alignment-for-Art.git</title>
        <p>2Author contributions: A.K. and A.S.L. conceived, implemented, and tested the pArtLink and WES methods,
respectively. D.F.M-F. worked on the reliability of the evaluation agreement and the respective implementation.
E.N. and A.S.L. conducted data exploration. A.J., H.Z. conducted the literature review and contributed to the paper</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>2. Background</title>
      <p>Knowledge graphs have been widely used to describe artworks and other cultural heritage
artifacts, as discussed by Baroncini et al. [9]. For instance, ArCo, an Italian cultural heritage
knowledge graph, collects 169 million triples describing 820 thousand cultural entities [10].
ArCo provides links to around 18.7K entities in external Linked Open Data (LOD) datasets, such
as DBpedia, Wikidata, and Geonames. Europeana [11] is another key project for collecting
and structuring data relating to cultural heritage across Europe. Works have looked at ways to
link Europeana with external knowledge bases, such as WikiArt (see Task 4.3.1 in [12]). Other
art-related KGs include Zeri for photos that document artworks [13], Nomisma for numismatic
data (https://nomisma.org/), and PaintKG describing paintings for Chinese audiences [14].</p>
      <p>A G ℎ is a KG that encapsulates concepts related to works of art. It was populated using
WikiArt and DBpedia starting with the most popular genres and styles in WikiArt. A G ℎ
is made up of 135,038 resources, including artworks, artists, genres, styles, movements, and
more. In contrast with the aforementioned art-focused graphs, the information of A G ℎ is
neither bound to a specific geographic region nor restricted by a specific type of visual artwork.
In addition to its broad scope, WikiArt, which A G ℎ is based on, is built and maintained
by Ukrainian developers [15], where taking into account the ongoing situation in the country,
we are interested in supporting the project by connecting this KG to an external one, namely
Wikidata.</p>
      <p>A G ℎ includes Wikipedia URLs for the artists, but not for artworks. However, at the
time of writing this paper, we noticed that there were artists who were missing a Wikipedia
page URL (e.g. Ivan Vladimirov) and others that had an invalid one (e.g. Dana Levin). So we
opted to use the Wikipedia URLs only as a first step of two in WES, as a basis to find a first
batch of artists, and then execute pArtLink as an attempt to align the artists with missing or
invalid Wikipedia URLs.</p>
      <p>Regarding the entity alignment task, a survey by Zeng et al. [16] summarizes the relevant
methods developed in recent years. Entity alignment has been applied in diverse scenarios, such
as for geographic knowledge bases [17], for large-scale multilingual knowledge bases (DBpedia)
[18], and for electric power marketing [19]. To the best of our knowledge, the alignment of
entities within KGs related to the domain of art has not been investigated.</p>
      <p>Our goal is to provide a hybrid solution for aligning entities between A G ℎ and Wikidata.
We present the steps of our proposed solution in the following section.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Proposed approach</title>
      <p>Our proposed approach uses two methods: WES and pArtLink. WES was executed and evaluated
ifrst for aligning artists and artworks. Afterwards, pArtLink was used to improve the results
of WES in terms of artists’ alignment. Thus, these two methods are used in a complementary
manner. Before testing our approach, we examined the KGs (i.e., Wikidata and A G ℎ) to
estimate their quality and extracted the data we needed as input for our methods. Both the data
and the internals of our two methods are presented in this section.
writing. A.H. supervised the research. All authors provided critical feedback and helped shape the research.</p>
      <sec id="sec-3-1">
        <title>3.1. Analysis of data resources</title>
        <p>Building on the foundation laid out previously, this section explores the data resources integral
to our approach. Central to this is the A G ℎ KG, which describes 135,038 resources and
is accessible on Zenodo. Following our initial review of the A G ℎ KG, we conducted a
content coverage analysis and identified any missing information.</p>
        <p>Initial observations. Our preliminary observations revealed several issues. For instance,
there is a discrepancy between the published KG and the ontology, specifically regarding the
belongtofield attribute being used in one place and belongstofield in another. It is also
necessary to consider the presence of alternative names or titles for the artworks concerning
Wikidata. For example, alternative names may include middle names or alternative (translated)
titles such as “Ezechiel’s Vision” versus “Vision of Ezechiel” or “Poppy Field” versus “Field with
Poppies”. SPARQL queries were created to explore these issues to understand the extent of
specific gaps concerning properties related to the entities of interest.</p>
        <p>
          Artists exploration. Several information gaps were discovered for artists. The gender
property is missing for 57 artists. While this may seem small, understanding gender can provide
insights into the representation and diversity of artists in the dataset. Also, birth and death
details are crucial for historical context. Yet, birth_date is missing for 20 artists, death_date
for 425, birth_place for 1,341, and death_place for 1,566. Only 144 artists have information
on hasPatron. This could be indicative of the challenges in tracing patronage historically.
Similarly, high numbers of missing values in trainedBy (
          <xref ref-type="bibr" rid="ref2">2,460</xref>
          ), belongsToField (
          <xref ref-type="bibr" rid="ref1">1,917</xref>
          ),
relatedToSchool (
          <xref ref-type="bibr" rid="ref2">2,170</xref>
          ), and belongsToMovement (
          <xref ref-type="bibr" rid="ref1">1,850</xref>
          ) suggest gaps in the professional
and educational backgrounds of the artists.
        </p>
        <p>Artworks exploration. Regarding artworks, similarly, numerous gaps in information were
identified. The location of artworks is a significant gap, with locatedIn missing for 96,943
pieces. This could hinder eforts to trace the provenance or current location of the artworks.
Furthermore, the vast majority, 108,153 artworks, are missing the partOf attribute, which
could provide insights into collections or series they belong to. It’s worth noting that some
attributes like name, createdBy, hasGenre, image_url, and hasStyle for artworks are
fully populated, suggesting that while there are significant gaps in some areas, others are
well-documented.</p>
        <p>To conduct entity aligning experiments, we extracted the names of the artists and their
artworks in a 2-column CSV file, which was given as input for the proposed methods described
in the following section.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Methodology</title>
        <p>This section will introduce a two-step methodology designed to align artworks and artists on
A G ℎ with the corresponding entities on Wikidata. For the artworks, we used WES (the
ifrst step of our approach) as a stand-alone method. For artists, we integrate our two strategies
(i.e., WES and pArtLink) aiming to enhance both the accuracy and coverage of links. Given the
limitations of Wikidata Entity Search and the absence of contextual knowledge in the input
data, incorporating a complementary method based on state-of-the-art research would allow
for identifying additional links and enable comparison with WES.
3.2.1. Method 1 — WES: Wikidata Entity Search
The first method, called WES for short, uses the names of artists and the titles of their artworks
from A G ℎ to search for the corresponding entities in Wikidata. By matching entity
labels between the two KGs, WES also provided a hint on the quality of Wikidata in terms of
population completeness according to the definition proposed by [ 20]. This method was used
to align both artworks and artists, as explained in this section.</p>
        <p>Entity retrieval and linking to Wikidata for artworks. First, a SPARQL query using the
SPARQLWrapper library in Python is executed to align the artwork’s title and creator with the
corresponding title, description, and/or creator data present on Wikidata. This query employs
EntitySearch, a practical method underpinned by ElasticSearch that is used to perform full-text
searches on Wikidata pages, even in image captions. We made the word “The” optional before
the title of an artwork to enhance the query. This ensures it was impossible to miss any titles that
might not include it. As a limitation, it is not possible to make too many requests simultaneously,
which is challenging for this case, as there is a large amount of data to process. To circumvent
this, we have introduced a timeout mechanism. The methodology yields results for over 14,000
categorized artworks from an initial query set of more than 100,000 artworks.
Entity retrieval and linking to Wikidata for artists. Within the A G ℎ, hyperlinks
point to artist profiles on Wikipedia. Given that Wikipedia entries are linked to from Wikidata,
we have extracted these Wikipedia URLs to pinpoint the precise artist page title. This step
is useful in avoiding disambiguation, as the Wikipedia page will point to the right entity in
case of homonymy. Subsequently, the Python library pywikibot was employed to derive the
corresponding Wikidata item. There were cases of artists that could not be identified, perhaps
because of missing Wikipedia pages or invalid Wikipedia URLs (as discussed in Section 2). So,
Spacy OpenTapioca 3 was used on the remaining artists’ names directly.</p>
        <sec id="sec-3-2-1">
          <title>3https://spacy.io/universe/project/spacyopentapioca</title>
          <p>Legend
Sequence-to-sequence language models
Tool for mapping Wikipedia article title to
Wikidata entity ID</p>
          <p>Output
Input + Wikidata IDs
Context prompt
(static)</p>
          <p>Main prompt
(dynamic)</p>
          <p>Artists and Artworks
Input
Artists and Artworks</p>
          <p>Prompt creation</p>
          <p>Sentence
generation</p>
          <p>Entity retrieval and
linking to Wikipedia</p>
          <p>Wikipedia-to-Wikidata
mapping</p>
          <p>After using this method for linking the artists, we observed that 423 of the 2,452 total artists
were not linked. So, we proceeded towards a second complementary method using Natural
Language Processing (NLP) and LLMs to fill the gaps for artists that were not linked.
3.2.2. Method 2 — pArtLink: Pipeline for entity retrieval and linking using LLMs
The second method, which we call pArtLink, complements WES with NLP techniques for
mapping the artist names to Wikidata entity IDs after mapping them to the most likely Wikipedia
article titles. The motivation behind developing this method was to find links for the more
complex cases of artists missed by WES, for whom little context is often provided, i.e., the
artists’ names and titles of their artworks. Our idea was to leverage modern LLMs’ retrieval
and generative capabilities to enrich the context available for entity alignment.</p>
          <p>The main processes involved are prompt generation, textual sequence generation, and entity
identification and linking. The pipeline takes as input a CSV file with the artists and their
artworks and outputs a CSV file with the artists and their Wikidata IDs. The workflow can be
divided into the following modules that are arranged sequentially (Figure 2).
Prompt creation. This initial module of the pipeline creates one prompt for each artist and
one of his/her artwork (arbitrarily chosen), which will then be given as input to an LLM. The
prompt aims to make the LLM produce informative sentences for the artist. Another goal of the
prompt is to yield answers with a specific format in order to be parsed later in an automated way.
Furthermore, the answer from the LLM should not contain any additional redundant text (e.g.,
introductory remarks that rephrase parts of the prompt). With these goals as requirements, the
constructed prompt template is: You are a helpful AI assistant for finding information about artists.
Your answer must contain only one line. Your answer must follow this template: “artist_name:
info_about_artist”. Your answer must not include any other text than the answer itself. Write a
short sentence about the artist {artist} who created {artwork}.</p>
          <p>Sentence generation. This module is responsible for generating for each artist a natural
language sentence that describes them. It accomplishes this by feeding the prompts created by
the previous module to a sequence-to-sequence LLM called Llama 2 [8]. For the purposes of
this work, a pre-trained version of the model called Llama-2-Chat with 7 billion parameters
was used via a Python interface provided by GPT4ALL project [21]. Llama-2-Chat models are
open-source models developed by Meta specifically for dialogue use cases. In comparison to
open-source chat models, according to the original paper, these fine-tuned LLMs perform better
on most benchmarks and are on par with some popular closed-source models like ChatGPT and
PaLM in terms of helpfulness and safety, as per the human evaluations they conducted.
Entity retrieval and linking to Wikipedia. This pipeline step aims to identify the entities
representing the artists in the given sentences and link them to Wikipedia. Specifically, for each
sentence, this module retrieves the title of the Wikipedia article that corresponds to the artist
mentioned in that sentence. If there is more than one possible title, all of them are retrieved for
disambiguation purposes. This process is achieved by utilizing a sequence-to-sequence language
model called GENRE (Generative ENtity REtrieval) [22]. In this work, we used an instance of
the pre-trained model on End-to-End Entity Linking from sentences to Wikipedia. The GENRE
system employs a fine-tuned BART architecture to perform entity retrieval, connecting input
text to distinct entities. It generates a unique entity name from the input text using constrained
beam search to maintain the validity of the generated identifiers.</p>
          <p>Wikipedia-to-Wikidata mapping. The final module of the workflow has the role of finding
the correspondence between Wikipedia article titles and Wikidata entity IDs. A Python library
called Wikimapper4 was used for performing this mapping. This tool creates an index of
mappings from a Wikipedia SQL dump to accomplish this. Before the execution of the pipeline,
a recent dump of the English Wikipedia was retrieved, and then the index was generated from it
by Wikimapper. The coding interface of GENRE provides a way of specifying a custom Python
method that functions as a converter from the Wikipedia article title to an ID. In this way,
Wikimapper’s mapping functionality was injected in GENRE to perform the mapping internally
before GENRE produces its output.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Results and evaluation</title>
      <p>The results were separated depending on the category of the entities, i.e., artists and artworks
(Figure 1). A sample of entities from each category was evaluated manually by the authors
of this paper. Later, by using the Fleiss Kappa measure, we calculated the agreement of the
evaluations for the result samples that were collectively judged.</p>
      <sec id="sec-4-1">
        <title>4.1. Artworks</title>
        <p>Artworks were linked using the WES method, generating 14,982 links to Wikidata. We then
evaluated 100 artwork links randomly chosen from the results and concluded that 99 of these
links were correct.</p>
        <p>During the evaluation, an artwork was considered correctly linked if there was a convergence
between the source WikiArt entity and the target Wikidata entity with regard to the title, author,
image, and date properties. If all the mentioned fields except for the title matched, a slightly
diferent title was accepted.</p>
        <p>During the evaluation, we observed that if two or more artworks by the same author have a
similar title, such as The Drinkers and Drinkers in a Tavern by Adriaen van Ostade, especially in
the case one title is a subset of the other, then if one of them isn’t listed on Wikidata, the other
will often be incorrectly displayed as a result. These are particularly challenging cases.</p>
        <p>We did not use our pArtLink method to link any artwork because we found some caveats that
might influence with a greater chance the performance of pArtLink for artworks, than artists. In
4https://github.com/jcklie/wikimapper
particular, the state of Wikipedia in which GENRE was pre-trained (several years ago) is not the
same as the current one. Given the higher number of artworks than artists, it is more probable
that new artwork pages have been created on Wikipedia since then. Also, entity linking models
are often trained for people, places, and organizations, and thus may not be suitable for titles
of works, which may be verbose and contain common words like “Drinker” that are dificult
to disambiguate. For these reasons, GENRE might not be able to perform entity linking for
artworks with the same accuracy as for artists, though it would be interesting to explore this in
future work.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Artists</title>
        <p>Regarding the artists, our WES method was executed first to extract links for 2,452 artists.
Wikidata entity IDs were found for 2,029 artists, out of which 100 were manually evaluated by
the authors of this work; all were found to be correctly linked to Wikidata.</p>
        <p>However, using WES led to 423 artists being left unlinked. We saw this as an opportunity
to use our pArtLink method with the prospect that the abilities of LLMs will prove fruitful. In
order to test pArtLink, we used as input 700 artists in total: the 423 unlinked by WES and 277
of the rest. This method found Wikidata entity IDs for 332 artists. We manually checked the
correctness of these entities. Out of these artists, 117 were left unlinked by WES (i.e., 215 were
also linked by WES) and 252 were correctly linked to Wikidata. Also, 76 of the correctly linked
artists were artists that WES could not link.</p>
        <p>Each entity link that was evaluated was considered to be correct if the following conditions
were met: The retrieved Wikidata entity represents a person that has a name, nationality, and
profession that match with those in the artist’s WikiArt page (searching by the artist’s artwork).</p>
        <p>During evaluation, we detected some cases of erroneous and incomplete entity linking. Some
artists, like Erol Akyavaş, do not exist in the English Wikipedia. Since Wikimapper is based on
the English version of Wikipedia as a source for the mapping to Wikidata, such artists were
either not linked to Wikidata or incorrectly linked (25 out of the evaluated 332). Other artists
may have names that are common among diferent Wikidata entities (e.g. James Weeks). GENRE
is used to disambiguate using the produced titles. Except for 19 manually disambiguated artists,
a single Wikidata entity ID was produced by pArtLink for each of the others. Also, for 7 of
the 332 evaluated artists, the LLM-generated sentence described another person with a similar
name. As a result, pArtLink produced the wrong Wikidata entity ID. One example is Joaquim
Rodrigo (painter), for which the LLM answered with a sentence describing Joaquín Rodrigo
(music composer).</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3. Reliability of evaluation agreement</title>
        <p>
          After compiling the results of our two methods, WES and pArtLink, we performed a manual
evaluation to measure the level of agreement between reviewers. To do this, we created a
sample of 100 random artwork/artist entities. The task was to confirm whether the links to their
Wikidata ID were correct. The evaluation was limited to two variables:‘correct (
          <xref ref-type="bibr" rid="ref1">1</xref>
          )’ or ‘incorrect
(0)’. Five reviewers individually examined the same sample to test agreement among reviewers.
An example of the sample file generated can be seen in Table 2.
        </p>
        <p>Entities
Artworks</p>
        <p>Artists</p>
        <p>Methods</p>
        <p>WES
pArtLink</p>
        <p>WES
pArtLink</p>
        <p>Total</p>
        <p>Our process began by examining the artwork samples. After the first round of evaluations,
we decided to apply the well-known Fleiss Kappa (FK) measure to assess the level of agreement
among the reviewers. Specifically, FK allows evaluating the ‘inter-rater reliability’ of nominal
data (e.g., correct/incorrect) in situations with 3 or more raters [23]. Predefined values allow
interpreting the score obtained with FK. For example, if the score is less than 0, the agreement is
poor; if it is between 0.01 and 0.2, the agreement is slight; between 0.61 and 0.8, it is a substantial
agreement; and if the score is between 0.8 and 1, the agreement is almost perfect. The resulting
FK for the first evaluation of artwork samples was 0.011, indicating a ‘slight agreement’. After
some discussion among the reviewers, it was discovered that most of the disagreements were
due to special cases on the Wikidata page: cases where the language of the artwork title in
the sample (e.g., La Pergola) diverged from that of the Wikidata page (e.g., After lunch), cases
where the artist’s name was abbreviated or slightly modified, e.g., Ivan Aivazovsky (sample)
and Iwan Aiwasowski (Wikidata), and situations where the entity label was exclusively within
the caption of the illustration (e.g., Q62116516). Therefore, we decided to perform a second
round of evaluations for the artwork sample after defining the criteria mentioned in sections
4.1 and 4.2. For the second round, the obtained FK was 0.018, indicating a ‘slight agreement’.
However, despite the low concordance, we estimated that more than 90% of the entities in
the sample were marked as correct by the majority of the reviewers. During our research,
we found that a limitation of the FK score is that it is afected by the no-variation/bias in the
reviewed data. In situations like this, it is important to incorporate other measures to get a
more comprehensive view of the agreement. In the study of [24], the authors suggested using
the Percentage of Agreement (PA) measure alongside FK. Similarly, in the work of [25], the
researchers introduced a modified version of the FK measure called Free Marginal Multirater
Kappa (K) to address issues related to prevalence and unbalanced classes. By employing
both approaches, we computed the PA and K values for our artwork sample, resulting in
the following outcomes: a PA of 95% and a K of 0.9. This time both results were higher than
the obtained FK measure.</p>
        <p>Subsequently, we extended our evaluation to the artist entities. For this case, we performed
only one round of evaluation with the same pre-defined criteria as in Subsection 4.2, i.e., if the
name of the artist, its occupation, and nationality in the Wikidata ID corresponded to the one
on the sample, regardless of the languages and slight modifications, the sample was marked as
correct. As anticipated, there was a distinction between the artist and the artwork evaluation
due to the utilization of diferent methods. Consequently, the FK value of 0.88 indicated a
‘substantial agreement’ reflecting less prevalence in the data. The other metrics also exhibited
consistency with the FK score, with PA remaining at 95% and K at 0.89.</p>
        <p>To conclude with the evaluation of concordance, we can confirm that low FK values do not
always reflect inter-rater reliability when the sample is unbalanced towards a particular label.
Conversely, we can observe that both scores of artwork and artist, calculated with the K
and PA metrics, are high. This result allows us to verify that for the reviewers, the entities were
correctly associated with their Wikidata ID in most of the cases.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Limitations</title>
      <p>In this work, we discuss a framework to fill information gaps and inconsistencies about artists
and artworks for two knowledge bases, comprised of two methods: the WES method and the
pArtLink method. These approaches, while successfully aligning artworks and artists, encounter
some limitations. In the case of artworks with similar names for the same artists, the WES
method may find false positives. Moreover, the results of the pArtLink method heavily rely on
the quality of the sentences that get generated by an LLM. It could be that some artists were
not present in the training dataset of the LLM. In this case, the LLM may not be able to produce
an informative enough sentence so that GENRE can successfully link the entities mentioned in
the text to Wikipedia. Furthermore, to the best of our knowledge, there is no perfect metric to
measure agreement among reviewers. Readers should be aware that using the FK metric alone
may result in low scores due to unbalanced assessments. A feasible solution is to incorporate
complementary metrics for a more complete analysis and broaden the assessment categories
beyond correct or incorrect.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusion and future work</title>
      <p>In this paper, we introduced two methods for entity alignment and applied them to align entities
between the A G ℎ and Wikidata KGs. On the one hand, WES uses traditional querying
techniques. On the other hand, pArtLink uses recently developed LLMs. Our evaluations
demonstrate that these methods are complementary and can yield accurate results. With the
entities aligned in this work, the information gaps of both A G ℎ and Wikidata can be
iflled by completing missing data in a subsequent endeavor.</p>
      <p>In this research, we investigated the task of entity alignment in art-related knowledge bases.
However, there are still avenues for further exploration and enhancement. One of the immediate
steps forward in terms of practical impact is to establish a collaboration with Wikidata by
uploading the evaluated and corrected datasets. Additionally, to validate the versatility and
scalability of our framework, it would be beneficial to test its eficacy on other art-related
knowledge bases. Finally, we plan to expand our pArtLink method on artworks, and to link
entities with Wikipedia articles in languages other than English.</p>
      <p>
        Acknowledgments Heng Zheng thanks the grants, United States Institute of Museum and Library Services
RE-250162-OLS-21, Alfred P. Sloan Foundation G-2022-19409, the University of Illinois Urbana Champaign, and the
University of Groningen for supporting me in participating in the 2023 International Semantic Web Summer School
which led to this contribution. Antonis Klironomos is partially afiliated with the EU project Graph Massiviser (GA
101093202). Anna Sofia Lippolis is supported through the WHOW project (EU CEF programme - grant agreement
no. INEA/CEF/ICT/A2019/2063229). Ebrahim Norouzi thanks the Information Service Engineering group at FIZ
Karlsruhe for supporting me in participating in the International Semantic Web Summer School 2023. Daniela
F. Milon-Flores is part of the TRACES project (http://traces-anr-fns.imag.fr/) funded by ANR: Agence Nationale
de la Recherche (France) and FNS: Fonds National Suisse (Swiss). Alexane Jouglar is part of the ARIAC project
(https://www.digitalwallonia.be/fr/publications/trail/).
landscape on iconography and iconology statements of knowledge graphs in the semantic
web, Journal of Documentation 79 (
        <xref ref-type="bibr" rid="ref7">7</xref>
        ) (2023) 115–136.
[10] V. A. Carriero, A. Gangemi, M. L. Mancinelli, L. Marinucci, A. G. Nuzzolese, V. Presutti,
C. Veninata, Arco: The italian cultural heritage knowledge graph, in: C. Ghidini, O. Hartig,
M. Maleshkova, V. Svátek, I. Cruz, A. Hogan, J. Song, M. Lefrançois, F. Gandon (Eds.), The
Semantic Web – ISWC 2019, Springer International Publishing, Cham, 2019, pp. 36–52.
[11] A. Isaac, B. Haslhofer, Europeana linked open data –data.europeana.eu, Semantic Web
4 (
        <xref ref-type="bibr" rid="ref3">3</xref>
        ) (2013) 291–297.
[12] M. Leferts, C. Concordia, L. Anastasiou, A. Jahnke, M. Kittelmann, H. Manguinhas,
Europeana cloud - deliverable 4.3 - a report and a plan on future directions for improving
metadata in the europeana cloud, Tech. rep., Project report, Europeana Cloud, Deliverable
D4.3, 2016 (2016).
[13] M. Daquino, F. Mambelli, S. Peroni, F. Tomasi, F. Vitali, Enhancing semantic expressivity
in the cultural heritage domain: Exposing the zeri photo archive as linked open data, J.
      </p>
      <p>
        Comput. Cult. Herit. 10 (
        <xref ref-type="bibr" rid="ref4">4</xref>
        ) (jul 2017).
[14] H. Wu, S. Y. Liu, W. Zheng, Y. Yang, H. Gao, Paintkg: the painting knowledge graph
using bilstm-crf, in: 2020 International Conference on Information Science and Education
(ICISE-IE), 2020, pp. 412–417.
[15] Wikiart visual encyclopaedia blocked in russia, https://shorturl.at/cxCU8 (4 2022).
[16] K. Zeng, C. Li, L. Hou, J. Li, L. Feng, A comprehensive survey of entity alignment for
knowledge graphs, AI Open 2 (2021) 1–13.
[17] K. Sun, Y. Zhu, J. Song, Progress and challenges on entity alignment of geographic
knowledge bases, ISPRS International Journal of Geo-Information 8 (
        <xref ref-type="bibr" rid="ref2">2</xref>
        ) (2019).
[18] J. Lehmann, R. Isele, M. Jakob, A. Jentzsch, D. Kontokostas, P. N. Mendes, S. Hellmann,
M. Morsey, P. van Kleef, S. Auer, C. Bizer, DBpedia –a large-scale, multilingual knowledge
base extracted from wikipedia, Semantic Web 6 (
        <xref ref-type="bibr" rid="ref2">2</xref>
        ) (2015) 167–195.
[19] W. Meng, D. Zhang, T. Guo, Z. Zong, Y. Liu, Y. Wang, J. Li, W. Zhu, Research on the typical
application of knowledge graph in power marketing, in: 2021 International Conference on
Computer Engineering and Application (ICCEA), 2021, pp. 318–321.
[20] C. Brando, N. Abadie, F. Frontini, Linked data quality for domain-specific named-entity
linking, in: Conférence Extraction et Gestion des Connaissances (EGC) Atelier Qualité des
Données du Web (QLOD), Reims, France, 2016, hal-0239918.
[21] Y. Anand, Z. Nussbaum, B. Duderstadt, B. Schmidt, A. Mulyar, Gpt4all: Training an
assistant-style chatbot with large scale data distillation from gpt-3.5-turbo, https://github.
com/nomic-ai/gpt4all (2023).
[22] N. De Cao, G. Izacard, S. Riedel, F. Petroni, Autoregressive entity retrieval, in: 9th
International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May
3-7, 2021, OpenReview.net, 2021.
[23] J. L. Fleiss, Measuring nominal scale agreement among many raters., Psychological bulletin
76 (
        <xref ref-type="bibr" rid="ref5">5</xref>
        ) (1971) 378.
[24] M. L. McHugh, Interrater reliability: the kappa statistic, Biochemia medica 22 (
        <xref ref-type="bibr" rid="ref3">3</xref>
        ) (2012)
276–282.
[25] J. J. Randolph, Free-marginal multirater kappa (multirater k [free]): An alternative to fleiss’
ifxed-marginal multirater kappa., Online submission (2005).
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>S.</given-names>
            <surname>Aslan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Castellano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Digeno</surname>
          </string-name>
          , G. Migailo,
          <string-name>
            <given-names>R.</given-names>
            <surname>Scaringi</surname>
          </string-name>
          ,
          <string-name>
            <surname>G.</surname>
          </string-name>
          <article-title>Vessio, Recognizing the emotions evoked by artworks through visual features and knowledge graph-embeddings</article-title>
          , in: P. L.
          <string-name>
            <surname>Mazzeo</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <string-name>
            <surname>Frontoni</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Sclarof</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          Distante (Eds.),
          <source>Image Analysis and Processing. ICIAP 2022 Workshops</source>
          , Springer International Publishing,
          <year>2022</year>
          , pp.
          <fpage>129</fpage>
          -
          <lpage>140</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>G.</given-names>
            <surname>Castellano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Digeno</surname>
          </string-name>
          , G. Sansaro, G. Vessio,
          <article-title>Leveraging Knowledge Graphs and Deep Learning for automatic art analysis</article-title>
          ,
          <source>Knowledge-Based Systems</source>
          <volume>248</volume>
          (
          <year>2022</year>
          )
          <fpage>108859</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A.</given-names>
            <surname>Kouretsis</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Varlamis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Limniati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Pergantis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Giannakoulopoulos</surname>
          </string-name>
          ,
          <article-title>Mapping art to a knowledge graph: Using data for exploring the relations among visual objects in renaissance art</article-title>
          ,
          <source>Future Internet</source>
          <volume>14</volume>
          (
          <issue>7</issue>
          ) (
          <year>2022</year>
          )
          <fpage>206</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>N.</given-names>
            <surname>Partarakis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Doulgeraki</surname>
          </string-name>
          , E. Karuzaki,
          <string-name>
            <given-names>G.</given-names>
            <surname>Galanakis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Zabulis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Meghini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Bartalesi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Metilli</surname>
          </string-name>
          ,
          <article-title>A web-based platform for traditional craft documentation</article-title>
          ,
          <source>Multimodal Technologies and Interaction</source>
          <volume>6</volume>
          (
          <issue>5</issue>
          ) (
          <year>2022</year>
          )
          <fpage>37</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>N.</given-names>
            <surname>Díaz-Rodríguez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Lamas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Sanchez</surname>
          </string-name>
          , G. Franchi, I. Donadello,
          <string-name>
            <given-names>S.</given-names>
            <surname>Tabik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Filliat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Cruz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Montes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Herrera</surname>
          </string-name>
          ,
          <article-title>Explainable neural-symbolic learning (x-nesyl) methodology to fuse deep learning representations with expert knowledge graphs: The monumai cultural heritage use case</article-title>
          ,
          <source>Information Fusion</source>
          <volume>79</volume>
          (
          <year>2022</year>
          )
          <fpage>58</fpage>
          -
          <lpage>83</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>D.</given-names>
            <surname>Vrandecic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Krötzsch</surname>
          </string-name>
          ,
          <article-title>Wikidata: a free collaborative knowledgebase</article-title>
          ,
          <source>Commun. ACM</source>
          <volume>57</volume>
          (
          <issue>10</issue>
          ) (
          <year>2014</year>
          )
          <fpage>78</fpage>
          -
          <lpage>85</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>A.</given-names>
            <surname>Waagmeester</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. L.</given-names>
            <surname>Willighagen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. I.</given-names>
            <surname>Su</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kutmon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. E. L.</given-names>
            <surname>Gayo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Fernández-Álvarez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Groom</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. J.</given-names>
            <surname>Schaap</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. M.</given-names>
            <surname>Verhagen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. J.</given-names>
            <surname>Koehorst</surname>
          </string-name>
          ,
          <article-title>A protocol for adding knowledge to wikidata: aligning resources on human coronaviruses</article-title>
          ,
          <source>BMC Biology 19 (1)</source>
          (
          <year>2021</year>
          )
          <fpage>12</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>H.</given-names>
            <surname>Touvron</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Martin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Stone</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Albert</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Almahairi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Babaei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Bashlykov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Batra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Bhargava</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bhosale</surname>
          </string-name>
          , et al.,
          <source>Llama</source>
          <volume>2</volume>
          :
          <article-title>Open foundation and fine-tuned chat models</article-title>
          ,
          <source>arXiv preprint arXiv:2307.09288</source>
          (
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>S.</given-names>
            <surname>Baroncini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Sartini</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. Van Erp</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Tomasi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Gangemi</surname>
          </string-name>
          , Is dc:subject enough? a
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>