<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>App to Gather Expert Knowledge on Inclusive Cultural Heritage Metadata</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Andrei Nesterov</string-name>
          <email>nesterov@cwi.nl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Laura Hollink</string-name>
          <email>hollink@cwi.nl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jacco van Ossenbruggen</string-name>
          <email>jacco.van.ossenbruggen@vu.nl</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Workshop</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Centrum Wiskunde &amp; Informatica</institution>
          ,
          <addr-line>Science Park 123, 1098 XG Amsterdam</addr-line>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Metadata</institution>
          ,
          <addr-line>Cultural Heritage, Inclusivity, Annotation, Domain Expert Knowledge, Web Application</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Vrije Universiteit Amsterdam</institution>
          ,
          <addr-line>De Boelelaan 1105, 1081 HV Amsterdam</addr-line>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2024</year>
      </pub-date>
      <fpage>26</fpage>
      <lpage>28</lpage>
      <abstract>
        <p>In this demo paper, we present the web application Alter Heritage. Its goal is to support researchers in collecting domain experts' knowledge about inclusive cultural heritage metadata. While stakeholders in the cultural sector have been formulating strategies to mitigate biases in metadata, there is little empirical knowledge on how the metadata can be revised for inclusivity by users. With Alter Heritage, researchers can study what kind of alterations users make in artefacts' metadata to make it inclusive, how these alterations difer per artefacts and users. We develop the app's core metadata editing functionality based on requirements elicited from existing practices and guidelines in the cultural heritage domain. The data gathered via Alter Heritage will contribute to understanding of the specific alterations that make digital artefacts' metadata more inclusive.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        What users learn about cultural heritage on the websites of GLAMs (galleries, libraries, archives, and
museums) largely depends on metadata. Thanks to the metadata, users can search through artefacts and
get contextual information about them: dates, places, keywords, textual descriptions, and relations to
other artefacts. Apart from useful information, metadata can also transmit cultural biases. For example,
subject terms from controlled vocabularies may be ofensive towards marginalised communities.[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
Titles and descriptions assigned by archivists are not free of prejudiced views either, which users might
perceive as racist, sexist, homophobic, or ableist.[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]
      </p>
      <p>
        To address biases in the metadata of artefacts, stakeholders in the cultural heritage domain have
been developing strategies.[
        <xref ref-type="bibr" rid="ref3 ref4 ref5">3, 4, 5</xref>
        ] Their eforts have resulted in diferent documents: policies of
cultural heritage institutions, academic research papers, or guidelines from formal and informal groups
advocating for better representation of their heritage (such as Anti-Racist Description Working Group
[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]). Despite numerous documents, there is little empirical knowledge about how experts can change
metadata to reduce biases. This led us to to develop Alter Heritage, a web app for research purposes.
The tool allows annotators to make changes in (biased) metadata and captures their input: for example,
whether they edit text in certain metadata fields, hide them from the view or remove them entirely, add
new fields, provide explanation notes or warnings to signal about biases. Using the app, we can study
how annotators alter metadata of selected cultural heritage artefacts, whether there are diferences
between annotators, as well as between alterations of diferent artefacts.
      </p>
      <p>We select 14 domain documents, such as the guidelines and strategies mentioned above. From these
documents, we elicit requirements: which functionality the web app must ofer to annotators in order
to realise the suggestions on making the metadata inclusive. We implement the functionality for 38 user
actions extracted from the documents, design the web app interface, and run a pilot with three users.</p>
      <p>CEUR</p>
      <p>ceur-ws.org</p>
      <p>
        While multiple tools have been developed specifically for the cultural heritage domain to gather
annotations (see related work in [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]), Alter Heritage is a research tool to study how annotators change
metadata with the goal of increasing its inclusiveness. Apart from some common functionalities of
existing tools, such as adding new tags or comments to artefacts, Alter Heritage incorporates other
functionalities that primarily relate to dealing with ofensive content: for example, hiding parts of the
metadata on a web page or adding content warnings. To annotators, Alter Heritage ofers an interactive
environment for editing digital artefacts’ metadata. For researchers, Alter Heritage allows setting up
annotation studies with any type of artefacts, as long as their metadata can be put into the app’s data
scheme. The data collected with Alter Heritage will contribute to a deeper understanding of what
makes cultural heritage metadata inclusive.
      </p>
      <p>The Alter Heritage demo is available online1 as well as its code with open licence and documentation.2</p>
    </sec>
    <sec id="sec-2">
      <title>2. Requirements Elicitation</title>
      <p>
        We use the domain analysis approach to elicit the web app requirements from the domain documents.[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]
      </p>
      <p>First, we select domain documents about inclusive metadata strategies. We find them with keywords 3
using the Web search engines, academic repositories, and applying the snowballing method: following
the references in the initial set of documents. While some documents are broad and general, others
contain concrete actions. We targeted the latter ones and selected 14 documents in total (see the list in
the documentation). Our aim was not to collect an exhaustive list of documents, but to collect a diverse
set of actions of how metadata can be altered to increase its inclusivity.</p>
      <p>
        Second, from the documents, we extract phrases with actions on making metadata inclusive. In total,
we identified 48 actions. For each action, we formulate use cases: which user performs an action and
how. For these use cases, we have only one type of user – an annotator. We specify use cases with 41
scenarios that illustrate how annotators might perform actions in practice using the web app.[
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]
      </p>
      <p>Third, covering all the scenarios, we outline the web app functionality. In Table 1, we provide an
example of how we adapted suggestions from the domain documents into the web app functionality.
The full table with actions, use cases, scenarios, and the functionality based on them is available in the
documentation.</p>
      <p>The requirements elicited from the domain documents concern only annotators editing metadata.
Apart from them, we formulated additional requirements associated with how researchers would
manage the annotation task: loading and displaying metadata, saving and retrieving annotators’ input.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Functionality and Implementation</title>
      <p>When annotators access Alter Heritage, they begin with a consent screen showing their privacy terms.
If a consent is given, annotators see the main screen, which includes instructions, an artefact (an image)
and its metadata on the left, navigation and questionnaire on the right (see Figure 1). Annotators can
perform multiple actions editing metadata. These actions are applicable to both textual metadata fields</p>
      <sec id="sec-3-1">
        <title>1https://alterheritage.project.cwi.nl/#ekaw24demo</title>
        <p>2Documentation: https://github.com/cultural-ai/AlterHeritage
3For example, one of the queries we used was “cultural heritage” AND metadata AND (inclusive OR bias)
and subject terms: adding a custom field or a subject term, removing, hiding/showing, and adding a
note to them. To textual fields, annotators can also add a warning indicating, for example, ofensive
content. The button “Restore” allows annotators to restore the original metadata.</p>
        <p>After completing their edits, annotators click “Submit” to send their changes to the server. Unsaved
changes are stored in annotators’ browser to allow navigation across artefacts preserving edits. The
app notifies annotators when they submit edits and complete the task. Annotators can make more
changes even after submission and resubmit them.</p>
        <p>For researchers, the annotation tasks can be managed by creating JSON-files. There are 5 JSON-files
linked to an annotator: consent, original metadata, edits, responses to questionnaire, and the submission
progress. Researchers set ID for each annotator. This ID is used as a hash value in the web app link, for
example, #ekaw24demo. The web app checks the hash value to read and write into corresponding files
sufixed with the same ID. This way, researchers can distribute diferent sets of artefacts’ metadata to
diferent annotators. The initial metadata of artefacts should be put in a JSON-schema, so that the web
app could parse and display it. Researchers can indicate explicitly in a JSON-file which metadata fields
will be available for editing by annotators. The annotators’ edits are stored separately from the original
metadata in the same JSON-schema to ease the before-and-after comparison.</p>
        <p>In this demo version, we chose not to implement such functionalities as image manipulation (e.g.,
cropping) and connecting external datasets (e.g., thesauri for pre-defined subject terms) that would
make the annotation process more complex. These features could be realised in future development.</p>
        <p>Alter Heritage is built with HTML, CSS, and JavaScript, running on a Node.js4 server. It utilises the
Fetch API to parse JSON-files and POST requests to save annotators’ edits.</p>
        <p>To test the functionality and interface, we ran a pilot with three users. They edited the metadata of
six artefacts, completed the questionnaire, and gave comments during the task. The pilot helped us
eliminate errors in interface interactions and understand users’ behaviour. As a result, we made the
interface more intuitive based on users’ feedback.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Demo Use Case</title>
      <p>During the demonstration of the app, users will have the opportunity to test all the functionality we
implemented and see how the data is being processed in real time. The app will load the metadata of
six cultural heritage objects we selected for users to annotate. Originally, we developed and tested the
interface in Dutch. For this demo, we will provide an English version.</p>
      <p>
        There are various types of biases that can be found in the cultural heritage metadata. In our demo, we
focus on one use case: (non)inclusive representation of LGBTQ+ people. We obtained artefacts’ metadata
that contains outdated and stereotyping terminology referring to LGBTQ+ people. First, we identified
outdated and ofensive LGBTQ+ terminology in specialised sources, such as Homosaurus 5 and a practical
guide for searching LGBTQ+ related objects in archives.[
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] Second, we queried this terminology in the
metadata of artefacts from the European cultural heritage institutions using Europeana API6. We selected
six objects with potentially non-iclusive metadata. For example, the historical term “sodomites” used
in the title of one of the selected objects could be considered pejorative.7 We expect participants to alter
metadata using our app if they find the usage of such terms inappropriate to make the representation
of LGBTQ+ related artefacts more inclusive.
      </p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusions and Future Work</title>
      <p>In the cultural heritage domain, it is a known problem that cultural biases are transmitted through digital
artefacts’ metadata. To mitigate the risks, the stakeholders in this domain have published multiple
documents with various strategies. However, to the metadata developers, there is little empirical
knowledge available on how metadata can be altered to reduce biases and represent heritage inclusively.
We developed a web application Alter Heritage intended for research purposes. The tool’s aim is
to gather knowledge on how domain experts alter metadata of cultural heritage artefacts to make
their representation inclusive. The core functionality of Alter Heritage was developed according to
the requirements extracted from the domain documents, such as policies and guidelines. In Alter
Heritage, users can make changes in (biased) metadata: edit, hide from the view, remove, or add content.
Researchers can collect these changes to analyse which alterations contribute to making metadata
inclusive, how changes difer per artefact or user.</p>
      <p>In future work, Alter Heritage could be updated to support more metadata editing functionality,
which we left out: images manipulation and support of external thesauri for auto filling of subject terms.
Another additional feature could be a user-friendly admin panel for researchers, in which they can set
up the desired components in the app interface (such as optional consent screen or questionnaire) and
import/export metadata and annotations.</p>
      <sec id="sec-5-1">
        <title>4https://nodejs.org/en 5https://homosaurus.org 6https://apis.europeana.eu/en 7https://homosaurus.org/v3/homoit0000227</title>
        <p>This work is a part of the “Culturally Aware AI” project funded by NWO (Dutch Research Council). We
thank the pilot users Thomas Schoegje (Utrecht University), Lynda Hardman (CWI), and Dirk Meijer
(CWI) for their input and feedback. Thank you, Dirk, for helping us with the app installation on the
server as well.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A.</given-names>
            <surname>Nesterov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Hollink</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. van Ossenbruggen</surname>
          </string-name>
          ,
          <article-title>How Contentious Terms About People and Cultures are Used in Linked Open Data</article-title>
          ,
          <source>in: Proceedings of the ACM Web Conference</source>
          <year>2024</year>
          , WWW '24,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computing Machinery, New York, NY, USA,
          <year>2024</year>
          , p.
          <fpage>4523</fpage>
          -
          <lpage>4533</lpage>
          . doi:
          <volume>10</volume>
          .1145/ 3589334.3648140.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M. R.</given-names>
            <surname>Rinn</surname>
          </string-name>
          ,
          <article-title>Nineteenth-Century Depictions of Disabilities and Modern Metadata: A Consideration of Material in the P. T. Barnum Digital Collection</article-title>
          ,
          <source>Journal of Contemporary Archival Studies</source>
          <volume>5</volume>
          (
          <year>2018</year>
          ). URL: https://elischolar.library.yale.edu/jcas/vol5/iss1/1.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A.</given-names>
            <surname>Chilcott</surname>
          </string-name>
          ,
          <article-title>Towards protocols for describing racially ofensive language in UK public archives</article-title>
          ,
          <source>Archival Science</source>
          <volume>19</volume>
          (
          <year>2019</year>
          )
          <fpage>359</fpage>
          -
          <lpage>376</lpage>
          . doi:
          <volume>10</volume>
          .1007/s10502- 019- 09314- y.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>J.</given-names>
            <surname>Guiliano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Heitman</surname>
          </string-name>
          ,
          <article-title>Dificult Heritage and the Complexities of Indigenous Data</article-title>
          ,
          <source>Journal of Cultural Analytics</source>
          <volume>4</volume>
          (
          <year>2019</year>
          ). doi:
          <volume>10</volume>
          .22148/16.044.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>K.</given-names>
            <surname>Wright</surname>
          </string-name>
          ,
          <article-title>Archival interventions and the language we use</article-title>
          ,
          <source>Archival Science</source>
          <volume>19</volume>
          (
          <year>2019</year>
          )
          <fpage>331</fpage>
          -
          <lpage>348</lpage>
          . doi:
          <volume>10</volume>
          .1007/s10502- 019- 09306- y.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>A.</given-names>
            <surname>Antracoli</surname>
          </string-name>
          <string-name>
            <surname>A.</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Berdini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Bolding</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Charlton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ferrara</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Johnson</surname>
          </string-name>
          , K. Rawdon,
          <article-title>Archives for Black Lives in Philadelphia: Anti-Racist Description Resources</article-title>
          .,
          <year>2020</year>
          . URL: https://github.com/ a4blip/A4BLiP.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>E.</given-names>
            <surname>Kaldeli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Menis-Mastromichalakis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bekiaris</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ralli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Tzouvaras</surname>
          </string-name>
          , G. Stamou,
          <article-title>CrowdHeritage: Crowdsourcing for Improving the Quality of Cultural Heritage Metadata</article-title>
          ,
          <source>Information</source>
          <volume>12</volume>
          (
          <year>2021</year>
          ). doi:
          <volume>10</volume>
          .3390/info12020064.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>The</given-names>
            <surname>Trans Metadata Collective</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Burns</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Cronquist</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Murphy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Rawson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Schaefer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Simons</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. M.</given-names>
            <surname>Watson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Williams</surname>
          </string-name>
          ,
          <source>Metadata Best Practices for Trans and Gender Diverse Resources</source>
          ,
          <year>2023</year>
          . doi:
          <volume>10</volume>
          .5281/zenodo.10076876.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>D.</given-names>
            <surname>Zowghi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Coulin</surname>
          </string-name>
          , Requirements Elicitation:
          <article-title>A Survey of Techniques, Approaches</article-title>
          , and Tools, Springer Berlin Heidelberg, Berlin, Heidelberg,
          <year>2005</year>
          , pp.
          <fpage>19</fpage>
          -
          <lpage>46</lpage>
          . doi:
          <volume>10</volume>
          .1007/3- 540- 28244-
          <issue>0</issue>
          _
          <fpage>2</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>H.</given-names>
            <surname>Holbrook</surname>
          </string-name>
          ,
          <article-title>A scenario-based methodology for conducting requirements elicitation</article-title>
          ,
          <source>SIGSOFT Softw. Eng. Notes</source>
          <volume>15</volume>
          (
          <year>1990</year>
          )
          <fpage>95</fpage>
          -
          <lpage>104</lpage>
          . doi:
          <volume>10</volume>
          .1145/382294.382725.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>N.</given-names>
            <surname>Shopland</surname>
          </string-name>
          ,
          <string-name>
            <surname>A Practical</surname>
          </string-name>
          <article-title>Guide to Searching LGBTQIA Historical Records, LGBTQ histories</article-title>
          , Routledge, Taylor &amp; Francis Group, Abingdon, Oxon,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>