<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Proceedings of the First International Workshop of Semantic Digital Humanities co-located with the Extended Semantic Web Conference 2024</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Oleksandra Bruns</string-name>
          <email>oleksandra.bruns@fiz-karlsruhe.de</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
          <xref ref-type="aff" rid="aff5">5</xref>
          <xref ref-type="aff" rid="aff6">6</xref>
          <xref ref-type="aff" rid="aff7">7</xref>
          <xref ref-type="aff" rid="aff8">8</xref>
          <xref ref-type="aff" rid="aff9">9</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andrea Poltronieri</string-name>
          <email>andrea.poltronieri2@unibo.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff5">5</xref>
          <xref ref-type="aff" rid="aff6">6</xref>
          <xref ref-type="aff" rid="aff7">7</xref>
          <xref ref-type="aff" rid="aff8">8</xref>
          <xref ref-type="aff" rid="aff9">9</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lise Stork</string-name>
          <email>l.stork@uva.nl</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff5">5</xref>
          <xref ref-type="aff" rid="aff6">6</xref>
          <xref ref-type="aff" rid="aff7">7</xref>
          <xref ref-type="aff" rid="aff8">8</xref>
          <xref ref-type="aff" rid="aff9">9</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tabea Tietz</string-name>
          <email>tabea.tietz@fiz-karlsruhe.de</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
          <xref ref-type="aff" rid="aff5">5</xref>
          <xref ref-type="aff" rid="aff6">6</xref>
          <xref ref-type="aff" rid="aff7">7</xref>
          <xref ref-type="aff" rid="aff8">8</xref>
          <xref ref-type="aff" rid="aff9">9</xref>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Eggenstein-Leopoldshafen, Germany</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science and Engineering, University of Bologna</institution>
          ,
          <addr-line>Mura Anteo Zamboni 7, 40126 Bologna</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Digital Humanities</institution>
          ,
          <addr-line>Knowledge Graphs, Semantic Representation, Large Language Models, Cultural</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>FIZ Karlsruhe - Leibniz Institute for Information Infrastructure</institution>
          ,
          <addr-line>Hermann-von-Helmholtz-Platz 1, 76344</addr-line>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Informatics Institute, University of Amsterdam</institution>
          ,
          <addr-line>Science Park 900, 1098 XH Amsterdam</addr-line>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>Karlsruhe Institute of Technology (AIFB)</institution>
          ,
          <addr-line>Kaiserstr. 89, 76133 Karlsruhe</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff5">
          <label>5</label>
          <institution>Andrea Poltronieri, University of Bologna</institution>
        </aff>
        <aff id="aff6">
          <label>6</label>
          <institution>Harald Sack, FIZ Karlsruhe, Karlsruhe Institute of Technology</institution>
        </aff>
        <aff id="aff7">
          <label>7</label>
          <institution>Lise Stork, University of Amsterdam</institution>
        </aff>
        <aff id="aff8">
          <label>8</label>
          <institution>Oleksandra Bruns, FIZ-Karlsruhe, Karlsruhe Institute of Technology</institution>
        </aff>
        <aff id="aff9">
          <label>9</label>
          <institution>Tabea Tietz, FIZ-Karlsruhe, Karlsruhe Institute of Technology</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2024</year>
      </pub-date>
      <abstract>
        <p>Exploration, analysis, and preservation of the rich cultural and historical tapestry of the world are essential for our understanding of humanity's past and shaping our future. In recent years, there has been increased interest in the creation and application of Ontologies, Knowledge Graphs, and other Semantic Web Technologies within cultural heritage (CH) and digital humanities (DH). However, to date, the distinct areas of expertise, methodologies and traditions across the fields have led to a noticeable gap between tech solutions and humanities' needs. The aim of the International Workshop of Semantic Digital Humanities (SemDH) was to bridge this division and encourage closer collaboration and networking across diverse fields.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>CEUR
ceur-ws.org</p>
    </sec>
    <sec id="sec-2">
      <title>2. Steering Committee</title>
      <p>LGOBE
∗Corresponding author.
• Torsten Schrade, Academy of Sciences and Literature Mainz
• Stefan Schlobach, Vrije Universiteit Amsterdam
• Valentina Presutti, University of Bologna
• Catherine Faron, Université Côte d’Azur
• Eero Hyvönen, Aalto University
• Francesca Tomasi, University of Bologna
• Mareike König, German Historical Institute Paris
• Laura Hollink, Centrum Wiskunde Informatica
3. Program Committee
• Valentina Carriero, Cefriel
• Stefano De Giorgis, University of Bologna
• Daniil Dobriy, Vienna University of Economics and Business
• Catherine Faron, Université Côte d’Azur
• Ariana Graciotti, University of Bologna
• Ivan Heibi, University of Bologna
• Eero Hyvönen, Aalto University and University of Helsinki
• Nitisha Jain, King’s College London
• Mareike König, German Historical Institute Paris
• Nicolas Lazzari, University of Bologna
• Delfina Sol Martinez Pandiani, Centrum Wiskunde &amp; Informatica
• Andrei Nesterov, Centrum Wiskunde, Informatica
• Harald Sack, FIZ Karlsruhe, Karlsruhe Institute of Technology
• Stefan Schlobach, Vrije Universiteit Amsterdam
• Torsten Schrade, Academy of Sciences and Literature Mainz
• Sarah Shoilee, Vrije Universiteit Amsterdam
• Francesca Tomasi, University of Bologna
• Mary Ann Tan, FIZ Karlsruhe, Karlsruhe Institute of Technology
• Mahsa Vafaie, FIZ Karlsruhe, Karlsruhe Institute of Technology
• Shenghui Wang, University of Twente
• Andreas Weber, University of Twente
• Viktor de Boer, Vrije Universiteit Amsterdam
• Jacco van Ossenbruggen, Vrije Universiteit Amsterdam</p>
    </sec>
    <sec id="sec-3">
      <title>4. Preface</title>
      <p>The International Workshop of Semantic Digital Humanities (SemDH’24)1 invited members of
the Semantic Web, CH, and DH communities that were actively involved in the development of
systematic approaches and advanced technologies for handling heterogeneous, diverse, and
challenging humanities data. It provided a platform to researchers to share and discuss their
challenges and showcase research findings. Moreover, it encouraged closer collaboration and
networking across these diverse fields.</p>
      <p>The proposed topics included, but were not limited to, constructing and utilizing knowledge
graphs for the humanities, extracting and representing knowledge from historical texts, data
linking across disciplines, enriching the semantics of historical records and biographies,
analyzing social networks, and adopting, extending, and evaluating ontologies for specific domains.
The participants were also encouraged to submit works using a combination of Semantic
Technologies with more recent technologies, such as Large Language Models (LLMs), to tackle the
diverse challenges associated with working with historical data.</p>
      <p>There were 16 papers submitted for peer-review to this workshop. Out of these, 12 papers
were accepted for this volume, 6 as regular papers and 6 as short papers. The range of accepted
papers demonstrated a focus not only on technological endeavors, but also on the analysis
of CH and DH data landscapes from diverse perspectives. Full research and resource papers
were presented in two full paper presentation sessions, while work-in-progress contributions
were discussed in a dedicated poster session. Additionally, the workshop featured a keynote by
Marieke van Erp titled ”Knowing What You Don’t Know - On Gaps, Quality Issues, and Other
Challenges in DH Use Cases.” Following the keynote was a panel discussion with Harald Sack,
Marieke van Erp, Enrico Daga, Lise Stork, Victor de Boer, and Eljas Oksanen.</p>
      <p>The workshop’s success was demonstrated not only by the high number of submissions but
also by the level of engagement from participants, the quality of discussions, and the breadth
of topics covered, showing the importance of the workshop in fostering collaboration and
advancing the integration of semantic technologies within the domains of cultural heritage and
digital humanities.</p>
      <p>June, 2024</p>
      <sec id="sec-3-1">
        <title>Oleksandra Bruns, Andrea Poltronieri, Lise Stork, and Tabea Tietz</title>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>5. Contents</title>
      <sec id="sec-4-1">
        <title>Full Paper Session I</title>
        <p>Bistìris Ontology: Towards a Structured Representation of Sardinian Traditional Female
Costumes.</p>
        <p>Giorgio Corona, Dario Guidotti, Laura Pandolfo and Luca Pulina
Exploring Prosopographical Information in thex Virtual Record Treasury of Ireland’s Knowledge
Graph for Irish History.</p>
        <p>Beyza Yaman, Lucy McKenna, Alex Randles, Lynn Kilgallon, Peter Crooks and Declan O’Sullivan
Publishing Numismatic Public Finds on the Semantic Web for Digital Humanities Research –
CoinSampo Linked Open Data Service and Semantic Portal.</p>
        <p>Heikki Rantala, Eljas Oksanen, Frida Ehrnsten and Eero Hyvönen</p>
      </sec>
      <sec id="sec-4-2">
        <title>Full Paper Session II</title>
        <p>Towards a semantic representation of Egyptian demonology: requirements and benchmark
study.</p>
        <p>Bruno Sartini and Rita Lucarelli
eXtreme Design for Ontological Engineering in the Digital Humanities with Viewsari, a
Knowledge Graph of Giorgio Vasari’s The Lives.</p>
        <p>Sarah Rebecca Ondraszek, Grischka Petri, Ulrike Blumenthal, Lisa Dieckmann, Etienne Posthumus
and Harald Sack
A Corpus of Biblical Names in the Greek New Testament to Study the Additions, Omissions,
and Variations across diferent Manuscripts.</p>
        <p>Christoph Werner, Zacharias Shoukry, Soham Al-Suadi and Frank Krüger</p>
      </sec>
      <sec id="sec-4-3">
        <title>Keynote: “Knowing what you don’t know - on gaps, quality issues and other challenges in DH use cases”</title>
        <p>Dr. Marieke van Erp
Working with historical data is not for the faint of heart: data can be incomplete, warped
through digitisation artefacts, or dificult to understand as language and society have changed.
However, it also provides a window on the past that can help us understand today’s society
better. In this talk, I will give examples of DH use cases that take on gaps in data, quality issues
and other challenges head-first to make computational methods better suited to them.</p>
      </sec>
      <sec id="sec-4-4">
        <title>Panel Discussion: “Bridging the Gap: Semantic Web and Digital Humanities”</title>
        <p>Prof. Dr. Harald Sack, Dr. Marieke van Erp, Dr. Enrico Daga, Dr. Lise Stork, Dr. Victor de Boer,
and Dr. Eljas Oksanen.</p>
        <p>The panel discussion dived into the intersections of Semantic Web and Digital Humanities,
exploring challenges, perspectives, and strategies for fostering collaboration between these fields.
Experts from both domains shared insights on collaborative research projects, digital methods,
and interdisciplinary education. Key themes included the need for responsible AI, user-centric
design, and the importance of aligning technological solutions with the needs of humanities
researchers. The panel emphasized the necessity of mutual learning, interdisciplinary education,
and ongoing dialogue to bridge the gap between Semantic Web and Digital Humanities, aiming
to enhance the integration of semantic technologies within humanities research.</p>
      </sec>
      <sec id="sec-4-5">
        <title>Poster Session for Short and Position Papers</title>
        <p>PaleOrdia: Semantically Describing (Cuneiform) Paleography using Paleographic Linked Open
Data.</p>
        <p>Timo Homburg
Towards LLM-based Semantic Analysis of Historical Legal Documents.</p>
        <p>Tania Litaina, Andreas Soularidis, Georgios Bouchouras, Konstantinos Kotis and Evangelia Kavakli</p>
        <sec id="sec-4-5-1">
          <title>Sustainable Semantics for Sustainable Research Data.</title>
          <p>Stefen Hennicke, Pascal Belouin, Hassan El Hajj, Matthew Fielding, Robert Casties and Kim Pham</p>
        </sec>
        <sec id="sec-4-5-2">
          <title>Non-Canonical Acts and their Topical Distribution.</title>
          <p>Christian Vrangbaek, Eva Vrangbaek, Márton Kardos, Kristofer Nielbo and Jacob Mortensen
Digitalisation Workflows in the Age of Transformer Models: A Case Study in Digital Cultural
Heritage.</p>
          <p>Mahsa Vafaie, Mary Ann Tan and Harald Sack</p>
        </sec>
        <sec id="sec-4-5-3">
          <title>FAIR Paper: Applying FAIR to Academic Publishing.</title>
          <p>Wouter Beek, Rick Maurits and Auke Rijpma</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>6. Summary of the Panel “Bridging the Gap: Semantic Web and</title>
    </sec>
    <sec id="sec-6">
      <title>Digital Humanities”</title>
      <p>The panel brought together six distinguished experts, Harald Sack, Marieke van Erp, Enrico
Daga, Lise Stork, Victor de Boer, and Eljas Oksanen. With these representatives from both,
the Semantic Web and (Digital) Humanities fields the goal was to to discuss the challenges,
gaps, perspectives, and future steps for a closer collaboration between these domains. In a
two-minute statement, each panelist provided an outline of their position on the intersection
of Semantic Web and Digital Humanities, sharing their experiences, and identifying the main
challenges they acquired. Afterwards, the panelists were directed with concrete questions, e.g.
their views on the efectiveness of Semantic Web solutions in addressing the needs of (Digital)
Humanities; on main diferences between the fields, and proposing strategies to bridge the gap.
Additionally, the audience was actively encouraged to engage with the panelists by posing their
own questions, which ranged from the reasons on possible reluctance of humanists to adopt
Semantic Web technologies, e.g. Knowledge Graphs, to the dificulty of teaching SPARQL to
humanists, and possible strategies for dealing with data quality and incompleteness in Digital
Humanities research. The results of the discussion can be summarized in the following key
challenges and the proposals on addressing them:
• Diferences in Methodologies. The panelists emphasized the critical evaluation
required when using digital methods in the humanities, acknowledging the imperfections
and complexities of historical data. Possible Solution: A more user-centered approach,
as well as the adoption of responsible AI and fostering mutual learning between fields.
• Challenges in Digital Methods and Data Ownership. On example of historical videos,
issues such as intellectual property rights and ownership of data post-project completion
were mentioned. Possible Solution: Clear guidelines on data ownership and rights
management in digital projects were highlighted.
• Research Bubbles. According to the discussion, there are growing concerns about
whether fields such as Digital Humanities and the Semantic Web are efectively
addressing the problems of the Humanities. The focus of Digital Humanities has shifted towards
self-fulfillment rather than directly meeting the specific needs of the field. Nonetheless,
they provide important tools that foster novel approaches to evidence interpretation, and
humanists require knowledge of how to use these tools. Similarly, concerns have been
raised about Semantic Web methods in Humanities, particularly regarding whether
ontologies, often simplified representations, truly address the needs of humanities researchers.
To address this, computer scientists require to possess a fundamental understanding of
humanities principles and perspectives. Possible Solution: Interdisciplinary education
and mutual learning, the development of tools that align with the needs of humanists.
• Diferences in Research Paces. The difering research paces between Semantic Web
and Humanities projects and the importance of aligning research questions with project
goals were mentioned. Possible Solution: Closer collaboration and communication to
synchronize research timelines and objectives.
• Training and Mutual Learning. The solution to many of the highlighted challenges
lies in training humanists in computer science and fostering interdisciplinary learning.
There is an ongoing debate about whether humanists should be trained in technical skills
like SPARQL queries and ontology development, and how to efectively implement this
training. Additionally, it was pointed out that computer scientists should also learn more
about history and humanities methods. The core challenge is how to efectively provide
this kind of training and develop a curriculum that bridges the gap between the fields while
motivating experts to engage in mutual learning. Possible solution: To efectively train
humanists in technical approaches, it is crucial to focus on familiarizing them with digital
methods and tools, rather than requiring coding skills. In countries like Finland, digital
methods are already integrated into humanists’ curriculum, and many Digital Humanities
students are established historians. For more experienced researchers, workshops, short
courses, and clear communication emphasizing practical applications of digital tools
are key. Fostering interdisciplinary collaboration through joint projects enables mutual
learning between humanists and computer scientists. Developing a curriculum that
showcases the practical benefits of digital methods can motivate humanists to embrace
these new approaches.
• Data Quality and Provenance: Understanding data quality and provenance poses a
significant challenge in Digital Humanities research. Without proper mechanisms in
place, ensuring the accuracy and reliability of data becomes dificult. Possible Solution:
Utilizing machine learning techniques can aid in identifying gaps in data, helping to
enhance data quality. Additionally, employing SPARQL queries with proper provenance
is crucial for maintaining the accuracy and reliability of data sources.</p>
      <p>Overall, the panel underscored the need for deeper interdisciplinary collaboration, mutual
education, and the development of tools and methodologies that truly meet the needs of both
Semantic Web and Digital Humanities researchers.
We want to thank the contributors and the steering and program committee members for their
work. This work has been supported the by Deutsche Forschungsgemeinschaft (DFG), project
NFDI4Culture (project number 441958017), and EU’s Horizon Europe research and innovation
programme, the MUHAI project (grant agreement no. 951846).</p>
    </sec>
  </body>
  <back>
    <ref-list />
  </back>
</article>