<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Proceedings of the 3rd International Workshop on Semantic Technologies and Deep Learning Models for Scientific, Technical and Legal Data co-located with the Extended Semantic Web Conference 2025</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Rima Dessi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Joy Jeenu</string-name>
          <email>joy.jeenu@fiz-karlsruhe.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Danilo Dessi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Francesco Osborne</string-name>
          <email>francesco.osborne@open.ac.uk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hidir Aras</string-name>
          <email>hidir.aras@fiz-karlsruhe.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>- Rima Dessi, Higher Colleges of Technology</institution>
          ,
          <addr-line>Sharjah, UAE. - Joy Jeenu, FIZ Karlsruhe, Germany. - Danilo Dessi</addr-line>
          ,
          <institution>College of Computing and Informatics, University of Sharjah</institution>
          ,
          <addr-line>Sharjah, UAE. - Francesco Osborne</addr-line>
          ,
          <institution>The Open University</institution>
          ,
          <addr-line>Milton Keynes</addr-line>
          ,
          <country country="UK">United Kingdom.</country>
          <addr-line>- Hidir Aras, FIZ Karlsruhe</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>College of Computing and Informatics, University of Sharjah</institution>
          ,
          <addr-line>Sharjah, UAE</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>FIZ Karlsruhe - Leibniz Institute for Information Infrastructure</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Higher Colleges of Technology</institution>
          ,
          <addr-line>Sharjah, UAE</addr-line>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>Knowledge Media Institute, The Open University</institution>
          ,
          <addr-line>Milton Keynes</addr-line>
          ,
          <country country="UK">United Kingdom</country>
        </aff>
      </contrib-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Committee
The rapid expansion of publicly available scientific, technical, and legal documents, such as patents,
reports, and research articles, has made large-scale processing and analysis increasingly vital. Every
day, researchers, patent analysts, inventors, and other professionals (e.g., information scientists,
legal experts) contribute to this growing body of data through publications, technical reports, and
patent filings. However, analyzing and navigating these documents presents significant challenges</p>
      <p>Rima Dessi, Joy Jeenu, Danilo Dessi, Francesco Osborne, and Hidir Aras
due to their considerable length, the use of specialized vocabulary, and their coverage of diverse
scientific domains. These documents are often semi-structured, combining unstructured text with
structured elements such as tables, mathematical expressions, diagrams, and domain-specific content
like chemical compounds or biological sequences. This combination of formats and domain complexity
significantly increases the dificulty of processing them efectively.</p>
      <p>To benefit from the scientific and technical knowledge present in such documents, such as for
decision-making or professional search and analytics, there is an urgent need to analyze, enrich, and
link this data by employing state-of-the-art Semantic Web technologies and AI methods. However,
as they are heterogeneous and are written using domain-specific terminology, applying the existing
semantic technologies is not straightforward. To address the challenges mentioned above, Semantic
Web Technologies, Natural Language Processing (NLP) techniques, and Deep Neural Networks
(DNN) must be leveraged to provide eficient and efective solutions for creating easily accessible
and machine-understandable knowledge of science and industry.</p>
      <p>To this end, the goal of the SemTech4STLD workshop5 was to provide a meeting forum for
academics and industry professionals to come together and discuss topics such as the application of
Semantic Web Technologies and Deep Learning Models to scientific, technical, and legal data.
Further, the primary objective of the workshop was to promote collaboration among the participants
and exchange ideas. The workshop started with a keynote entitled “Evaluation Challenges in Using
Generative AI for Science and Technical Contents” by Prof. Dr. Paul Groth.</p>
      <p>Overall, the workshop’s success can be demonstrated by the high number of participants. Further,
during the workshop, many participants joined the discussions, asked questions, and exchanged ideas
about the application of Semantic Web Technologies and Machine Learning models on Scientific,
Technical, and Legal Data. We believe this workshop has helped participants build a new network
and has encouraged future projects. We definitely plan to organize the 4th edition of this workshop.
Keynote Talk Prof. Dr. Paul Groth
Keynote on Evaluation Challenges in Using Generative AI for Science &amp; Technical
Content .</p>
    </sec>
    <sec id="sec-2">
      <title>Keynote Abstract:</title>
      <p>Foundation Models show impressive results in a wide range of tasks on scientific and legal content
from information extraction to question answering and even literature synthesis. However, standard
evaluation approaches (e.g., comparing to ground truth) often do not seem to work. Qualitatively,
the results look great, but quantitative scores do not align with these observations. In this talk, I
discuss the challenges we have faced in our lab in evaluation. I then outline potential routes forward.
Papers presented at SemTech4STLD
Evaluating LLMs for Named Entity Recognition in Scientific Domain with Fine-Tuning
and Few-Shot Learning. Davide Buscaldi, Danilo Desı,s` Francesco Osborne, Davide Piras and
Diego Reforgiato Recupero
5 https://semtech4stld.github.io/
June 2025
Contents</p>
    </sec>
    <sec id="sec-3">
      <title>Rima Desı,s` Joy Jeenu, Danilo Desı,s` Francesco Osborne, and Hidir Aras</title>
      <p>Taming Hallucinations: A Semantic Matching Evaluation Framework for LLM-Generated
Ontologies. Nadeen Fathallah, Stefen Staab and Alsayed Algergawy
Benchmarking Large Language Models for Sustainable Development Goals
Classification: Evaluating In-Context Learning and Fine-Tuning Strategies. Andrea Cadeddu,
Alessandro Chessa, Vincenzo De Leo, Gianni Fenu, Enrico Motta, Francesco Osborne, Diego
Reforgiato Recupero, Angelo Salatino and Luca Secchi
Enabling Natural Language Access to BIM Models with AI and Knowledge Graphs.
Andrea Ibba, Ruebn´ Alonso and Diego Reforgiato Recupero</p>
    </sec>
  </body>
  <back>
    <ref-list />
  </back>
</article>