<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Lernen, (LWDA) Wissen, Daten, Analysen Conference Proceedings</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>LWDA'</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>CEUR Workshop Proceedings</string-name>
        </contrib>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>I
© 2023 for the individual papers by the papers’ authors. Copying permitted for private and academic purposes. This volume
is published and copyrighted by its editors.</p>
    </sec>
    <sec id="sec-2">
      <title>Preface</title>
      <p>LWDA 2023 conference provides a joint forum for experienced and young researchers, to
bring insights to recent trends, technologies and applications and to promote interaction in the
research field of big data and beyond.</p>
      <p>The acronym LWDA expands in German to “Lernen. Wissen. Daten. Analysen.” (English:
“Learning. Knowledge. Data. Analytics.”). Recent research in the field is presented and
discussed from the viewpoint of machine learning, data mining, knowledge extraction,
knowledge management, information retrieval, personalization, database management,
information systems, big data management and big data analytics to name a few.</p>
      <p>The LWDA conference series comprises the workshops BIA, DB, FGWM, IR and KDML
which are organized by the respective special interest groups within the German Computer
Science Society:
–
–
–
–
–</p>
      <p>FG-BIA 2023 – Business Intelligence &amp; Analytics
FG-DB 2023 - Database Systems
FG-FGWM 2023 - Knowledge Management
FG-IR 2023 - Information Retrieval</p>
      <p>FG-KDML 2023 - Knowledge Discovery, Data Mining and Machine Learning
The papers published in LWDA 2023 proceedings have been selected by independent
program committees from the respective fields. The program consists of four invited keynotes
and two joint research sessions as well as the community meetings of the special interest
groups. In addition to these joint sessions, there are five parallel research sessions for each of
the workshops focusing on more specific topics. A joint poster session gives all presenters the
opportunity to discuss their work in a broader context. This year’s social program includes a
city tour for further interaction on the second evening.</p>
      <p>Our distinguished keynote speakers are:
–
–
–
–</p>
      <p>Prof. Dr. Hannes Mühleisen - Radboud Universiteit Nijmegen
Prof. Dr. Erhard Rahm – University of Leipzig
Prof. Dr. Michael Granitzer – University of Passau</p>
      <p>Dr. Dietrich Alexander Herberg</p>
      <p>The working group for Digitization &amp; Process Management at the Philipps-University of
Marburg is proud to host the LWDA 2023 conference. For the technical program the organizer
would like to thank the workshop chairs and their programme committees for their hard work
as well as the keynote speakers for their inspiring talks. We hope the participants will keep the
venue as an inspiring event with fruitful discussions in mind and the readers will enjoy
studying the scientific contributions in this proceedings volume. The proceedings are published with
CEUR and can be found here. http://ceur-ws.org/Vol-1917
Michael Leyer</p>
      <sec id="sec-2-1">
        <title>Program Chairs</title>
        <p>Tanja Auge
Henning Baars
Andreas Henrich
Thomas Mandl
Thorsten Papenbrock
Pascal Reuss
Jakob Schönborn
Helge Spieker
Felix Stamm</p>
      </sec>
      <sec id="sec-2-2">
        <title>Program Committee</title>
        <p>Bernhard Seeger
Hazar Harmouch
Uta Störl
Stefan Schulte
Fabian Panse
Benjamin Hättasch
Annett Ungethüm
Marina Tropmann-Frick
Hannes Grunert
Carsten Felden
Ralf Finger
Sebastian Olbrich
Martin Atzmueller
Christian Bauckhage
Ulf Brefeld
Mirko Bunse
Dennis Groß
Andreas Hotho
Eyke Hüllermeier
Robert Jäschke
Christian Kühnert
Thomas Liebig
Thomas Seidl</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Conference Organization</title>
      <p>Philipps-University of Marburg/Queensland University of Technology
University of Regensburg
University of Stuttgart
Otto-Friedrich-University of Bamberg
University of Hildesheim
Philipps-University of Marburg
University of Hildesheim
University of Hildesheim
Simula Research Laboratory
Rheinisch-Westfälische Technische Hochschule Aachen
Philipps-University of Marburg
Hasso-Plattner-Institute
Fernuniversität Hagen
TUHH Institute for Data Engineering
Hasso-Plattner-Institute
TU Darmstadt
TU Dresden
HAW Hamburg
University of Rostock
TU Bergakademie Freiberg
Information Works
Deloitte
University of Osnabrück
Fraunhofer IAIS / University of Bonn
Leuphana University of Lüneburg
TU Dortmund
Radboud University
University of Würzburg
LMU Munich
HU Berlin
Fraunhofer IOSB
TU Dortmund
LMU Munich
TU Wien
Fraunhofer IAIS / University of Bonn
Norwegian University of Science and Technology
University of Würzburg
University of Trier
Universiy of Applied Sciences of Hanover
University of Hildesheim
University of Trier
University of Applied Sciences Neu-Ulm
University of Nuremberg-Erlangen
University of Applied Sciences of St. Gallen
University of Hildesheim
DFKI
denkbares
Philipps-University of Marburg
University of Frankfurt
University of Hildesheim
HU Berlin
University of Applied Sciences of Saarbrücken
University of Duisburg-Essen
Christian-Albrechts-University of Kiel
Technical Applied University of Cologne
Applied University of Coburg
HAW Hamburg
Technical Applied University of Cologne
University of Trier</p>
      <sec id="sec-3-1">
        <title>Maike Holtkemper, Maria Potanin, Alexander Oberst and Christian Beecks</title>
      </sec>
      <sec id="sec-3-2">
        <title>Till Carlo Schelhorn, Jonas Gunklach and Alexander Maedche</title>
        <p>Designing an Analytical Control Chart System with ML-predicted Quality Characteristics............................ 14</p>
      </sec>
      <sec id="sec-3-3">
        <title>Benedikt Augenstein and Darjan Salaj</title>
      </sec>
      <sec id="sec-3-4">
        <title>Paul Herbst and Henning Baars</title>
      </sec>
      <sec id="sec-3-5">
        <title>Maximilian Werling, Patrick Weber and Heiner Lasi</title>
      </sec>
      <sec id="sec-3-6">
        <title>Jens F. Lachenmaier, Maximilian Werling and Dominik Morar</title>
      </sec>
      <sec id="sec-3-7">
        <title>Christoph Großmann and Johannes Schildgen</title>
        <p>Governance of Artificial Intelligence – A Framework Towards Ethical AI Applications ................................ 63
Integrating Machine Learning into SQL with Exasol ....................................................................................... 73</p>
      </sec>
      <sec id="sec-3-8">
        <title>Maximilian Langohr, Tim Vogler and Klaus Meyer-Wegener</title>
      </sec>
      <sec id="sec-3-9">
        <title>Chimi Wangmo and Lena Wiese</title>
        <p>Database and Workflow Optimizations for Spatial-Geometric Queries in GeoMine ....................................... 86</p>
      </sec>
      <sec id="sec-3-10">
        <title>Martin Poppinga, Joel Graef, Konrad Diedrich, Matthias Rarey and Norbert Ritter</title>
        <p>SKYSHARK: A Benchmark with Real-world Data for Line-rate Stream Processing with FPGAs ................. 98
SuMExplorer: Summarisation-based Frequent Subgraph Mining for Visual Exploratory Subgraph Searching 110
Enhancing Data Acquisition and Fault Analysis for Large-Scale Facilities: A Case Study on the
Laser-Based Synchronization System at the European X-Ray Free-Electron Laser......................................... 121</p>
        <p>Arne Grünhagen, Maximilian Schütte, Annika Eichler, Marina Tropmann-Frick and Görschwin Fey
Heterogeneity in NoSQL Databases —Challenges of Handling schema-less Data .......................................... 134</p>
        <p>Mark Lukas Möller, Dominique Hausler, Sebastian Strasser, Tanja Auge and Meike Klettke
Pythagoras: Semantic Type Detection of Numerical Data Using Graph Neural Networks .............................. 146</p>
      </sec>
      <sec id="sec-3-11">
        <title>Sven Langenecker, Christoph Sturm, Christian Schalles and Carsten Binnig</title>
        <p>Patient trajectory visualization for FHIR healthcare data: A use case on melanoma patients .......................... 153
Meijie Li, Wolfgang Galetzka, Bahadir Eryilmaz, Georg Christian Lodde, Elisabeth Livingstone,</p>
      </sec>
      <sec id="sec-3-12">
        <title>Jörg Schlötterer and Christin Seifert</title>
        <p>Comparative Survey of German Hate Speech Datasets: Background, Characteristics and Biases ................... 207</p>
      </sec>
      <sec id="sec-3-13">
        <title>Martin Bullin and Andreas Henrich</title>
      </sec>
      <sec id="sec-3-14">
        <title>Sebastian Diem and Thomas Mandl</title>
      </sec>
      <sec id="sec-3-15">
        <title>Markus Bertram, Johannes Schäfer and Thomas Mandl</title>
      </sec>
      <sec id="sec-3-16">
        <title>Philipp Schaer, Svetlana Myshkina and Jüri Keller</title>
      </sec>
      <sec id="sec-3-17">
        <title>Leon Martin and Andreas Henrich</title>
      </sec>
      <sec id="sec-3-18">
        <title>Tobias Hirmer, Michaela Ochs and Andreas Henrich</title>
      </sec>
      <sec id="sec-3-19">
        <title>Joachim Baumeister and Valentin Roß</title>
      </sec>
      <sec id="sec-3-20">
        <title>Andreas Korger and Joachim Baumeister</title>
        <p>Vertical Search Scenarios within a Digital Study Planning Assistant............................................................... 239
Integrating BDI Agents with the MATSim Traffic Simulation for Autonomous Mobility on Demand ........... 247</p>
      </sec>
      <sec id="sec-3-21">
        <title>Marcel Mauri, Ömer Ibrahim Erduran, Thu Pham Dieu Anh and Mirjam Minor</title>
        <p>KIRETT: Knowledge-Graph-Based Smart Treatment Assistant for Intelligent Rescue Operations................. 259</p>
      </sec>
      <sec id="sec-3-22">
        <title>Mubaris Nadeem, Johannes Zenkert, Lisa Bender, Christian Weber and Madjid Fathi</title>
        <p>SKOS-Utils: Developing and Checking SKOS Knowledge Graphs (Tool Presentation) ................................. 271
CLEARNESS: Coreference Resolution for Generating and Ranking Arguments Extracted
from Debate Portals for Queries ....................................................................................................................... 161</p>
      </sec>
      <sec id="sec-3-23">
        <title>Johannes Weidmann, Lorik Dumani and Ralf Schenkel</title>
      </sec>
      <sec id="sec-3-24">
        <title>Maik Fröbe, Jan Heinrich Reimer, Sean MacAvaney, Niklas Deckers, Janek Bevendorff,</title>
      </sec>
      <sec id="sec-3-25">
        <title>Benno Stein, Matthias Hagen and Martin Potthast</title>
        <p>Andreas Lommatzsch, Brandon Llanque, Vinay Srinath Rosenberg, Syed Ali Murad Tahir,</p>
      </sec>
      <sec id="sec-3-26">
        <title>Hristo Dimitrov Boyadzhiev and Maurice Walny</title>
        <p>The Data Dilemma: Google Analytics’ Untapped Potential and Web Data Literacy ....................................... 311</p>
      </sec>
      <sec id="sec-3-27">
        <title>Tom Alby</title>
        <p>Bridging the Gap: Examining the trust dimensions of smart contracts using supply chain applications .......... 325</p>
      </sec>
      <sec id="sec-3-28">
        <title>Wieland Müller and Michael Leyer</title>
        <p>A Feature-wise Comparative Assessment of the CBR-based Methodologies FLEA and SEASALT .............. 339</p>
        <p>Viktor Eisenstadt, Jessica Bielski, Christoph Langenhan, Klaus-Dieter Althoff and Andreas Dengel
Comparative Analysis of Text-Based CBR Algorithms for Cybercrime Profiling Investigations.................... 347</p>
      </sec>
      <sec id="sec-3-29">
        <title>Marc Krüger</title>
        <p>Cover Song Identification in Practice with Multimodal Co-Training ............................................................... 359</p>
      </sec>
      <sec id="sec-3-30">
        <title>Simon Hachmeier and Robert Jäschke</title>
        <p>Preprocessing Ground-Based Hyperspectral Image Data for Improving CNN-based Classification ............... 399</p>
      </sec>
      <sec id="sec-3-31">
        <title>Andreas Schliebitz, Heiko Tapken and Martin Atzmueller</title>
        <p>Free-Energy Advantage Functions for Policy Transfer to Noisy Environments with Safety Constraints ........ 414</p>
      </sec>
      <sec id="sec-3-32">
        <title>Pierre Haritz and Thomas Liebig</title>
        <p>Automatic Speech Detection on a Smart Beehive’s Raspberry Pi .................................................................... 424</p>
      </sec>
      <sec id="sec-3-33">
        <title>Pascal Janetzky, Philip Lissmann, Andreas Hotho and Anna Krause</title>
        <p>Comparing Humans and Algorithms in Feature Ranking: A Case-Study in the Medical Domain ................... 430</p>
        <p>Jonas Hanselle, Jaroslaw Kornowicz, Stefan Heid, Kirsten Thommes and Eyke Hüllermeier
Biomedical Event Extraction with Generative Language Models .................................................................... 442</p>
      </sec>
      <sec id="sec-3-34">
        <title>Fabio Barth, Leon Weber-Genzel and Ulf Leser</title>
        <p>Liquor-HGNN: A heterogeneous graph neural network for leakage detection in water distribution networks 454</p>
        <p>Melanie Schaller, Michael Steininger, Andrzej Dulny, Daniel Schlör and Andreas Hotho
A Document Tagging Support System for Nursing Care Experts..................................................................... 470</p>
      </sec>
      <sec id="sec-3-35">
        <title>Beat Tödtli1, Sebastian Müller, Melanie Rickenmann, Janine Vetsch and Simon Haug</title>
        <p>Efficient Light Source Placement using Quantum Computing ......................................................................... 478</p>
      </sec>
      <sec id="sec-3-36">
        <title>Sascha Mücke and Thore Gerlach</title>
        <p>Contextual Preselection Methods in Pool-based Realtime Algorithm Configuration....................................... 492</p>
        <p>Jasmin Brandt, Elias Schede, Shivam Sharma, Viktor Bengs, Eyke Hüllermeier and Kevin Tierney
A Few Models to Rule Them All: Aggregating Machine Learning Models..................................................... 506</p>
      </sec>
      <sec id="sec-3-37">
        <title>Florian Siepe, Phillip Wenig and Thorsten Papenbrock</title>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1
          <string-name>
            <given-names>Exploiting</given-names>
            <surname>Foundation</surname>
          </string-name>
          <string-name>
            <surname>Models for Spoken Language Identification ................................................................ 28</surname>
          </string-name>
          <article-title>Accelerating literature screening for systematic literature reviews with Large Language Models - development, application, and first evaluation of a solution</article-title>
          <string-name>
            <surname>.......................................................................... 41</surname>
          </string-name>
          <article-title>Datengenossenschaften als Datentreuhänder - Eine qualitative Analyse von</article-title>
          <string-name>
            <surname>Pilotprojekten........................... 52</surname>
          </string-name>
          <article-title>Preliminary Results of a Scientometric Analysis of the German Information</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>Retrieval</given-names>
            <surname>Community</surname>
          </string-name>
          2020
          <article-title>-2023</article-title>
          <string-name>
            <surname>...................................................................................................................... 222</surname>
          </string-name>
          <article-title>A Testbed for Dual-Entity Knowledge</article-title>
          <string-name>
            <given-names>Panels .................................................................................................. 231 The</given-names>
            <surname>Information</surname>
          </string-name>
          <string-name>
            <surname>Retrieval Experiment Platform .............................................................................................. 175</surname>
          </string-name>
          <article-title>Applied Face Recognition in the</article-title>
          <string-name>
            <surname>Humanities ................................................................................................... 179</surname>
          </string-name>
          <article-title>Automatic Classification of Portraits: Application of Transformer and CNN Based Models for an Art Historic</article-title>
          <string-name>
            <surname>Dataset ................................................................................................................................</surname>
          </string-name>
          <article-title>192 Case-Based Sample Generation using Multi-Armed</article-title>
          <string-name>
            <surname>Bandits .......................................................................... 282</surname>
          </string-name>
          <article-title>Combining Information Retrieval and Large Language Models for a Chatbot that</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>Generates</given-names>
            <surname>Reliable</surname>
          </string-name>
          ,
          <article-title>Natural-style</article-title>
          <string-name>
            <given-names>Answers ...................................................................................................... 298</given-names>
            <surname>Higher-Order</surname>
          </string-name>
          <string-name>
            <surname>DeepTrails</surname>
          </string-name>
          : Unified Approach to *
          <source>Trails</source>
          <string-name>
            <given-names>................................................................................... 372 Tobias</given-names>
            <surname>Koopmann</surname>
          </string-name>
          , Jan Pfister, André Markus, Astrid Carolus,
          <article-title>Carolin Wienrich and Andreas Hotho Fast k-Nearest-</article-title>
          <string-name>
            <surname>Neighbor-Consistent</surname>
            <given-names>Clustering ............................................................................................... 387 Lars</given-names>
          </string-name>
          <string-name>
            <surname>Lenssen</surname>
          </string-name>
          , Niklas Strahmann and Erich Schubert
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>