<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Toward Advanced Query Processing in Dataspaces</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Christoph Quix</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Fraunhofer Institute for Applied Information Technology FIT</institution>
          ,
          <addr-line>Sankt Augustin</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Hochschule Niederrhein University of Applied Sciences</institution>
          ,
          <addr-line>Krefeld</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Dataspaces aim at enabling inter-organizational data exchange, emphasizing interoperability and data sovereignty of data assets. While current implementations focus on providing a foundational framework to enable secure, standards-based data sharing and sovereignty, they lack the robust query processing features needed to address emerging demands in distributed and federated data ecosystems. We present a vision for advancing dataspace technology by incorporating sophisticated query processing mechanisms and integrating features that ensure data sovereignty within traditional data management platforms such as data lakes.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;dataspaces</kwd>
        <kwd>data integration</kwd>
        <kwd>federated query processing</kwd>
        <kwd>data sovereignty</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. The Need for Advanced Query</title>
    </sec>
    <sec id="sec-2">
      <title>Processing in Dataspaces</title>
      <p>
        The original idea of dataspaces as envisioned by Franklin
et al. [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] emphasized lightweight data integration and
incremental development of an integrated, linked personal
dataspace. Dataspaces should provide basic data access and
interoperability between heterogeneous data sources while
progressively enhancing integration through user-driven
refinement and automated techniques. This approach
features flexibility and usability, allowing users to interact with
partially integrated data while supporting iterative
improvements in data organization and querying capabilities.
      </p>
      <p>
        In 2015, Fraunhofer in Germany started the Industrial
Dataspace (IDS) initiative [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] which created a new view on
dataspaces. Dataspaces were envisioned as a multi-sided
platform for secure and trusted data exchange, guaranteeing
data sovereignty with a decentralized architecture [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. The
development is governed by an institutionalized alliance
of diverse stakeholders, i.e., the International Data Spaces
Association (IDSA)1. First ideas of the IDS outline a
dataspace as platform or market for data and services, in which
data is described semantically in (central) metadata
repositories. Data consumers can search the metadata for relevant
datasets, invoke data services to integrate, transform or
enrich data as desired, and finally use the data according to
defined usage policies [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. However, the work in the IDS
project focused on the deployment of a trusted and secure
connector framework [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>
        Gaia-X has evolved from the IDS concept by extending
its focus on data sharing and sovereignty into a broader
framework that integrates cloud services, edge computing,
and data ecosystems through standardized frameworks and
governance mechanisms [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Gaia-X is a European initiative
aimed at creating a secure, federated, and interoperable data
infrastructure. It builds on IDS principles, such as trust and
compliance, while expanding the ecosystem to include
decentralized, federated infrastructures and a strong emphasis
on transparency, openness, and digital sovereignty.
      </p>
      <p>
        The priority for trust and data sovereignty is a significant
strength, it also imposes limitations on the ability to
support data processing across a distributed data ecosystem [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
These limitations become particularly evident in use cases
requiring:
(a) Federated Query Processing: The capability to process
queries across multiple, independently managed datasets
without compromising performance or accuracy.
(b) Semantic Enrichment: Leveraging metadata and
domain-specific ontologies to enable more precise and
meaningful query results.
(c) Granular Data Sovereignty: Enforcing fine-grained
access control policies that align with legal and organizational
requirements.
      </p>
      <p>A lack of these features constrains the practical utility of
dataspaces in scenarios where data-driven decision-making
depends on seamless and secure integration of data in a
distributed data ecosystem.</p>
    </sec>
    <sec id="sec-3">
      <title>2. Integrating Dataspace Features into Modern Data Architectures</title>
      <p>
        The evolution from data lakes to data meshes and data
fabrics reflects a significant transformation in how
organizations approach data management to address issues of
scalability, governance, and accessibility [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Data lakes,
originally designed to store large volumes of structured
and unstructured data in a centralized repository [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ],
often faced challenges related to governance and usability.
Without robust management and accessibility frameworks,
many data lakes devolved into ‘data swamps’, where finding
meaningful insights became increasingly dificult [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
      <p>Data meshes emerged to solve these issues by
decentralizing data governance. This paradigm treats data as a
product, where responsibility for the quality, usability, and
governance of data lies with domain-specific teams. This
domain-driven ownership model ensures scalability while
addressing the shortcomings of centralized approaches, such
as those found in traditional data lakes.</p>
      <p>In parallel, data fabrics focus on creating an
interconnected layer that integrates metadata across disparate
systems. By employing technologies such as AI, automation,
and knowledge graphs, data fabrics enable seamless data
discovery, improved lineage tracking, and enhanced
integration across an organization’s diverse data landscape.
This approach prioritizes metadata-driven governance and
context-aware connectivity, providing a more dynamic and
agile data ecosystem that supports advanced analytics and
decision-making processes.</p>
      <p>The concept of modern dataspaces aligns with these
paradigms. Similar to the ‘data as a product’ philosophy in
data meshes, dataspaces emphasize treating data assets as
shared, governed resources designed for collaboration and
interoperability. In both cases, the focus is on ensuring the
quality, contextual relevance, and accessibility of data for
specific stakeholders or applications. This shared
emphasis underlines the mutual goal of enabling robust,
domainaware data collaboration across organizational boundaries.</p>
      <p>
        However, sharing data products in a dataspace does not
imply centralized governance or uniform data quality
control. Similar to data meshes, data governance should be
organized in a decentralized manner. Therefore, each participant
should manage their own policies through self-governing
data products. This approach aligns with the dataspace
architecture, where data owners define and enforce usage
policies for their data assets [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>
        Additionally, dataspaces as well as data fabrics rely on
semantic models to support semantic interoperability. In data
fabrics, knowledge graphs serve as a foundational tool for
modeling relationships and enriching metadata, allowing
for enhanced data discovery, integration, and query
capabilities. Similarly, dataspaces employ semantic models to
achieve interoperability among heterogeneous data sources
and domains (e.g., the IDS information model [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]). These
models provide a shared understanding of data structures
and relationships, which is essential for enabling meaningful
cross-domain analytics and collaboration.
      </p>
      <p>The interplay between these paradigms suggests a path
toward convergence, where dataspaces could incorporate the
principles of both data meshes and data fabrics. By
blending the domain-centric ownership and product-oriented
data management of data meshes with the semantic and
automation-driven integration of data fabrics, dataspaces
could emerge as a comprehensive framework for addressing
modern data challenges. This evolution reflects a growing
recognition of the need for distributed, interoperable, and
semantically enriched data ecosystems capable of supporting
diverse organizational and cross-domain needs.</p>
      <p>
        The challenge lies in finding the optimal balance between
unified semantic models for describing data assets and
decentralized governance. In many dataspace projects, we
have observed that a centralized approach to defining the
core information model significantly slows down the
bootstrapping process. A decentralized approach, as envisioned
in data meshes, could accelerate this process but comes with
the risk of diverging semantics. To mitigate these issues,
collaborative ontology engineering methodologies need to
be applied [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
      </p>
      <p>
        Enhancing data lakes with dataspace-inspired features
can bridge the gap between centralized data repositories
and the decentralized nature of dataspaces. Specifically,
integrating features for data sovereignty and advanced query
processing can yield transformative capabilities. By
incorporating mechanisms like usage policies [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], a data lake can
enforce access control, data provenance, and compliance
policies. Databricks has proposed Delta Sharing2, a protocol
for sharing datasets between data lakes, or even between
organizations. Although the protocol supports fine-grained
access control, usage policies to support data sovereignty
as in dataspaces are not yet covered.
      </p>
      <p>
        On the other hand, enhancing dataspace frameworks,
such as the Eclipse Dataspace Components3, with more
sophisticated federated query processing for heterogeneous
datasets could ofer a better usability of dataspaces. Data
scientists require easy solutions for creating a Pandas data
frame over heterogeneous data: an API as provided in
Apache Spark, combined with Delta Sharing, and enriched
with usage policies, could facilitate a true sovereign data
science framework that integrates heterogeneous data
access, data integration, and machine learning. Although it
might be still challenging to merge all these features into
one platform, we can leverage large-language models to
support users in executing their tasks in such a platform
[
        <xref ref-type="bibr" rid="ref14 ref15">14, 15</xref>
        ]. SEDAR, as an open-source data lake platform,
offers a concrete foundation for these integrations [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ].
Enhancing SEDAR with dataspace features could demonstrate
the feasibility of such extensions and provide insights into
performance trade-ofs and usability.
      </p>
    </sec>
    <sec id="sec-4">
      <title>3. Future Research Directions</title>
      <p>We advocate for a paradigm shift in dataspace technology
by prioritizing advanced query processing and seamless
integration with traditional data management platforms.
Leveraging existing innovations, such as the Delta Sharing
Protocol, and extending platforms like SEDAR, can help
realize the vision of a unified, sovereignty-aware data
management ecosystem. However, several research challenges
must still be addressed to fully enable this vision.</p>
      <p>Policy-aware query execution requires embedding data
sovereignty rules directly into query execution plans.
Queries should be executed in compliance with access
restrictions, data-sharing agreements, and computational
constraints defined by data owners. User-centric interfaces
should allow non-expert users to interact efectively with
the dataspace. Many existing dataspaces sufer from poor
usability, limiting their adoption and accessibility.
Furthermore, usage policies must be extended beyond basic access
control to include restrictions on query processing itself.
Finally, data quality management remains a significant
challenge in many dataspaces. A decentralized data quality
framework, incorporating objective and standardized
quality metrics, could help assess and improve data reliability
while allowing participants to retain autonomy over their
data assets.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <p>This work has been sponsored by the German Federal
Ministry of Education and Research in the funding
program “Forschung an Fachhochschulen”, project I2DACH
(grant no. 13FH557KX0) and in the funding program
“KIAnwendungshub Kunststofverpackungen – nachhaltige
Kreislaufwirtschaft durch Künstliche Intelligenz”, project
KIOptiPack (grant no. 033KI111).</p>
      <p>AI Disclosure Statement During the preparation of this
work, the author used ChatGPT 4o in order to improve
writing style, check grammar, and spelling. After using
this tool, the author reviewed and edited the content as
needed and takes full responsibility for the content of the
publication.</p>
      <sec id="sec-5-1">
        <title>2https://github.com/delta-io/delta-sharing/</title>
      </sec>
      <sec id="sec-5-2">
        <title>3https://projects.eclipse.org/projects/technology.edc</title>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Franklin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Y.</given-names>
            <surname>Halevy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Maier</surname>
          </string-name>
          ,
          <article-title>From databases to dataspaces: a new abstraction for information management</article-title>
          ,
          <source>SIGMOD Rec</source>
          .
          <volume>34</volume>
          (
          <year>2005</year>
          )
          <fpage>27</fpage>
          -
          <lpage>33</lpage>
          . URL: https://doi.org/10.1145/1107499.1107502. doi:
          <volume>10</volume>
          .1145/1107499.1107502.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>B.</given-names>
            <surname>Otto</surname>
          </string-name>
          , Jürjens,
          <string-name>
            <given-names>J.</given-names>
            <surname>Schon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Auer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Menz</surname>
          </string-name>
          , S. Wenzel, J. Cirullies,
          <article-title>Industrial Data Space - Digital Sovereignity over Data, Whitepaper, Fraunhofer-</article-title>
          <string-name>
            <surname>Gesellschaft</surname>
          </string-name>
          ,
          <year>2016</year>
          . URL: https://www.fraunhofer.de/content/ dam/zv/de/Forschungsfelder/industrial
          <article-title>-data-space/ Industrial-Data-Space_whitepaper</article-title>
          .pdf.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>B.</given-names>
            <surname>Otto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Jarke</surname>
          </string-name>
          ,
          <article-title>Designing a multi-sided data platform: findings from the international data spaces case</article-title>
          ,
          <source>Electron. Mark</source>
          .
          <volume>29</volume>
          (
          <year>2019</year>
          )
          <fpage>561</fpage>
          -
          <lpage>580</lpage>
          . URL: https: //doi.org/10.1007/s12525-019-00362-x. doi:
          <volume>10</volume>
          .1007/ S12525-019-00362-X.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>C.</given-names>
            <surname>Quix</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Chakrabarti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Klef</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Pullmann</surname>
          </string-name>
          ,
          <article-title>Business process modelling for a data exchange platform</article-title>
          ,
          <source>in: Proceedings of the Forum and Doctoral Consortium Papers Presented at the 29th International Conference on Advanced Information Systems Engineering, CAiSE</source>
          <year>2017</year>
          , Essen, Germany, June 12-16,
          <year>2017</year>
          , volume
          <volume>1848</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2017</year>
          , pp.
          <fpage>153</fpage>
          -
          <lpage>160</lpage>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-1848/ CAiSE2017_Forum_Paper20.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>H.</given-names>
            <surname>Pettenpohl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Spiekermann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. R.</given-names>
            <surname>Both</surname>
          </string-name>
          ,
          <article-title>International data spaces in a nutshell, in: Designing Data Spaces: The Ecosystem Approach</article-title>
          to Competitive Advantage, Springer,
          <year>2022</year>
          , pp.
          <fpage>29</fpage>
          -
          <lpage>40</lpage>
          . URL: https: //doi.org/10.1007/978-3-
          <fpage>030</fpage>
          -93975-
          <issue>5</issue>
          _3. doi:
          <volume>10</volume>
          .1007/ 978-3-
          <fpage>030</fpage>
          -93975-5\_3.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>H.</given-names>
            <surname>Tardieu</surname>
          </string-name>
          ,
          <article-title>Role of Gaia-X in the european data space ecosystem, in: Designing Data Spaces: The Ecosystem Approach</article-title>
          to Competitive Advantage, Springer,
          <year>2022</year>
          , pp.
          <fpage>41</fpage>
          -
          <lpage>59</lpage>
          . URL: https: //doi.org/10.1007/978-3-
          <fpage>030</fpage>
          -93975-
          <issue>5</issue>
          _4. doi:
          <volume>10</volume>
          .1007/ 978-3-
          <fpage>030</fpage>
          -93975-5\_4.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>S.</given-names>
            <surname>Geisler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Vidal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Cappiello</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. F.</given-names>
            <surname>Lóscio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Jarke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lenzerini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Missier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Otto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Paja</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Pernici</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Rehof</surname>
          </string-name>
          ,
          <article-title>Knowledge-driven data ecosystems toward data transparency</article-title>
          ,
          <source>ACM J. Data Inf. Qual</source>
          .
          <volume>14</volume>
          (
          <year>2022</year>
          ) 3:
          <fpage>1</fpage>
          -
          <lpage>3</lpage>
          :
          <fpage>12</fpage>
          . URL: https://doi.org/10.1145/ 3467022. doi:
          <volume>10</volume>
          .1145/3467022.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>I.</given-names>
            <surname>Blohm</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Wortmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Legner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Köbler</surname>
          </string-name>
          ,
          <article-title>Data products, data mesh, and data fabric</article-title>
          ,
          <source>Bus. Inf. Syst. Eng</source>
          .
          <volume>66</volume>
          (
          <year>2024</year>
          )
          <fpage>643</fpage>
          -
          <lpage>652</lpage>
          . URL: https:// doi.org/10.1007/s12599-024-00876-5. doi:
          <volume>10</volume>
          .1007/ S12599-024-00876-5.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>R.</given-names>
            <surname>Hai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Koutras</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Quix</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Jarke</surname>
          </string-name>
          ,
          <article-title>Data lakes: A survey of functions and systems</article-title>
          ,
          <source>IEEE Trans. Knowl. Data Eng</source>
          .
          <volume>35</volume>
          (
          <year>2023</year>
          )
          <fpage>12571</fpage>
          -
          <lpage>12590</lpage>
          . URL: https: //doi.org/10.1109/TKDE.
          <year>2023</year>
          .
          <volume>3270101</volume>
          . doi:
          <volume>10</volume>
          .1109/ TKDE.
          <year>2023</year>
          .
          <volume>3270101</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>R.</given-names>
            <surname>Hai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Geisler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Quix</surname>
          </string-name>
          ,
          <string-name>
            <surname>Constance:</surname>
          </string-name>
          <article-title>An intelligent data lake system</article-title>
          ,
          <source>in: Proceedings of the 2016 International Conference on Management of Data, SIGMOD Conference</source>
          <year>2016</year>
          , San Francisco, CA, USA, June 26 - July 01,
          <year>2016</year>
          , ACM,
          <year>2016</year>
          , pp.
          <fpage>2097</fpage>
          -
          <lpage>2100</lpage>
          . URL: https://doi.org/10.1145/2882903.2899389. doi:
          <volume>10</volume>
          .1145/2882903.2899389.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>S. R.</given-names>
            <surname>Bader</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Pullmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Mader</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Tramp</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Quix</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. W.</given-names>
            <surname>Müller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Akyürek</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Böckmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. T.</given-names>
            <surname>Imbusch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lipp</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Geisler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Lange</surname>
          </string-name>
          ,
          <article-title>The international data spaces information model - an ontology for sovereign exchange of digital content</article-title>
          ,
          <source>in: The Semantic Web - ISWC 2020 - 19th International Semantic Web Conference</source>
          , Athens, Greece, November 2-
          <issue>6</issue>
          ,
          <year>2020</year>
          , Proceedings,
          <string-name>
            <surname>Part</surname>
            <given-names>II</given-names>
          </string-name>
          , volume
          <volume>12507</volume>
          of Lecture Notes in Computer Science, Springer,
          <year>2020</year>
          , pp.
          <fpage>176</fpage>
          -
          <lpage>192</lpage>
          . URL: https://doi.org/10.1007/978-3-
          <fpage>030</fpage>
          -62466-8_
          <fpage>12</fpage>
          . doi:
          <volume>10</volume>
          . 1007/978-3-
          <fpage>030</fpage>
          -62466-8\_
          <fpage>12</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>K. I.</given-names>
            <surname>Kotis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. A.</given-names>
            <surname>Vouros</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Spiliotopoulos</surname>
          </string-name>
          ,
          <article-title>Ontology engineering methodologies for the evolution of living and reused ontologies: status, trends, findings and recommendations</article-title>
          ,
          <source>Knowl. Eng. Rev</source>
          .
          <volume>35</volume>
          (
          <year>2020</year>
          )
          <article-title>e4</article-title>
          . URL: https://doi.org/10.1017/S0269888920000065. doi:
          <volume>10</volume>
          .1017/S0269888920000065.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>D. M. Mustafa</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Nadgeri</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Collarana</surname>
            ,
            <given-names>B. T.</given-names>
          </string-name>
          <string-name>
            <surname>Arnold</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Quix</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Lange</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Decker</surname>
          </string-name>
          ,
          <article-title>From instructions to ODRL usage policies: An ontology guided approach</article-title>
          ,
          <source>in: Proceedings of Workshops at the 50th International Conference on Very Large Data Bases, VLDB</source>
          <year>2024</year>
          , Guangzhou, China,
          <source>August 26-30</source>
          ,
          <year>2024</year>
          , VLDB.org,
          <year>2024</year>
          . URL: https://vldb.org/workshops/ 2024/proceedings/LLM+KG/LLM+KG-15.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>S.</given-names>
            <surname>Hoseini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Burgdorf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Paulus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Meisen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Quix</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Pomp</surname>
          </string-name>
          ,
          <article-title>Towards llm-augmented creation of semantic models for dataspaces</article-title>
          ,
          <source>in: Proceedings of the Second International Workshop on Semantics in Dataspaces (SDS</source>
          <year>2024</year>
          )
          <article-title>co-located with the 21st Extended Semantic Web Conference (ESWC</article-title>
          <year>2024</year>
          ), Hersonissos, Greece, May
          <volume>26</volume>
          ,
          <year>2024</year>
          , volume
          <volume>3705</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2024</year>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3705</volume>
          /paper03.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>S.</given-names>
            <surname>Hoseini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ibbels</surname>
          </string-name>
          ,
          <string-name>
            <surname>C. Quix,</surname>
          </string-name>
          <article-title>Enhancing machine learning capabilities in data lakes with automl and llms</article-title>
          ,
          <source>in: Advances in Databases and Information Systems - 28th European Conference, ADBIS</source>
          <year>2024</year>
          , Bayonne, France,
          <source>August 28-31</source>
          ,
          <year>2024</year>
          , Proceedings, volume
          <volume>14918</volume>
          of Lecture Notes in Computer Science, Springer,
          <year>2024</year>
          , pp.
          <fpage>184</fpage>
          -
          <lpage>198</lpage>
          . URL: https:// doi.org/10.1007/978-3-
          <fpage>031</fpage>
          -70626-4_
          <fpage>13</fpage>
          . doi:
          <volume>10</volume>
          .1007/ 978-3-
          <fpage>031</fpage>
          -70626-4\_
          <fpage>13</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>S.</given-names>
            <surname>Hoseini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ali</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Shaker</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          <article-title>Quix, SEDAR: A semantic data reservoir for heterogeneous datasets</article-title>
          ,
          <source>in: Proceedings of the 32nd ACM International Conference on Information and Knowledge Management</source>
          ,
          <string-name>
            <surname>CIKM</surname>
          </string-name>
          <year>2023</year>
          , Birmingham, United Kingdom,
          <source>October 21-25</source>
          ,
          <year>2023</year>
          , ACM,
          <year>2023</year>
          , pp.
          <fpage>5056</fpage>
          -
          <lpage>5060</lpage>
          . URL: https://doi.org/10.1145/3583780.3614753. doi:
          <volume>10</volume>
          .1145/3583780.3614753.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>