<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Graf von Data: A Knowledge Graph Question Answering Agent for Organisational Usage</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Rene Dorsch</string-name>
          <email>rene.dorsch@iis.fraunhofer.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Daniel Henselmann</string-name>
          <email>daniel.henselmann@iis.fraunhofer.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andreas Harth</string-name>
          <email>andreas.harth@iis.fraunhofer.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Fraunhofer IIS</institution>
          ,
          <addr-line>Nordostpark 84, 90411 Nuremberg</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Friedrich-Alexander-Universität Erlangen-Nürnberg</institution>
          ,
          <addr-line>Lange Gasse 20, 90403 Nuremberg</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <abstract>
        <p>Large Language Model-based (LLM) systems have advanced Knowledge Graph-based Question Answering (KGQA) but overlook organisational demands for transparency, control, and flexible deployment. We present Graf von Data (GvD), an LLM-augmented agent for KGQA, designed to meet key organisational demands. GvD translates natural language questions into SPARQL queries by iteratively exploring the knowledge graph (KG) using the ReAct framework and a set of three actions - search, describe, and query - requiring only a SPARQL endpoint with text search capabilities. Unlike other approaches, GvD is graph-agnostic and uses an open-weight LLM. We evaluate GvD in the Text2SPARQL challenge, demonstrating near state-of-the-art performance on the Corporate KG benchmark. In addition to quantitative results, we conduct a qualitative comparison against the challenge competitors, showing that GvD thrives under the organisational demands.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Large Language Model</kwd>
        <kwd>RDF</kwd>
        <kwd>Knowledge Graph</kwd>
        <kwd>Question Answering</kwd>
        <kwd>Knowledge Graph Question Answering</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Connecting data across various sources is a crucial task for many organisations, enabling process
optimisation and informed decision-making. Industrial organisations [
        <xref ref-type="bibr" rid="ref1 ref2 ref3">1, 2, 3</xref>
        ], technology
organisations [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], and government institutions [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] employ knowledge graphs (KGs) due to their flexible
structure and schema to connect data, thereby simplifying data integration.
      </p>
      <p>
        However, accessing and querying KGs remains challenging for non-technical users in
organisations [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] due to the specialised expertise required for crafting SPARQL queries. For non-technical
users, Knowledge Graph Question Answering (KGQA) approaches have been developed to ofer an
interface for natural language questions. Current approaches rely on Large Language Models (LLMs)
to interact with the user and retrieve answers from the KG.
      </p>
      <p>
        Nevertheless, existing LLM-based approaches (see Section 2) for KGQA are not developed with the
demands of organisations in mind. The demands are [
        <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
        ]:
      </p>
      <p>To satisfy the additional demands of organisations, this paper introduces Graf von Data (GvD), an
LLM-augmented agent. Users can ask questions in natural language, which GvD uses to explore and
retrieve answers from the KG and then provide an answer. Users not only receive natural language
answers but also corresponding executable SPARQL queries, supporting transparency. GvD’s
agentbased approach enables iterative gathering and reasoning over KG information stored within a trace
that is available to users, thereby also enhancing the transparency of query generation. Furthermore,
GvD employs a KG-agnostic approach, requiring only a SPARQL endpoint and no prior assumption
about the KG. Thus, it supports flexible deployments. By prioritising medium-sized, open-weight
models, GvD supports deployment within trusted environments (on-premise or in the cloud), reduces
operational costs, and avoids vendor lock-in.</p>
      <p>GvD achieved the highest F1 score (44.4%) for the Corporate KG benchmark in the quantitative
evaluation of the Text2SPARQL challenge1. A qualitative comparison with the competitors also shows
that GvD is the only competitor addressing all organisational demands.</p>
      <p>The remainder of this paper is structured as follows. Section 2 presents related work on
LLMaugmented KGQA. Section 3 introduces GvD. Section 4 presents the experimental setup to evaluate
GvD within the Text2SPARQL challenge. Section 5 introduces the competitors of the Text2SPARQL
challenge. Section 6 discusses the results from the experiments and highlights diferences between
our approach and the other participants of the Text2SPARQL challenge. Section 7 concludes the paper
with limitations and directions for future research.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        The emergence of LLMs significantly impacted KG question answering as LLM-augmented approaches
outperformed the results of prior approaches (e.g., [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]), becoming state-of-the-art. Multiple strategies
for LLM-augmented systems for KG question answering exist, which we distinguish in the following.
We consider approaches [
        <xref ref-type="bibr" rid="ref10 ref11 ref12 ref13 ref14 ref15 ref16 ref17 ref18 ref19 ref20 ref21 ref22 ref9">9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22</xref>
        ] that generate a SPARQL query
for a natural language question and find the question’s answer by executing the query, resulting in
comprehensible answers based on facts.
      </p>
      <p>
        Fine-tuning trains an LLM on a specific KG, inherently adding its content to the LLM’s knowledge.
Approaches using fine-tuning [
        <xref ref-type="bibr" rid="ref14 ref19">14, 19</xref>
        ] require expensive resources to adjust the weights, are optimized
for individual KGs, and require fine-tuning after each update of the KG.
      </p>
      <p>
        Approaches [
        <xref ref-type="bibr" rid="ref14 ref16 ref17 ref21 ref22">14, 16, 17, 21, 22</xref>
        ] implementing few-shot learning [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ] provide example
questionsquery pairs to the LLM.
      </p>
      <p>
        Retrieval-Augmented Generation (RAG) [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ] allows an LLM to consider (up-to-date) information
that is not inherent to the model by retrieving and considering additional resources after a question
is asked. Actions, often referred to as tools, can be used to incorporate various resources. An action
usually corresponds to an API call, providing information based on the question and receiving context
information from the KG. Actions that implement RAG retrieve context information from an external
knowledge source out of a given, finite set of information items. RAG actions in related work are
entity/relation linking (retrieving IDs out of all IDs in the KG) [
        <xref ref-type="bibr" rid="ref10 ref11 ref12 ref13 ref14 ref15 ref16 ref20 ref21 ref22 ref9">9, 10, 11, 12, 13, 14, 15, 16, 20, 21,
22</xref>
        ], subgraph retrieval for an entity (out of the entire KG) [
        <xref ref-type="bibr" rid="ref10 ref11 ref12 ref13 ref20 ref21">10, 11, 12, 13, 20, 21</xref>
        ], subgraph retrieval
specifically for a property (out of the entire KG) [
        <xref ref-type="bibr" rid="ref10 ref20">10, 20</xref>
        ], path discovery (out of all paths in the KG) [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ],
and SPARQL query execution (retrieving a combination of entities/relations from the KG) [
        <xref ref-type="bibr" rid="ref10 ref11 ref12">10, 11, 12</xref>
        ].
      </p>
      <p>
        Other actions return context information that is not retrieved from an external knowledge source but
instead generated based on the question’s information. Such actions in related work are entity/relation
extraction from the question [
        <xref ref-type="bibr" rid="ref14 ref15 ref16 ref9">9, 14, 15, 16</xref>
        ], shape generation [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ], SPARQL query generation [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], and
SPARQL query validation [
        <xref ref-type="bibr" rid="ref15 ref16 ref17 ref22">15, 16, 17, 22</xref>
        ].
      </p>
      <p>
        Besides actions, calling the LLM is a potential step in the question-answering process. In addition
to the task of generating the SPARQL query, intermediate tasks to translate the question [
        <xref ref-type="bibr" rid="ref18 ref21">18, 21</xref>
        ],
extract entities/relations [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ], identify relevant classes and properties from the KG [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], filter retrieved
      </p>
      <sec id="sec-2-1">
        <title>1See https://text2sparql.aksw.org/</title>
        <p>
          subgraphs [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ], and select the best SPARQL query among several candidates [
          <xref ref-type="bibr" rid="ref11 ref9">9, 11</xref>
          ] have been used.
        </p>
        <p>
          Action and LLM calls may be combined in diferent ways. Figure 1 shows four strategies.
Approaches [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ] that find an answer directly execute a single LLM call (see Figure 1a) without considering
up-to-date context information from the KG. Workflow approaches [
          <xref ref-type="bibr" rid="ref11 ref14 ref15 ref16 ref17 ref18 ref21 ref9">9, 11, 14, 15, 16, 17, 18, 21</xref>
          ] execute
a predefined sequence of action and LLM calls (see Figure 1b). Plan-and-solve [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ] approaches [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ]
initially call the LLM to draft a planned sequence of then executed action/LLM calls (see Figure 1c).
ReAct [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ] approaches [
          <xref ref-type="bibr" rid="ref10 ref12 ref13 ref20">10, 12, 13, 20</xref>
          ] iteratively alternate between calling the LLM to select the next
action and executing the selected action, until the LLM decides to terminate, usually because the SPARQL
query was found (see Figure 1d).
        </p>
        <p>(a) Direct answer.</p>
        <p>
          (b) Workflow.
(c) Plan-and-solve [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ].
        </p>
        <p>
          (d) ReAct [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ].
        </p>
        <p>
          The current state-of-the-art for question answering on Wikidata is held by SPINACH [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] using a
ReAct approach with actions for entity/relation linking, subgraph retrieval for an entity (enhanced with
an LLM call to filter retrieved subgraphs), subgraph retrieval specifically for a property, and SPARQL
query execution. SPINACH was developed explicitly for Wikidata, using three APIs provided by
Wikidata and profiting from the LLM’s inherent knowledge on Wikidata entities from its training. In
contrast, GvD requires only a SPARQL endpoint (with search extension) as API, supporting KG-agnostic
operation.
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Graf von Data Architecture</title>
      <p>We provide a high-level overview of the GvD system and a description of its behaviour based on an
example in Section 3.1. Afterwards, we provide a detailed description of the actions, prompts, and LLM
selection for the agent in Section 3.2.</p>
      <sec id="sec-3-1">
        <title>3.1. High Level Overview</title>
        <p>GvD operates within a structured environment composed of two elements (see Figure 2): the agent
and the environment. These components interact iteratively to process natural language questions
and produce answers based on knowledge graph data.</p>
        <p>Prompt</p>
        <p>LLM
Available Actions</p>
        <p>Search
Describe
Query
Success
Fail</p>
        <p>Sensor
Actuator</p>
        <p>A
g
e
n
t</p>
        <p>S
P
A
R
Q
L
E
n
d
p
o
ti
n</p>
        <p>Triple Store
Keyword Search</p>
        <p>Plugin</p>
        <p>Answer
(SPARQL Query and
Natural Language</p>
        <p>Answer)</p>
        <p>User</p>
        <p>E
n
v
i
r
o
n
m
e
n
t</p>
        <p>• Agent: The agent orchestrates the question-answering process. It consists of three
subcomponents: (i) an LLM, which processes input to select actions; (ii) a sensor, which parses and appends
input data to the prompt; and (iii) an actuator, which executes the selected actions.
• Environment: The environment includes a User and a SPARQL endpoint. The SPARQL
endpoint serves as the interface between the agent and the KG. The endpoint provides a standardised
access point to the triple store and the keyword search, enabling GvD to work with diferent KGs
without modification or additional infrastructure.
1. Initial Search (Step 1): The agent identifies the key term “K367 Strain Encoder” from the user
question. Afterwards, the agent calls the search action for the term to identify related URIs from
the KG.
2. Entity Expansion (Step 2-3): Once the relevant URIs are located, the agent
retrieves their RDF description to identify compatible products and associated relationships
(ns2:compatibleProduct).
3. Intermediate Steps (Step ...): The agent extends the search using additional keywords and/or
explores connected entities to discover a solution path.
4. Final Query Step (Step n−1): After gathering all relevant data (compatible products and their
suppliers), the agent composes and executes a SPARQL query to count the number of unique
suppliers.
5. Completion step (Step n): The agent receives the final result (6 suppliers) and returns both
the SPARQL query and the natural language answer to the user, terminating.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Agent Implementation</title>
        <p>The agent implements the ReAct framework to iteratively interact with its environment. Within each
interaction loop, the agent can execute multiple actions in parallel, allowing eficient and flexible graph
exploration. The agent consists of three components: the sensor, the actuator, and the LLM.</p>
        <p>The sensor is responsible for perceiving and parsing all incoming information, either from the user’s
initial question or as feedback from the environment (search results, RDF descriptions, query outputs,
User: How many suppliers can deliver alternative compatible products for the K367 Strain Encoder?
Graf von Data (Trace)
Step 1
Think: The user is asking for the number of suppliers that can deliver alternative compatible products for a specific item, the K367 Strain Encoder. The first step is to
sAecat:rcshefaorrcthh(e"KK336677 SSttrraaiinn EEnnccooddeer.r")
Observation: The search for K367 Strain Encoder returned the following matches.
http://ld.company.org/prod-instances/hw-K367-1320550 5.6152534
http://ld.company.org/prod-instances/bom-part-9-K367-1320550 3.1478124
...</p>
        <p>Step 2
Observation: The description of http://ld.company.org/prod-instances/hw-K367-1320550 is as follows:
@prefix ns1: &lt;http://ld.company.org/prod-vocab/&gt; .
@prefix ns2: &lt;http://ld.company.org/prod-instances/&gt; .
@prefix rdf: &lt;http://www.w3.org/2000/01/22-rdf-syntax-ns&gt; .
@prefix rdfs: &lt;http://www.w3.org/2000/01/rdf-schema&gt; .
@prefix xsd: &lt;http://www.w3.org/2001/XMLSchema#&gt; .
ns2:hw-K367-1320550 ns1:compatibleProduct ns2:hw-C119-5354812, ns2:hw-K845-4116844,
ns2:hw-Q248-7597886, ns2:hw-R658-8902629, ns2:hw-S271-9518696,
ns2:hw-T808-9704137 ;
ns1:depth_mm 16 ;
ns1:hasCategory ns2:prod-cat-Encoder, ns2:prod-cat-Strain ;
ns1:hasProductManager ns2:empl-Dietlinde.Boehme%2540company.org ;
ns1:hasSupplier ns2:suppl-7c4b3772-ee27-4915-8166-359d068db153 ;
ns1:height_mm 62 ;
ns1:id "K367-1320550" ;
ns1:name "Strain Encoder" ;
ns1:price ns2:price-hw-K367-1320550-EUR ;
ns1:reliabilityIndex 0.929 ;
ns1:weight_g 8 ;
ns1:width_mm 27 ;0
rdf:type ns1:Hardware ;
rdfs:label "K367-1320550 - Strain Encoder" .</p>
        <p>Think: The search results include several URIs that seem related to the K367 Strain Encoder. The first URI, `http://ld.company.org/prod-instances/hw-K367-1320550`,
appears to be the most relevant as it directly mentions the K367 Strain Encoder.</p>
        <p>Think: The description of the K367 Strain Encoder includes a list of compatible products. To find out how many suppliers can deliver these alternative compatible
products, I need to describe each of the compatible products to find their suppliers.</p>
        <p>Act: describe("http://ld.company.org/prod-instances/hw-C119-5354812") | describe("http://ld.company.org/prod-instances/hw-K845-4116844")
Step 3
Observation: ...</p>
        <p>Step ...</p>
        <p>Think: ...</p>
        <p>Act: ...</p>
        <p>Observation: ...</p>
        <p>Step n-1
Step n
Think: Now that I have the descriptions of all the compatible products for the K367 Strain Encoder, I can see that each product has a `hasSupplier` property that points to
a specific supplier URI.</p>
        <p>Act: query("PREFIX ns1: &lt;http://ld.company.org/prod-vocab/&gt; SELECT (COUNT(DISTINCT ?supplier) AS ?numSuppliers) WHERE {
&lt;http://ld.company.org/prodinstances/hw-K367-1320550&gt; ns1:compatibleProduct ?compatibleProduct . ?compatibleProduct ns1:hasSupplier ?supplier . }")
Observation: The query "PREFIX ns1: &lt;http://ld.company.org/prod-vocab/&gt; SELECT (COUNT(DISTINCT ?supplier) AS ?supplierCount) WHERE {
&lt;http://ld.company.org/prod-instances/hw-K367-1320550&gt; ns1:compatibleProduct ?compatibleProduct .
?compatibleProduct ns1:hasSupplier ?supplier . }" returned the following results:
[{'supplierCount': '"6"^^http://www.w3.org/2001/XMLSchema#integer'}]
Think: The query has confirmed that there are 6 unique suppliers that can deliver alternative compatible products for the K367 Strain Encoder. This matches the
information I gathered from the descriptions of the compatible products. I have now successfully determined the number of suppliers that can deliver alternative
compatible products for the K367 Strain Encoder.</p>
        <p>Act: success("There are 6 suppliers that can deliver alternative compatible products for the K367 Strain Encoder.")
Graf von Data (Answer)
User Response: For the question "How many suppliers can deliver alternative compatible products for the K367 Strain Encoder?" we used the following SPARQL
query:
PREFIX ns1: &lt;http://ld.company.org/prod-vocab/&gt;
SELECT (COUNT(DISTINCT ?supplier) AS ?supplierCount)
WHERE {
&lt;http://ld.company.org/prod-instances/hw-K367-1320550&gt; ns1:compatibleProduct ?compatibleProduct .</p>
        <p>?compatibleProduct ns1:hasSupplier ?supplier .
}
and found that: There are 6 suppliers that can deliver alternative compatible products for the K367 Strain Encoder
or error messages). The sensor transforms this information into structured text and appends it to the
prompt, extending the history. The updated prompt serves as the internal state for the LLM’s
decisionmaking.</p>
        <p>The prompt is the primary mechanism to direct the LLM towards robust and eficient query
generation. GvD implements a zero-shot prompt that only includes high-level instructions. The initial
prompt consists of the following five sections:
1. Role Definition: This section instructs the model to act as an autonomous agent tasked with
answering user questions by exploring the KG and generating appropriate SPARQL queries.
2. Loop Definition: The loop definition outlines the strategy based on the ReAct framework,
allowing the agent to alternate between processing environmental feedback and taking actions,
facilitating controlled and explainable behaviour. The loop is composed of three components:
(i) Think to formulate the decision-making of the agent, (ii) Act, in which the agent selects and
executes the following action, and (iii) Observation to perceive data from the environment.
3. Actions: Each action is described in terms of its input requirements, its intended purpose, and
the expected output format. The actions are keyword search, entity description, SPARQL query
execution, and termination (success or failure). By explicitly defining these actions, the prompt
provides the agent with a clear operational vocabulary for interacting with the KG.
4. Stage Description: The stage description suggests an ”explore and expand” approach, in which
the agent begins with a coarse-grained search and incrementally builds upon the results of
previous actions.
5. General Guidelines: The guidelines provide recommendations that support correct and
eficient system behaviour. For example, the agent is advised to avoid relying on known
namespaces (such as DBpedia), always to use fully expanded URIs when composing SPARQL queries,
and to be aware of the maximum number of interaction loops. The heuristics help the agent to
remain compatible with diferent KGs and prevent common failure modes.</p>
        <p>
          The agent can interact with its environment through the actuator. The actuator parses the textual
action call and executes the action on the environment. The actuator has five actions available. Two
actions specify termination conditions (success and failure) to enable the agent to stop the query
generation. The success condition allows the agent to signal a found query based on the information from
the graph. The fail condition indicates that the agent found either no or an unsatisfactory query. The
other three actions enable the agent to interact with any KG:
1. Search action: Is an entity linking action that matches entity URIs from natural language
keywords by the similarity score of the keyword and entity textual descriptions. The action uses
keyword search plugins that are widely available for RDF triple stores2, to enable search based
on text similarity within the KG through a SPARQL SELECT query. The similarity score can be
specified on any literal. The search extension creates an index, such as a Lucene Index, using
the literals to facilitate faster retrieval.
2. Describe action: Is a subgraph retrieval method that returns a subgraph for a provided entity
URI according to defined entity boundaries. The boundaries vary based on the entity’s type.
The specific boundaries for class entities [
          <xref ref-type="bibr" rid="ref28">28</xref>
          ] are shown in Figure 5 and expressed by a SPARQL
query with the union of the graph patterns in Listings 1, 3, 4, and 5 from Appendix A. The
specific boundaries for object property entities [
          <xref ref-type="bibr" rid="ref28">28</xref>
          ] are shown in Figure 6 and expressed by a
SPARQL query with the union of the graph patterns in Listings 1, 7, 8, and 6 from Appendix A.
The entity boundaries for all other entities are shown in Figure 4 and expressed by a SPARQL
query with the union of the graph patterns in Listings 1 and 2 from Appendix A. For all outgoing
(see Listing 1) and incoming (see Listing 2) triples, we prune property occurrences that exist over
ten times (i.e., remove additional triples with that property) to limit the total number of triples
in the subset.
3. Query action: Executes a SPARQL query specified by the LLM. The results are returned as a
        </p>
        <p>JSON result set.
2See e.g., for Apache Jena: http://loopasam.github.io/jena-doc/documentation/query/text-query.html, GraphDB: https:
//graphdb.ontotext.com/documentation/11.0/full-text-search.html, Blazegraph: https://github.com/blazegraph/database/
wiki/FullTextSearch</p>
        <p>
          The actuator executes actions (including termination conditions) selected by the LLM. A previous
study [
          <xref ref-type="bibr" rid="ref29">29</xref>
          ] found that selected actions should be evaluated before execution to prevent failures,
unnecessary repetitions, and hallucinations. Thus, the actuator includes a function to validate each action
before execution and provides, in the case of a failure, feedback to the sensor. The control function
ensures that only syntactically valid and non-repeated actions are executed against the KG.
        </p>
        <p>
          For the LLM selection, we prioritised diferent LLMs according to the organisational demands.Thus,
we evaluated diferent open-weight models considering three key metrics: performance, robustness,
and model size. We employed SupplyBench3 for the evaluation of LLMs. SupplyBench is a benchmark
for enterprise-focused KGQA systems, featuring a KG with question/query pairs of varying complexity.
Our experimental findings revealed distinct performance patterns across model sizes. Small LLMs with
fewer than 10 billion parameters showed unstable agent loop behaviour. Large LLMs with more than
100 billion parameters showed higher performance but required prohibitively higher computational
resources. Medium-sized LLMs with approximately 70 billion parameters have achieved the optimal
balance between performance and eficiency. Specifically, current state-of-the-art medium-sized
models, including Llama 3.3 70B Instruct and Qwen 2.5 72B Instruct, successfully managed the robustness
requirements for efective agent loops and parallel action planning. LLM-KG-Bench-3 [
          <xref ref-type="bibr" rid="ref30">30</xref>
          ] confirmed
our findings, as Llama 3.3 70B and Qwen 2.5 72B demonstrated strong performance across various
KG-specific tasks.
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Experimental Setup</title>
      <p>We participated in the Text2SPARQL challenge for the evaluation of GvD. For the Text2SPARQL
challenge, GvD and the associated Text2SPARQL APIs were self-hosted among two Apache Jena Fuseki4
servers extended by Jena Full Text Search5. The two APIs provided the corporate and DBpedia SPARQL
endpoints to GvD. Jena Full Text Search used Lucene indexes based on rdfs:label values in the
respective KGs and the default analyser and tokeniser.</p>
      <p>The configuration of GvD limited the number of interaction loops to eight and the response times
for the search action to 10 seconds, describe action to 20 seconds, and query action to 30 seconds.</p>
      <p>
        We submitted two versions of GvD to the Text2SPARQL challenge, which difered only in the LLM
used. One used Llama 3.3 70B Instruct6 and the other Qwen 2.5 72B Instruct7. We used both models
hosted at Chat AI8 [
        <xref ref-type="bibr" rid="ref31">31</xref>
        ]. Regarding model parameters, the temperature (influencing the output
creativity) was set to 0 (lowest creativity), and the top_p (influencing the output variety) was set to 1 (highest
variety).
      </p>
      <p>All experiments were conducted with one run by the Text2SPARQL organizers. The organizers
evaluated GvD with the metrics precision, recall, F1, and for SPARQL queries with sorted results, NDCG
(Normalised Discounted Cumulative Gain). The organizers did not diferentiate between diferent
types of queries (e.g. ASK, SELECT). We were not able to reproduce the results from the organizers
with our experimental setup9. This may be caused by changed or unknown configuration parameters
of the organizers or Chat AI.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Competitor Profiles</title>
      <p>Nine groups participated in the Text2SPARQL challenge with twelve competing implementations.
Three competitors withdrew from the competition after the results were revealed. Therefore, we
excluded them from the evaluation. Table 1 lists the remaining participants with their KGQA strategy.
3See https://github.com/wintechis/supplybench
4See https://jena.apache.org/documentation/fuseki2/index.html
5See https://jena.apache.org/documentation/query/text-query.html
6See https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct
7See https://huggingface.co/Qwen/Qwen2.5-72B-Instruct
8See https://docs.hpc.gwdg.de/services/chat-ai/index.html
9https://github.com/AKSW/text2sparql.aksw.org/issues/101</p>
      <p>
        Organisation
Institute of Applied Informatics and Formal Description Methods (AIFB) [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]
Artificial Intelligence Research Institute (AIRI) [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]
DBpedia Group (DBpedia) [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]
Fraunhofer IIS (IIS)
ETi @ Institute for Applied Informatics (InfAI) [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]
WSE Research Group @ Leipzig University of Applied Sciences (WSE) [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ]
KGQA Approach
Workflow
Workflow
Direct
ReAct
ReAct
Plan-and-Solve
      </p>
      <p>
        The Institute of Applied Informations and Formal Description Methods (AIFB) of the Karlsruher
Institute of Technology developed a four-step workflow approach [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] involving language translation,
entity extraction, ShEx shape generation via shexer [
        <xref ref-type="bibr" rid="ref32">32</xref>
        ], and finally SPARQL generation.
      </p>
      <p>
        The Artificial Intelligence Research Institute (AIRI) proposed Text-to-RDF Executor (TREx) [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ], a
workflow-based approach that begins with language translation but then diverges into KG-specific
processing pipelines. The DBpedia KG pipeline includes additional steps for query rewriting, entity
linking through additional dependencies, and query refinement through question-query examples
provided through the LCQuad10 and QALD-911 datasets to improve question generation performance.
      </p>
      <p>
        The DBpedia Group’s approaches [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] fine-tuned three open-weight models, CodeGen 2.5 350M [
        <xref ref-type="bibr" rid="ref33">33</xref>
        ]
(DBpedia-CG), StarCoder 1B [
        <xref ref-type="bibr" rid="ref34">34</xref>
        ] (DBpedia-SC), and CodeLlama 7b [
        <xref ref-type="bibr" rid="ref35">35</xref>
        ] (DBpedia-CL) using the NSpM
data set12 containing 8 million natural language question-SPARQL pairs.
      </p>
      <p>
        The Institute for Applied Informatics (INFAI) developed a ReAct agent approach [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. While sharing
GvD’s loop definition, the system relies on a vector database as an additional infrastructure. The
approach has access to six actions. It has three actions - entity search, property search and class search
- for entity and relation linking, two actions - property example and knowledge graph entry - for
subgraph retrieval and a query action.
      </p>
      <p>
        The Web &amp; Software Engineering (WSE) Research Group of the Leipzig University of Applied
Sciences developed the mKGQAgent [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ]. The mKGQAgent is a plan-and-solve approach comprising both
ofline and online phases. The ofline phase creates an experience pool from training data to generate
successful sequences of actions, while the online phase uses this pool to guide question analysis and
step-by-step action execution with GPT-4o. The experience pool is stored within a vector database,
using the multilingual e5 text embeddings [
        <xref ref-type="bibr" rid="ref36">36</xref>
        ] to store successful plans.
      </p>
    </sec>
    <sec id="sec-6">
      <title>6. Results and Discussion</title>
      <p>This section evaluates GvD‘s performance in three parts: a quantitative evaluation based on the
Text2SPARQL challenge (Sec. 6.1), a qualitative comparison with Text2SPARQL challenge
competitors regarding usability criteria in an organisational context, and a qualitative investigation of system
errors of GvD (Sec. 6.3).</p>
      <sec id="sec-6-1">
        <title>6.1. Performance Evaluation</title>
        <p>We evaluate GvD’s efectiveness through quantitative analysis of the Text2SPARQL challenge
performance results, examining results across the Corporate KG and DBpedia benchmarks to assess system
performance in SPARQL query generation from natural language questions.</p>
        <p>Table 2 shows that the ReAct agent approaches achieve the best performance on the Corporate KG
benchmark. GvD with the Qwen 2.5 72b Instruct model (IIS-Q) achieved the highest recall (0.452) and
F1-score (0.444). The approach from INFAI achieved the highest performance in terms of NDCG score
(1.0) and precision (0.542).
10https://github.com/AskNowQA/LC-QuAD
11https://github.com/ag-sc/QALD/tree/master
12https://github.com/LiberAI/NSpM/tree/master</p>
        <p>GvD with the Llama 3.3 70b Instruct model (IIS-L) followed behind the other two competitors. While
individual metric leadership varies across approaches, the consistent dominance of ReAct agent
approach over the other approaches suggests that flexible exploration capabilities provide advantages for
KGQA applications without prior knowledge.</p>
      </sec>
      <sec id="sec-6-2">
        <title>6.2. Comparison to Organisational Demands</title>
        <p>To evaluate GvD’s suitability for organisational usage, we compare the Text2SPARQL challenge
competitors across three demands derived from organisational needs: transparent answer generation,
organisational control, and flexible deployment. Table 5 summarises the evaluation of GvD and the
competitor systems.</p>
        <p>The AIFB team employed a modular four-step workflow that supports generalisation across KGs.
However, their dependence on the closed-weight GPT-4o model from OpenAI limits organisational
control and may introduce privacy concerns. Additionally, their use of ShEx shapes to provide schema
information about the KG during query construction may obscure how answers are generated,
reducing transparency and making error analysis dificult for end users.</p>
        <p>The AIRI group created specialised pipelines for each KG. This approach limits the flexibility to apply
the system easily in new domains and increases configuration overhead. Additionally, their reliance
on the closed-weight GPT-4 model further restricts organisational control and prevents on-premise
usage.</p>
        <p>The DBpedia Group uses fine-tuned, open-weight models, which support organisational control
by enabling local hosting and avoiding third-party dependencies. However, the model operates in
a static inference mode solely based on training data rather than considering up-to-date information,
which limits the transparency of the answer generation. The reliance on small language models lowers
deployment demands, but enforces continuous retraining if the KG changes.</p>
        <p>The mKGQAgent from the WSE group relies on a precomputed experience pool to support plan
generation. This ofline component helps with generating more transparent answers, as plans are
composed of verifiable steps. However, the need for training data that resembles the target domain
reduces the system’s ability to generalise, thereby increasing deployment overhead. The usage of
GPT4o again limits organisational control due to the reliance on a closed-weight LLM.</p>
        <p>The INFAI system adopts a ReAct agent approach similar to GvD, but relies on a vector database for
resolving entity identifiers. This dependency increases the deployment efort, particularly for
organisations aiming to reduce infrastructure complexity. The usage of OpenAI’s GPT-4.1 mini model reduces
organisational control, introducing external dependencies and may raise data privacy concerns.
Trustworthiness was not explicitly reported, but the reasoning traces may be available, showing how the
agent came up with a result.</p>
        <p>In contrast, GvD is explicitly designed to address all three organisational demands. It ensures
transparent answers by generating structured reasoning traces that show how a query was derived. It
supports full organisational control by using open-weight LLMs that can be deployed locally, avoiding
external dependencies and supporting data privacy. Furthermore, GvD has a high deployment
flexibility, as it is not optimised for a specific KG or requires infrastructure beyond a SPARQL endpoint with
a keyword search plugin.</p>
      </sec>
      <sec id="sec-6-3">
        <title>6.3. Error Analysis</title>
        <p>We conducted an error analysis to identify current system limitations and opportunities for
improvement. We created a dataset of 40 problematic cases by examining the execution logs from questions
generating at least three errors. The error distribution revealed language and benchmark
dependencies. Only 3 (7.5%) cases were caused by the Corporate KG benchmark compared to 37 (92.5%) from
the DBpedia benchmark. Furthermore, 23 of 37 (62%) of the errors from the DBpedia benchmark were
caused by Spanish questions.</p>
        <p>In the analysis of the errors, we made three observations:
• Data Parsing Failures: Unsupported formatting in SPARQL queries (through the query action)
and RDF responses (through the describe action) are in 95% (38/40) cases. These failures stem
from missing namespace definitions for standard vocabularies (e.g., missing rdfs: prefix) and
inadequate URI escaping (e.g., dbr:New_York_(City) vs. dbr:New_York_\(City\)).
• Function Repetition Errors: The LLM repeatedly tried to execute identical functions with
identical arguments in 77.5% of cases (31/40), suggesting currently insuficient feedback
mechanisms to prevent unproductive exploration loops.
• Stable Agentic Loop: We observed no case with a loss of the agentic loop control or failure to
execute a proper action call, indicating that ReAct agent approaches may be adequate for KGQA.</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>7. Conclusion</title>
      <p>This paper tackles the accessibility and query challenges of KGs within organisations for non-technical
users. We introduced Graf von Data (GvD), an LLM-augmented agent that bridges this accessibility
gap by translating natural language questions into executable SPARQL queries while fulfilling
organisational usage demands.</p>
      <p>GvD’s innovation lies in its knowledge graph-agnostic approach, which operates exclusively
through a SPARQL endpoints with keyword search plugin, enabling seamless integration across
diverse KG environments without further customisation. This approach, combined with our use of
openweight models, distinguishes GvD among the Text2SPARQL challenge competitors. Despite this focus
on organisational demands, GvD achieved the highest F1-score on the Corporate KG benchmark and
competitive results in the DBpedia benchmark.</p>
      <p>An evaluation of error cases presents opportunities for future enhancement. Future research
directions may include expanding the actions of GvD through additional SPARQL interface mappings,
integrating specialised entity recognition capabilities, and exploring hybrid approaches combining
keyword search with embedding-based indexing for improved entity linking.</p>
    </sec>
    <sec id="sec-8">
      <title>Acknowledgments</title>
      <p>This work was partially funded by the German Federal Ministry for Economic Afairs and Climate
Action (BMWK) through the Antrieb 4.0 project (Grant No. 13IK015B).</p>
      <p>The authors gratefully acknowledge the scientific support and HPC resources provided by the
Erlangen National High Performance Computing Center (NHR@FAU). The hardware is funded by the
German Research Foundation (DFG).</p>
    </sec>
    <sec id="sec-9">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, the author(s) used Grammarly to: Grammar and spelling check.
After using these tool(s)/service(s), the authors reviewed and edited the content as needed and take
full responsibility for the publication’s content.</p>
    </sec>
    <sec id="sec-10">
      <title>A. Graph Patterns for Entity Boundaries</title>
      <p>&lt;{uri}&gt; ?p ?o .
?s ?p &lt;{uri}&gt; .</p>
      <sec id="sec-10-1">
        <title>Listing 1: Graph pattern: Outgoing triples of the focal entity.</title>
      </sec>
      <sec id="sec-10-2">
        <title>Listing 2: Graph pattern: Incoming triples of the focal entity.</title>
      </sec>
      <sec id="sec-10-3">
        <title>Listing 3: Graph pattern: Subclasses of the focal class.</title>
        <p>?subclass rdfs:subClassOf &lt;{uri}&gt; .</p>
        <p>OPTIONAL { ?subclass rdf:type ?type . }
OPTIONAL { ?subclass rdfs:label ?label . }
&lt;{uri}&gt; rdfs:subClassOf ?parentclass .
OPTIONAL { ?parentclass rdf:type ?type . }
OPTIONAL { ?parentclass rdfs:label ?label . }</p>
      </sec>
      <sec id="sec-10-4">
        <title>Listing 4: Graph pattern: Parent classes of the focal class. Listing 5: Graph pattern: Properties having the focal class as domain or range plus classes in domain or range of such properties.</title>
        <p>?property ?domainrange &lt;{uri}&gt; .</p>
        <p>FILTER ( ?domainrange IN ( rdfs:domain , rdfs:range ))
OPTIONAL { ?property rdf:type ?p_type . }
OPTIONAL { ?property rdfs:label ?p_label . }
OPTIONAL {
?property ?rangedomain ?class .</p>
        <p>FILTER ( ?rangedomain IN ( rdfs:domain , rdfs:range ))
OPTIONAL { ?class rdfs:label ?c_label . }
OPTIONAL { ?class rdf:type ?c_type . }
}</p>
        <p>Listing 6: Graph pattern: Classes in domain or range of the focal object property.
&lt;{uri}&gt; ?domainrange ?class .</p>
        <p>FILTER ( ?domainrange IN ( rdfs:domain , rdfs:range ))
OPTIONAL { ?class rdfs:label ?label . }
OPTIONAL { ?class rdf:type ?label . }
Listing 7: Graph pattern: Subproperties of the focal object property plus classes in domain or range of
subproperties.
?subproperty rdfs:subPropertyOf &lt;{uri}&gt; .</p>
        <p>OPTIONAL { ?subproperty rdf:type ?p_type . }
OPTIONAL { ?subproperty rdfs:label ?p_label . }
OPTIONAL {
?subproperty ?rangedomain ?class .</p>
        <p>FILTER ( ?rangedomain IN ( rdfs:domain , rdfs:range ))
OPTIONAL { ?class rdf:type ?c_type . }
OPTIONAL { ?class rdfs:label ?c_label . }
}
&lt;{uri}&gt; rdfs:subPropertyOf ?parentproperty .</p>
        <p>OPTIONAL { ?parentproperty rdf:type ?p_type . }
OPTIONAL { ?parentproperty rdfs:label ?p_label . }
OPTIONAL {
?parentproperty ?rangedomain ?class .</p>
        <p>FILTER ( ?rangedomain IN ( rdfs:domain , rdfs:range ))
OPTIONAL { ?class rdf:type ?c_type . }
OPTIONAL { ?class rdfs:label ?c_label . }
}
Listing 8: Graph pattern: Parent properties of the focal object property plus classes in domain or range
of parent properties.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>E. G.</given-names>
            <surname>Kalaycı</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I. Grangel</given-names>
            <surname>González</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Lösch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Xiao</surname>
          </string-name>
          , A.
          <string-name>
            <surname>ul-Mehdi</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <string-name>
            <surname>Kharlamov</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Calvanese</surname>
          </string-name>
          ,
          <article-title>Semantic Integration of Bosch Manufacturing Data Using Virtual Knowledge Graphs</article-title>
          ,
          <source>in: The Semantic Web - ISWC</source>
          <year>2020</year>
          , volume
          <volume>12507</volume>
          <source>of LNCS</source>
          , Springer International Publishing,
          <year>2020</year>
          , pp.
          <fpage>464</fpage>
          -
          <lpage>481</lpage>
          . doi:
          <volume>10</volume>
          .1007/978- 3-
          <fpage>030</fpage>
          - 62466- 8_
          <fpage>29</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>T.</given-names>
            <surname>Hubauer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Lamparter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Haase</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Herzig</surname>
          </string-name>
          ,
          <article-title>Use Cases of the Industrial Knowledge Graph at Siemens, in: ISWC 2018 Posters &amp; Demonstrations, Industry and Blue Sky Ideas Tracks</article-title>
          , Monterey, USA,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>T.</given-names>
            <surname>Liebig</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Maisenbacher</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Opitz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. R.</given-names>
            <surname>Seyler</surname>
          </string-name>
          , G. Sudra,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wissmann</surname>
          </string-name>
          ,
          <article-title>Building a Knowledge Graph for Products and Solutions in the Automation Industry</article-title>
          ,
          <source>in: Joint Proceedings of the 1st International Workshop on Knowledge Graph Building and 1st International Workshop on Large Scale RDF Analytics co-located with 16th Extended Semantic Web Conference (ESWC</source>
          <year>2019</year>
          ),
          <year>2019</year>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2489</volume>
          /paper2.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>N.</given-names>
            <surname>Noy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Gao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Jain</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Narayanan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Patterson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Taylor</surname>
          </string-name>
          ,
          <article-title>Industry-scale Knowledge Graphs: Lessons and Challenges</article-title>
          ,
          <source>ACM Queue 17</source>
          (
          <year>2019</year>
          )
          <fpage>48</fpage>
          -
          <lpage>75</lpage>
          . doi:
          <volume>10</volume>
          .1145/3329781.3332266.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>D.</given-names>
            <surname>Diefenbach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. D.</given-names>
            <surname>Wilde</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Alipio</surname>
          </string-name>
          ,
          <article-title>Wikibase as an Infrastructure for Knowledge Graphs: The EU Knowledge Graph</article-title>
          ,
          <source>in: The Semantic Web - ISWC</source>
          <year>2021</year>
          , volume
          <volume>12922</volume>
          <source>of LNCS</source>
          , Springer International Publishing,
          <year>2021</year>
          , pp.
          <fpage>631</fpage>
          -
          <lpage>647</lpage>
          . doi:
          <volume>10</volume>
          .1007/978- 3-
          <fpage>030</fpage>
          - 88361- 4_
          <fpage>37</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>E.</given-names>
            <surname>Kaufmann</surname>
          </string-name>
          , A. Bernstein,
          <article-title>How Useful Are Natural Language Interfaces to the Semantic Web for Casual End-Users?</article-title>
          ,
          <source>in: The Semantic Web</source>
          , volume
          <volume>4825</volume>
          , Springer,
          <year>2007</year>
          , pp.
          <fpage>281</fpage>
          -
          <lpage>294</lpage>
          . doi:
          <volume>10</volume>
          . 1007/978- 3-
          <fpage>540</fpage>
          - 76298- 0_
          <fpage>21</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>T.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Gascó-Hernandez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Esteve</surname>
          </string-name>
          ,
          <source>The Adoption and Implementation of Artificial Intelligence Chatbots in Public Organizations: Evidence from U.S. State Governments</source>
          ,
          <source>The American Review of Public Administration</source>
          <volume>54</volume>
          (
          <year>2024</year>
          )
          <fpage>255</fpage>
          -
          <lpage>270</lpage>
          . doi:
          <volume>10</volume>
          .1177/02750740231200522.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>T.</given-names>
            <surname>Bruckhaus</surname>
          </string-name>
          , RAG Does Not Work for Enterprises,
          <year>2024</year>
          . doi:
          <volume>10</volume>
          .48550/arXiv.2406.04369. arXiv:
          <volume>2406</volume>
          .
          <fpage>04369</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>D. M.</given-names>
            <surname>Schmidt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. F.</given-names>
            <surname>Elahi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Cimiano</surname>
          </string-name>
          , Lexicalization Is All You Need:
          <article-title>Examining the Impact of Lexical Knowledge in a Compositional QALD System</article-title>
          ,
          <source>in: Knowledge Engineering and Knowledge Management</source>
          , Springer Nature,
          <year>2025</year>
          , pp.
          <fpage>102</fpage>
          -
          <lpage>122</lpage>
          . doi:
          <volume>10</volume>
          .1007/978- 3-
          <fpage>031</fpage>
          - 77792-
          <issue>9</issue>
          _
          <fpage>7</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>S.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Semnani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Triedman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I. D.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Lam, SPINACH: SPARQL-Based Information Navigation for Challenging Real-World Questions, in: Findings of the Association for Computational Linguistics: EMNLP 2024, Association for Computational Linguistics</article-title>
          ,
          <year>2024</year>
          , pp.
          <fpage>15977</fpage>
          -
          <lpage>16001</lpage>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <year>2024</year>
          .findings- emnlp.938.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>C. V. S.</given-names>
            <surname>Avila</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Casanova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V. M. P.</given-names>
            <surname>Vidal</surname>
          </string-name>
          ,
          <article-title>A Framework for Question Answering on Knowledge Graphs Using Large Language Models</article-title>
          , in: The Semantic Web:
          <article-title>ESWC 2024 Satellite Events</article-title>
          , volume
          <volume>15344</volume>
          <source>of LNCS</source>
          , Springer,
          <year>2025</year>
          , pp.
          <fpage>168</fpage>
          -
          <lpage>172</lpage>
          . doi:
          <volume>10</volume>
          .1007/978- 3-
          <fpage>031</fpage>
          - 78952- 6_
          <fpage>20</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>G.</given-names>
            <surname>Xiong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Bao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <article-title>Interactive-KBQA: Multi-Turn Interactions for Knowledge Base Question Answering with Large Language Models</article-title>
          ,
          <year>2024</year>
          . doi:
          <volume>10</volume>
          .48550/arXiv.2402.15131. arXiv:
          <volume>2402</volume>
          .
          <fpage>15131</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>L.</given-names>
            <surname>Sun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Tao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Arakawa</surname>
          </string-name>
          , ODA:
          <article-title>Observation-Driven Agent for integrating LLMs</article-title>
          and
          <string-name>
            <surname>Knowledge Graphs</surname>
          </string-name>
          ,
          <year>2024</year>
          . doi:
          <volume>10</volume>
          .48550/arXiv.2404.07677. arXiv:
          <volume>2404</volume>
          .
          <fpage>07677</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>S.</given-names>
            <surname>Xu</surname>
          </string-name>
          , S. Liu,
          <string-name>
            <given-names>T.</given-names>
            <surname>Culhane</surname>
          </string-name>
          , E. Pertseva,
          <string-name>
            <surname>M.-H. Wu</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Semnani</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Lam</surname>
          </string-name>
          ,
          <article-title>Fine-tuned LLMs Know More, Hallucinate Less with Few-Shot Sequence-to-Sequence Semantic Parsing over Wikidata</article-title>
          ,
          <source>in: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics</source>
          ,
          <year>2023</year>
          , pp.
          <fpage>5778</fpage>
          -
          <lpage>5791</lpage>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <year>2023</year>
          . emnlp- main.353.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>G.</given-names>
            <surname>Piao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mountantonakis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Papadakos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Sonawane</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.</surname>
          </string-name>
          <article-title>OMahony, Toward Exploring Knowledge Graphs with LLMs</article-title>
          ,
          <source>in: Joint Proceedings of Posters, Demos, Workshops, and Tutorials of the 20th International Conference on Semantic Systems (SEMANTiCS</source>
          <year>2024</year>
          ), volume Vol-
          <volume>3759</volume>
          , CEUR-WS, Amsterdam, The Netherlands,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>H. M. Zahera</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Ali</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          <string-name>
            <surname>Sherif</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Moussallem</surname>
          </string-name>
          , A.-C.
          <article-title>Ngonga Ngomo, Generating SPARQL from Natural Language Using Chain-of-Thoughts Prompting, in: Knowledge Graphs in the Age of Language Models</article-title>
          and
          <string-name>
            <surname>Neuro-Symbolic</surname>
            <given-names>AI</given-names>
          </string-name>
          , volume
          <volume>60</volume>
          of
          <article-title>Studies on the Semantic Web</article-title>
          , IOS Press, Amsterdam,
          <year>2024</year>
          , pp.
          <fpage>353</fpage>
          -
          <lpage>368</lpage>
          . doi:
          <volume>10</volume>
          .3233/SSW240028.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>L.</given-names>
            <surname>Kovriguina</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Teucher</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Radyush</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Mouromtsev</surname>
          </string-name>
          , SPARQLGEN:
          <article-title>One-Shot Prompt-based Approach for SPARQL Query Generation</article-title>
          ,
          <source>in: Proceedings of the Posters and Demo Track of the 19th International Conference on Semantic Systems (SEMANTiCS</source>
          <year>2023</year>
          ), volume Vol-
          <volume>3526</volume>
          , CEUR, Leipzig, Germany,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>J.</given-names>
            <surname>Wardenga</surname>
          </string-name>
          , T. Käfer, Challenge submission: AIFB, https://text2sparql.aksw.org/assets/talks/ 2
          <article-title>-AIFB-tobias-kaefer-kit</article-title>
          .pdf,
          <year>2025</year>
          . Accessed 2025-
          <volume>06</volume>
          -13.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>M.</given-names>
            <surname>Shahinmoghadam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Soru</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Josh</surname>
          </string-name>
          , S. Tiwari, DBpedia Group at the 1st
          <source>Text2SPARQL Challenge</source>
          , https://text2sparql.aksw.org/assets/talks/3
          <article-title>-DBPEDIA-tommaso-soru</article-title>
          .pdf,
          <year>2025</year>
          . Accessed 2025-
          <volume>06</volume>
          -13.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>D.</given-names>
            <surname>Gerber</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Bühmann</surname>
          </string-name>
          , L.-P. Meyer, C. Stadler, Felix Brei,
          <string-name>
            <given-names>J.</given-names>
            <surname>Frey</surname>
          </string-name>
          , K. Bulert, ETi / InfAI: Text2Sparql Agent, https://text2sparql.aksw.org/assets/talks/5-INFAI-lars
          <string-name>
            <surname>-</surname>
          </string-name>
          peter-meyer.pdf,
          <year>2025</year>
          . Accessed 2025-
          <volume>06</volume>
          -13.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>O.</given-names>
            <surname>Somov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Berezin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Avdeev</surname>
          </string-name>
          ,
          <article-title>TEXT2SPARQL: TREx (Text-to-RDF Executor)</article-title>
          , https: //text2sparql.aksw.org/assets/talks/6
          <article-title>-MIPT-oleg-somov-daniil-berezin-roman-avdeev-airi</article-title>
          .pdf,
          <year>2025</year>
          . Accessed 2025-
          <volume>06</volume>
          -13.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>A.</given-names>
            <surname>Perevalov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Both</surname>
          </string-name>
          , mKGQAgent:
          <article-title>Multilingual Human-inspired Question Answering over Language Graphs</article-title>
          , https://text2sparql.aksw.org/assets/talks/7
          <article-title>-WSE-andreas-both-htwk</article-title>
          .
          <year>mp4</year>
          ,
          <year>2025</year>
          . Accessed 2025-
          <volume>06</volume>
          -13.
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>L.</given-names>
            <surname>Fei-Fei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Fergus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Perona</surname>
          </string-name>
          ,
          <article-title>One-shot learning of object categories</article-title>
          ,
          <source>IEEE Transactions on Pattern Analysis and Machine Intelligence</source>
          <volume>28</volume>
          (
          <year>2006</year>
          )
          <fpage>594</fpage>
          -
          <lpage>611</lpage>
          . doi:
          <volume>10</volume>
          .1109/TPAMI.
          <year>2006</year>
          .
          <volume>79</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>P.</given-names>
            <surname>Lewis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Perez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Piktus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Petroni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Karpukhin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Goyal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Küttler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lewis</surname>
          </string-name>
          , W.-t. Yih,
          <string-name>
            <given-names>T.</given-names>
            <surname>Rocktäschel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Riedel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Kiela</surname>
          </string-name>
          ,
          <article-title>Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks</article-title>
          ,
          <source>in: NIPS'20: Proceedings of the 34th International Conference on Neural Information Processing Systems</source>
          , Curran Associates Inc.,
          <string-name>
            <surname>Red</surname>
            <given-names>Hook</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>NY</given-names>
            ,
            <surname>United</surname>
          </string-name>
          <string-name>
            <surname>States</surname>
          </string-name>
          ,
          <year>2020</year>
          , pp.
          <fpage>9459</fpage>
          -
          <lpage>9474</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>L.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Lan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Hu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Lan</surname>
          </string-name>
          ,
          <string-name>
            <surname>R. K.-W. Lee</surname>
            ,
            <given-names>E.-P.</given-names>
          </string-name>
          <string-name>
            <surname>Lim</surname>
          </string-name>
          ,
          <string-name>
            <surname>Plan-</surname>
          </string-name>
          and
          <string-name>
            <surname>-Solve Prompting</surname>
          </string-name>
          :
          <article-title>Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models</article-title>
          ,
          <source>in: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics</source>
          ,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>S.</given-names>
            <surname>Yao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Du</surname>
          </string-name>
          , I. Shafran,
          <string-name>
            <given-names>K.</given-names>
            <surname>Narasimhan</surname>
          </string-name>
          ,
          <string-name>
            <surname>Y. Cao,</surname>
          </string-name>
          <article-title>ReAct: Synergizing Reasoning and Acting in Language Models</article-title>
          ,
          <source>in: The Eleventh International Conference on Learning Representations ICLR</source>
          <year>2023</year>
          , Kigali, Rwanda,
          <year>2023</year>
          . doi:
          <volume>10</volume>
          .48550/arXiv.2210.03629.
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>OMG</given-names>
            <surname>Unified Modeling</surname>
          </string-name>
          <article-title>Language (OMG UML)</article-title>
          ,
          <source>Version 2.5.1</source>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>D.</given-names>
            <surname>Henselmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Dorsch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Harth</surname>
          </string-name>
          ,
          <article-title>Impact of Knowledge Graph Representations on Question Answering with Language Models</article-title>
          ,
          <source>in: Advanced Information Systems Engineering Workshops</source>
          , volume
          <volume>556</volume>
          <source>of LNBIP</source>
          , Springer, Vienna, Austria,
          <year>2025</year>
          , pp.
          <fpage>81</fpage>
          -
          <lpage>92</lpage>
          . doi:
          <volume>10</volume>
          .1007/ 978- 3-
          <fpage>031</fpage>
          - 94931-
          <issue>9</issue>
          _
          <fpage>7</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>S.</given-names>
            <surname>Schmid</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Freund</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Harth</surname>
          </string-name>
          ,
          <article-title>Adaptive Planning on the Web: Using LLMs and Afordances for Web Agents</article-title>
          ,
          <source>in: Knowledge Graphs and Semantic Web</source>
          , volume
          <volume>15459</volume>
          <source>of LNCS</source>
          , Springer Nature Switzerland,
          <year>2025</year>
          , pp.
          <fpage>93</fpage>
          -
          <lpage>108</lpage>
          . doi:
          <volume>10</volume>
          .1007/978- 3-
          <fpage>031</fpage>
          - 81221-
          <issue>7</issue>
          _
          <fpage>7</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>L.-P.</given-names>
            <surname>Meyer</surname>
          </string-name>
          , J.
          <string-name>
            <surname>Frey</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Heim</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Brei</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Stadler</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Junghanns</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <article-title>Martin, LLM-KG-Bench 3.0: A Compass for Semantic Technology Capabilities in the Ocean of LLMs</article-title>
          , volume
          <volume>15719</volume>
          <source>of LNCS</source>
          ,
          <year>2025</year>
          , pp.
          <fpage>280</fpage>
          -
          <lpage>296</lpage>
          . doi:
          <volume>10</volume>
          .1007/978- 3-
          <fpage>031</fpage>
          - 94578- 6_
          <fpage>16</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>A.</given-names>
            <surname>Doosthosseini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Decker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Nolte</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Kunkel</surname>
          </string-name>
          ,
          <string-name>
            <surname>Chat</surname>
            <given-names>AI</given-names>
          </string-name>
          :
          <string-name>
            <given-names>A Seamless</given-names>
            <surname>Slurm-Native Solution for HPC-Based</surname>
          </string-name>
          <string-name>
            <surname>Services</surname>
          </string-name>
          ,
          <year>2024</year>
          . doi:
          <volume>10</volume>
          .48550/arXiv.2407.00110.
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>D.</given-names>
            <surname>Fernandez-Álvarez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. E.</given-names>
            <surname>Labra-Gayo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Gayo-Avello</surname>
          </string-name>
          ,
          <article-title>Automatic extraction of shapes using sheXer, Knowledge-Based Systems 238 (</article-title>
          <year>2022</year>
          ). doi:
          <volume>10</volume>
          .1016/j.knosys.
          <year>2021</year>
          .
          <volume>107975</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <string-name>
            <given-names>E.</given-names>
            <surname>Nijkamp</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Hayashi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Xiong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Savarese</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <article-title>CodeGen2: Lessons for Training LLMs on Programming</article-title>
          and
          <source>Natural Languages</source>
          ,
          <year>2023</year>
          . doi:
          <volume>10</volume>
          .48550/arXiv.2305.02309.
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [34]
          <string-name>
            <given-names>R.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. B.</given-names>
            <surname>Allal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Muennighof</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Kocetkov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Mou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Marone</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Akiki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Chim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Liu</surname>
          </string-name>
          , E. Zheltonozhskii,
          <string-name>
            <given-names>T. Y.</given-names>
            <surname>Zhuo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Dehaene</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Davaadorj</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lamy-Poirier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Monteiro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Shliazhko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Gontier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Meade</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Zebaze</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.-H. Yee</surname>
            ,
            <given-names>L. K.</given-names>
          </string-name>
          <string-name>
            <surname>Umapathi</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Zhu</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Lipkin</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Oblokulov</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Murthy</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Stillerman</surname>
            ,
            <given-names>S. S.</given-names>
          </string-name>
          <string-name>
            <surname>Patel</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Abulkhanov</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Zocca</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Dey</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Fahmy</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          <string-name>
            <surname>Bhattacharyya</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          <string-name>
            <surname>Yu</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Singh</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Luccioni</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Villegas</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Kunakov</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Zhdanov</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Romero</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Timor</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Ding</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Schlesinger</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Schoelkopf</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Ebert</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Dao</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Mishra</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Gu</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Robinson</surname>
          </string-name>
          ,
          <string-name>
            <surname>C. J. Anderson</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Dolan-Gavitt</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Contractor</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Reddy</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Fried</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Bahdanau</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Jernite</surname>
            ,
            <given-names>C. M.</given-names>
          </string-name>
          <string-name>
            <surname>Ferrandis</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Hughes</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Wolf</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Guha</surname>
          </string-name>
          , L. von
          <string-name>
            <surname>Werra</surname>
          </string-name>
          , H. de Vries,
          <source>StarCoder: May the source be with you!</source>
          ,
          <year>2023</year>
          . doi:
          <volume>10</volume>
          .48550/arXiv.2305.06161.
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [35]
          <string-name>
            <given-names>B.</given-names>
            <surname>Rozière</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Gehring</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Gloeckle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Sootla</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Gat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X. E.</given-names>
            <surname>Tan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Adi</surname>
          </string-name>
          , J. Liu,
          <string-name>
            <given-names>R.</given-names>
            <surname>Sauvestre</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Remez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Rapin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kozhevnikov</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Evtimov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Bitton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Bhatt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. C.</given-names>
            <surname>Ferrer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Grattafiori</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Xiong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Défossez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Copet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Azhar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Touvron</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Martin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Usunier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Scialom</surname>
          </string-name>
          , G. Synnaeve, Code Llama: Open Foundation Models for Code,
          <year>2024</year>
          . doi:
          <volume>10</volume>
          .48550/arXiv.2308.12950.
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          [36]
          <string-name>
            <given-names>L.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Majumder</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Wei</surname>
          </string-name>
          ,
          <source>Multilingual E5 Text Embeddings: A Technical Report</source>
          ,
          <year>2024</year>
          . doi:
          <volume>10</volume>
          .48550/arXiv.2402.05672.
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          [37]
          <string-name>
            <given-names>A.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Hui</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Zheng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Tu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Dang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Lu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Bao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Xue</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Zhu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Men</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Tang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Xia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Ren</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Ren</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Fan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Su</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Cui</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Qiu</surname>
          </string-name>
          ,
          <source>Qwen2.5 Technical Report</source>
          ,
          <year>2025</year>
          . doi:
          <volume>10</volume>
          .48550/arXiv. 2412.15115.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>