<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Agentic AI, Context Engineering and Knowledge Graphs: Current Approaches, Challenges and Opportunities</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Niraj Karki</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Manjila Pandey</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sanju Tiwari</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nandana Mihindukulasooriya</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sven Groppe</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Daniel Dobriy</string-name>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>IBM Research</institution>
          ,
          <addr-line>NYC</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Pulchowk Engineering Campus</institution>
          ,
          <country country="NP">Nepal</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Sharda University</institution>
          ,
          <addr-line>Delhi-NCR</addr-line>
          ,
          <country country="IN">India</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Universität zu Lübeck, Germany</institution>
          ,
          <addr-line>&amp; TU Bergakademie Freiberg</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>WU Vienna</institution>
          ,
          <addr-line>Austria, &amp; Dobriy AI</addr-line>
          ,
          <country country="AT">Austria</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2026</year>
      </pub-date>
      <abstract>
        <p>With the recent advancements in Large Language Models (LLMs) and Agentic AI, Context Engineering (CE) has emerged as a novel research area. CE aims to fill the prompts for LLM Agents with relevant contextual knowledge required to perform complex tasks, where the quality of this context is paramount for reliability. Knowledge Graphs (KGs) ofer a promising approach to integrate diverse contextual knowledge based on Semantic Web and Knowledge Representation approaches. In this paper, we study current approaches to identify challenges and opportunities for utilising KGs in CE and explore their limitations and strategic future research directions. The findings illustrate inconsistencies in methodologies and limited understanding of scalability and quality assurance challenges, which slow down the development of robust, context-aware AI systems capable of dealing with real-world complexity and multi-domain reasoning tasks.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Context Engineering</kwd>
        <kwd>Knowledge Graphs</kwd>
        <kwd>Large Language Models</kwd>
        <kwd>Ontology</kwd>
        <kwd>Knowledge Representation</kwd>
        <kwd>Quality</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Large Language Models (LLMs) have demonstrated impressive performance across a wide variety of
natural language tasks, including machine translation [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], question answering [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], summarization [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ],
and dialogue generation [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Their growing influence spans domains such as cybersecurity, education,
and healthcare due to their ability to generalise well on language tasks [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. However, the performance
and eficiency of these models are fundamentally governed by the context they receive. They still
face significant challenges such as dificulty in handling structured knowledge, especially in the case
of smaller models [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], lack of explicit memory, and hallucinations, where models produce plausible
sounding but factually incorrect responses [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. These limitations directly afect the quality, factual
reliability, and robustness of LLM-based agentic systems. To address these shortcomings, the emerging
domain of Context Engineering (CE) focuses on providing high-quality context to guide LLMs more
efectively [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Within this landscape, Knowledge Graphs (KGs) provide a promising solution to tackle
many persistent challenges in LLM-based agentic systems [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. As structured representations of entities
and their interrelations, KGs help bridge the gap between unstructured language and symbolic reasoning
[
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. KGs encode factual information about real-world objects in a machine-readable format [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] and
overcome LLM limitations by grounding context, using multi-hop reasoning and serving as a validator
for LLM outputs and response quality [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
      </p>
      <sec id="sec-1-1">
        <title>Current Approaches for Integrating Knowledge Graph in</title>
      </sec>
      <sec id="sec-1-2">
        <title>Context Engineering for Agentic AI</title>
        <p>K-BERT:
Concept
Former:
Injecting KG
into LLM:
Think on Graph:
Knowledge</p>
        <p>Graph</p>
        <p>Knowledge</p>
        <p>Graph</p>
        <sec id="sec-1-2-1">
          <title>KG Injection</title>
          <p>Enhanced
KnoLawyleedrge w0 w1 rr112 w2 ww112..wi r.i1.wn-1 wi1 wn SenIntpenutces RQeAsapLnoLdnMsNeEiRn ZEP:
n-concept
vector
Star-topology generation
subgraph
KGE Model EmbKeGdding eFmrobIzenedpnduiLtnLgMin</p>
          <p>Input
embedding in
Frozen LLM</p>
          <p>Factually
Enhanced
Response
Check if all the details
required for answering
the quesiton is obtained.</p>
          <p>LLM</p>
          <p>Challenges and Limitations
Limited availability of high-quality KG
Limited Generalizability
Knowledge noise and incomplete graph
Computationally heavy construction
and Integration
Struggle with real time and
evolving data
Dependent on inconsistent
task and automated metrics</p>
        </sec>
      </sec>
      <sec id="sec-1-3">
        <title>RGerespatoenrsFea, cletuasatl Temporally aware</title>
        <p>cost KG engine
Improved
explainability of</p>
        <p>Response</p>
        <sec id="sec-1-3-1">
          <title>KG Augmentation</title>
          <p>HOLMES:
Unstructured
Text</p>
        </sec>
        <sec id="sec-1-3-2">
          <title>Continuous Knowledge Update</title>
          <p>Transform selected nodes
edges into formattedtext</p>
          <p>context
Identifying relevant
nodes and edges</p>
          <p>Re-ranking
results
Memory Retreival</p>
          <p>LLM
Answer</p>
          <p>Generation
Pruning KG to
Knowledge
Schema</p>
          <p>LLM</p>
          <p>Highly
queryfocused
context</p>
          <p>aware KG
Hyper-relational</p>
          <p>KG</p>
          <p>Future Research Directions
Integration of symbolic reasoning with neural capabilities of LLM
for multi-hop question answering
Dynamic KG integration
Autonomous graph construction
Multimodal LLMs for knowledge alignment and Information
extraction</p>
          <p>Therefore, we aim to explore the current landscape of KG-augmented LLM methods from a CE
perspective, addressing the following three research questions:
• RQ1: What are the recent approaches for integrating KGs in CE for Agentic AI?
• RQ2: What are the experienced challenges and limitations in KG-enhanced CE?
• RQ3: What are the current gaps to advance the interdisciplinary field?</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>2. Methodology of the Literature Study</title>
      <p>For RQ 1/2, we conducted the following steps for recent articles published between 2020 and 2025:</p>
      <p>Initial Search: We use 3 research databases, IEEE Xplore (accessed on 10 August 2025), ACM Digital
Library (accessed on 10 August 2025), and arXiv (accessed on 12 August 2025) to access relevant articles.
Afterwards, the search engine Google Scholar is also used for article search. As CE and KG-LLM
integration are rapidly evolving fields, we include methodologies that are currently available as
preprints but have not completed the peer-review cycle yet. We use particular keywords to probe journals
and proceedings that focus on CE, ontology, and the use of KGs to improve the quality, performance,
and eficiency of LLMs and agentic systems. Specifically, we probe articles whose titles and abstracts (i)
matches “Context Engineering”, “In-Context Learning”, “Chain-of-thought” or “Retrieval-Augmented
Generation (RAG)”; and (ii) matches either “Knowledge Graph”, “Structured Knowledge”, “Knowledge
Base” or “Ontology”; and (iii) matches either “Agentic AI”, “Agentic Systems”, “LLM” or “Generative AI”.
From the initial search, we collect 436 articles that are potentially related to the topics of our study.</p>
      <p>Selection and Inclusion: In the initial selection step, we exclude 340 and select 96 articles based
on the title and abstract that are not related to CE in AI systems using KGs. In the 2nd selection step,
we preliminarily read the remaining articles and filter out the articles based on methods and reported
quality-related outcomes - narrowing down the number of articles to 52. In the last selection step, we
fully read each article and exclude theoretical articles that purely discuss KG with no LLM components.
Finally, we select 35 articles that discuss CE for Agentic AI, propose novel KG-LLM integration methods,
and contribute to enhancing LLM performance as well as output quality through better context.</p>
      <p>Categorization and Taxonomy: We classify the KG-LLM integration approaches into categories
according to the stage at which the KG is integrated into the model, and discuss it in section 4. The specific
taxonomy is chosen because it systematically covers knowledge integration into the model lifecycle
and its impact on model performance and reliability.We identify five categories of KG integration:
Pretraining, Post-training, KG-based Augmentation, Inference-time Integration, and Continuous Update.</p>
      <p>Analysis and Synthesis: We extract and analyze model name, core methodology, key contributions,
integration type, datasets used, relative performance, and limitations of each article (see table 1), which
is the basis for the identification of common challenges (section 4) and future research directions
(section 5).</p>
    </sec>
    <sec id="sec-3">
      <title>3. Background</title>
      <p>
        Although LLMs have revolutionized AI applications, their efectiveness remains dependent on the
quality and structure of contextual information provided to them [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. Traditional prompt
engineering approaches are often insuficient, so CE focuses on managing contextual information, addressing
the hallucinations and factual inaccuracies [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. KGs can add structured, well-organized
information with well-defined semantics to LLM, reduce hallucinations, and increase the factual precision of
responses [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ].
3.1. Prompt Engineering
A prompt in GenAI is a textual input that enhances the model output, ranging from simple text to
specific information [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. Diferent prompting techniques like descriptive prompts in image generation
models like DALL-E 3 and simple queries to complex problem statements in GPT-5 [17] are used in
practice. They may guide LLMs for logical reasoning with simple queries and advanced techniques
like chain-of-thought prompting [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. So, prompt engineering crafts the optimal prompt to achieve
a specific goal and get desired domain-specific output [ 18]. It requires a blend of domain knowledge,
an understanding of the underlying behavior of the model, and careful adaptation to the specificities
of the chosen LLM, such as its instruction-tuning regime, context window limitations, and response
calibration mechanisms. [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ].
3.2. Retrieval-augmented generation (RAG)
Retrieval Augmented Generation (RAG) improves the capability of LLM in knowledge-intensive tasks,
continuous knowledge updates, and provides domain-specific information [ 19]. RAG addresses LLM
limitations such as hallucination and short context window by providing important contextual
information, including knowledge from external databases [20, 21]. Naive RAG focused on basic chunk
similarity and an incomplete understanding of queries [21] whereas advanced RAG introduced
hierarchical indexing and reranking. Modular RAG introduced a task-specific, flexible, and modular
architecture [21]. Recently, GraphRAG [22] excels in capturing relational knowledge for more
accurate and context-aware retrieval by relying on KGs to support multi-hop traversal and entity-relation
matching, thereby integrating various degrees of reasoning over structured data.
3.3. Agentic AI
Agentic AI is presented as an evolution of GenAI applications, enhancing systems to operate
independently, perform broader aspects rather than isolated tasks, and execute complex activities [
        <xref ref-type="bibr" rid="ref25">44</xref>
        ].
Historically proposed as foundational part of the Semantic Web ecosystem [
        <xref ref-type="bibr" rid="ref26">45</xref>
        ], modern AI Agents
extended the capabilities of LLMs by leveraging external tools, function calling, and workflows, enabling
them to perform more complex processes through planning, tool selection, and feedback loops. The
Agentic AI paradigm further extends this autonomy by designing systems that consist of multiple agents
that coordinate and communicate with each other as well as perform tasks collaboratively, adapting to
dynamic conditions to achieve a broader goal [
        <xref ref-type="bibr" rid="ref27 ref28">46, 47</xref>
        ].
3.4. Context Engineering
      </p>
      <p>
        Addressing the shortcomings of traditional prompt engineering, CE enhances LLM capabilities by
systematically designing, filtering, and structuring input information. At the core of this framework
lies the information architecture, which structures context in a hierarchical order and groups related
concepts to reduce processing load [
        <xref ref-type="bibr" rid="ref29">48</xref>
        ]. Furthermore, contextual relevance and filtering optimise limited
context windows by applying query-context alignment, ranking details by importance, and reducing
redundancy to ensure more accurate and useful responses [
        <xref ref-type="bibr" rid="ref30 ref31">49, 50</xref>
        ]. Multimodal integration further
boosts these capabilities by incorporating diverse modalities (text, visual, and temporal information),
which enable complex cross-modal reasoning.
      </p>
      <p>
        Figure 2 shows key components of modern AI systems, highlighting the role of CE to enable more
capable and goal-directed Agentic AI systems: Diferent sources of information, incl. KGs, long-term
memory, and past state or history, are used together to create, process, and manage useful context. This
context helps to guide AI systems to act more intelligently and produce better structured results.
3.5. Knowledge Graphs: Representation and Reasoning
KGs provide structured, machine-interpretable representations of knowledge through
subject-predicateobject triples, enabling semantic understanding and automated reasoning across complex,
interconnected datasets [
        <xref ref-type="bibr" rid="ref32 ref33">51, 52</xref>
        ]. They are widely used in domains such as search, recommendation systems,
information retrieval, and data integration [53]. Their graph-based structure, comprising nodes (entities,
literals) and edges (relations), presents a rich semantic representation [
        <xref ref-type="bibr" rid="ref33">52</xref>
        ]. Furthermore, bidirectional
integration of KGs with LLMs has created new opportunities for contextual engineering, where KGs
enhance context ingestion and query enrichment while LLMs contribute to KG construction and
relationship prediction [54]. Modern KGs capture factual information and deeper relationships between
concepts [55] such as hierarchies and causal links that are beyond surface level association [
        <xref ref-type="bibr" rid="ref32">51, 56</xref>
        ]. In
addition to semantic richness, KGs support multi-hop and type-based reasoning, which supports tasks
such as classification, generalization and logical reasoning [ 57]. Moreover, the dynamic nature of KGs
allows updates that add new entities without requiring reconstruction of full graph [58].
4. Approaches for Integrating KGs into Context Engineering and Their
      </p>
      <p>Limitations
KG and LLM have recently gained increased attention, as both technologies are highly complementary
in their capabilities [59]. LLMs excel at natural language understanding and generation, while KGs ofer
structured, semantically rich information that enhances LLM efectiveness and interpretability. In this
section, we explore several methodologies to integrate KGs into CE for Agentic AI and the limitations
they sufer. We also explore diferent datasets used to advance KGs, CE, and Agentic AI.
4.1. Knowledge Integration Methodologies
We categorize KG–LLM integration methods into pre-training, post-training, KG-based augmentation,
inference time integration and continuous knowledge updates based on the type of integration of
KG into the LLM pipeline. Figure 3 illustrates the interaction of each method with model training or
inference, while Figure 4 contains the taxonomy of KG integration methodologies in the literature.</p>
      <p>Pre-training integration involves using structured knowledge into the model’s embedding or
representation layer even before training. As indicated in figure 3a, KG is used to shape embeddings before the
main training in pre-training integration approaches. Early injection-based works such as K-BERT [23]
inject KG triples directly into input sentences through a knowledge layer as structured “sentence trees,”
controlled by soft-position embeddings and visible matrices. It is used for expert reasoning in tasks like
question answering (Q&amp;A) and Named Entity Recognition (NER). A similar approach is employed in
ConceptFormer [25], which injects KG-derived concept vectors into the LLM embedding space without
retraining. Both methods efectively ground local context and improve domain precision but remain
limited by the quality and completeness of curated graphs. Building on previous injection-based
strategies, Graph-Token uses embeddings to integrate KG representations directly into a frozen LLM that
enables knowledge-aware reasoning without additional fine-tuning [ 26]. Although these approaches
demonstrate significant improvement in generating grounded responses, current evaluations focus
mainly on node-level reasoning tasks such as existence, counting, and identification. More complex
tasks, for example, edge-centric and graph-level reasoning, remain underexplored.</p>
      <p>On the contrary, post-training integration adds new knowledge to an already trained model (see
ifgure 3a). Building on this direction, Lavrinovics et al. [ 31] propose a KG-based hallucination mitigation
framework, where knowledge from KGs is integrated at multiple stages of the LLM pipeline. Their
method extends conventional factual injection approaches by enabling autonomous fact, self-correction,
and reasoning via KG-based memory and hybrid mitigation strategies. However, the approach remains
limited by static graph updates and modular complexity.</p>
      <p>
        In KG-based augmentation, we augment the model with data from KGs for enhanced model
performance in terms of accuracy and depth. Inference time integration (figure 3b) does not require LLMs
to store KGs; instead, they dynamically query and use KGs at the moment of answering a user query.
Hence, the aim of this approach is to create a grounding mechanism that supports LLMs in generating
responses that are explainable, factually grounded, and consistent. Furthermore, continuous update
involves repetitively monitoring new data and continuously updating knowledge; hence, the model
is ever-evolving and can adapt to new insights (figure 3a). Techniques use KGs during inference
to boost the capabilities of LLMs using semantic and structural relationships without requiring
embedding during pre-training. For example, THINK-ON-GRAPH [32] enables KGs and LLMs to work
(a) Pre-training, post-training, and KG-based augmentation
(b) Inference-time integration and continuous knowledge update
together through a beam search algorithm to dynamically explore multiple reasoning paths within a
KG, improving decision-making and explainability. HOLMES [
        <xref ref-type="bibr" rid="ref23">42</xref>
        ] enhances interpretability through
hyper-relational schemas and controlled multi-hop BFS expansion to identify missing facts and retrieve
supporting evidence. Their performance depends on the graph quality and completeness, where missing
or noisy triples can disrupt reasoning chains. Large-scale traversal of graphs such as Wikidata remains
computationally demanding.
      </p>
      <p>Recent research uses KGs as structural scafolds to enhance retrieval and context building in
retrievals
d
o
h
t
e
M
n
o
it
a
r
g
e
t
n
I
G
K</p>
      <p>Pre-training
Integration
Post-training
Integration</p>
      <p>KG-based
Augmentation</p>
      <p>Hybrid
Approaches
Inference time
Integration</p>
      <p>
        Interactive-KBQA [
        <xref ref-type="bibr" rid="ref18">37</xref>
        ]
GraphReader [64]
      </p>
      <p>Multi-turn SPARQL generation</p>
      <p>Constructs KG from long texts during inference</p>
      <p>
        Continuous
Knowledge Update
augmented generation (RAG) systems. For example, KG-FiD [24] integrates semantic passage graphs
into Fusion-in-Decoder architectures that enable more precise reranking and improve the extraction of
answer-relevant text. This yields notable gains in exact-match accuracy and reduces computational cost.
Similarly, frameworks such as Evidence-Focused Fact Summarization (EFSUM) [
        <xref ref-type="bibr" rid="ref17">36</xref>
        ] and TrumorGPT [33]
employ KG-based subgraph construction, OpenIE-derived triples, or graph-guided document filtering
to enhance faithfulness and mitigate hallucinations. These retrieval methods are vulnerable to
entitylinking and subgraph errors, often yielding confident but wrong answers in fast-changing domains.
4.2. Emerging Paradigms: Dynamic Memory, Hybrid Systems, and Open Challenges
The field is rapidly evolving toward incorporating KGs into complex systems, giving rise to dynamic
agent memory architectures and hybrid symbolic-neural models. ZEP introduces a temporally aware
KG that unifies episodic, semantic, and community subgraphs for agent memory, enabling accurate,
low-latency long-term memory for real applications. KARMA [30] applies a multi-agent LLM system to
independently ingest, segment, align, and validate new knowledge for expanding KG coverage. However,
they also have issues concerning quality assurance, validation, and reliability, as automatically generated
triples can introduce factual inconsistencies, requiring human oversight to maintain reliability.
      </p>
      <p>
        Other approaches adopt hybrid architectures that combine symbolic structure with LLM generation.
Systems such as FOLK, SURGE [
        <xref ref-type="bibr" rid="ref20">39</xref>
        ], Interactive-KBQA [
        <xref ref-type="bibr" rid="ref18">37</xref>
        ], and KERE [29] leverage logical
representations, subgraph retrieval, and ontological constraints to enable explainable, entity-level reasoning
across tasks including dialogue and relation extraction. While these methods can enhance faithfulness,
consistency, and interpretability, they also introduce system-level complexities, as errors in retrieval,
linking, or component interaction can propagate and undermine symbolic guarantees. More broadly,
recurring challenges emerge around balancing structural precision with neural adaptability: graph-based
methods improve factual consistency but depend on graph quality and face scalability limits, and KGs
are evolving from static background resources to dynamic, agent-driven memory systems for multi-turn,
multimodal, temporally extended reasoning [
        <xref ref-type="bibr" rid="ref18 ref19">38, 30, 37</xref>
        ]. Current methods span from structural KG
injection (e.g., K-BERT [23]) to dynamic, agent-based architectures (e.g., ZEP, KARMA [30]) (see Table 1),
reflecting a shift toward more context-aware and explainable reasoning systems.
      </p>
      <p>These studies highlight the rapid advancement of LLM-KG integration with the need of establishing
a consistent evaluation framework and addressing the challenges of large-scale, dynamically evolving
graphs. With operational use, balancing adaptability with reliability will remain a major challenge.
4.3. Datasets and diferent evaluation metrices
This section highlights datasets from papers advancing Agentic AI, CE, and KGs, focusing on those
most relevant to our analysis. From the survey, HotpotQA is the most frequently used dataset due to its
robust framework for multi-hop question answering. WebQSP, CWQ, DocRED, Synthetic, PolitiFact,
and DBpedia are also frequently used for question answering, relation extraction, and fact checking.</p>
      <p>Table 2 summarizes the datasets and their usages in literature in three research areas: Knowledge
Graphs (KG), Context Engineering (CE), and Agentic AI (AA).</p>
    </sec>
    <sec id="sec-4">
      <title>5. Future Research Direction</title>
      <p>We identified several research directions based on our literature study to address fundamental limitations
in knowledge correctness and reliability while opening new directions for future innovation.</p>
      <p>
        Neuro Symbolic Integration: An emerging direction for advancing CE is the integration of symbolic
reasoning from KGs with the neural capabilities of LLMs [69]. Approaches like ConceptFormer [25] and
HOLMES [
        <xref ref-type="bibr" rid="ref23">42</xref>
        ] inject KG-derived knowledge into LLMs, while THINK-ON-GRAPH [32] demonstrates
how symbolic reasoning and neural generation can work together in multi-hop question answering.
However, most methods rely on static alignments or task-specific pipelines, limiting scalability and
dynamic updates. Future research should focus on seamlessly aligning structured KGs and unstructured
LLM outputs, integrating neural and symbolic reasoning to improve interpretability [? ]. This will
enhance the development of trustworthy, real-time, context-aware agentic AI systems [? ].
      </p>
      <p>
        Autonomous and Self-Updating KGs: Most current systems like KG-FiD [24], KGAT [27], and
EFSUM [
        <xref ref-type="bibr" rid="ref17">36</xref>
        ] rely on static KGs, which struggle with evolving data contexts. Self-updating KGs address
this by enabling real-time integration of facts and relationships without system retraining. It still faces
challenges with static and outdated information due to manual validation and query accuracy [70].
Systems like ZEP [
        <xref ref-type="bibr" rid="ref19">38</xref>
        ] and KARMA [30], demonstrate the potential for autonomous construction.
Future systems should update their KGs autonomously by monitoring external data, identifying missing
information, and making necessary adjustments to maintain a consistent knowledge structure.
      </p>
      <p>
        Multimodal Context Grounding for Real-World Understanding: KGs should integrate text,
images, audio, and video to better model the real world and improve CE. Systems like VisDoMRAG [
        <xref ref-type="bibr" rid="ref22">41</xref>
        ]
show 12–20% gains on visually rich datasets but still face visual bias and cross-modal alignment issues.
Current multimodal methods (e.g., image labeling and symbol grounding with datasets like MSCOCO
[71]) reach only 43% accuracy in detecting misclassified objects [ 71] and fail to capture abstract or
emotional concepts. Utilizing multimodal LLMs to extract information from sources like medical
scans and verify data consistency, along with attention mechanisms for live context updates, has great
potential. This will enhance healthcare, robotics, and chat systems [72].
      </p>
      <p>Quality: With the rise of Agentic AI and hybrid LLM-KG systems, ensuring quality becomes more
complex. While it’s important to assess individual component quality to improve the overall system, it’s
also crucial to explore how small quality issues in components may accumulate into larger problems.
Investigating the addition of monitoring components to mitigate such issues seems promising. We
encourage research into quality-by-design software architectures to address potential quality challenges
in future hybrid Agentic AI systems.
Description
Random graphs used for node-related reasoning tasks with KG embeddings in LLMs
Molecular graphs for anti-HIV activity, used for KG reasoning and context embedding
Nitroaromatic compound graphs, used for KG reasoning and context embedding
Molecular graphs for solubility, used for KG reasoning and context embedding
Biomedical texts used for automated KG enrichment with multi-agent LLMs
KG-based QA dataset with Freebase/Wikidata, used for reasoning and context [32,
construction 36]
GrailQA [68] KG-based QA dataset with Freebase/Wikidata, used for reasoning and context [32]</p>
      <p>construction
CWQ [32] Complex KG-based QA dataset with Wikidata, used for reasoning and context [32,</p>
      <p>construction 37]
BioRel [29] For sentence-level relation extraction [29]
DocRED [29] Document-level relation extraction dataset with Wikidata, used for KG-based ex- [29]</p>
      <p>
        traction
HotpotQA Multi-hop QA dataset with Wikipedia-based KGs, used for reasoning and context [
        <xref ref-type="bibr" rid="ref23">42</xref>
        ]
      </p>
      <p>
        construction
MuSiQue [
        <xref ref-type="bibr" rid="ref23">42</xref>
        ] Multi-hop QA dataset with complex reasoning, used for KG construction and rea- [
        <xref ref-type="bibr" rid="ref23">42</xref>
        ]
      </p>
      <p>
        soning
2WikiMultiHopQA [M43u]lti-hop QA dataset with Wikidataand complex
MetaQA [
        <xref ref-type="bibr" rid="ref18">37</xref>
        ] KG-based QA dataset with multi-hop queries, used for reasoning and interactions
Natural Ques- Open-domain QA dataset with Wikipedia-based KGs, used for fact summarization
tions (NQ) [24]
TriviaQA [24] Trivia QA dataset with Wikipedia-based KGs, used for fact summarization
Mintaka [
        <xref ref-type="bibr" rid="ref17">36</xref>
        ] Multilingual QA dataset with Wikipedia-based KGs, used for fact summarization
Chnsenticorp [23] A hotel review dataset for single-sentence sentiment classification
MedicalKG [23] A self-developed Chinese medical concept KG
CN- A large open-domain encyclopedic Chinese KG
DBpedia [23]
HowNet [23]
DialogRE [28]
TACRED [28]
VisDoM [
        <xref ref-type="bibr" rid="ref22">41</xref>
        ]
HOVER [
        <xref ref-type="bibr" rid="ref21">40</xref>
        ]
      </p>
    </sec>
    <sec id="sec-5">
      <title>6. Conclusion</title>
      <p>In conclusion, this study provides a thorough analysis of KG-based CE in Agentic AI systems,
highlighting key research questions and integration strategies. We identified limitations and proposed
future directions for improving context-aware models, with a focus on enhancing the quality and
reliability of contextual knowledge. Beyond conventional integration methods, we also explored areas
like continuous knowledge updates, neurosymbolic integration, self-updating KGs, and multimodal CE,
all of which must address critical challenges of knowledge quality, consistency, and trustworthiness.</p>
    </sec>
    <sec id="sec-6">
      <title>7. Declaration on Generative AI</title>
      <p>During the preparation of this work, the authors used Grammarly to improve grammar, check spelling,
and reword. After using these tool(s)/service(s), the authors reviewed and edited the content as needed
and take full responsibility for the publication’s content.
[17] L. Giray, Prompt engineering with chatgpt: a guide for academic writers, Annals of biomedical
engineering 51 (2023) 2629–2633.
[18] Q. Ye, M. Ahmed, R. Pryzant, F. Khani, Prompt engineering a prompt engineer, in: Findings of the</p>
      <p>Association for Computational Linguistics: ACL 2024, 2024, pp. 355–385.
[19] Y. Gao, Y. Xiong, X. Gao, K. Jia, J. Pan, Y. Bi, Y. Dai, J. Sun, M. Wang, H. Wang, Retrieval-augmented
generation for large language models: A survey, 2024. URL: https://arxiv.org/abs/2312.10997.
arXiv:2312.10997.
[20] L. Mei, J. Yao, Y. Ge, Y. Wang, B. Bi, Y. Cai, J. Liu, M. Li, Z.-Z. Li, D. Zhang, C. Zhou, J. Mao,
T. Xia, J. Guo, S. Liu, A survey of context engineering for large language models, 2025. URL:
https://arxiv.org/abs/2507.13334. arXiv:2507.13334.
[21] Y. Gao, Y. Xiong, M. Wang, H. Wang, Modular rag: Transforming rag systems into lego-like
reconfigurable frameworks, 2024. URL: https://arxiv.org/abs/2407.21059. arXiv:2407.21059.
[22] B. Peng, Y. Zhu, Y. Liu, X. Bo, H. Shi, C. Hong, Y. Zhang, S. Tang, Graph retrieval-augmented
generation: A survey, 2025.
[23] W. Liu, P. Zhou, Z. Zhao, Z. Wang, Q. Ju, H. Deng, P. Wang, K-bert: Enabling language
representation with knowledge graph, in: Proceedings of the AAAI conference on artificial intelligence,
volume 34, 2020, pp. 2901–2908.
[24] D. Yu, C. Zhu, Y. Fang, W. Yu, S. Wang, Y. Xu, X. Ren, Y. Yang, M. Zeng, Kg-fid: Infusing knowledge
graph in fusion-in-decoder for open-domain question answering, in: Proceedings of the 60th
annual meeting of the association for computational linguistics (volume 1: long papers), 2022, pp.
4961–4974.
[25] J. Barmettler, A. Bernstein, L. Rossetto, Conceptformer: Towards eficient use of
knowledgegraph embeddings in large language models, 2025. URL: https://arxiv.org/abs/2504.07624.
arXiv:2504.07624.
[26] E. Coppolillo, Injecting knowledge graphs into large language models, arXiv preprint
arXiv:2505.07554 (2025).
[27] R. Kumar, H. Kumar, K. Shalini, Detecting and mitigating bias in llms through knowledge
graph-augmented training, in: 2025 International Conference on Artificial Intelligence and Data
Engineering (AIDE), IEEE, 2025, pp. 608–613.
[28] X. Chen, N. Zhang, X. Xie, S. Deng, Y. Yao, C. Tan, F. Huang, L. Si, H. Chen, Knowprompt:
Knowledge-aware prompt-tuning with synergistic optimization for relation extraction, in:
Proceedings of the ACM Web Conference 2022, WWW ’22, Association for Computing
Machinery, New York, NY, USA, 2022, p. 2778–2788. URL: https://doi.org/10.1145/3485447.3511998.
doi:10.1145/3485447.3511998.
[29] M. Jain, Knowledge enabled relation extraction, in: Companion Proceedings of the ACM Web</p>
      <p>Conference 2024, 2024, pp. 1210–1213.
[30] Y. Lu, J. Wang, Karma: Leveraging multi-agent llms for automated knowledge graph enrichment,
2025. URL: https://arxiv.org/abs/2502.06472. arXiv:2502.06472.
[31] E. Lavrinovics, R. Biswas, J. Bjerva, K. Hose, Knowledge graphs, large language models, and
hallucinations: An nlp perspective, Journal of Web Semantics 85 (2025) 100844.
[32] J. Sun, C. Xu, L. Tang, S. Wang, C. Lin, Y. Gong, L. M. Ni, H.-Y. Shum, J. Guo, Think-on-graph:
Deep and responsible reasoning of large language model on knowledge graph, arXiv preprint
arXiv:2307.07697 (2023).
[33] C. N. Hang, P.-D. Yu, C. W. Tan, Trumorgpt: Query optimization and semantic reasoning over
networks for automated fact-checking, in: 2024 58th Annual Conference on Information Sciences
and Systems (CISS), 2024, pp. 1–6. doi:10.1109/CISS59072.2024.10480162.
[34] C. N. Hang, P.-D. Yu, C. W. Tan, Trumorgpt: Graph-based retrieval-augmented large language
model for fact-checking, IEEE Transactions on Artificial Intelligence (2025) 1–15. URL: http:
//dx.doi.org/10.1109/TAI.2025.3567369. doi:10.1109/tai.2025.3567369.
[35] A. Martin, H. F. Witschel, M. Mandl, M. Stockhecke, Semantic verification in large language
model-based retrieval augmented generation, in: Proceedings of the AAAI Symposium Series,
volume 3, 2024, pp. 188–192.
494–514.
[53] J. Jiaxin, X. Huang, B. Choi, J. Xu, S. S Bhowmick, L. Xu, Ppkws: An eficient framework for keyword
search on public-private networks, 2020, pp. 457–468. doi:10.1109/ICDE48307.2020.00046.
[54] A. Kau, X. He, A. Nambissan, A. Astudillo, H. Yin, A. Aryani, Combining knowledge graphs and
large language models, 2024. URL: https://arxiv.org/abs/2407.06564. arXiv:2407.06564.
[55] M. Yahya, J. G. Breslin, M. I. Ali, Semantic web and knowledge graphs for industry 4.0, Applied</p>
      <p>Sciences 11 (2021) 5110.
[56] F. N. Al-Aswadi, H. Y. Chan, K. H. Gan, From ontology to knowledge graph trend: ontology as
foundation layer for knowledge graph, in: Iberoamerican Knowledge Graphs and Semantic Web
Conference, Springer, 2022, pp. 330–340.
[57] K. Liang, L. Meng, M. Liu, Y. Liu, W. Tu, S. Wang, S. Zhou, X. Liu, F. Sun, K. He, A survey of
knowledge graph reasoning on graph types: Static, dynamic, and multi-modal, IEEE Transactions
on Pattern Analysis and Machine Intelligence 46 (2024) 9456–9478.
[58] S. M. Mohamed, S. Farah, A. M. Lotfy, K. A. Rizk, A. Y. Saeed, S. H. Mohamed, G. Khoriba, T. Arafa,
Knowledge graphs: The future of data integration and insightful discovery, in: Advanced Research
Trends in Sustainable Solutions, Data Analytics, and Security, IGI Global Scientific Publishing,
2025, pp. 99–146.
[59] H. Khorashadizadeh, F. Z. Amara, M. Ezzabady, F. Ieng, S. Tiwari, N. Mihindukulasooriya, J. Groppe,
S. Sahri, F. Benamara, S. Groppe, Research trends for the interplay between large language models
and knowledge graphs, Proceedings of the VLDB Endowment. ISSN 2150 (2024) 8097.
[60] J. Jiang, K. Zhou, W. X. Zhao, Y. Song, C. Zhu, H. Zhu, J.-R. Wen, Kg-agent: An eficient autonomous
agent framework for complex reasoning over knowledge graph, in: Proceedings of the 63rd Annual
Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025, pp. 9505–
9523.
[61] J. Liu, X. Huang, Z. Chen, Y. Fang, Drak: Unlocking molecular insights withnbsp;domain-specific
retrieval-augmented knowledge innbsp;llms, in: Natural Language Processing and Chinese
Computing: 13th National CCF Conference, NLPCC 2024, Hangzhou, China, November 1–3,
2024, Proceedings, Part II, Springer-Verlag, Berlin, Heidelberg, 2024, p. 255–267. URL: https:
//doi.org/10.1007/978-981-97-9434-8_20. doi:10.1007/978-981-97-9434-8_20.
[62] J. Baek, A. F. Aji, A. Safari, Knowledge-augmented language model prompting for zero-shot
knowledge graph question answering, Proceedings of the First Workshop on Matching From
Unstructured and Structured Data (MATCHING 2023) (2023). URL: https://api.semanticscholar.
org/CorpusID:260063238.
[63] Y. Wei, Q. Huang, Y. Zhang, J. Kwok, KICGPT: Large language model with knowledge in context for
knowledge graph completion, in: H. Bouamor, J. Pino, K. Bali (Eds.), Findings of the Association for
Computational Linguistics: EMNLP 2023, Association for Computational Linguistics, Singapore,
2023, pp. 8667–8683. URL: https://aclanthology.org/2023.findings-emnlp.580/. doi: 10.18653/v1/
2023.findings-emnlp.580.
[64] S. Li, Y. He, H. Guo, X. Bu, G. Bai, J. Liu, J. Liu, X. Qu, Y. Li, W. Ouyang, W. Su, B. Zheng,
GraphReader: Building graph-based agent to enhance long-context abilities of large language
models, in: Y. Al-Onaizan, M. Bansal, Y.-N. Chen (Eds.), Findings of the Association for Computational
Linguistics: EMNLP 2024, Association for Computational Linguistics, Miami, Florida, USA, 2024,
pp. 12758–12786. doi:10.18653/v1/2024.findings-emnlp.746.
[65] Y. Zhang, K. Chen, X. Bai, Z. Kang, Q. Guo, M. Zhang, Question-guided knowledge graph
rescoring and injection for knowledge graph question answering, in: Findings of the Association for
Computational Linguistics: EMNLP 2024, 2024, pp. 8972–8985.
[66] Y. Su, X. Han, Z. Zhang, Y. Lin, P. Li, Z. Liu, J. Zhou, M. Sun, Cokebert: Contextual knowledge
selection and embedding towards enhanced pre-trained language models, AI Open 2 (2021)
127–134.
[67] Z. Zhang, X. Han, Z. Liu, X. Jiang, M. Sun, Q. Liu, ERNIE: Enhanced language representation
with informative entities, in: A. Korhonen, D. Traum, L. Màrquez (Eds.), Proceedings of the 57th
Annual Meeting of the Association for Computational Linguistics, Association for Computational
Linguistics, Florence, Italy, 2019, pp. 1441–1451. URL: https://aclanthology.org/P19-1139/. doi:10.
18653/v1/P19-1139.
[68] Y. Gu, S. Kase, M. Vanni, B. M. Sadler, P. Liang, X. Yan, Y. Su, Beyond I.I.D.: three levels of
generalization for question answering on knowledge bases, in: J. Leskovec, M. Grobelnik, M. Najork,
J. Tang, L. Zia (Eds.), WWW ’21: The Web Conference 2021, Virtual Event / Ljubljana, Slovenia,
April 19-23, 2021, ACM / IW3C2, 2021, pp. 3477–3488. URL: https://doi.org/10.1145/3442381.3449992.
doi:10.1145/3442381.3449992.
[69] B. P. Bhuyan, A. Ramdane-Cherif, R. Tomar, T. Singh, Neuro-symbolic artificial intelligence: a
survey, Neural Computing and Applications 36 (2024) 12809–12844.
[70] S. Hatem, G. Khoriba, M. H. Gad-Elrab, M. ElHelw, Up to date: Automatic updating knowledge
graphs using llms, Procedia Computer Science 244 (2024) 327–334. URL: https://www.sciencedirect.
com/science/article/pii/S1877050924030072. doi:https://doi.org/10.1016/j.procs.2024.
10.206, 6th International Conference on AI in Computational Linguistics.
[71] X. Zhu, Z. Li, X. Wang, X. Jiang, P. Sun, X. Wang, Y. Xiao, N. J. Yuan, Multi-modal knowledge graph
construction and application: A survey, IEEE Transactions on Knowledge and Data Engineering
36 (2024) 715–735. URL: http://dx.doi.org/10.1109/TKDE.2022.3224228. doi:10.1109/tkde.2022.
3224228.
[72] X. Wang, B. Meng, H. Chen, Y. Meng, K. Lv, W. Zhu, Tiva-kg: A multimodal knowledge graph
with text, image, video and audio, in: Proceedings of the 31st ACM International Conference
on Multimedia, MM ’23, Association for Computing Machinery, New York, NY, USA, 2023, p.
2391–2399. URL: https://doi.org/10.1145/3581783.3612266. doi:10.1145/3581783.3612266.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>D.</given-names>
            <surname>Gao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Dai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Jin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Jiang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Ning</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Xuan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Cai</surname>
          </string-name>
          , et al.,
          <article-title>Llms-based machine translation for e-commerce</article-title>
          ,
          <source>Expert Systems with Applications</source>
          <volume>258</volume>
          (
          <year>2024</year>
          )
          <fpage>125087</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhuang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Sun</surname>
          </string-name>
          ,
          <string-name>
            <surname>C. Zhang,</surname>
          </string-name>
          <article-title>Toolqa: A dataset for llm question answering with external tools</article-title>
          , in: A.
          <string-name>
            <surname>Oh</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Naumann</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Globerson</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Saenko</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Hardt</surname>
          </string-name>
          , S. Levine (Eds.),
          <source>Advances in Neural Information Processing Systems</source>
          , volume
          <volume>36</volume>
          ,
          <string-name>
            <surname>Curran</surname>
            <given-names>Associates</given-names>
          </string-name>
          , Inc.,
          <year>2023</year>
          , pp.
          <fpage>50117</fpage>
          -
          <lpage>50143</lpage>
          . URL: https://proceedings.neurips.cc/paper_files/paper/2023/file/ 9cb2a7495900f8b602cb10159246a016-Paper-Datasets_and_Benchmarks.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>H.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. S.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. Zhang,</surname>
          </string-name>
          <article-title>A systematic survey of text summarization: From statistical methods to large language models</article-title>
          ,
          <source>ACM Comput. Surv</source>
          .
          <volume>57</volume>
          (
          <year>2025</year>
          ). URL: https://doi.org/10.1145/3731445. doi:
          <volume>10</volume>
          .1145/3731445.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>C.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Xie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <article-title>Speak from heart: An emotion-guided llm-based multimodal method for emotional dialogue generation</article-title>
          ,
          <source>in: Proceedings of the 2024 International Conference on Multimedia Retrieval</source>
          , ICMR '24,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computing Machinery, New York, NY, USA,
          <year>2024</year>
          , p.
          <fpage>533</fpage>
          -
          <lpage>542</lpage>
          . URL: https://doi.org/10.1145/3652583.3658104. doi:
          <volume>10</volume>
          .1145/3652583.3658104.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>H. F.</given-names>
            <surname>Atlam</surname>
          </string-name>
          ,
          <article-title>Llms in cyber security: Bridging practice and education</article-title>
          ,
          <source>Big Data and Cognitive Computing</source>
          <volume>9</volume>
          (
          <year>2025</year>
          )
          <fpage>184</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>D.</given-names>
            <surname>Dobriy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Bauer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Azzam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Banerjee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Polleres</surname>
          </string-name>
          ,
          <string-name>
            <surname>Agentic</surname>
            <given-names>SPARQL</given-names>
          </string-name>
          :
          <article-title>Evaluating SPARQL-MCP-powered Intelligent Agents on the Federated KGQA Benchmark</article-title>
          ,
          <source>Technical Report, Vienna Unversity of Economics and Business</source>
          ,
          <year>2026</year>
          . doi:
          <volume>10</volume>
          .57938/ 83c86964-2d48
          <string-name>
            <surname>-</surname>
          </string-name>
          46f1
          <string-name>
            <surname>-</surname>
          </string-name>
          b655-5bef78c1a837.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Shi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Ma</surname>
          </string-name>
          , W. Zhong,
          <string-name>
            <given-names>J.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zheng</surname>
          </string-name>
          ,
          <article-title>Llm hallucinations in practical code generation: Phenomena, mechanism, and mitigation</article-title>
          ,
          <source>Proceedings of the ACM on Software Engineering</source>
          <volume>2</volume>
          (
          <year>2025</year>
          )
          <fpage>481</fpage>
          -
          <lpage>503</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>R.</given-names>
            <surname>Shankar</surname>
          </string-name>
          ,
          <article-title>Context is king: From prompt engineering to context engineering in healthcare ai</article-title>
          ,
          <source>Available at SSRN</source>
          <volume>5365971</volume>
          (
          <year>2025</year>
          ). URL: http://dx.doi.org/10.2139/ssrn.5365971.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>H.</given-names>
            <surname>Abu-Rasheed</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Weber</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Fathi</surname>
          </string-name>
          ,
          <article-title>Knowledge graphs as context sources for llm-based explanations of learning recommendations</article-title>
          ,
          <source>in: 2024 IEEE Global Engineering Education Conference (EDUCON)</source>
          , IEEE,
          <year>2024</year>
          , p.
          <fpage>1</fpage>
          -
          <lpage>5</lpage>
          . URL: http://dx.doi.org/10.1109/EDUCON60312.
          <year>2024</year>
          .
          <volume>10578654</volume>
          . doi:
          <volume>10</volume>
          .1109/educon60312.
          <year>2024</year>
          .
          <volume>10578654</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>W.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Qi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Ji</surname>
          </string-name>
          ,
          <article-title>Hybrid reasoning in knowledge graphs: Combing symbolic reasoning and statistical reasoning</article-title>
          ,
          <source>Semantic Web</source>
          <volume>11</volume>
          (
          <year>2020</year>
          )
          <fpage>53</fpage>
          -
          <lpage>62</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>S.</given-names>
            <surname>Pan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Luo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <article-title>Unifying large language models and knowledge graphs: A roadmap</article-title>
          ,
          <source>IEEE Transactions on Knowledge and Data Engineering</source>
          <volume>36</volume>
          (
          <year>2024</year>
          )
          <fpage>3580</fpage>
          -
          <lpage>3599</lpage>
          . URL: http://dx.doi.org/10.1109/TKDE.
          <year>2024</year>
          .
          <volume>3352100</volume>
          . doi:
          <volume>10</volume>
          .1109/tkde.
          <year>2024</year>
          .
          <volume>3352100</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>H.</given-names>
            <surname>Ren</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Dai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Dai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Leskovec</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Schuurmans</surname>
          </string-name>
          ,
          <article-title>Smore: Knowledge graph completion and multi-hop reasoning in massive knowledge graphs</article-title>
          ,
          <source>in: Proceedings of the 28th ACM SIGKDD conference on knowledge discovery and data mining</source>
          ,
          <year>2022</year>
          , pp.
          <fpage>1472</fpage>
          -
          <lpage>1482</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>S.</given-names>
            <surname>An</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Ma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Zheng</surname>
          </string-name>
          , J.-G. Lou, W. Chen,
          <article-title>Make your llm fully utilize the context</article-title>
          ,
          <source>Advances in Neural Information Processing Systems</source>
          <volume>37</volume>
          (
          <year>2024</year>
          )
          <fpage>62160</fpage>
          -
          <lpage>62188</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Xia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>McAuley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <article-title>Beyond chain-of-thought: A survey of chain-of-X paradigms for LLMs</article-title>
          , in: O.
          <string-name>
            <surname>Rambow</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Wanner</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Apidianaki</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Al-Khalifa</surname>
            ,
            <given-names>B. D.</given-names>
          </string-name>
          <string-name>
            <surname>Eugenio</surname>
          </string-name>
          , S. Schockaert (Eds.),
          <source>Proceedings of the 31st International Conference on Computational Linguistics</source>
          , Association for Computational Linguistics, Abu Dhabi,
          <string-name>
            <surname>UAE</surname>
          </string-name>
          ,
          <year>2025</year>
          , pp.
          <fpage>10795</fpage>
          -
          <lpage>10809</lpage>
          . URL: https://aclanthology.org/
          <year>2025</year>
          .coling-main.
          <volume>719</volume>
          /.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>L.</given-names>
            <surname>Cai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Kang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Fu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <article-title>Practices, opportunities and challenges in the fusion of knowledge graphs and large language models</article-title>
          ,
          <source>Frontiers in Computer Science</source>
          <volume>7</volume>
          (
          <year>2025</year>
          )
          <fpage>1590632</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>T. F.</given-names>
            <surname>Heston</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Khun</surname>
          </string-name>
          , Prompt engineering in medical education,
          <source>International Medical Education</source>
          <volume>2</volume>
          (
          <year>2023</year>
          )
          <fpage>198</fpage>
          -
          <lpage>205</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [36]
          <string-name>
            <given-names>S.</given-names>
            <surname>Ko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Cho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Chae</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Yeo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <article-title>Evidence-focused fact summarization for knowledgeaugmented zero-shot question answering</article-title>
          , in: Y.
          <string-name>
            <surname>Al-Onaizan</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Bansal</surname>
            ,
            <given-names>Y.-N.</given-names>
          </string-name>
          <string-name>
            <surname>Chen</surname>
          </string-name>
          (Eds.),
          <source>Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing</source>
          , Association for Computational Linguistics, Miami, Florida, USA,
          <year>2024</year>
          , pp.
          <fpage>10636</fpage>
          -
          <lpage>10651</lpage>
          . URL: https://aclanthology.org/
          <year>2024</year>
          .emnlp-main.
          <volume>594</volume>
          /. doi:
          <volume>10</volume>
          .18653/v1/
          <year>2024</year>
          .emnlp-main.
          <volume>594</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [37]
          <string-name>
            <given-names>G.</given-names>
            <surname>Xiong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Bao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <article-title>Interactive-kbqa: Multi-turn interactions for knowledge base question answering with large language models</article-title>
          ,
          <source>arXiv preprint arXiv:2402.15131</source>
          (
          <year>2024</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [38]
          <string-name>
            <given-names>P.</given-names>
            <surname>Rasmussen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Paliychuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Beauvais</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ryan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Chalef</surname>
          </string-name>
          ,
          <article-title>Zep: A temporal knowledge graph architecture for agent memory</article-title>
          ,
          <year>2025</year>
          . URL: https://arxiv.org/abs/2501.13956. arXiv:
          <volume>2501</volume>
          .
          <fpage>13956</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [39]
          <string-name>
            <given-names>M.</given-names>
            <surname>Kang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Kwak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Baek</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. J.</given-names>
            <surname>Hwang</surname>
          </string-name>
          ,
          <article-title>Knowledge graph-augmented language models for knowledge-grounded dialogue generation</article-title>
          ,
          <year>2023</year>
          . URL: https://arxiv.org/abs/2305.18846. arXiv:
          <volume>2305</volume>
          .
          <fpage>18846</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [40]
          <string-name>
            <given-names>H.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Shu</surname>
          </string-name>
          ,
          <article-title>Explainable claim verification via knowledge-grounded reasoning with large language models</article-title>
          , in: H.
          <string-name>
            <surname>Bouamor</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Pino</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          Bali (Eds.),
          <source>Findings of the Association for Computational Linguistics: EMNLP</source>
          <year>2023</year>
          ,
          <article-title>Association for Computational Linguistics</article-title>
          , Singapore,
          <year>2023</year>
          , pp.
          <fpage>6288</fpage>
          -
          <lpage>6304</lpage>
          . URL: https://aclanthology.org/
          <year>2023</year>
          .findings-emnlp.
          <volume>416</volume>
          /. doi:
          <volume>10</volume>
          .18653/v1/
          <year>2023</year>
          . findings-emnlp.
          <volume>416</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [41]
          <string-name>
            <given-names>M.</given-names>
            <surname>Suri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Mathur</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Dernoncourt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Goswami</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. A.</given-names>
            <surname>Rossi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Manocha</surname>
          </string-name>
          ,
          <article-title>Visdom: Multi-document qa with visually rich elements using multimodal retrieval-augmented generation, in: Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers</article-title>
          ),
          <year>2025</year>
          , pp.
          <fpage>6088</fpage>
          -
          <lpage>6109</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [42]
          <string-name>
            <given-names>P.</given-names>
            <surname>Panda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Agarwal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Devaguptapu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kaul</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Ap</surname>
          </string-name>
          , Holmes:
          <article-title>Hyper-relational knowledge graphs for multi-hop question answering using llms, in: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics</article-title>
          (Volume
          <volume>1</volume>
          :
          <string-name>
            <surname>Long</surname>
            <given-names>Papers)</given-names>
          </string-name>
          ,
          <year>2024</year>
          , pp.
          <fpage>13263</fpage>
          -
          <lpage>13282</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [43]
          <string-name>
            <given-names>J.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Hu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Shen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Wei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Gu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. Z.</given-names>
            <surname>Pan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <article-title>Learning to plan for retrieval-augmented large language models from knowledge graphs</article-title>
          , in: Y.
          <string-name>
            <surname>Al-Onaizan</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Bansal</surname>
            ,
            <given-names>Y.-N.</given-names>
          </string-name>
          <string-name>
            <surname>Chen</surname>
          </string-name>
          (Eds.),
          <source>Findings of the Association for Computational Linguistics: EMNLP</source>
          <year>2024</year>
          ,
          <article-title>Association for Computational Linguistics</article-title>
          , Miami, Florida, USA,
          <year>2024</year>
          , pp.
          <fpage>7813</fpage>
          -
          <lpage>7835</lpage>
          . URL: https://aclanthology.org/
          <year>2024</year>
          .findings-emnlp.
          <volume>459</volume>
          /. doi:
          <volume>10</volume>
          .18653/v1/
          <year>2024</year>
          . findings-emnlp.
          <volume>459</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [44]
          <string-name>
            <given-names>J.</given-names>
            <surname>Schneider</surname>
          </string-name>
          ,
          <article-title>Generative to agentic ai: Survey, conceptualization, and challenges</article-title>
          ,
          <source>arXiv preprint arXiv:2504.18875</source>
          (
          <year>2025</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [45]
          <string-name>
            <given-names>A.</given-names>
            <surname>Polleres</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Bauer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Dobriy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Käfer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Kubelka</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Harth</surname>
          </string-name>
          , T. Wehr,
          <article-title>On the historic roots of Agentic AI in Semantic Web Services</article-title>
          ,
          <source>in: The ACM Web Conference</source>
          <year>2026</year>
          ,
          <year>2026</year>
          . URL: http: //polleres.net/publications/poll-etal2026WebConfHoW.
          <article-title>pdf, to appear, extended abstract (invited, History of the Web track</article-title>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [46]
          <string-name>
            <given-names>R.</given-names>
            <surname>Sapkota</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. I.</given-names>
            <surname>Roumeliotis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Karkee</surname>
          </string-name>
          ,
          <article-title>Ai agents vs. agentic ai: A conceptual taxonomy, applications and challenges</article-title>
          , Information
          <string-name>
            <surname>Fusion</surname>
          </string-name>
          (
          <year>2025</year>
          )
          <fpage>103599</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [47]
          <string-name>
            <surname>D. B. Acharya</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Kuppan</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Divya</surname>
          </string-name>
          , Agentic ai:
          <article-title>Autonomous intelligence for complex goals-a comprehensive survey</article-title>
          ,
          <source>IEEe Access</source>
          <volume>13</volume>
          (
          <year>2025</year>
          )
          <fpage>18912</fpage>
          -
          <lpage>18936</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [48]
          <string-name>
            <given-names>P.</given-names>
            <surname>Lewis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Perez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Piktus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Petroni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Karpukhin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Goyal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Küttler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lewis</surname>
          </string-name>
          , W.-t. Yih,
          <string-name>
            <given-names>T.</given-names>
            <surname>Rocktäschel</surname>
          </string-name>
          , et al.,
          <article-title>Retrieval-augmented generation for knowledge-intensive nlp tasks</article-title>
          ,
          <source>Advances in neural information processing systems</source>
          <volume>33</volume>
          (
          <year>2020</year>
          )
          <fpage>9459</fpage>
          -
          <lpage>9474</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [49]
          <string-name>
            <given-names>L.</given-names>
            <surname>Brehme</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Ströhle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Breu</surname>
          </string-name>
          ,
          <article-title>Can llms be trusted for evaluating rag systems? a survey of methods and datasets (</article-title>
          <year>2025</year>
          )
          <fpage>16</fpage>
          -
          <lpage>23</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [50]
          <string-name>
            <given-names>V.</given-names>
            <surname>Lapov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Laurent</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Araya</surname>
          </string-name>
          , G. Ortiz,
          <string-name>
            <given-names>S.</given-names>
            <surname>Albrecht</surname>
          </string-name>
          ,
          <article-title>Dynamic context integration in large language models using a novel progressive layering framework</article-title>
          , Research Square Preprints (
          <year>2024</year>
          ). URL: https://doi.org/10.21203/rs.3.rs-
          <volume>5357232</volume>
          /v1.
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [51]
          <string-name>
            <given-names>A.</given-names>
            <surname>Hogan</surname>
          </string-name>
          , E. Blomqvist,
          <string-name>
            <given-names>M.</given-names>
            <surname>Cochez</surname>
          </string-name>
          , C. d'Amato,
          <string-name>
            <given-names>G. D.</given-names>
            <surname>Melo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Gutierrez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kirrane</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. E. L.</given-names>
            <surname>Gayo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Navigli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Neumaier</surname>
          </string-name>
          , et al.,
          <string-name>
            <surname>Knowledge</surname>
            <given-names>graphs</given-names>
          </string-name>
          ,
          <source>ACM Computing Surveys (Csur) 54</source>
          (
          <year>2021</year>
          )
          <fpage>1</fpage>
          -
          <lpage>37</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [52]
          <string-name>
            <given-names>S.</given-names>
            <surname>Ji</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Pan</surname>
          </string-name>
          , E. Cambria,
          <string-name>
            <given-names>P.</given-names>
            <surname>Marttinen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. S.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <article-title>A survey on knowledge graphs: Representation, acquisition, and applications</article-title>
          ,
          <source>IEEE transactions on neural networks and learning systems 33</source>
          (
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>