<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Nara, Japan
$ antoine.dupuy@irit.fr (A. Dupuy); nathalie.aussenac-gilles@irit.fr (N. Aussenac-Gilles); christophe.baehr@meteo.fr
(C. Baehr); cassia.trojahn-dos-santos@univ-grenoble-alpes.fr (C. Trojahn)</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Interpreting User Needs with LLMs-based Conversational Agents and Knowledge Graphs: An Earth Observation Use Case</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Antoine Dupuy</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nathalie Aussenac-Gilles</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Christophe Baehr</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Cassia Trojahn</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>CNRM UMR-3589, Université de Toulouse, Météo-France</institution>
          ,
          <addr-line>CNRS, Toulouse</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>IRIT, Université de Toulouse</institution>
          ,
          <addr-line>UT2, CNRS, Toulouse</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Univ. Grenoble Alpes</institution>
          ,
          <addr-line>Inria, CNRS, Grenoble INP, LIG, Grenoble</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <volume>000</volume>
      <fpage>0</fpage>
      <lpage>0003</lpage>
      <abstract>
        <p>Open Science has broadened access to scientific datasets. However, identifying relevant ones to specific user needs remains challenging due to the volume, diversity and poor metadata. This paper proposes to integrate semantically enriched metadata with LLM agents to interpret user natural language queries, to extract user intent and to generate justifications for retrieved results. Experiments with diferent LLMs highlight the potential of such approach for scientific dataset retrieval.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Public and research institutions have increasingly promoted open access to scientific data, leading to a
proliferation of repositories ofering environmental, geospatial and sensor-based datasets. However,
openness alone does not guarantee usability. Many portals remain dificult to navigate and often
lack suficient context for users—especially non-experts—to assess data relevance [1]. A key barrier is
the limited availability and heterogeneity of metadata [2, 3]. In Earth Observation (EO), this issue is
amplified by the data’s multifaceted spatial, temporal, thematic and technical dimensions, emphasizing
the necessity for structured and semantically enriched metadata.</p>
      <p>Research in information retrieval, semantic web and natural language interfaces has improved
metadata access and query formulation [4, 5, 6, 7], with more recently LLMs enabling intuitive user
input interpretation and readable responses. Yet, many systems rely on rigid and heterogeneous
templates, assume domain knowledge expertise and lack explanations for results. Recent work calls for
better alignment of user intent with metadata and transparent, explainable retrieval [8, 9]. Close to
ours, in [10], ontologies are injected into language models — a strategy we adopt with the DATA-FW
[11] ontology – to structure notions such as datasets, users and data quality. In [12], combining a LLM
with an ontology yields more relevant results than relying solely on a traditional relational data source.</p>
      <p>
        This paper proposes to integrate LLM-based conversational agents with a domain-specific knowledge
graph to retrieve datasets. The approach builds upon previous work in several key respects: (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) a
knowledge graph to represent EO datasets metadata; (
        <xref ref-type="bibr" rid="ref2">2</xref>
        ) LLM-based agents to interpret user queries
and retrieve suitable datasets; (
        <xref ref-type="bibr" rid="ref3">3</xref>
        ) natural language interaction, including iterative query refinement,
enabling non-expert users to progressively articulate complex information needs; (
        <xref ref-type="bibr" rid="ref4">4</xref>
        ) a justification
mechanism, summarizing the search criteria used, the selected datasets and explaining why they are
relevant to the query. The performance of four LLMs (LLaMA 3.3 70B, Mistral Saba 24B, Deepseek-R1
and Qwen 32B) is evaluated across scenarios in the EO domain, using the Deepeval framework [13, 14]
with metrics such as relevance, precision, recall and faithfulness.
      </p>
      <p>The rest of the paper is organized as follows. Section 2 introduces the proposed architecture. Section 3
discusses evaluation and results. Section 4 concludes with a synthesis of our findings and outlines
perspectives for future work.</p>
    </sec>
    <sec id="sec-2">
      <title>2. K2K: An approach Based on LLM Agents and KG</title>
      <sec id="sec-2-1">
        <title>2.1. Main steps of the approach</title>
        <p>K2K (Knowledge-To-Knowledge) addresses the barriers that (non-expert) users encounter when
exploring datasets from diferent domains. By integrating LLM-based agents, relevant contextual knowledge
and an ontology representing datasets metadata, it enables users to express their information needs in
natural language and receive results with justifications (Figure 1). The code and the evaluation datasets 1,
as well as the prototype2 are available online.</p>
        <p>
          As depicted in Figure 1, K2K processes user queries through a modular architecture that integrates
domain-specific data, semantic metadata and LLM-based agents modules. Each of these modules is
responsible for a well-delimited functional role. When a user submits a query (
          <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
          ), the selected domain
(e.g., Meteorology, Earth Observation) and the current conversation determine the contextual data
scope and the historical dialogue memory to be retrieved. The domain context (
          <xref ref-type="bibr" rid="ref3">3</xref>
          ) is filtered out using a
TF.IDF-based context selector that identifies the most relevant textual chunks to be used as grounding
information. Meanwhile, the SPARQL library selectively retrieves semantic triples (
          <xref ref-type="bibr" rid="ref4">4</xref>
          ) from the ontology
graph hosted on a SPARQL endpoint. This ontology layer is built on top of established vocabularies
such as DCAT3, DUV, RDF Data Cube, DOV and FOAF. It is structured as a network in the DATA-FW
ontology that define the concept of a dataset and its links to platforms, producers, structure, etc. and its
properties.
        </p>
        <p>
          The retrieval step includes filtering ontology entities to reduce redundancy (e.g., multilingual labels,
duplicate predicates) and reduce the number of tokens while preserving semantic richness. All selected
data, including the user query, context snippets, relevant triples (from the ontology) and dialogue
history are passed to the LLM-based agents (
          <xref ref-type="bibr" rid="ref5">5</xref>
          ) module and integrated into the agents prompts. This
module contains specialized agents: (i) the Query Analyst Agent analyzes the user request in relation to
the ontology and identifies missing or ambiguous criteria (
          <xref ref-type="bibr" rid="ref6">6</xref>
          ), (ii) the Data Identification Agent identifies
appropriate datasets by triggering web search through a tool-integrated LLM and parses the results to
extract download links and metadata (
          <xref ref-type="bibr" rid="ref7">7</xref>
          ) and (iii) the Response Agent synthesizes a fluent, structured
response from the output of the previous agents, ensuring readability and traceability (
          <xref ref-type="bibr" rid="ref8">8</xref>
          ). Agents
operate in isolation but communicate through JSON structures or structured outputs, enabling easier
parsing, traceability and logging. The final answer, together with links to any matched datasets, is
rendered in the user interface, completing the interaction cycle (
          <xref ref-type="bibr" rid="ref10 ref9">9, 10</xref>
          ). Moreover, this architecture
enables flexible domain adaptation by changing the corpus of specific-domain data while keeping the
agent logic reusable.
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Example of User interaction</title>
        <p>
          An illustrative interaction is presented in Figure 2. The user initiates the dialogue with the query:
“Which datasets are available to analyze CO2 concentration in France between 2015 and 2023?”. This
query triggers (
          <xref ref-type="bibr" rid="ref1">1</xref>
          ) the complete K2K pipeline: the system extracts ontology entities (classes, object
properties and data properties) associated with datasets,retrieves relevant contextual knowledge from
domain-specific resources (here Earth Observation) using tf.idf scores and generates a natural-language
response. The response comprises several elements. The system provides (
          <xref ref-type="bibr" rid="ref2">2</xref>
          ) the criteria extracted from
the user query, (
          <xref ref-type="bibr" rid="ref3">3</xref>
          ) the dataset identified as relevant and a justification explaining why this dataset was
selected. It also issues a disclaimer highlighting possible limitations of the dataset and requests further
details such as preferred file type (CSV, JSON, etc.), specific producers, or direct dataset sources. In the
1https://github.com/DupuyAntoine/K2K
2http://ec2-13-60-15-33.eu-north-1.compute.amazonaws.com:3000
example, the user responds by noting that (
          <xref ref-type="bibr" rid="ref4">4</xref>
          ) the dataset proposed by the system is not appropriate, as
it contains predictive rather than observational data for the requested period. The user further specifies
a preference for CSV format and for data provided by ADEME. In turn, the system (
          <xref ref-type="bibr" rid="ref5">5</xref>
          ) reformulates
the updated search criteria, acknowledges that the previous dataset did not meet the user’s needs and
apologises. It then continues by asking for clarification: whether the user is seeking CO 2 concentration
or CO2 emissions (
          <xref ref-type="bibr" rid="ref6">6</xref>
          ) and which temporal resolution (daily, weekly, monthly, yearly) is desired. At this
stage, the system reports (
          <xref ref-type="bibr" rid="ref7">7</xref>
          ) that no dataset matching all the refined constraints has been identified yet.
However, it emphasizes that it is keeping track of the user’s evolving requirements and (
          <xref ref-type="bibr" rid="ref8">8</xref>
          ) encourages
the user to provide further details that could guide the search more efectively.
        </p>
        <p>
          As illustrated in Figure 3, the user provides additional details: (
          <xref ref-type="bibr" rid="ref9">9</xref>
          ) he specifies that he is interested
in CO2 data (without clarifying whether this refers to emissions or concentrations), with yearly
averages, at the national resolution and focused on France. The system responds by (
          <xref ref-type="bibr" rid="ref10">10</xref>
          ) reformulating
and confirming the updated search criteria, following its established practice of maintaining explicit
traceability of user constraints. Since the distinction between emission and concentration remains
unresolved, the system (
          <xref ref-type="bibr" rid="ref11">11</xref>
          ) reiterates its request for clarification and encourages the user to provide
further details if possible. At this point, the system presents (
          <xref ref-type="bibr" rid="ref12">12</xref>
          ) four candidate datasets related to
CO2 concentration, each accompanied by an explanation of its relevance with respect to the specified
query. The response is concluded with (
          <xref ref-type="bibr" rid="ref13">13</xref>
          ) a renewed request for clarification on whether the user’s
interest lies in concentration or emission data, along with an invitation to refine the query to achieve
more accurate results. Throughout the interaction, (
          <xref ref-type="bibr" rid="ref14">14</xref>
          ) the datasets retrieved by the system are directly
accessible to the user via the file panel displayed on the right-hand side of the interface, ensuring
transparency and immediate usability.
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Evaluation</title>
      <p>A benchmark of 8 natural language queries was created, each corresponding to a common theme
encountered in EO scenarios. All material is available online3. The evaluation adopts 4 metrics [15, 16]:
Answer Relevance (how well answers address the query), Contextual Precision and Recall (accuracy
and completeness of explanations against dataset metadata) and Faithfulness (consistency with factual
content). These were computed automatically using the Deepeval library4, which leverages a local
LLaMA 3 8B model running via Ollama to evaluate the generated responses. The 8 queries were
distributed across the models 200 times: 75 were allocated to LLaMA, 50 to Mistral and to Deepseek-R1
and 25 to Qwen. Figure4 compares these results.</p>
      <p>LLaMA 3.3 70B and DeepSeek-R1 70B are the most balanced models, performing well across all
metrics. Qwen excels in precision, while Mistral demonstrates strong faithfulness. It demonstrates that
LLM-based agents, particularly LLaMA 3.3 70B and DeepSeek-R1, are efective at interpreting complex
queries and generating grounded explanations in structured data contexts. An ablation study confirms
the ontology’s role in preserving contextual recall. The system justification mechanism improves
response transparency, as shown by consistently high faithfulness scores (as in Figure 2, step 3 and in
Figure 3, step 12).</p>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusion and Future work</title>
      <p>This paper presented K2K, a system that combines LLM-based agents, ontologies, domain-specific
metadata context and a language interaction to improve transparency in data retrieval. Results
corrob3https://github.com/DupuyAntoine/K2K/tree/main/ai-agent/src/agents/evaluation
4https://github.com/confident-ai/deepeval, 28/04/2025
orate the positive impact of integrating ontologies into language-based interaction pipelines. Future
work will explore hybrid agent configurations, usage of embedding models to select relevant chunks of
domain-specific data context (instead of TF.IDF), improvement of the evaluation protocol and integration
of user-centered evaluations.</p>
    </sec>
    <sec id="sec-5">
      <title>Declaration on Generative AI</title>
      <p>In accordance with the CEUR-WS Policy on AI-Assisting Tools 5, the authors disclose the following:</p>
      <p>Tools and services used. During the preparation of this manuscript, we used the following
generative AI tools: ChatGPT (OpenAI).</p>
      <p>Contributions of each tool. ChatGPT was employed solely for *language polishing, paraphrasing
and stylistic refinement*. In no instance was it used to replace original scientific contributions, develop
arguments, draw conclusions or generate novel technical content.</p>
      <p>Human oversight and responsibility. All AI-generated suggestions were meticulously reviewed,
edited and validated by the authors. We take full responsibility for the final content, correctness and
integrity of the manuscript. We afirm that the core scientific insights, results and reasoning remain
entirely the work of the human authors.</p>
      <p>The contributions made by generative AI are acknowledged here in this dedicated section, in
compliance with the transparency and accountability requirements mandated by CEUR-WS.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A.</given-names>
            <surname>Quarati</surname>
          </string-name>
          ,
          <article-title>Open government data: Usage trends and metadata quality 49 (</article-title>
          <year>2023</year>
          )
          <fpage>887</fpage>
          -
          <lpage>910</lpage>
          . URL: http: //journals.sagepub.com/doi/10.1177/01655515211027775. doi:
          <volume>10</volume>
          .1177/01655515211027775.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhu</surname>
          </string-name>
          ,
          <article-title>Unlocking potential: Harnessing the power of metadata for discoverability and accessibility 43 (</article-title>
          <year>2023</year>
          )
          <fpage>249</fpage>
          -
          <lpage>256</lpage>
          . URL: https://www.medra.org/servlet/aliasResolver?alias=iospress&amp;doi=10.3233/ ISU-230202. doi:
          <volume>10</volume>
          .3233/ISU-230202.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>J.</given-names>
            <surname>Mi</surname>
          </string-name>
          ,
          <article-title>Making open resources discoverable: Collaborative approaches for enhanced access 8 (</article-title>
          <year>2024</year>
          )
          <fpage>17</fpage>
          -
          <lpage>29</lpage>
          . URL: https://journal.calaijol.org/index.php/ijol/article/view/350. doi:
          <volume>10</volume>
          .23974/ ijol.
          <year>2024</year>
          .
          <year>vol8</year>
          .
          <volume>4</volume>
          .350.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>G.</given-names>
            <surname>Guizzardi</surname>
          </string-name>
          , Ontology,
          <source>ontologies and the “i” of FAIR 2</source>
          (
          <year>2020</year>
          )
          <fpage>181</fpage>
          -
          <lpage>191</lpage>
          . URL: https://direct.mit. edu/dint/article/2/1-2/
          <fpage>181</fpage>
          -191/10008. doi:
          <volume>10</volume>
          .1162/dint_a_
          <fpage>00040</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>C.</given-names>
            <surname>Brewster</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Nouwt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Raaijmakers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Verhoosel</surname>
          </string-name>
          ,
          <article-title>Ontology-based access control for FAIR data 2 (</article-title>
          <year>2020</year>
          )
          <fpage>66</fpage>
          -
          <lpage>77</lpage>
          . URL: https://direct.mit.edu/dint/article/2/1-2/
          <fpage>66</fpage>
          -77/9993. doi:
          <volume>10</volume>
          .1162/dint_a_
          <fpage>00029</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>K.</given-names>
            <surname>Janowicz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Haller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. J.</given-names>
            <surname>Cox</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. Le</given-names>
            <surname>Phuoc</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Lefrançois, SOSA: A lightweight ontology for sensors, observations, samples</article-title>
          ,
          <source>and actuators 56</source>
          (
          <year>2019</year>
          )
          <fpage>1</fpage>
          -
          <lpage>10</lpage>
          . URL: https://linkinghub.elsevier.com/ retrieve/pii/S1570826818300295. doi:
          <volume>10</volume>
          .1016/j.websem.
          <year>2018</year>
          .
          <volume>06</volume>
          .003.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>M.</given-names>
            <surname>Compton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Barnaghi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Bermudez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>García-Castro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Corcho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Cox</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Graybeal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hauswirth</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Henson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Herzog</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Janowicz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. D.</given-names>
            <surname>Kelsey</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. Le</given-names>
            <surname>Phuoc</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Lefort</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Leggieri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Neuhaus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Nikolov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Page</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Passant</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sheth</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Taylor</surname>
          </string-name>
          ,
          <article-title>The ssn ontology of the w3c semantic sensor network incubator group</article-title>
          ,
          <source>Journal of Web Semantics</source>
          <volume>17</volume>
          (
          <year>2012</year>
          )
          <fpage>25</fpage>
          -
          <lpage>32</lpage>
          . URL: https://www.sciencedirect.com/science/article/pii/S1570826812000571. doi:https://doi.org/10.1016/j.websem.
          <year>2012</year>
          .
          <volume>05</volume>
          .003.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>V.</given-names>
            <surname>Armant</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Vargas-Rojas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Agazzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.-C.</given-names>
            <surname>Desconnets</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Mougenot</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Beretta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Debard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Symeonidou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Mouakher</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Guérin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Catry</surname>
          </string-name>
          , E. Roux,
          <article-title>Leveraging Knowledge Graphs for Earth System Dataset Discovery</article-title>
          , in: G. Demartini,
          <string-name>
            <given-names>K.</given-names>
            <surname>Hose</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Acosta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Palmonari</surname>
          </string-name>
          , G. Cheng, H.
          <string-name>
            <surname>Skaf-Molli</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Ferranti</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Hernandez</surname>
            ,
            <given-names>A</given-names>
          </string-name>
          . Hogan (Eds.),
          <source>Lecture Notes in Computer Science</source>
          , volume Volume 15233
          <source>LNCS of Lecture Notes in Computer Science</source>
          , Springer, Baltimore (Maryland),
          <source>United States</source>
          ,
          <year>2024</year>
          , pp.
          <fpage>271</fpage>
          -
          <lpage>288</lpage>
          . URL: https://hal.science/hal-04823866. doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>031</fpage>
          -77847-6\_
          <fpage>15</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Cheng</surname>
          </string-name>
          , C. Zhang,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Meng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Hong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Yin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <article-title>Exploring large language model based intelligent agents: Definitions, methods</article-title>
          , and prospects,
          <year>2024</year>
          . URL: https://arxiv.org/abs/2401.03428. doi:
          <volume>10</volume>
          .48550/ARXIV.2401.03428, version Number:
          <volume>1</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>F.</given-names>
            <surname>Ronzano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Nanavati</surname>
          </string-name>
          ,
          <article-title>Towards ontology-enhanced representation learning for large language models</article-title>
          ,
          <year>2024</year>
          . URL: https://arxiv.org/abs/2405.20527. doi:
          <volume>10</volume>
          .48550/ARXIV.2405.20527, version Number:
          <volume>1</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>A.</given-names>
            <surname>Dupuy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Trojahn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Aussenac-Gilles</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Baehr</surname>
          </string-name>
          ,
          <article-title>Data-fw: An ontology network for annotating open datasets</article-title>
          ,
          <source>ACM</source>
          ,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>D.</given-names>
            <surname>Allemang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Sequeda</surname>
          </string-name>
          ,
          <article-title>Increasing the LLM accuracy for question answering: Ontologies to the rescue</article-title>
          !,
          <year>2024</year>
          . URL: http://arxiv.org/abs/2405.11706. arXiv:
          <volume>2405</volume>
          .11706 [cs].
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>N.</given-names>
            <surname>Thakur</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Reimers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Rücklé</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Srivastava</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Gurevych</surname>
          </string-name>
          ,
          <article-title>Beir: A heterogeneous benchmark for zero-shot evaluation of information retrieval models</article-title>
          ,
          <source>in: NeurIPS Datasets and Benchmarks Track (Round 2)</source>
          ,
          <year>2021</year>
          . URL: https://arxiv.org/abs/2104.08663.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>N.</given-names>
            <surname>Ferro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Maistro</surname>
          </string-name>
          ,
          <source>Evaluation of IR Systems, Technical Report</source>
          , ACM / SIGIR / Tutorial / survey,
          <year>2024</year>
          . URL: https://doi.org/10.1145/nnnnnnn.nnnnnnn.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Gao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Xiong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Gao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Jia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Pan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Bi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Dai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Sun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>Retrieval-augmented generation for large language models: A survey</article-title>
          ,
          <year>2023</year>
          . URL: https://arxiv.org/abs/2312.10997. doi:
          <volume>10</volume>
          .48550/ARXIV.2312.10997, version Number:
          <volume>5</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Hu</surname>
          </string-name>
          ,
          <string-name>
            <surname>Y.</surname>
          </string-name>
          <article-title>Lu, RAG and RAU: A survey on retrieval-augmented language model in natural language processing</article-title>
          ,
          <year>2024</year>
          . URL: https://arxiv.org/abs/2404.19543. doi:
          <volume>10</volume>
          .48550/ARXIV.2404.19543, version Number:
          <volume>1</volume>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>