<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>metis: AI Agent Platform for Human-AI Interaction with Knowledge Graphs⋆</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Linn Aung</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Aaron Eberhart</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Peter Haase</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nicolas Heist</string-name>
          <email>la@metaphacts.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Liubov Kovriguina</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>David Lamprecht</string-name>
          <email>dl@metaphacts.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nazanin Mashhaditafreshi</string-name>
          <email>nm@metaphacts.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>metaphacts GmbH</institution>
          ,
          <addr-line>Wieslocher Str. 38, 69190 Walldorf</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>We present metis, an AI agent platform that assists users with semantic modeling, search, and discovery across knowledge graphs. It combines the conversational capabilities of Large Language Models with the precision of semantic knowledge graphs, integrating seamlessly with the functionalities of the metaphactory platform, such as semantic search and visualization. metis delivers AI agents that provide generative power, semantic precision &amp; contextual, explainable insights.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;metis and metaphactory</kwd>
        <kwd>AI agents for semantic modeling</kwd>
        <kwd>agentic interfaces to knowledge graphs</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
    </sec>
    <sec id="sec-2">
      <title>2. metis Foundations and Architecture</title>
      <p>At the heart of metis lies a semantic model, the trust and context layer, that informs and scopes every
AI interaction. This model guides the agents, ensuring not only relevancy of the results but also
explainability and alignment with enterprise semantics.</p>
      <p>The generic architecture of metis 1, is structured into three main layers: the User Interface, the metis
agent layer, and the underlying metaphactory knowledge graph platform. The interaction process
begins when a user sends a Request through the User Interface, which can include a message and files in
natural language. This request is ingested by the Conversational Agent in the metis layer. The agent’s
Context Manager enriches the request by combining it with predefined instructions, dialogue history,
and the relevant semantic model, which is retrieved from metaphactory. This enriched prompt is
then passed to an LLM-powered Orchestrator that plans and executes a workflow to fulfill the request.
If necessary, the Orchestrator performs tool calls, sending specific input parameters to one or more
available tools. These tools are built on top of metaphactory APIs and interact with the data layer, which
is managed by a Repository Manager. The tool returns a Tool Result, consisting of either the requested
output or an error message. The Orchestrator may iteratively call tools until the necessary information
is retrieved and uses the results for Response Formulation. Finally, the resulting Response, which can be
a combination of natural language and various visualizations, such as tables or charts, is sent back to
the User Interface.</p>
      <p>Based on the generic architecture of the metis platform, diferent kind of agents can be configured.
We here focus on two types of agents equipped with a set of tools to support specific tasks: a Semantic
Modeling Assistant to collaboratively develop and refine ontologies 2, and a Search &amp; Discovery Agent
with access to the knowledge graph to resolve information needs of its users (see the architecture in
Appendix A).</p>
      <p>
        Semantic Modeling Assistant. The Semantic Modeling Assistant is designed to support users
in the collaborative development and refinement of ontologies. This agent guides users through a
structured modeling process based on the established Linked Open Terms (LOT) methodology3 [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] as
well as metaphacts’ Semantic Modeling Guidelines4. Instead of a rigid implementation of LOT within
the agent, metis leverages the entire metaphactory ecosystem to cover the ontology lifecycle.
      </p>
      <p>The initial steps of the LOT methodology, such as Ontology Requirements Specification , are driven by
metis. The assistant helps users formulate competency questions and define use cases, which are stored
as explicit modeling artifacts in the ontology. For the Ontology Implementation phase, metis assists
with the crucial sub-activities of reuse, conceptualization, and evaluation. It facilitates ontology reuse
by treating the metaphacts Ontology Repository5 as a primary resource hub. The agent distinguishes
between conceptualization and encoding, presenting modeling concepts like classes and properties
in a human-interpretable format while handling the formal encoding in the background. Ontology
evaluation is supported through both formal SPARQL-based checks and by applying the LLM’s reasoning
capabilities to critique modeling artifacts. The subsequent steps of ontology publication and maintenance
are handled by the robust, built-in functionalities of the metaphactory platform, such as automated
documentation generation and editorial workflows. The assistant’s architecture (Figure ??) features
pre-configured tools tightly integrated with metaphactory to create and persist these modeling artifacts.
Search &amp; Discovery Agent. The Search &amp; Discovery Agent enables users to perform intuitive
exploration and targeted search across knowledge graphs using natural language. This agent is highly
customizable; users can adapt its instructions, enable or disable specific tools, and tune LLM parameters
to fit their domain and use case. Its architecture (cf. Figure A) gives it access to three categories of tools:
• Structured Querying tools perform tasks like entity lookup and linking, as well as translation of
natural language to SPARQL for precise data aggregation and analysis.
• Unstructured Retrieval tools leverage metaphactory’s retrieval functionality to find information
in textual content that cannot be easily mapped to a graph pattern.
• Visualization tools render the retrieved data in various formats, such as tables, charts, and graphs,
to present it to the user in the most appropriate way.
2See the architecture in the platform documentation: https://help.metaphacts.com/resource/Help:SemanticModelingAssistant.
3https://lot.linkeddata.es/
4https://metaphacts.com/images/PDFs/metaphacts_Semantic_Modeling_Guidelines_2.0.pdf
5https://ontologies.metaphacts.com/</p>
      <p>Through this combination of tools, the agent supports multi-turn dialogues, allowing users to
iteratively refine their information needs and build on previous results to gain deep, context-aware
insights from their data.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Demo Scenarios for metis Agents</title>
      <sec id="sec-3-1">
        <title>Demo Scenario for the Semantic Modeling Assistant. The Semantic Modeling Assistant supports</title>
        <p>creating ontologies from scratch by guiding the user through the LOT methodology or refining and
evaluation existing ontologies. The following scenarios are demonstrated in our supplemental material
and partially shown in Figure 1:
1. From specification to implementation . In this scenario the ontology is developed from scratch.
metis guides the user through the ontology requirements specification, ontology formalization,
ontology implementation, ontology evaluation, and ontology publication activities. The user can
provided additional input by uploading files to support the ontology development process.
2. Reusing ontological resources. Ontology developers both reuse and re-engineer ontological
resources. metis identifies existing ontology resources that can be reused facilitating the integration,
adaptation, and extension of these resources to meet specific domain requirements.
3. Evaluate and restructure ontological resources. Ontology engineers systematically evaluate
and restructure (e.g., modularize, prune, extend, specialize) ontological resources. metis
incorporates requirements specification and built-in evaluation mechanisms to assist in assessing and
refining ontological resources.</p>
        <p>Figure 1 shows an example how metis facilitates the construction of a Personal Information ontology
by building a controlled vocabulary for the Topic class, which is derived from a spreadsheet of predefined
topics. During the modeling process metis ensures that each ontology element is explicitly defined
with appropriate metadata. In the example the Task class is enriched with a human-readable label and
description. Furthermore, metis tracks semantic relations and links the Task class to the Resource class
through the newly defined relation uses.</p>
      </sec>
      <sec id="sec-3-2">
        <title>Demo Scenario for the Search &amp; Discovery Agent. We have deployed the Search &amp; Discovery</title>
        <p>
          Agent across several knowledge graphs, covering diferent domains. Notably, SemOpenAlex 6 [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] and
the Dimensions Knowledge Graph 7 that are massive knowledge graphs with billion of RDF triples about
scientific publications and their associated entities, such as authors, institutions, publishers, funders
and sources, from all scientific domains. Additionally, the agent has been integrated with Wikidata [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ],
a comprehensive cross-domain knowledge graph, as well as the metaphacts Enterprise Information
Architecture Knowledge Graph, further demonstrating its versatility and applicability across diverse
knowledge bases. Figure 2 demonstrates how users interact with the Dimensions Knowledge Graph via
the agent’s natural language interface. Retrieved results are grounded in scientific metadata from the
underlying database and appropriately visualized, enabling users to efectively interpret and trust the
presented information.
        </p>
        <p>1. Natural language request. The user formulates an information need to retrieve the number of
publications addressing Alzheimer Disease from the year 2014 to 2024.
2. Transparent Multi-step Reasoning &amp; Tool Orchestration. The agent transparently uses a
sequence of tools to fulfill the user request. First the entity Alzheimer Disease is identified in the
6https://semopenalex.org
7https://metaphacts.com/solutions/products/dimensions-knowledge-graph
request and linked to its corresponding IRI in the Knowledge Graph. Then a structured SPARQL
query is generated and executed to retrieve the number of relevant publications.
3. Result visualization. After the data is retrieved by executing the SPARQL query the agent uses
its visualization capabilities to present the number of publications per year as a bar chart.
4. Explanations. Along with the presented results, explanations about how the result was generated
are provided to the user. Upon user request, detailed information about all tools used and the
entire data lineage are presented.
5. Multi turn dialogue. The user sends a follow-up request to rank all publications from the
previous result by their altmetric score8.
6. Context-aware answer. The agent retrieves the 10 Alzheimer Disease publication with the
highest altmetric score from 2014 to 2024 and presents the results in a table for clear visualization.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Declaration on Generative AI</title>
      <sec id="sec-4-1">
        <title>The authors have not employed any Generative AI tools.</title>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>B. Online Resources</title>
      <sec id="sec-5-1">
        <title>Demo videos of metis are available</title>
        <p>PLIseUuCw9-OAlmJxFDwNfmsRVPVGRHzNN.
https://metaphacts.com/solutions/products/metis.
via https://www.youtube.com/playlist?list=
Further resources on metis are available at</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>L.</given-names>
            <surname>Kovriguina</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Teucher</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Radyush</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Mouromtsev</surname>
          </string-name>
          , Sparqlgen:
          <article-title>One-shot prompt-based approach for sparql query generation</article-title>
          .,
          <source>in: SEMANTiCS (Posters &amp; Demos)</source>
          ,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>J.</given-names>
            <surname>Sequeda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Allemang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Jacob</surname>
          </string-name>
          ,
          <article-title>A benchmark to understand the role of knowledge graphs on large language model's accuracy for question answering on enterprise sql databases</article-title>
          ,
          <source>in: Proceedings of the 7th Joint Workshop on Graph Data Management Experiences &amp; Systems (GRADES) and Network Data Analytics (NDA)</source>
          ,
          <year>2024</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>12</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>M.</given-names>
            <surname>Freund</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Dorsch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Schmid</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Wehr</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Harth</surname>
          </string-name>
          ,
          <article-title>Enriching rdf data with llm based named entity recognition and linking on embedded natural language annotations</article-title>
          ,
          <source>in: International Knowledge Graph and Semantic Web Conference</source>
          , Springer,
          <year>2024</year>
          , pp.
          <fpage>109</fpage>
          -
          <lpage>122</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>S.</given-names>
            <surname>Tsaneva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Dessì</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Osborne</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Sabou</surname>
          </string-name>
          ,
          <article-title>Knowledge graph validation by integrating llms and human-in-the-</article-title>
          <string-name>
            <surname>loop</surname>
          </string-name>
          ,
          <source>Information Processing &amp; Management</source>
          <volume>62</volume>
          (
          <year>2025</year>
          )
          <fpage>104145</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>D.</given-names>
            <surname>Edge</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Trinh</surname>
          </string-name>
          , N. Cheng, J.
          <string-name>
            <surname>Bradley</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Chao</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Mody</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Truitt</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Metropolitansky</surname>
            ,
            <given-names>R. O.</given-names>
          </string-name>
          <string-name>
            <surname>Ness</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Larson</surname>
          </string-name>
          ,
          <article-title>From local to global: A graph rag approach to query-focused summarization</article-title>
          ,
          <source>arXiv preprint arXiv:2404.16130</source>
          (
          <year>2024</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>N.</given-names>
            <surname>Fathallah</surname>
          </string-name>
          ,
          <string-name>
            <surname>A. Das</surname>
            ,
            <given-names>S. D.</given-names>
          </string-name>
          <string-name>
            <surname>Giorgis</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Poltronieri</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Haase</surname>
          </string-name>
          , L. Kovriguina,
          <article-title>Neon-gpt: a large language model-powered pipeline for ontology learning</article-title>
          ,
          <source>in: European Semantic Web Conference</source>
          , Springer,
          <year>2024</year>
          , pp.
          <fpage>36</fpage>
          -
          <lpage>50</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>B.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V. A.</given-names>
            <surname>Carriero</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Schreiberhuber</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Tsaneva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. S.</given-names>
            <surname>González</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Kim</surname>
          </string-name>
          , J. de Berardinis,
          <article-title>Ontochat: a framework for conversational ontology engineering using language models</article-title>
          ,
          <source>in: European Semantic Web Conference</source>
          , Springer,
          <year>2024</year>
          , pp.
          <fpage>102</fpage>
          -
          <lpage>121</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>P.</given-names>
            <surname>Haase</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. M.</given-names>
            <surname>Herzig</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kozlov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Nikolov</surname>
          </string-name>
          , J. Trame,
          <article-title>metaphactory: A platform for knowledge graph management</article-title>
          ,
          <source>Semantic Web</source>
          <volume>10</volume>
          (
          <year>2019</year>
          )
          <fpage>1109</fpage>
          -
          <lpage>1125</lpage>
          . URL: https://doi.org/10.3233/SW-190360. doi:
          <volume>10</volume>
          .3233/SW-190360.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>M.</given-names>
            <surname>Poveda-Villalón</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Fernández-Izquierdo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Fernández-López</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>García-Castro</surname>
          </string-name>
          ,
          <article-title>Lot: An industrial oriented ontology engineering framework</article-title>
          ,
          <source>Engineering Applications of Artificial Intelligence</source>
          <volume>111</volume>
          (
          <year>2022</year>
          )
          <fpage>104755</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>M.</given-names>
            <surname>Färber</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lamprecht</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Krause</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Aung</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Haase</surname>
          </string-name>
          ,
          <article-title>Semopenalex: the scientific landscape in 26 billion rdf triples</article-title>
          , in: International Semantic Web Conference, Springer,
          <year>2023</year>
          , pp.
          <fpage>94</fpage>
          -
          <lpage>112</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>D.</given-names>
            <surname>Vrandečić</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Krötzsch</surname>
          </string-name>
          ,
          <article-title>Wikidata: a free collaborative knowledgebase</article-title>
          ,
          <source>Communications of the ACM</source>
          <volume>57</volume>
          (
          <year>2014</year>
          )
          <fpage>78</fpage>
          -
          <lpage>85</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>