<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Converging Hypermedia, Protocols, and Knowledge Architectures: A New Paradigm for Grounded and Interoperable LLM-Agent Systems</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Maksim Ilin</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Transport and Telecommunication Institute, Engineering Faculty</institution>
          ,
          <addr-line>Riga</addr-line>
          ,
          <country country="LV">Latvia</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The evolution of LLM-powered agents has led to a distinction between single-purpose AI Agents and collaborative Agentic AI systems. While powerful, both face persistent challenges in grounding, interoperability, and governance when operating in open Web environments. This paper argues that a new, more robust paradigm is emerging from a deep convergence of architectural principles and modern standards. We posit that Hypermedia Multi-Agent Systems (HMAS) provide the foundational philosophy for Web-native agency, where the concepts of W3C Web of Things (WoT) "Things" and Model Context Protocol (MCP) "Tools" are unified as complementary specializations of hypermedia-described Web resources with discoverable afordances. This unified view enables the design of advanced Agentic AI systems-composed of autonomous AI Agents grounded in foundational definitions of agency-that can reliably perceive and act upon a rich environmental fabric of physical (WoT) and semantic (DKG) resources. By analyzing this convergence and presenting an illustrative agentic architecture, we demonstrate how this paradigm ofers enhanced grounding, true interoperability, and clearer pathways to trustworthy governance, providing a structured approach for developing the next generation of scalable and context-aware autonomous systems.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Hypermedia Multi-Agent Systems</kwd>
        <kwd>Model Context Protocol</kwd>
        <kwd>LLM Agents</kwd>
        <kwd>Web of Things</kwd>
        <kwd>Distributed Knowledge Graphs</kwd>
        <kwd>Agent Architectures</kwd>
        <kwd>Interoperability</kwd>
        <kwd>Grounding</kwd>
        <kwd>Semantic Web</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        The proliferation of powerful Large Language Models (LLMs) has spurred the development of "Agentic
AI", where LLMs serve as the cognitive core for autonomous agents. Frameworks such as AutoGen,
CrewAI, and Magentic-One [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] demonstrate this potential by enabling multi-agent teams to tackle
complex tasks through orchestrated collaboration. However, as these systems aspire to operate beyond
predefined teams and engage with the dynamic, open complexity of the World Wide Web, significant
challenges persist. These include ensuring robust factual grounding for LLM outputs, achieving true
interoperability across diverse systems and data silos, and establishing trustworthy governance and
operational safety [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Many current agent orchestration platforms, while adept at managing internal
team dynamics, often lack inherent mechanisms for dynamic discovery and interaction within truly
open, decentralized environments, a core tenet of Web-native agency.
      </p>
      <p>The Hypermedia Multi-Agent Systems (HMAS) paradigm, championed by the HyperAgents
community [3], ofers a visionary blueprint for agents that perceive, reason, and act by navigating a Web
structured as a discoverable hypermedia environment. This paper posits that a new, more potent and
practically realizable paradigm for autonomous agents, especially LLM-based Agents, is emerging from
the synergistic convergence of:
• Advanced Web architectures, specifically the Web of Things [ 5] for grounding agents in physical
reality, and Distributed Knowledge Graphs (DKGs) – enhanced with hybrid vector-graph
capabilities [6] and techniques like Denoising Difused Embeddings [ 7] and Interlocked Hypergraphs
[8] – for providing rich semantic context and verifiable knowledge.</p>
      <p>This paper argues that the path forward lies not in viewing these components as separate pillars
to be integrated, but in recognizing a deeper convergence. We propose a unified paradigm where
HMAS provides the foundational philosophy for Web-native agency, and where WoT "Things" and MCP
"Tools" are understood as complementary specializations of a single concept: the hypermedia-described
Web resource. We detail this unified view, define our terminology for agents and services based on
foundational literature, and present an illustrative Agentic AI system architecture. We demonstrate how
this converged paradigm ofers inherent mechanisms for enhanced LLM grounding, interoperability,
and governance. Finally, we identify key research directions, aiming to contribute to the workshop’s
objective of building conceptual and technological bridges between the AI and Web communities.</p>
    </sec>
    <sec id="sec-2">
      <title>2. A Unified Paradigm for Web-Native Agency</title>
      <p>The proposed paradigm is best understood as a unified architectural vision grounded in the principles
of the Web itself. This vision is defined by its core philosophy (HMAS), its unified view of resources
(Things and Tools), and the rich environments these resources inhabit.</p>
      <sec id="sec-2-1">
        <title>2.1. Defining Agents, Services, and Systems</title>
        <p>To clarify our discussion, we adopt a precise taxonomy. We distinguish a simple program from an
autonomous agent based on the foundational definition by Franklin and Graesser [ 9], which requires an
agent to be situated, sense and act on its environment over time in pursuit of its own agenda, and afect
its future sensing. Building on this, we adopt the modern, LLM-era taxonomy of Sapkota et al. [10]:
• Service or Tool: A reactive program that exposes capabilities (afordances) via an interface (e.g.,
an MCP Server, a WoT device API). It lacks its own overarching agenda and temporal continuity.
• AI Agent: A single, autonomous, tool-using entity that satisfies the Franklin &amp; Graesser criteria,
often with an LLM as its cognitive core.
• Agentic AI System: A collaborative multi-agent system, composed of specialized AI Agents,
working under an orchestration layer to achieve complex, high-level goals.</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. HMAS: A Foundational Philosophy for Web-Native Agency</title>
        <p>The HMAS paradigm [3] provides the architectural philosophy. It envisions agents as first-class
citizens of the Web, operating within a distributed hypermedia environment. Key principles include
Environmental Interaction, HATEOAS (Hypermedia as the Engine of Application State), and Dynamic
Discovery of resources and their afordances, allowing agents to "escape the streetlight efect" of only
interacting with pre-known entities [11].</p>
      </sec>
      <sec id="sec-2-3">
        <title>2.3. A Unified View of WoT Things and MCP Tools</title>
        <p>A core insight is that WoT "Things" and MCP "Tools" are not fundamentally diferent categories; they are
complementary specializations of the same underlying concept: a Hypermedia-Described Web Resource
with Discoverable Afordances. As Table 1 illustrates, both define interaction models, description formats,
and protocols for agents to perceive and act. Their primary divergence lies in their target domains and
non-functional requirements: WoT is optimized for resources tied to the physical world, while MCP is
optimized for interaction with LLM agents, emphasizing rich context management. A prime example of
this convergence is a WoT-enabled robotic lab controlled via a standardized MCP interface, where a
physical "Thing" exposes its capabilities as a "Tool".</p>
      </sec>
      <sec id="sec-2-4">
        <title>2.4. Rich Web Environments: Grounding in Physical and Semantic Reality</title>
        <p>This unified view of resources is grounded in rich environments that provide context and data:
• Web of Things (WoT): Standardized by the W3C, WoT extends the hypermedia fabric to the
physical world, allowing agents to discover, understand, and interact with sensors and actuators
in a standardized way [12]. Agents can even reason about incomplete WoT descriptions using
existential reasoning [5].
• Distributed Knowledge Graphs (DKGs): Built on Semantic Web technologies, DKGs serve
as the semantic backbone, providing structured knowledge. Modern DKGs leverage Hybrid
Vector-Graph Databases [6] and are enhanced with techniques like DDE [7] and Interlocked
Hypergraphs [8] for completeness and can be made verifiable using blockchain [13].</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Illustrative Architecture: An Agentic Knowledge Retrieval System</title>
      <p>To illustrate the converged paradigm, we conceptualize an "Agentic Knowledge Retrieval System,"
inspired by author’s ongoing development eforts like Knowledge Retrieval Agentic System and OpenAlex
Explorer MCP Server projects. This system is composed of an ensemble of specialized AI Agents, each
satisfying the criteria for autonomous agency as defined by Franklin and Graesser [ 9]. This distinguishes
them from the reactive services they utilize, such as the “OpenAlex MCP Server”. The architecture,
shown in Figure 1, can be implemented using frameworks such as CrewAI and LangGraph.
Key Components &amp; Workflow:
1. Orchestration Layer manages the overall process, delegating tasks to the specialized AI Agents.
2. External Knowledge Access (MCP): The SearchAgent, a specialized AI Agent, receives a
research query. It exhibits autonomy in executing its search plan and is temporally continuous
for the duration of its task. It acts by invoking the “OpenAlex MCP Server” service via MCP.
3. Knowledge Ingestion &amp; Representation (DKG &amp; Vector DB): The IngestionAgent, upon
receiving data, autonomously executes its goal of processing and storing it. It extracts metadata
into a Neo4j DKG and text embeddings into a Qdrant Vector DB.
4. Grounded Synthesis (LLM &amp; RAG): The SynthesisAgent, a cognitive AI Agent with an LLM
core, pursues its goal of generating a response. It exhibits autonomy by first retrieving context
via RAG from the DKG and Vector DB, and then synthesizing its grounded answer.
• Querying the Neo4j DKG for structured information (e.g., influential authors, citation
patterns).
• Performing semantic similarity searches in Qdrant for relevant text passages. (Access to</p>
      <p>Neo4j/Qdrant further could be itself realized via internal MCP tools).
5. State Management: Each AI Agent maintains its own task-specific state, while the orchestrator
manages the overall workflow state, demonstrating a multi-agent system with social ability
through coordination.</p>
      <p>This system, by leveraging standardized protocols for external interactions and rich internal
knowledge structures, exemplifies how agents can perform complex, knowledge-intensive tasks with improved
grounding and modularity.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Key Benefits &amp; Addressed Challenges of the Convergence</title>
      <p>The proposed converged paradigm ofers substantial benefits, directly addressing persistent challenges
in developing advanced autonomous agents:
• Significantly Enhanced LLM Grounding: This is arguably the most critical benefit. LLM-based
Agents often struggle with factual accuracy and "hallucinations" [14, 15]. The converged paradigm
provides multiple grounding layers: real-time data from WoT, structured factual knowledge
and complex relationships from DKGs (further enriched by DDE for hypergraph sparsity) and
Interlocked Hypergraphs (for cross-domain knowledge fusion), and verifiable interactions with
specialized tools via Model Context Protocol (MCP). This rich, multi-faceted context drastically
improves the reliability and trustworthiness of LLM agent outputs.
• Improved Interoperability and Dynamic Coordination: Current agent ecosystems are often
fragmented. The convergence fosters interoperability at multiple levels:
– HMAS principles encourage discoverable services and common Web interaction patterns.
– MCP standardizes agent-tool interfaces.
– Protocols like Agent-to-Agent (A2A) aim to standardize inter-agent dialogues [16].
– WoT standardizes device interaction.
– DKGs with shared ontologies provide semantic interoperability. This multi-layered approach
helps mitigate protocol bridging challenges, enabling flexible, dynamic coordination.
• Pathways to Robust Trust, Security, and Governance:
– The transparency of HMAS interactions with structured MCP logs, enables comprehensive
hypermedia-enabled audit trails, supporting Explainable Agent Design (XAI) [17].
– Secure protocols (MCP with ETDI Framework [18], A2A with DIDs) and verifiable data
layers (DKGs with blockchain-anchored provenance like OriginTrail DKG, or coordination
secured by BlockAgents [19]) address trust and security concerns.</p>
      <p>– This provides a stronger foundation for efective AI governance and building user trust.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion &amp; Future Directions</title>
      <p>The convergence of Hypermedia MAS principles with modern interaction protocols (like MCP) and
advanced Web architectures (WoT, enhanced DKGs) represents a significant evolutionary step towards
more capable, grounded, interoperable, and trustworthy autonomous Web agents. This paradigm shift,
illustrated conceptually by our agentic knowledge retrieval architecture, ofers a structured approach
to leveraging the strengths of LLMs while mitigating their weaknesses through deep integration with
verifiable external knowledge and standardized interaction mechanisms.</p>
      <p>This vision’s realization depends on progress in several key areas. Future research directions include:
1. Comprehensive Standardization: Accelerate eforts within bodies like the W3C WebAgents
Community Group to develop and promote standards for HMAS afordance descriptions,
MCPA2A/ANP interoperability profiles, semantic alignment for WoT-DKG integration, and federated
DKG protocols.
2. Scalable and Secure Converged Architectures: Research into scalable, secure implementations
of all components, including robust protocol bridging fabrics, federated DKG solutions (e.g.,
building on FedMSGL [20]), and zero-trust models spanning the entire converged stack.
3. Deep WoT Integration for Physical Agency: Expand research into practical applications of
WoT within HMAS-MCP architectures, enabling agents to robustly perceive, reason about, and
act upon the physical world, including addressing real-time safety and overall energy eficiency.
4. Advanced HAI and Governance for Converged Systems: Develop intuitive HAI (Human-AI
Interaction) methods and comprehensive governance frameworks (including stateful monitoring
and XAI) tailored to the complexities of these converged, LLM-driven agent ecosystems.</p>
      <p>Addressing these directions will pave the way for a new generation of Web agents capable of realizing
many aspects of the original Semantic Web vision [21], truly transforming the Web into a collaborative
space for both humans and intelligent machines.</p>
    </sec>
    <sec id="sec-6">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, author used several generative AI tools. To support the initial
research phase, Perplexity AI and Google Gemini were employed for Generate literature review and
Content enhancement by helping to identify and synthesize concepts from source literature for the
author’s review. To validate the conceptual framework, tools including GitHub Copilot and Qoder
provided Content enhancement by assisting with code generation and quality improvement (linting) for
software prototypes. Additionally, Google Gemini 2.5 Pro was used to Paraphrase and reword source
material into private research notes. For the manuscript itself, Apple Writing assisted with the author’s
original text by performing Grammar and spelling check and ofering suggestions to Improve writing
style and Paraphrase and reword for clarity. The author confirms that no manuscript text was drafted
or generated by AI and takes full responsibility for the originality and conclusions of this publication.
[3] A. Ciortea, S. Mayer, F. Gandon, O. Boissier, A. Ricci, A. Zimmermann, A Decade in Hindsight:
The Missing Bridge Between Multi-Agent Systems and the World Wide Web, in: Proceedings of
the 18th International Conference on Autonomous Agents and Multiagent Systems (AAMAS ’19),
2019, p. 5. Blue Sky Ideas Track.
[4] Anthropic, Model Context Protocol Documentation, https://modelcontextprotocol.io, 2024.
Accessed: 2025-01-01.
[5] V. Charpenay, S. Käbisch, H. Kosch, Characterizing Web of Things Interactions with Existential</p>
      <p>Reasoning, Semantic Web ((n.d.)).
[6] S. Liu, Z. Zeng, L. Chen, A. Ainihaer, A. Ramasami, S. Chen, Y. Xu, M. Wu, J. Wang, TigerVector:
Supporting Vector Search in Graph Databases for Advanced RAGs, arXiv preprint arXiv:2501.11216
(2025).
[7] S. Wu, J. Yang, G. Xu, J. Zhu, Denoising Difused Embeddings: a Generative Approach for</p>
      <p>Hypergraphs, arXiv preprint arXiv:2501.01541 (2025).
[8] K. Boughanmi, A. Ansari, Y. Li, EXPRESS: Modeling Categorized Consumer Collections with</p>
      <p>Interlocked Hypergraph Neural Networks, Journal of Marketing Research ((n.d.)).
[9] S. Franklin, A. Graesser, Is it an Agent, or just a Program?: A Taxonomy for Autonomous Agents,
in: International workshop on agent theories, architectures, and languages, Springer, 1996, pp.
21–35.
[10] R. Sapkota, K. I. Roumeliotis, M. Karkee, Ai agents vs. agentic ai: A conceptual taxonomy,
applications and challenges, arXiv preprint arXiv:2505.10468 (2025).
[11] S. Bienz, A. Ciortea, S. Mayer, F. Gandon, O. Corby, Escaping the streetlight efect: Semantic
hypermedia search enhances autonomous behavior in the web of things, in: Proceedings of the
9th International Conference on the Internet of Things, 2019, pp. 1–8. doi:10.1145/3365871.
3365901.
[12] J. A. Martins, A. Mazayev, N. Correia, Hypermedia APIS for the Web of Things, IEEE Access 5
(2017) 20058–20067. doi:10.1109/ACCESS.2017.2755259.
[13] M. Pawlak, M. Stolarczyk, A. Poniszewska-Marańda, Food Supply Chain Management with
Blockchain Technology in Implementation of Hyperledger Fabric, in: 2025 IEEE 22nd International
Conference on Software Architecture Companion (ICSA-C), 2025, pp. 261–270.
[14] C. Agostinho, Explainability as the Key Ingredient for AI Adoption in Industry 5.0 Settings,</p>
      <p>Frontiers in Artificial Intelligence 6 (2023). doi:10.3389/frai.2023.1264372.
[15] M. R. Smeets, P. G. Roetzel, R. J. Ostendorf, AI and its Opportunities for Decision-Making in
Organizations: A Systematic Review of the Influencing Factors on the Intention to use AI, Die
Unternehmung 75 (2021) 432–460. doi:10.5771/0042-059X-2021-3-432.
[16] A. Ehtesham, A. Singh, G. K. Gupta, S. Kumar, A survey of agent interoperability protocols: Model
context protocol (mcp), agent communication protocol (acp), agent-to-agent protocol (a2a), and
agent network protocol (anp), arXiv preprint arXiv:2505.02279 (2025).
[17] S. Rodriguez, J. Thangarajah, A. Davey, Design patterns for explainable agents (XAg), in:
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024,
pp. 1621–1629.
[18] V. S. Narajala, I. Habler, Enterprise-grade security for the model context protocol (MCP):
Frameworks and mitigation strategies, arXiv preprint arXiv:2504.08623 (2025).
[19] B. Chen, G. Li, X. Lin, Z. Wang, J. Li, BlockAgents: Towards Byzantine-Robust LLM-Based
Multi-Agent Coordination via Blockchain, in: Proceedings of the ACM Turing Award Celebration
Conference-China 2024, 2024, pp. 187–192. doi:10.1145/3674399.3674445.
[20] D. Li, Z. Yang, S. Xie, FedMSGL: A Self-Expressive Hypergraph Based Federated Multi-View
Learning, in: Proceedings of the AAAI Conference on Artificial Intelligence, 2025, pp. 18244–
18252.
[21] T. Berners-Lee, J. Hendler, O. Lassila, The Semantic Web, in: Linking the World’s Information:
Essays on Tim Berners-Lee’s Invention of the World Wide Web, 2023, pp. 91–103. Reprint of
original 2001 article.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>S.</given-names>
            <surname>Joshi</surname>
          </string-name>
          ,
          <article-title>Review of autonomous systems and collaborative AI agent frameworks (</article-title>
          <year>2025</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Ferrag</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Tihanyi</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Debbah, From LLM reasoning to autonomous AI agents: A comprehensive review</article-title>
          ,
          <source>arXiv preprint arXiv:2504.19678</source>
          (
          <year>2025</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>