<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <article-id pub-id-type="doi">10.5281/zenodo.16704302</article-id>
      <title-group>
        <article-title>Ontology for the MQT T Message Protocol</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Niklas Doerner</string-name>
          <email>doernern@hsu-hh.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maria Maleshkova</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Professorship of Data Engineering, Helmut Schmidt University</institution>
          ,
          <addr-line>Holstenhofweg 85, 22043 Hamburg</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <fpage>2</fpage>
      <lpage>6</lpage>
      <abstract>
        <p>In the Web of Things community, the SSN/SOSA ontology, a W3C recommendation, has established itself as a foundational framework for representing semantic sensor networks, modeling sensors, observations, and actuations. However, an explicit representation of message protocols is missing, which limits semantic interoperability in machine-to-machine communication scenarios. To close this gap, we propose MQTT4SSN as an extension to SSN/SOSA that semantically models the MQTT protocol. Furthermore, the ontology addresses use cases without SSN/SOSA implementations by exclusively describing the MQTT messaging protocol. MQTT4SSN represents MQTT entities such as brokers, clients, control packets, topics, and payload metadata, linking them to SSN/SOSA concepts to enable end-to-end traceability between sensing semantics and communication semantics. The ontology captures heterogeneous payload formats, encodings, and transport metadata, enabling machine-interpretable description and integration of transmitted content. MQTT4SSN represents an extensible and reusable resource that is accessible according to the FAIR principles and documented as an Ontology Specification Draft. Ontology: https://doernern.github.io/MQTT4SSNOntology/MQTT4SSN.owl Documentation: https://doernern.github.io/MQTT4SSNOntology/documentation/index-en.html WebVOWL: https://doernern.github.io/MQTT4SSNOntology/documentation/webvowl/index.html OOPS!: https://doernern.github.io/MQTT4SSNOntology/documentation/OOPSevaluation/oopsEval.html GitHub: https://github.com/doernern/MQTT4SSNOntology License: CC BY-NC-SA 4.0 In the Web of Things (WoT) community, the SSN/SOSA ontology, a W3C recommendation, has established itself as a foundational framework for representing semantic sensor networks. However, while SSN/SOSA considers modeling physical systems, sensor deployments, and observational and actuation processes, it does not consider machine-to-machine (M2M) communication protocols [1]. In particular, Message Queuing Telemetry Transport (MQTT) is a widely adopted message protocol in industrial and Internet of Things (IoT) settings for its lightweight, publish-subscribe-based architecture [2, 3]. The lack of the transport component motivates extending the SSN/SOSA framework with MQTT's main elements, enabling seamless alignment between observation and actuation process descriptions with transport semantics. A further challenge arises from the various payload formats and encodings used in MQTT message protocols. Payloads may contain plain text, comma-separated values, or structured JSON as format, possibly encoded in various character sets [4, 5]. These heterogeneous payloads are not clearly described to semantic systems unless an explicit annotation with metadata that describes their format and structure is missing. Therefore, there is a need to model such payload metadata semantically to enable parsing, interpretation, and integration of the transmitted content. Moreover, MQTT ∗Corresponding author. †These authors contributed equally. https://www.hsu-hh.de/dataeng/en/team/maria-maleshkova/ (M. Maleshkova*) 0009-0004-0088-8633 (N. Doerner*); 0000-0003-3458-4748 (M. Maleshkova*) Proceedings ceur-ws.org</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>https://www.hsu-hh.de/dataeng/en/team/niklas-doerner/ (N. Doerner*);
CEUR
Workshop</p>
      <p>
        ISSN1613-0073
topics and payload structures are generally syntactically defined and application-specific. However, in
the context of semantic sensor networks, topic names and data structures could be derived from the
semantics of the observation [
        <xref ref-type="bibr" rid="ref5 ref6">5, 6</xref>
        ]. For instance, hierarchical topic paths could describe the associated
room, sensing device, sensor, observed property, and feature of interest. Likewise, the structure and
arrangement of payload content, such as the column order in CSV or key nesting JSON syntax, could
describe the made observation collection, for instance, with the associated room, sensor, and feature of
interest. The hierarchy of MQTT topic names and payload structures raises the question of whether
there is a relationship to the entities defined by SSN/SOSA.
      </p>
      <p>To address these challenges, we initially defined the following three research questions (RQs):
• RQ1: How can the MQTT message protocol be semantically represented in a way that aligns
with an existing sensor network ontology, such as W3C SSN/SOSA?
• RQ2: How can heterogeneous payload formats and character encodings be represented in a
semantic model to enable machine-interpretable integration of the transmitted data?
• RQ3: To what extent can semantic MQTT topic names and structured payload representations
be derived from observation-related information, such as the observed feature of interest, the
used sensor, and the observed property?</p>
      <p>
        The OASIS standard MQTT (Message Queuing Telemetry Transport) is a messaging protocol designed
for communication in M2M and IoT contexts [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. It operates on a publish/subscribe model with four
main components: clients, which act as publishers and/or subscribers, and a central broker that manages
message routing. Clients publish messages to topics and subscribe to topic filters, which can include
wildcards to capture related topic streams dynamically. For communication, MQTT uses control packets,
which contain a fixed header in all control packets, and depending on the packet type, an optional
variable header and payload. The MQTT specification defines fourteen diferent types of control packets,
managing actions such as connection establishment (CONNECT, CONNACK), publishing (PUBLISH, PUBACK),
and subscription (SUBSCRIBE, SUBACK). The payload of the PUBLISH control packet contains application
data, such as sensor data, and can vary in format and encoding. MQTT delivers Application Messages
according to three diferent Quality of Service ( QoS) levels to ensure reliability in message delivery.
There are additional header flags for transmission, such as the RETAIN and DUP flags. The MQTT
message protocol typically runs over the TCP/IP transport protocol.
      </p>
      <p>
        Our MQTT4SSN ontology focuses on MQTT version 3.1.1[
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] to ensure compatibility with established
systems and model the foundation. However, we designed the model to remain extensible for future
integration of MQTT 5.0 features. The newer MQTT Version 5.0[
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] standard extends the protocol with
richer metadata, reason codes, user properties, and improved error handling, enhancing flexibility and
scalability for modern IoT applications [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. There exists a draft from the OASIS group, the MQTT For
Sensor Networks (MQTT-SN)[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], which adapts MQTT for non-TCP/IP networks such as UDP, Zigbee,
or Bluetooth. It simplifies message headers and introduces topic IDs to support constrained devices in
wireless sensor networks (WSNs).
      </p>
      <p>The remainder of this paper includes the following: Section 2 provides an overview of related
ontologies and data models relevant to MQTT, IoT, and WoT. Section 3 explains the ontology engineering
process of MQTT4SSN, covering its core concepts 3.1, semantic relations 3.2, and applied design
patterns 3.3. Section 4 outlines the evaluation methodology and results, including ontology verification 4.1
and validation 4.2 based on predefined competency questions. Finally, Section 5 summarizes the paper
and highlights possible directions for future research.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Data Models</title>
      <p>
        In the landscape of semantic data modeling for IoT and WoT, numerous ontologies such as the W3C
SSN/SOSA ontology have been developed that describe sensing and actuation devices, observation
procedures, and environments. However, despite the central role of communication protocols in IoT
infrastructures, very few of these ontologies explicitly address transport mechanisms such as MQTT.
To the best of our knowledge, the only dedicated efort toward semantic representation of MQTT is the
MQTT to RDF draft ontology defined in the W3C WoT Binding Templates. This fragmentation has
created a gap between the semantic modeling of observation processes and the underlying transport
layers that enable real-time M2M communication [
        <xref ref-type="bibr" rid="ref10 ref6">10, 6</xref>
        ].
      </p>
      <p>
        The W3C SSN/SOSA ontology is the most widely adopted standard for modeling sensors,
observations, actuators, and sampling activities in semantic IoT contexts. Originally published as a W3C
Recommendation in 2017[
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], SOSA serves as a lightweight core, while SSN adds richer axioms for
more expressiveness. A new working draft further extends the model and enriches its interoperability
across domains [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. Despite its complexity, SSN/SOSA does not specify how data is transmitted or
how protocols like MQTT interact with observation streams, leaving a crucial semantic gap.
      </p>
      <p>
        The WoT MQTT to RDF ontology draft, developed as part of the W3C WoT Binding Templates,
provides, to the best of our knowledge, the only formal semantic representation of MQTT, especially
the description of the control packet structure, topic filters, and the network entities such as broker
and client [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. This draft models protocol-level aspects of MQTT but neglects the modeling of sensing
systems, such as SSN/SOSA. Consequently, it cannot directly align MQTT messages with sensor
observations, features of interest, or platforms.
      </p>
      <p>
        The QUDT (Quantities, Units, Dimensions, and Data Types) vocabulary is another essential
component for semantic IoT modeling. Frequently combined with SSN/SOSA, QUDT provides a rigorous
and extensible vocabulary for expressing physical units, scales, and quantity values [
        <xref ref-type="bibr" rid="ref11 ref14">11, 14</xref>
        ]. It enables
precise representation of observation results (e.g., temperature in degree Celsius, length in millimeters),
which is vital for machine interpretation and data integration [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ].
      </p>
      <p>
        Among the broader IoT ontologies, several have attempted to provide domain models for smart
environments and device interactions:
• SAREF (Smart Applications REFerence ontology), maintained by ETSI, focuses on linking diferent
domains. Its modular structure includes extensions for smart cities, buildings, and industry,
and it emphasizes interoperability across domains [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. However, SAREF does not describe
communication transport, relying instead on linking to existing data representations.
• FIESTA-IoT was developed in the context of IoT testbed federation and emphasizes cross-platform
interoperability. It builds heavily on SSN/SOSA and adds vocabularies for testbeds, datasets, and
experimental metadata [17, 18]. While it provides integration logic, it does not address message
protocols like MQTT.
• IoT-O is an ontology designed to support semantic interoperability across IoT domains. It models
devices, sensors, and platforms and explicitly reuses SSN/SOSA [19]. Although rich in device and
service descriptions, IoT-O lacks support for protocol semantics and dynamic message exchange.
• IoT-Lite is a lightweight ontology designed to enable semantic interoperability in
resourceconstrained IoT environments. It simplifies the modeling of IoT devices and services to facilitate
implementation in low-power or embedded contexts [20]. However, the transport components
are neglected here.
• S3N (Semantic Smart Sensor Network) builds directly on top of SSN/SOSA and WoT Thing
Descriptions to describe smart sensors and their capabilities [21]. While it supports modular
modeling of sensor components and protocols, its MQTT handling is limited and not aligned
with the WoT MQTT to RDF efort.
      </p>
      <p>In summary, although various ontologies exist for describing IoT devices and services, they often lack
a consistent or integrated model for communication transport. The MQTT to RDF ontology from the
WoT Binding Templates is, to the best of our knowledge, a unique attempt to model MQTT semantics,
but does not interlink with domain models like SSN/SOSA. Conversely, SSN/SOSA ofers mature and
extensible semantics for observations but is agnostic to how these observations are communicated.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Ontology Development</title>
      <p>In this work, we propose the MQTT4SSN extension, an ontology developed to provide a semantic
representation of the MQTT message protocol, including the structure of transmitted data. It extends the
well-established SSN/SOSA ontology by introducing MQTT-specific components such as the network
entities broker and client, control packet structures with their payloads, and their associated topics. The
ontology was designed to model automated process data captured and transmitted via MQTT packets
in the context of the EKI1 project. The development was guided by initially defined research questions
(RQs), a comprehensive set of competency questions (CQs), and a review of existing ontology design
patterns (ODPs). These elements shaped the ontology’s structure during design and implementation
and served as the basis for its evaluation.</p>
      <p>Following best practice in ontology engineering, we analyzed existing ODPs regarding reusability
and interoperability. The W3C Web of Things (WoT) MQTT Binding Ontology draft, hereafter referred
to as MQV, was considered as a potential ODP. We used this ontology draft as a starting point for our
ontology design and adapted most of the MQV concepts into our ontology. Thereafter, we extend it with
further elements to fit our requirements on the MQTT side fully, but also enable the semantic relation
to SSN/SOSA. Special attention was paid to the representation of heterogeneous payload encoding,
enabling the ontology to handle structured data such as CSV or JSON. Furthermore, we integrated the
QUDT vocabulary in the traditional SSN/SOSA context and the MQTT4SSN extension.</p>
      <p>To ensure that the ontology is Findable, Accessible, Interoperable, and Reusable (FAIR), MQTT4SSN
is made openly available on GitHub and permanently archived via Zenodo under a DOI, with complete
access to all files. We provide the ontology under an open license and richly annotated with labels,
comments, metadata, and documentation to promote adoption and reuse by the semantic web and
engineering community.</p>
      <sec id="sec-3-1">
        <title>3.1. Core Concepts</title>
        <p>From the outset, we designed MQTT4SSN, an extension of the well-established SSN/SOSA ontology
by incorporating a semantic representation of the MQTT message protocol. While SSN/SOSA provides
a solid foundation for modeling sensing systems, including sensors and actuators and their linkage to
observations, properties, and features of interest, MQTT4SSN complements this by explicitly modeling
the structure and metadata of the transmitted data. The key elements of MQTT, which are the network
entities broker and client, the diferent control packets with their payload, and their associated topics,
form the core of the ontology, along with their interrelations.</p>
        <p>This integration enables seamless alignment between observation, actuation, and sampling
semantics and M2M communication semantics, thereby supporting end-to-end traceability and semantic
interoperability across distributed IoT systems.</p>
        <p>To describe payloads in a meaningful and machine-interpretable way, the ontology includes properties
for specifying the payload content type (MIME type) and character encoding. In addition, MQTT4SSN
considered transport-specific characteristics such as quality of service levels, retain and DUP flags to
allow for the traceability of message delivery. Despite the incompleteness of the MQV draft ontology,
we adapted core concepts in MQTT4SSN and established partial semantic mappings to align both
models for future interoperability.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Semantic Relations</title>
        <p>At the core of this model are four major class clusters: the network infrastructure, the control packet
hierarchy, the topic with its relations to diferent payloads with the relation to SSN/SOSA, and the
topic subject alignment with SSN/SOSA sensing semantics.</p>
        <p>Within the network infrastructure, depicted in figure 1, the ontology defines the entities client
and the broker as subtypes of network participants involved as actors in MQTT communication. A
1https://dtecbw.de/home/forschung/hsu/projekt-eki/, accessed: September 12, 2025
broker has a host address and a port number, while a client has a client ID. The network participants’
interaction is mediated by the network connection class, which can be optionally encrypted with TLS
and is characterized by initiation and acceptance relations: the connection is initiated by the client and
accepted by the broker. The client also explicitly maintains a connection to a specific broker, forming
the foundation for message exchange.</p>
        <p>The Communication is described through the control packet hierarchy, depicted in figure 2. The
publish, subscribe, and unsubscribe packet types are modeled as subclasses of a general control packet
class. These packets contain structured fixed and variable headers as well as payloads. For example
(seen in figures 2, 3), a publish packet includes a fixed header, defining a quality of service level, retain
and DUP flag, a variable header which typically includes the topic, and a publish payload that carries
encoded observation data as content with certain content type, character encoding and perhaps a
content delimiter. Subscribe and unsubscribe packets follow a similar structural breakdown, with
specific headers and payload compositions. A variable header can have a packet identifier. All payloads
have a content-dependent size. Clients are linked to the control packets they send, while brokers can
receive and forward them, enabling the messaging pipeline, as seen in figure 1.</p>
        <p>Topics and topic filters represent another essential part of MQTT communication, depicted in figure 3.
Sensors and actuators that a client hosts can observe or listen to topics. Topic filters, used during
subscription and unsubscription, specify patterns for selecting specific topics. The ontology captures
the relationship between topic filters and matched topics and supports modeling individual subscription
entries with specific quality of service levels. These filters are linked with the subscription payload
over the subscription entry or the unsubscription payload.</p>
        <p>One goal of the MQTT4SSN ontology was the extension of SSN/SOSA, shown in figure 3. The client
class runs on a platform for hosting SOSA sensors or actuators, thus bridging MQTT infrastructure and
the physical sensing world. A sensor hosted by a client may publish data to a topic, while an actuator
may listen to a topic for actuation commands. On the SSN/SOSA side, both systems are hosted by a
platform, and on the MQTT side by a client. Another alignment shows figure 4, depicting a topic that
can be related to one or more topic subjects, represented by SSN/SOSA entities. An observation or
actuation (possibly as a collection) can describe a topic name with an associated feature of interest and
property.</p>
        <p>Further, the ontology relates MQTT topics and payloads to SOSA observations, also depicted in
ifgure 3. Each topic may describe a specific observation or a collection of observations, which are
modeled as being described by topics. Payloads of publish packets can encode single observations or
actuation collections of them.</p>
        <p>Beyond the specification of conceptual axioms, logical constructs such as disjointness were
incorporated to enforce explicit class separation, while inverse relationships were introduced to define
predicates in a bidirectional manner, thereby enhancing semantic rigor and ensuring consistency across
the ontology.</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Design Patterns</title>
        <p>The MQTT4SSN ontology integrates MQV vocabulary in the SOSA/SSN ontology stack to provide
a semantic layer for modeling sensor observations and actuator commands over MQTT-based
infrastructures. This alignment bridges the gap between the low-level communication protocol MQTT and
high-level semantic descriptions of IoT systems via SSN/SOSA. We aligned our MQTT4SSN with the
lightweight core ontology SOSA, while also supporting integration with the more expressive SSN due to
the modular structure of SSN/SOSA. The MQTT4SSN and SSN/SOSA alignment is depicted in figures
3, 4, while neglecting detailed visualization of SSN/SOSA object properties due to space constraints.</p>
        <p>To achieve this, key classes and properties from the MQV vocabulary were aligned with
newly defined or extended classes or properties within our MQTT namespace. Several core
classes were mapped directly to their MQV counterparts using owl:equivalentClass. For
example, mqtt:NetworkConnection ≡ mqv:NetworkConnection, mqtt:Broker ≡ mqv:Server, and
mqtt:Client ≡ mqv:Client. These equivalences ensure that MQTT network entities modeled in
MQTT4SSN are semantically consistent with the existing MQV definitions, allowing direct
inferencing across both ontologies and enabling future interoperability. The alignment was also
applied to messaging constructs, such as mqtt:Topic ≡ mqv:TopicName and mqtt:ControlPacket ≡
mqv:ControlPacket.</p>
        <p>Beyond class equivalences, the ontology also maps various object properties using
owl:equivalentProperty. For instance, mqtt:sendsControlPacket ≡ mqv:sendsControlPacket,
mqtt:hasTopicName ≡ mqv:hasTopicName, and mqtt:hasPayload ≡ mqv:hasPayload ensure that
message composition and delivery semantics remain consistent regardless of whether they are
interpreted from the perspective of MQTT4SSN or MQV. These alignments allow for symmetric
entailment: for any triple using an aligned property in one ontology, a corresponding assertion in the
other ontology is logically entailed.</p>
        <p>Beyond class and object property alignments, the ontology also defines a range of datatype
properties that describe literal characteristics of MQTT communication entities such as clients, brokers,
payloads, and headers. Several of these properties are also explicitly aligned with their MQV
counterparts using owl:equivalentProperty. For example, mqtt:isRetainedFlag ≡ mqv:hasRetainFlag
and mqtt:isDUPFlag ≡ mqv:hasDUPFlag ensure that control flags in MQTT publish messages are
semantically interchangeable between the two namespaces.</p>
        <p>In contrast, other datatype properties deliberately avoid equivalence due to structural mismatches.
The most notable example mqtt:hasQosLevel, which expresses the Quality of Service level as a
constrained integer datatype (with values between 0 and 2). While MQV models this concept using the
object property mqv:hasQoSFlag, the type mismatch between the datatype property and the object
property prevents a clean alignment. As such, no equivalence is declared, preserving semantic
consistency and avoiding reasoning errors caused by incompatible property types. We left a corresponding
comment on the definition of the datatype property.</p>
        <p>We add datatype properties to describe essential identifiers and configuration details of
network components. For instance, mqtt:hasClientID, mqtt:hasHostAddress, mqtt:hasPortNumber,
and mqtt:usesTLS provide literal values such as client identifiers, broker addresses, and
encryption status, all of which are critical for modeling MQTT network behavior at the
protocol level that MQV did not implement. Similarly, payload-specific properties like
mqtt:hasPayloadContent, mqtt:hasPayloadContentType, mqtt:hasPayloadContentDelimiter,
and mqtt:hasPayloadEncoding describe the structure and format of transmitted data.</p>
        <p>The ontology integrates the QUDT ontology to represent measurement results in a precise and
interoperable manner. Observation values and payload metadata are expressed as qudt:QuantityValue
instances, which represent a qudt:value annotated with standard units from the QUDT vocabulary
(e.g., qudt:hasUnit unit:MilliM, qudt:hasUnit unit:Byte). Using QUDT, we ensure that units
are machine-readable, semantically precise, and avoid ambiguity in data interpretation. QUDT enables
seamless integration with domain ontologies that rely on quantitative measurements and allows
consistent handling of observational and technical metadata, like payload sizes.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Evaluation</title>
      <p>The ontology evaluation is divided into verification 4.1 and validation 4.2. Verification takes the
previously defined RQs and CQs as requirements and proves their presence in the ontology, represented
by entities and their relations. For validation, the modeled ontology is instantiated with real-world
individuals to determine whether it provides suficient utility in the application. During development
and finally, we checked the ontology with the OntOlogy Pitfall Scanner! (OOPS!) [ 22], locally installed
with the latest Docker Image2. The final OOPS! evaluation detects only minor pitfalls, which warn about
unconnected ontology elements (P04), missing annotations (P08), and missing inverse relationships
(P13). The warnings P04 and P08 refer to imported ontologies, whereas we explicitly modeled the
identified relationships in P13 as non-inverse. The full OOPS! evaluation report 3 is available in the
ontology documentation.</p>
      <sec id="sec-4-1">
        <title>4.1. Ontology Verification</title>
        <p>During the ontology engineering of MQTT4SSN, RQs and CQs were previously defined, representing
the ontology’s requirement profile. The RQs define the coverage of the general purpose in the application.
The CQs determine specific entities and their relations for the model design. Firstly, we identified
CQs to address essential components of the MQTT message protocol. Secondly, we determined CQs
to address the sensing and communication semantics gap. We categorized the CQs into six subjects
to structure them according to their common topics. The complete set of competency questions is
available on GitHub4. Table 1 shows an excerpt of one CQ per subject with entities that represent the
response.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Ontology Validation</title>
        <p>To fully validate MQTT4SSN for real-world applications, it is essential to instantiate the ontology
with individuals from a dataset, such as an industrial production process scenario using the MQTT
message protocol. The next step is to formulate SPARQL queries to test the ontology against the dataset
and execute them. The confirmation of expected answers validates the ontology with conceptual
completeness.
2https://hub.docker.com/r/mpovedavillalon/oops, accessed: September 12, 2025
3https://doernern.github.io/MQTT4SSNOntology/documentation/OOPSevaluation/oopsEval.html
4https://github.com/doernern/MQTT4SSN/blob/main/CQs/MQTT4SSN_CQs.txt</p>
        <p>We have neither acquired a dataset nor created individuals for validation yet. In the future, we plan
to investigate this in a Raspberry Pi simulation setup to generate test data and validate MQTT4SSN
with appropriate SPARQL queries.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion and Future Work</title>
      <p>From the outset, MQTT4SSN was conceived as an extension of the well-established SSN/SOSA ontology,
incorporating a semantic representation of the MQTT message protocol. While SSN/SOSA provides a
robust framework for modeling sensing systems, including sensors, actuators, and their relationships to
observations, properties, and features of interest, MQTT4SSN complements this foundation by explicitly
modeling the structure and metadata of transmitted data. The ontology captures the essential elements
of MQTT, such as the network entities broker and client, the various control packets and their payloads,
the topics that organize communication, and the interrelations between these components. This
integration bridges the gap between sensor semantics and M2M communication semantics, supporting
end-to-end traceability and semantic interoperability across distributed IoT systems.</p>
      <p>In addition, MQTT4SSN describes a payload content with a specific content type and character
encoding, making payload descriptions meaningful and machine-interpretable. Transport-specific
features such as QoS levels and the retain flag are also incorporated to enable traceability of message
delivery. Despite the incompleteness of the MQV draft ontology, MQTT4SSN successfully adapts
selected concepts and establishes partial semantic mappings to facilitate compatibility and reuse.</p>
      <p>Several promising directions are still open for further development of MQTT4SSN. In our work, we
focused exclusively on MQTT version 3.1.1. Future work could extend this scope to include the more
recent MQTT v5 specification, which introduces enhanced message properties, session control, and
user-defined metadata features. Furthermore, the MQTT-SN draft could extend the ontology, which
adapts MQTT for non-TCP/IP networks such as UDP, Zigbee, or Bluetooth.</p>
      <p>Another promising direction is the automated derivation of topic names. While MQTT4SSN currently
supports modeling the relationships between topic naming and SOSA elements such as
FeatureOfInterest, Property, Actuation, ActuationCollection, Observation, and ObservationCollection, an automated
method for deriving meaningful topic names from these linked semantic entities could significantly
reduce configuration efort and improve consistency. Similarly, the PayloadContent could be semantically
derived from SOSA concepts. For instance, if the PayloadContentType is a specified application/json
type, its structure could be automatically inferred from a combination of the aforementioned SOSA
elements, enhancing semantic interoperability and reducing manual modeling.</p>
      <p>We evaluate the ontology only to a limited extent. A valuable next step would be establishing a
simulation environment using Raspberry Pis to emulate processes in industrial automation scenarios
that communicate via MQTT to validate the proposed ontology comprehensively. This setup could be a
testbed for validating the integration and mappings between live MQTT trafic and the MQTT4SSN
ontology under realistic conditions.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>This research, as part of the project EKI1, is funded by dtec.bw – Digitalization and Technology Research
Center of the Bundeswehr, which we gratefully acknowledge. dtec.bw is funded by the European Union
– NextGenerationEU.</p>
    </sec>
    <sec id="sec-7">
      <title>Resource Availability Statement</title>
      <p>The MQTT4SSN ontology5 and the competency questions4 are available on GitHub [23] and Zenodo
(DOI: 10.5281/zenodo.16704302). All resources are licensed under Creative Commons
Attribution5https://doernern.github.io/MQTT4SSNOntology/MQTT4SSN.owl
NonCommercial-ShareAlike 4.0 International. An enriched documentation6 of the MQTT4SSN ontology
was automatically generated with the help of the WIDOCO wizard [24].</p>
    </sec>
    <sec id="sec-8">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, the author used ChatGPT and Grammarly in order to: Grammar
and spelling check, Paraphrase, and reword. After using these tools, the author reviewed and edited the
content as needed and takes full responsibility for the publication’s content.
6https://doernern.github.io/MQTT4SSNOntology/documentation/index-en.html
[17] FIESTA-IoT Consortium, Fiesta-iot ontology, https://github.com/fiesta-iot/ontology, 2024.
[18] R. Agarwal, D. G. Fernandez, T. Elsaleh, A. Gyrard, J. Lanza, L. Sanchez, N. Georgantas, V. Issarny,
Unified iot ontology to enable interoperability and federation of testbeds, in: 2016 IEEE 3rd World
Forum on Internet of Things (WF-IoT), 2016, pp. 70–75. doi:10.1109/WF- IoT.2016.7845470.
[19] IRIT - MELODI Team, Iot-o: An ontology for semantic interoperability of iot resources, https:
//www.irit.fr/recherches/MELODI/ontologies/IoT-O.html, 2024.
[20] W3C Submission, Iot-lite ontology, https://www.w3.org/submissions/iot-lite/, 2024.
[21] IMT Atlantique, Semantic smart sensor network (s3n) ontology, https://recherche.imt-atlantique.</p>
      <p>fr/info/ontologies/sms/s3n/, 2024.
[22] M. Poveda-Villalón, A. Gómez-Pérez, M. C. Suárez-Figueroa, OOPS! (OntOlogy Pitfall Scanner!):
An On-line Tool for Ontology Evaluation, International Journal on Semantic Web and Information
Systems (IJSWIS) 10 (2014) 7–34.
[23] N. Doerner, M. Maleshkova, doernern/MQTT4SSNOntology: MQTT4SSN v1.0.0: Pre-publication
release, 2025. URL: https://github.com/doernern/MQTT4SSNOntology. doi:10.5281/zenodo.
16704302.
[24] D. Garijo, Widoco: a wizard for documenting ontologies, in: International Semantic Web
Conference, Springer, Cham, 2017, pp. 94–102. URL: http://dgarijo.com/papers/widoco-iswc2017.
pdf. doi:10.1007/978- 3- 319- 68204- 4_9.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>W.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Tropea</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Abid</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Detti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Le Gall</surname>
          </string-name>
          ,
          <article-title>Review of standard ontologies for the web of things</article-title>
          ,
          <source>in: 2019 Global IoT Summit (GIoTS)</source>
          ,
          <year>2019</year>
          . doi:
          <volume>10</volume>
          .1109/giots.
          <year>2019</year>
          .
          <volume>8766377</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>G.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Park</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Chung</surname>
          </string-name>
          ,
          <article-title>An MQTT-Based Context-Aware Autonomous System in oneM2M Architecture</article-title>
          ,
          <source>IEEE Internet of Things Journal</source>
          <volume>6</volume>
          (
          <year>2019</year>
          )
          <fpage>8519</fpage>
          -
          <lpage>8528</lpage>
          . URL: https://ieeexplore. ieee.org/document/8726084/. doi:
          <volume>10</volume>
          .1109/jiot.
          <year>2019</year>
          .
          <volume>2919971</volume>
          , publisher: Institute of Electrical and Electronics
          <string-name>
            <surname>Engineers</surname>
          </string-name>
          (IEEE).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>M.</given-names>
            <surname>Markovic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Edwards</surname>
          </string-name>
          ,
          <article-title>Enhancing transparency of mqtt brokers for iot applications through provenance streams</article-title>
          ,
          <source>in: Proceedings of the 6th International Workshop on Middleware and Applications for the Internet of Things</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>17</fpage>
          -
          <lpage>20</lpage>
          . doi:
          <volume>10</volume>
          .1145/3366610.3368099.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>T. C.</given-names>
            <surname>Piller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Khelil</surname>
          </string-name>
          , Semsub:
          <article-title>Semantic subscriptions for the mqtt protocol</article-title>
          ,
          <source>in: 2020 IEEE 6th World Forum on Internet of Things (WF-IoT)</source>
          ,
          <year>2020</year>
          . doi:
          <volume>10</volume>
          .1109/wf- iot48130.
          <year>2020</year>
          .
          <volume>9221477</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>T. C.</given-names>
            <surname>Piller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. M.</given-names>
            <surname>Merz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Khelil</surname>
          </string-name>
          , Mqtt-4est:
          <article-title>Rule-based web editor for semantic-aware topic naming in mqtt</article-title>
          ,
          <source>in: 2022 IEEE 19th Annual Consumer Communications &amp; Networking Conference (CCNC)</source>
          ,
          <year>2022</year>
          . doi:
          <volume>10</volume>
          .1109/ccnc49033.
          <year>2022</year>
          .
          <volume>9700533</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>F.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Aubart</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Becker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Özcan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Blankenbach</surname>
          </string-name>
          ,
          <article-title>Real-time sensor data integration for bim-based hydraulic structure monitoring</article-title>
          ,
          <source>Unpublished Manuscript</source>
          (
          <year>2025</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>A.</given-names>
            <surname>Banks</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <source>Mqtt version 3.1</source>
          .1, http://docs.oasis-open.
          <source>org/mqtt/mqtt/v3.1</source>
          .1/os/mqtt-v3.
          <article-title>1.1-os</article-title>
          .html,
          <year>2014</year>
          .
          <string-name>
            <given-names>OASIS</given-names>
            <surname>Standard</surname>
          </string-name>
          . Latest version: http://docs.oasis-open.
          <source>org/mqtt/mqtt/v3.1.1/ mqtt-v3.1</source>
          .1.html.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>A.</given-names>
            <surname>Banks</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Briggs</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Borgendale</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <source>Mqtt version 5</source>
          .0, https://docs.oasis-open.org/mqtt/ mqtt/v5.0/os/mqtt-v5.
          <article-title>0-os</article-title>
          .html,
          <year>2019</year>
          .
          <string-name>
            <given-names>OASIS</given-names>
            <surname>Standard</surname>
          </string-name>
          . Latest version: https://docs.oasis-open. org/mqtt/mqtt/v5.0/mqtt-v5.
          <article-title>0</article-title>
          .html.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>OASIS</given-names>
            <surname>MQTT Technical</surname>
          </string-name>
          <string-name>
            <surname>Committee</surname>
          </string-name>
          ,
          <source>Mqtt-sn version 1</source>
          .2, https://groups.oasis-open.org/ higherlogic/ws/public/download/66091/MQTT-SN_
          <article-title>spec_v1.2</article-title>
          .pdf/latest,
          <year>2020</year>
          . Working Draft.
          <source>OASIS Standard</source>
          . Latest version: https://docs.oasis-open.org/mqtt/mqtt/v5.0/mqtt-v5.
          <article-title>0</article-title>
          .html.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>K.</given-names>
            <surname>Sahlmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Mikolajczak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Schnor</surname>
          </string-name>
          ,
          <article-title>Interoperability in the iot - an evaluation of the semanticbased approach</article-title>
          , arXiv (
          <year>2022</year>
          ). doi:
          <volume>10</volume>
          .48550/arXiv.2203.14585, arXiv:
          <fpage>2203</fpage>
          .
          <fpage>14585</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <article-title>W3C Semantic Sensor Network Ontology, Semantic sensor network ontology (ssn</article-title>
          ), https://www. w3.org/TR/vocab-ssn/,
          <year>2017</year>
          . W3C Recommendation.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <article-title>W3C Spatial Data on the Web Interest Group</article-title>
          , Ssn/sosa ontology draft, https://w3c.github.io/ sdw-sosa-ssn/ssn/,
          <year>2024</year>
          . Draft version.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <article-title>W3C Web of Things, Mqtt-to-rdf ontology and binding templates</article-title>
          , https://w3c.github.io/ wot-binding-templates/bindings/protocols/mqtt/ontology.html,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <article-title>QUDT.org, Quantities, units, dimensions and data types (qudt</article-title>
          ), https://www.qudt.org/,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>M.</given-names>
            <surname>Bodenbenner</surname>
          </string-name>
          ,
          <article-title>Providing FAIR Sensor Data Models Using Semantic Web Technologies and Ontologies</article-title>
          ,
          <source>Ph.D. thesis</source>
          , University Thesis,
          <year>2025</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>ETSI</surname>
            <given-names>SmartM2M</given-names>
          </string-name>
          ,
          <article-title>Saref - smart applications reference ontology</article-title>
          , https://saref.etsi.org/extensions. html,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>