<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Semantic Interoperability and Interrater Agreement in Annotation of IoT Data</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Robert Bosch GmbH</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Corporate Research</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Robert-Bosch-Campus</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Renningen</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Germany</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>yulia.svetashova</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>stefan.schmidg@de.bosch.com</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Karlsruhe Institute of Technology</institution>
          ,
          <addr-line>AIFB, Kaiserstr. 89, 76133 Karlsruhe</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>1807</year>
      </pub-date>
      <abstract>
        <p>Data management has become a critical ability in today's data-driven businesses. In the Internet of Things (IoT) domain, sensors, devices and applications generate huge amounts of data. To take advantage of this data, new storage and exchange solutions, e.g., IoT data repositories, enterprise data lakes and IoT data marketplaces, are emerging. These emerging data storage and exchange solutions allow authorized users to discover and access heterogeneous data streams and integrate them across stakeholder boundaries. We present an approach to annotate IoT datasets by mapping their schemata to corresponding ontology terms, paired with an on-the-fly ontology extension mechanism based on templates. We further introduce a framework to evaluate semantic interoperability in this complex setting via the agreement between domain experts on a set of annotationextension tasks and show how this framework can be used to improve the system iteratively and to avoid potential semantic interoperability conflicts.</p>
      </abstract>
      <kwd-group>
        <kwd>Semantic interoperability Semantic annotation sion Interrater agreement Internet of Things</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Data has become a strategic asset nowadays. As has been acknowledged by M. Chui et
al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], one of the main bottlenecks in making it accessible and usable is the lack of
highquality data labeling support. Modern data storage and exchange solutions (e.g., data
catalogues, data marketplaces and enterprise data lakes) allow easy access to various
types of data, as well as integration of data into analytic workflows, and collaboration
of data scientists and business users over data. However, to enable these intelligent
functionalities, the submitted datasets should be semantically interoperable.
      </p>
      <p>
        Semantic interoperability is defined as the ability of two or more systems to
interpret the content and meaning of the exchanged information [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. In the context of
the Internet of Things (IoT), it is typically achieved by adding semantic annotations
to the raw data or by mapping the diverse schemata of each data source to a unified
representation: an ontology, a taxonomy, or a controlled vocabulary (see [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]).
      </p>
      <p>
        The usage of a shared ontology or a set of ontologies as “the best pathway to achieve
semantic interoperability” [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] was tested in several European research projects:
OpenIoT [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ], IoT Lite [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], FIESTA-IoT [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], BIG IoT [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] and other. Among the challenges
and requirements, related to interoperability, these projects report that a shared ontology
must be evolvable over time as new data sources, sensors or IoT devices appear.
      </p>
      <p>
        Traditionally, ontology is extended in a centralized manner: data providers or
device owners first submit requests for extension and then use the updated ontology for
annotation. In practice, it means long waiting times before the ontology term is
available to perform a task in question. Multiple rounds of communication between data
providers and ontology engineers take place to clarify the meaning of the requested
ontology terms3. To address this limitation, several approaches to involve data providers
in the ontology extension process were suggested (see, e.g., [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ], [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]).
      </p>
      <p>
        The core characteristic of these approaches is the coupling of the annotation process
with the on-the-fly ontology extension. When a data provider lacks a term to perform an
annotation task, s/he can use a simple procedure to add it via the graphical user interface
(GUI). The Schema Editor of OpenIoT middleware infrastructure [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] is an example
of an implemented system. It is a Web-based application, which targets non-ontology
expert users. It allows the addition of new sensor types to the SSN-based OpenIoT
ontology and their later use in defining descriptions of sensor instances. In both cases, the
users provide values via the Web forms. These forms serve as templates that guide the
user, requiring them to supply information and linking it to the corresponding
superclasses. The tool, thus, preserves the ontological foundations of the underlying model.
      </p>
      <p>
        The decentralization of the ontology extension process might become truly
challenging for the external applications that consume newly introduced terms or data which
has been annotated with them. If a term description is not accurate or not complete,
it can cause incompatibilities in data semantics and structures between the systems,
phenomena termed semantic interoperability conflicts by J. Park [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ]. As an
example, imagine that a navigation application expects (in accordance with the
specification/description) an input in newton-seconds, while the other system delivers it
(contrary to the description) in foot-pound-seconds. In 1999, such a conflict (data-unit
conflict in Park’s classification) led to the wrong trajectory computation and sent the
NASA’s Mars Climate Orbiter spacecraft fatally close to the surface of the planet Mars4,
where it simply burned up.
      </p>
      <p>
        In the complex setting of annotation with the option to dynamically extend the
shared ontology, semantic interoperability conflicts can result either from 1) incorrect
mappings of elements of annotated data sources to the ontology terms, or 2)
incorrect/incomplete descriptions of newly introduced terms. In both cases, “the process of
converging to a uniform semantics can be influenced but not controlled” [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ].
      </p>
      <p>
        In order to build decentralized systems with the dynamic ontology extension
component, it is crucial to understand potential causes of semantic interoperability conflicts.
When they are known, one can influence the extension process. Numerous studies in the
bio-medical domain (see, e.g., [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] and related work there) showed how the task of
an3 Here we refer to the experience of the team responsible for the maintenance of the ontology
in the BIG IoT project. This challenge is also valid in the context of describing datasets in the
scientific communities (see [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]).
4 https://www.jpl.nasa.gov/missions/mars-climate-orbiter/.
notation with the existing terms can be explored via the interrater agreement metrics on
a set of tasks addressing potential semantic interoperability conflicts. Our focus stays,
however, on the extension component and its impact on the overall agreement. To assess
this, we conducted an experiment where 23 non-ontology experts annotated 7 samples
of IoT data (56 data points in total) with the terms of a shared ontology and
simultaneously extended it if the needed terms were missing.
      </p>
      <p>
        This paper makes the following contributions:
1) it suggests an experimental setup to explore the complex annotation-extension
setting, which consists of the application to annotate IoT data with an ontology
extension module, an ontology for IoT data and a set of SOSA[
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]-inspired templates, which
cover the main types of extensions;
      </p>
      <p>2) it introduces the modeling of the proposed extensions which is used along with
the selected existing ontology terms to measure interrater agreement in the
annotationextension task;</p>
      <p>3) it presents the results of a user experiment and proposes the interpretation of
agreement scores as signals of the potential semantic interoperability conflicts;
4) it outlines the measures to iteratively improve the system based on the findings.</p>
      <p>The rest of this paper is organized as follows. Section 2 discusses the relevant
research. Section 3 presents our approach to ontology extension implemented as a part
of IoT data annotation environment. Sections 4 and 5 provide the details of the
experiment with domain experts and discuss the results. Section 6 concludes this article with
a summary of accomplished and future research directions.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related work</title>
      <p>
        Previous research relevant for our work can be categorized into three broad areas.
Firstly, our work goes in the direction of collaborative frameworks and tools used
for knowledge engineering and ontology development. Our target users have little (if
any) exposure to ontologies, so it is desirable to hide the complexity of the latter. Thus,
the most closely related implemented systems are the the Linked Earth Framework [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]
and the Schema Editor of OpenIoT [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ].
      </p>
      <p>The Linked Earth Framework supports the paleoclimate community to describe
their datasets and enable users to add new metadata properties. It employs an initial
core ontology and a set of extensions (called proxies), which serve as the basis of the
crowd vocabulary. Proposed new metadata properties are available to the community at
the time of creation, and they can later decide to include them into the core ontology.
In contrast, our tool does not involve community discussions and voting procedures. At
some point, ontology engineers approve proposed extensions. Before that the terms are
available to the users, but have a special namespace.</p>
      <p>
        The Schema Editor also combines the extension of ontologies (the Sensor Type
Editor) with using them to annotate instance data (the Sensor Instance Editor). This
is one of the earlier tools for the annotation of the IoT data, focused on describing
sensor types by providing their names and observed properties with accuracy and
frequency values. As will be shown in Section 3, in our tool we use a different, but also
SSN/SOSA-inspired [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] set of templates and a more elaborate characterization of the
new ontology terms.
      </p>
      <p>
        Secondly, we built our system on the ideas coming from the template-based
ontology extension tools: TermGenie [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], Webulous [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ], OTTR [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]; Prote´ge´ plugins
supporting OPPL [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] and MappingMaster [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ]. In those tools, ontology design patterns
are used to generate templates. Each template represents a boilerplate for the ontology
entity of some type. Domain experts and data curators populate these templates by
filling in GUI forms or spreadsheets. Their input is further transformed into the axioms of
the shared ontology.
      </p>
      <p>
        Template-based tools consist of four generic components: 1) a templating
mechanism, 2) an input GUI, 3) a template instantiation/expansion processor, and 4)
(optionally) an input validation system. We implement these components in our Web-based
system prototype. We also add prompts in natural language to the corresponding GUI
forms (similar to [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]). Most importantly, we address one research gap of the
templatebased systems. To the best of our knowledge, there is no systematic work on the usage
of templates. Hence, in our experiment, we will try to gain insights on how templates
and sets of templates work in practice.
      </p>
      <p>
        Finally, our experiment design can be compared with the recent study [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ], which
explores the agreement of experts on a task of classifying entities in domain ontologies
under upper ontology classes. We take into account the literature on crowdsourced data
annotation [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], which models interrater agreement and disagreement, but for these
initial experiments we use easily interpretable metrics, such as percent agreement and
Fleiss’ kappa statistic [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
      </p>
      <p>In the following we describe our experimental setup and the conducted experiment.</p>
    </sec>
    <sec id="sec-3">
      <title>3 IoT Data Annotation Environment</title>
      <p>
        With the objective of exploring semantic interoperability issues in the context of IoT
data annotation, we built a system prototype where we extended an interface, developed
in the context of the project BIG IoT [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], with a module, which enables experts to add
new ontology entities and use them for data annotation (Figure 1).
      </p>
      <p>Suppose we need to annotate the following piece of car diagnostics data f“speed” :
11.75, “Rpm” : 814.12, “MAF” : 7.86, “ts”: “2017-09-19T23:00:00Z”g. The annotation
process is as follows: a user first selects a class (e.g., “Car”) and then – its corresponding
properties from the dropdown lists (Figure 1. 1 ).</p>
      <p>In Figure 2.1, we show how these properties are modeled in the shared ontology.
Relation to a class is specified by the property “schema:domainIncludes”. Property
“schema:rangeIncludes” characterizes the value type: in this case the float value is
mapped to the “schema:Number” data type.</p>
      <p>Additionally, each ontology entity has some characterization in terms of a
metamodel. This layer reflects core modeling principles of a shared ontology, which are
especially important for extension. For the core use cases we have preferred to use the
simple lightweight RDF(S) ontology. Due to having the meta-model, we can always
generate a SOSA-compliant representation of an annotated dataset.</p>
      <p>
        As the basis of our shared ontology we use the sosa:Observation pattern [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] , [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ].
In our approach, all entities in the ontology have an associated type:
“meta:ObservableProperty”, “meta:TemporalProperty”, “meta:SpatialProperty”, or “meta:Class”.
      </p>
      <p>Observable The entity type determines a set of meta-predicates, which specify its
semantics. For example, the property “basis:carSpeed” as an instance of
“meta:ObservableProperty” has two meta-predicates: “meta:propertySemantics”, related to the
higherlevel semantics (“meta:Speed”, “meta:Mass”, “meta:Amount”, etc.), and “meta:unit”,
specifying units of measurements. Possible objects for those predicates form closed
sets.</p>
      <p>
        Temporal Temporal properties have three meta-predicates. The first meta-predicate:
“meta:temporalSemantics” models the distinctions from Allen’s [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] seminal work on
temporal modeling: time vs. duration, instance vs. interval. The second predicate:
“meta:serviceSemantics” indicates whether the value of the temporal property is measured
or predicted. The third meta-predicate: “meta:associatedObservableProperty” is related
to the modeling of a “sosa:phenomenonTime”. In the car diagnostic data above, the
timestamp value with the key “ts” is related to all observable properties in the dataset.
Another option was to relate it to a certain observable property or a subset of properties
in a sample.
      </p>
      <p>Spatial Spatial properties with their two meta-predicates conclude the list. Firstly,
we can specify the location of an object as a postal address, coordinates alone or a
geometry, with many finer distinctions. These options form the possible objects of the
predicate “meta:spatialSemantics”. The meta-predicate “meta:structure” handles
differences between flat f“lng” : 43.2, “lat” : 9.3g and nested f“location” : f“lng : 43.2223”,
“lat” : 9.3 gg structures.</p>
      <p>In the following, we explain how the template-based ontology extension works.
Each entity type – “meta:ObservableProperty”, “meta:TemporalProperty”,
“meta:SpatialProperty” and “meta:Class” – have corresponding extension templates. In Figure 2.2,
you can see a skeleton of an observable property. By answering questions in the
userinterface forms (Figure 1. 2 ), a user literally fills the gaps in the proposed elements’
description. A user selects options from the dropdown lists, which represent possible
objects of the meta-predicates for this property. It results into a structured description
of a new element. Most importantly, a new element is linked to the existing ontology
concepts, in accordance with the modeling principles of the shared ontology.</p>
      <p>Wildcard In addition to the supported entity types, there exists also the “Wildcard”
template. This can be used when none of the more expressive templates fits.
4
4.1</p>
    </sec>
    <sec id="sec-4">
      <title>Towards a Framework to Explore Semantic Interoperability</title>
      <sec id="sec-4-1">
        <title>Research Questions</title>
        <p>Semantic interoperability of the annotated data depends to a large extent on how
annotators interpret the underlying shared ontology and a set of templates. Designing this
experiment, we wanted to investigate how the option to extend ontologies on the fly
would influence the results of the annotation task, how the set of templates worked
as a system and what can be said about the quality of a single template based on the
interrater agreement. We stated the following research questions:
1 Will the fine-grained modeling of proposed ontology extensions significantly
influence the overall agreement among experts in the annotation task?
2 Can we detect and categorize various types of conflicts between the existing
ontology entities and the proposed elements via the interrater agreement scores?
3 Can we detect and categorize various types of possible semantic interoperability
conflicts by inspecting the interrater agreement scores for the modeling choices
made while introducing new elements?
4.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>Experimental Setup</title>
        <p>To assess general tendencies and obtain some quantitative estimates for the complex
annotation-extension task, we organized an experiment with 23 participants. All of them
were computer scientists and engineers from various fields. We have not collected any
demographic data and processed the responses as anonymous.</p>
        <p>The participants of the study annotated 7 IoT data sources5 (see Table 1) in JSON
format, each of them with a description in natural language, which contained the
information required to annotate all the key–value pairs.</p>
        <p>The datasets were presented to the participants in increasing order of complexity.
We started with two confidence-building examples, where all needed properties were
present in the shared ontology. Then we gradually introduced datasets, which required
more complex modeling decisions: nested hierarchies, infrequent data value formats
(e.g., duration in the ISO 8601 (“P0DT0H1M12S”) or Unix epoch time), introducing
new properties and classes. The last and most complex dataset represented the output
of a service (predicted waiting time for a bus), which was very different from the core
use case: recorded sensor measurements at a particular timestamp.</p>
        <p>Prior to the experiment, all participants watched an introductory video, where we
explained the context, showed the annotation environment and annotated one data source,
which required adding several terms to the shared ontology.</p>
        <p>In the next section, we introduce the relevant interrater agreement metrics and
discuss the results of the experiment.
5
5.1</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Results and Discussion</title>
      <sec id="sec-5-1">
        <title>Interrater Agreement Measures</title>
        <p>
          Interrater agreement is defined as “the degree to which two or more raters achieve
identical results under similar assessment conditions” [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ]. It can be calculated at the level
of single rated items. For example, the first variable “parkingSpaceId” in Data source 1
was annotated as “schema:identifier” by 20 participants out of 23 (86.95%), the second
used element was “basis:operatorIdentifier” selected by 2 participants (8.7%), and the
third – “schema:description” used once (4.35%). We will refer to this metric as percent
agreement on the 1st, 2nd, and 3rd, etc. most frequent choices for the data item.
        </p>
        <p>
          In addition, we want to obtain a quantitative estimate of experts’ agreement on a
set of tasks for the whole experiment. We achieve that by treating shared ontology
elements as nominal categories, which are assigned by experts to the rated data items. For
5 These data sources and their descriptions are available in the following Github repository:
https://github.com/YuliaS/PatternBasedExtension.git. In this paper, we discuss mostly the
results of introducing new properties.
categorical data, the most common interrater agreement metric is a kappa-type statistic
which measures the observed level of agreement between raters for a set of nominal
ratings (P ) and corrects for agreement that would be expected by chance (Pe) [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]:
=
        </p>
        <p>P
1</p>
        <p>
          Pe
Pe
(1)
J. Cohen [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] was the first to propose kappa for two raters. In our experiments, we use
the Fleiss’ [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ] kappa metric generalized to any constant number of raters.
        </p>
        <p>
          We base our interpretation of kappa values on the Landis and Koch (1977) [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ]
benchmark scale. It describes relative strength of agreement associated with kappa
ranges, with values from 0.0 to 0.2 indicating slight agreement, 0.21 to 0.40
indicating fair agreement, 0.41 to 0.60 indicating moderate agreement, 0.61 to 0.80 indicating
substantial agreement, and 0.81 to 1.0 indicating almost perfect agreement. Statistics
&lt; 0:00 is a poor agreement, and = 1 indicates perfect agreement.
        </p>
      </sec>
      <sec id="sec-5-2">
        <title>5.2 Interrater Agreement and Ontology Extension: Tendencies</title>
        <p>To compute the Fleiss’ kappa statistic, we transform our results’ table into a 56 23
matrix in which the columns represent the different raters (n = 23), and the rows represent
data items (N = 56), which the raters have characterized either by existing ontology
terms or proposed extension elements. We treat both as nominal categories (Table 2).
The Fleiss’ kappa computed for this matrix was 0.6386 for the number of categories
k = 218, which indicates substantial interrater agreement.
However, using labels for the proposed elements directly introduced additional 178
categories to the 40 existing ontology elements used for annotation. These labels are not
generalizable in the sense that they do not reflect the underlying modeling decisions.
Thus, to explore the tendencies in the overall agreement, we need to perform several
data transformations.</p>
        <p>In Table 3, we show the choices for the element “proposed:status” used to
annotate data sample 1.1 by user23. These choices come from the closed sets of options
displayed in the drop-down lists of the user interface. By combining the options into
a sequence, e.g. “Observable ParkingSite Text Quality None”, we obtain a compound
name for the proposal, which will be reused for the proposal coined by another user,
if the choices were identical. Note that it is possible to represent just the pattern type
6 Statistics were done using R 3.5.0 (R Core Team, 2018), the irr (v0.84.1; Gamer, 2019) and
the reshape2 (v1.4.3; Wickham, 2007) packages.
(“ObservableProperty”) or pattern and a value type (“ObservableProperty Text”) and
check the expert agreement on a subset of choices. Now, we can substitute the
labels of the proposed elements in the results’ matrix (e.g. “proposed:status” !
“Observable ParkingSite Text Quality None”) with the compound proposals’
characteristics and assess the overall agreement.</p>
        <p>To answer Research Question 1 – will the fine-grained modeling of proposed
elements significantly influence the overall agreement among experts in the annotation
task? – we applied the above mentioned substitutions to our initial result matrix. We
thus created 12 datasets where the characteristics of the proposed entities were
modeled with different levels of granularity.</p>
        <p>As a baseline, we constructed a dataset where all labels of the proposed elements
were replaced by one label “proposed”. This dataset roughly approximates the setting
when only existing elements of shared ontology are used to annotate the datasets.</p>
        <p>More fine-grained modeling of the user choices introduced additional categories.
Their number ranges from 4, when only pattern types (“Observable property”,
“Temporal”, “Spatial”, “Wildcard”) were specified, to 142, when all possible combinations
of options were considered.</p>
        <p>The highest agreement, = 0:773, was obtained for the baseline dataset, the second
largest value, = 0:718 – for the dataset, where only pattern types were modeled. The
difference between the 1st and the 2nd highest kappa statistics is larger than in any
subsequent pairs, where the values decrease almost linearly with the increase in the
number of categories.</p>
        <p>The results in Table 4 show that irrespective of the number of user choices
considered, the Fleiss’ kappa values remain in the range of substantial agreement scores.
Therefore, the overall agreement score for the annotated dataset is influenced by the
fine-grained modeling of proposed elements, but not to the extent that it changes its
position on the Landis and Koch (1977) benchmark scale.</p>
        <p>We will further dive deep into the causes of the interrater disagreement by detecting
and categorizing various types of conflicts between elements used for annotation at the
level of single data items.</p>
      </sec>
      <sec id="sec-5-3">
        <title>5.3 Interrater Agreement and “Competing” Elements</title>
        <p>In order to answer Research Question 2 – can we detect and categorize various types
of conflicts between the existing ontology entities and the proposed elements via the
interrater agreement scores? – we compiled for each data item a list of existing ontology
entities and/or the proposed pattern types used to annotate this item. Then we calculated
percent agreement and sorted the results in decreasing order. Percentages for the three
most frequent choices for each data item are presented in Table 5.</p>
        <p>The table shows that, for the majority of data items (76.78%, or 46/56), the
agreement on the 1st most frequent choice is &gt;78%7. We interpret the percent agreement
&gt;20% for the 2nd most frequent choice as a “competing” alternative.</p>
        <sec id="sec-5-3-1">
          <title>In the results’ distribution, one can find three types of alternatives:</title>
          <p>7 Relatively low agreement scores for the first chosen option in the data items 7.0 and 7.1 are
most probably due to the equally probable conceptualizations. To annotate keys with
values “busStopId” and “busStopName”, participants could either first introduce a new class
“BusStop”, missing in the shared ontology, and then new properties where this class was a
domain, or create new properties with the domains “basis:Bus” or “basis:BusLine”.
– between two existing elements (e.g., “basis:incidentCause” and “schema:-description”
in data item 2.0),
– between a pattern type and an existing term (e.g., an instance of the Observable
property pattern and “schema:description” in 5.8; see also 4.5, 5.9, 5.11, 6.5),
– between two pattern types (e.g., instances of the Observable property and Wildcard
patterns in 5.10; see also 5.12, 6.7, 7.3).</p>
          <p>In 5 out of 6 cases, where the existing term was involved as the competing
alternative, this term was “schema:description”, a property with a very broad definition “A
description of the item” and the “schema:Text” value type. In the systems where
ontology extension by annotators is enabled, the presence of elements with such a general
meaning hinders interoperability. The majority of participants still chose either a more
specific property (“basis:incidentCause” in data item 2.0), or created properties with
a specialized meaning (e.g. by instantiating the Observable property pattern in 5.8 to
describe the availability status of a charging dispenser).</p>
          <p>Another case where an existing ontology element is selected as an alternative to
a pattern instance is a property “basis:carSpeed”. It was used to annotate data item 4.5
with a key name “GPS Speed”; 65.22% created a new property with a specialized
meaning by instantiating the Observable property pattern. Using the existing element is not
wrong in principle. Nevertheless, reusing “basis:carSpeed” to annotate “GPS Speed”
leads to ambiguity: two keys in Data sample 5 are described with the same property in
the shared ontology. In this case, if the shared ontology is not extended, annotating the
key with an existing entity creates semantic interoperability conflict.</p>
          <p>The group where two pattern types were competing as the 1st and the 2nd most
frequent choices, contains three combinations. Firstly, the Wildcard pattern is used along
with the Observable property pattern (items 5.10 and 5.12). In both data items, they
introduce properties related to sensor measurements: one, indicating whether a charging
plug has quick charge support, and the other – a maximum current of a charging
dispenser. These properties represent static attributes of a plug and a dispenser rather than
the values measured by sensors. This shows that such boundary cases can cause
inconsistencies while introducing properties with similar meaning. Discussing this difference
in the introductory materials will foster more uniform modeling.</p>
          <p>In data item 6.7, in order to introduce the property to annotate the key “country”,
nested in the object “city” (along with the key “name”), 43.48% of the participants
chose the Spatial pattern and 30.43% – the Wildcard pattern. In the description of Data
sample 6, a city is defined as a location for which the weather forecast is provided.
Here the “country” attribute is related to spatial modeling, but it differs from the postal
address element used in the address specification for buildings. Both modeling solutions
are possible and, if any of them is preferable, this case should also be discussed in the
introductory materials.</p>
          <p>Data items 7.0–7.3 go beyond prototypical sensor measurement contexts. They
model the output of a service which estimates the waiting time before the next bus
arrives at a bus stop. Even though the agreement in choosing the Temporal pattern to
annotate the key “busTimeToArrival” was 69.57%, 21.74% of the participants modeled
it as an Observable property instance. The latter modeling is also appropriate as this
value is calculated based on a schedule or a position of the bus, which differs from
the modeling of the resulting timestamp of a “sosa:Observation”. This example clearly
points to the limitations of the existing pattern set (see further discussion in Section 5.4)
and the need to reconsider the modeling principles of the shared ontology (if similar
datasets are modeled in the data cataloguing or integration solution).</p>
          <p>To sum up, the presence of “competing” alternatives indicates the need to revise
the shared ontology and a corresponding pattern set (in the case where very dissimilar
patterns are being confused) or to discuss specific modeling scenarios during the
training/introductory phase. The interrater agreement scores serve as reliable indicators of
possible collisions.
5.4</p>
        </sec>
      </sec>
      <sec id="sec-5-4">
        <title>Interrater Agreement and Semantic Interoperability Conflicts</title>
        <p>
          In response to Research Question 3 – can we detect and categorize various types of
possible semantic interoperability conflicts while using a single template? – we will
successively examine the modeling choices in all types of patterns used to introduce
new elements. Agreement metrics will reveal the aspects of semantic interoperability
related to the possible semantic interoperability conflicts (after J. Park, [
          <xref ref-type="bibr" rid="ref27">27</xref>
          ]).
        </p>
        <p>In general, we observe (see Tables 6–9) a high level of agreement in specifying
properties’ domains and ranges in all patterns except for the instances of Temporal
pattern. Agreement on the domain indicates that in most cases ontology extension will
not introduce schema-isomorphism conflicts. Agreement on the range is related to the
value type and possible data-representation conflicts.</p>
        <p>In Table 6, we present percent agreement on each of the chosen pattern fillers in the
Observable property pattern instances, where this pattern was selected by the majority
of the participants to introduce a new property. A hundred percent agreement means that
there was only one value; all other cases show the split between various options, sorted
in decreasing order8. High variability in the specification of the units of measurement
(e.g., for the data sample 4.4, the options were: “KilopascalAbsolute”, “Kilopascal”,
“Hectopascal”, “Pascal”) was somewhat surprising because all units were explicitly
mentioned in the descriptions of the data sources. Even though this situation is less
likely in the real setting (data providers must know their data, in the experiment it could
be a matter of attention), the data annotation system should consider mechanisms to
check the unit of measurement to avoid possible data-unit conflicts.</p>
        <p>We also explored tendencies in how the participants specified the higher level
property semantics. The options here were not mutually exclusive in terms of splitting the
meaning continuum (see “meta:Quantity”, “meta:Amount”, “meta:Mass”, etc.). The
results suggest that higher agreement is achieved for the options mentioned in the label of</p>
        <sec id="sec-5-4-1">
          <title>8 By using multiplication, e.g. “5:0</title>
          <p>5.0% of the participants.</p>
          <p>2”, we indicate that there we two options, each chosen by
an annotated element or in the data source description: “GPS Speed” – “meta:Speed”
(93.33%, data sample 4.5), “Intake Pressure” – “meta:Pressure” (66.67%, data
sample 4.4); or in the label of the related properties: “meta:Concentration” was chosen
by 80% of the participants after annotating several keys with properties labeled
“basis:noxConcentration”, “basis:coConcentration”, etc.</p>
          <p>In the results dataset we did not observe confusion between the concepts of the
highest level of abstraction: “meta:Quantity” and “meta:Quality”. As an example, in
data item 5.8, the semantics of a property introduced to annotate the availability status
of a charging dispenser was marked as “meta:Quality” in 66.67% cases; the second
most selected option was “meta:NoneOfTheListed” (25%).</p>
          <p>The results show that in the contexts focused on achieving semantics
interoperability, the options should be mutually exclusive. In the future experiments, we will split
the set of options into two lists and display them in the separate user interface forms.
The first one – f“meta:Quantity”, “meta:Quality”g – will contain options with most
general distinction that enable alignment with top-level ontologies. The second will be
oriented towards capturing the subsumption relationships (e.g., properties which
indicate speed values “carSpeed” and “GPSSpeed” will be the siblings of one superclass
“meta:Speed”).</p>
          <p>The instances of the Temporal pattern (see Table 7) demonstrated the greatest level
of variability. We start with the data items 5.13 and 6.8. First and foremost, the idea
of specifying a property which is observed by a sensor at a particular timestamp (the
association which is essential to reconstruct the sosa:Observation context) was not
understood by the participants. In the introductory video, we gave a short explanation of
this relationship, which was obviously not sufficient to make reliable choices. This may
also be due to the unspecific prompt “Which property is related to the proposed one?”
used as a label for the options list. The majority of replies were split between the
options: “meta:AllSensorMeasurementsInASample” and “meta:Other”. The former seems
more appropriate because in both datasets there are multiple observable properties and
only one time indicator.</p>
          <p>Another cause of disagreement was the usage of infrequent datatypes for the data
values (see the column “schema:rangeIncludes” in Table 7). For the Unix epoch time
in data item 6.8, the split was 56.56% – 31.25% – 12.5% for the options
“meta:EpochTime”, “meta:DateTime”, “meta:Date”. Similar distribution was obtained for the
duration value (“P0DT0H1M12S”) in data item 7.3. This variability is a potential cause
of the data-representation semantic interoperability conflicts and should be carefully
handled.</p>
          <p>
            The options that specified temporal semantics [
            <xref ref-type="bibr" rid="ref1">1</xref>
            ] were rated most consistently in
all samples: the replies contained at most 2 options, with the agreement ranging from
81.25% to 100.0% on the most frequent choice.
          </p>
          <p>Service semantics for the weather forecast (data item 6.8), despite the dataset
description (“for a particular time in the future”), was specified as measured, not predicted
by the 37.5% of the participants. This distinction obviously needs some elaboration in
the future training or assistance materials.</p>
          <p>We finally discuss a more complex temporal property in the last data item 7.3. The
value of this property presents the output of a service which calculates time remaining
until the bus arrives at the bus stop. Here the modeling depends on the
conceptualization of the situation: whether this attribute is seen as characterizing an arriving bus or
a bus stop or even a bus line. The study participants chose “basis:Bus” class as the
domain of the property in 43.75% cases, 31.25% of the participants introduced a new
class for a bus stop und used it as a domain; another options were – “meta:BusLine”
(18.75%) and “proposed:TimeToArrival” (6.25%). This is one of the examples where
we have a clear indicator of 1) the multiple modeling decisions and 2) the limitations
of the core temporal modeling in the sosa:Observation pattern, which aims at capturing
the time at which the phenomenon took place. Note that more general temporal
modeling (columns “meta:temporalSemantics” and “meta:serviceSemantics”) shows a higher
level of agreement even for very dissimilar properties.</p>
          <p>The instances of other pattern types – Spatial (Table 8) and Wildcard (Table 9) –
demonstrate a very strong agreement between participants. The Wildcard pattern was
introduced to collect properties not covered by the core template set. The agreement
is not surprising – this pattern simply allows the establishment of relationships
between classes, or between a class and a datatype. Nevertheless, these relationships can
be further specialized into a new pattern. For example, the Wildcard instances in the
current result set showed the need of a pattern to express the “part-of” relation (e.g.,
“basis:Dispenser” hasPlug in 5.9). We also need to differentiate a static characteristic
of an object (e.g., “basis:Plug” hasQuickChargeSupport in 5.10) and the changing state
measured by a sensor (see properties like “basis:plugAvailabilityStatus”).</p>
          <p>In this section, we showed how various types of semantic interoperability conflicts
can result from the modeling choices made in the process of using templates. The
agreement scores obtained for the options clearly indicate these problematic cases, where the
unified behavior should be enforced either by training or by automated checks and
suggestions. Also these scores point in many cases to limitations of the shared ontology.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Conclusions</title>
      <p>This work outlines an approach to explore various aspects of semantic interoperability,
related to the annotation of IoT data, by computing the interrater agreement scores
(percent agreement and the Fleiss’ kappa) on a set of annotation tasks. To the best of
our knowledge, it is the first experiment where annotation of IoT data with the elements
of one shared ontology is examined in the situation when the rater not only uses an
ontology (or a set of ontologies), but extends it if needed elements are missing.</p>
      <p>We show the role of the interrater reliability metrics in pointing out the “competing”
elements which might be introduced as the result of the ontology extension process. We
also describe how interrater agreement scores can be used to detect potential
semantic interoperability conflicts, in particular, data-representation, data-unit and
schemaisomorphism conflicts.</p>
      <p>We suggest methods to model proposed elements with various levels of granularity
and to investigate the impact of modeling choices on overall agreement in a complex
annotation-extension task. In the experiment with 23 domain experts, we obtained initial
estimates for the overall agreement and scores on single items. We will further use these
estimates to improve the implemented prototype of the annotation system tailored to the
IoT data.</p>
      <p>
        We believe that our approach can be applied outside the IoT domain as well as
generalized from the data cataloguing and/or data integration solutions to more complex
tasks and environments. In general, it contributes to a much wider topic – how meaning
is created in the community of users, and thus, to realizing the vision of the Semantic
Web: “...the relations allow communication and collaboration even when the
commonality of concept has not (yet) led to a commonality of terms” [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Exploring other
contexts where shared meaning is created and agreed upon in a distributed manner will be
one of the directions of our future research.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Allen</surname>
            ,
            <given-names>J.F.</given-names>
          </string-name>
          :
          <article-title>Maintaining Knowledge about Temporal Intervals</article-title>
          . In: Weld,
          <string-name>
            <given-names>D.S.</given-names>
            and
            <surname>de Kleer</surname>
          </string-name>
          , J. (eds.)
          <article-title>Readings in Qualitative Reasoning About Physical Systems</article-title>
          . pp.
          <fpage>361</fpage>
          -
          <lpage>372</lpage>
          Morgan Kaufmann (
          <year>1990</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Androcˇec</surname>
          </string-name>
          , D. et al.:
          <article-title>Using Semantic Web for Internet of Things Interoperability: A Systematic Review</article-title>
          .
          <source>Int. J. Semant. Web Inf. Syst</source>
          .
          <volume>14</volume>
          ,
          <issue>4</issue>
          ,
          <fpage>147</fpage>
          -
          <lpage>171</lpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Agarwal</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          et al.:
          <article-title>Unified IoT Ontology to Enable Interoperability and Federation of Testbeds</article-title>
          .
          <source>IEEE 3rd World Forum on Internet of Things (WF-IoT</source>
          )
          <fpage>70</fpage>
          -
          <lpage>75</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Bermudez-Edo</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          et al.
          <article-title>IoT-Lite: a lightweight semantic model for the Internet of Things. Ubiquitous Intelligence and Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Cloud and Big Data Computing, Internet of People, and Smart World Congress</article-title>
          , IEEE,
          <fpage>90</fpage>
          -
          <lpage>97</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Berners-Lee</surname>
            , Tim,
            <given-names>James</given-names>
          </string-name>
          <string-name>
            <surname>Hendler</surname>
            , and
            <given-names>Ora</given-names>
          </string-name>
          <string-name>
            <surname>Lassila</surname>
          </string-name>
          .
          <article-title>The Semantic Web. A new form of Web content that is meaningful to computers will unleash a revolution of new possibilities</article-title>
          .
          <source>Scientific American</source>
          <volume>284</volume>
          .5 (
          <year>2001</year>
          ):
          <fpage>3</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Broering</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          et al.:
          <article-title>Enabling IoT ecosystems through platform interoperability</article-title>
          .
          <source>IEEE software 34.1</source>
          ,
          <fpage>54</fpage>
          -
          <lpage>61</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Chui</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          et al.:
          <article-title>Applying artificial intelligence for social good</article-title>
          .
          <source>Discussion Paper. November</source>
          <year>2018</year>
          . https://www.mckinsey.
          <article-title>com/featured-insights/artificial-intelligence/applying-artificialintelligence-for-social-good</article-title>
          .
          <source>Last accessed 10 May 2019</source>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Cohen</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>A Coefficient of Agreement for Nominal Scales</article-title>
          .
          <source>Educational and Psychological Measurement</source>
          .
          <volume>20</volume>
          ,
          <issue>1</issue>
          ,
          <fpage>37</fpage>
          -
          <lpage>46</lpage>
          (
          <year>1960</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Dietze</surname>
          </string-name>
          , H. et al.:
          <article-title>TermGenie: A web-application for pattern-based ontology class generation</article-title>
          .
          <source>J Biomed Semantics. 5</source>
          , (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Dumitrache</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          et al.
          <article-title>Empirical methodology for crowdsourcing ground truth</article-title>
          .
          <source>Semantic Web</source>
          <volume>1</volume>
          , (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <given-names>Egana</given-names>
            <surname>Aranguren</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          et al.:
          <article-title>Transforming the Axiomisation of Ontologies: The Ontology Pre-Processor Language</article-title>
          .
          <source>Nature Procedings</source>
          . (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Fleiss</surname>
            ,
            <given-names>J.L.</given-names>
          </string-name>
          :
          <article-title>Measuring nominal scale agreement among many raters</article-title>
          .
          <source>Psychological Bulletin</source>
          .
          <volume>76</volume>
          ,
          <issue>5</issue>
          ,
          <fpage>378</fpage>
          -
          <lpage>382</lpage>
          (
          <year>1971</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Forssell</surname>
          </string-name>
          , H. et al.:
          <article-title>Reasonable Macros for Ontology Construction and Maintenance</article-title>
          .
          <source>In: CEUR Workshop Proceedings</source>
          . Vol.
          <year>1879</year>
          . Technical University of Aachen (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Gamer</surname>
            <given-names>M.</given-names>
          </string-name>
          et al:
          <article-title>irr: Various Coefficients of Interrater Reliability and Agreement</article-title>
          .
          <source>R package version 0</source>
          .84.1. https://CRAN.R-project.org/package=irr (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Gangemi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          et al.:
          <article-title>A Pattern-based Ontology for the Internet of Things</article-title>
          . In: WOP@ISWC. (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Ganzha</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          et al.:
          <article-title>Semantic interoperability in the Internet of Things: An overview from the INTER-IoT perspective</article-title>
          .
          <source>Journal of Network and Computer Applications</source>
          .
          <volume>81</volume>
          ,
          <fpage>111</fpage>
          -
          <lpage>124</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Gonzalez-Usach</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          et al.:
          <article-title>Interoperability in IoT</article-title>
          .
          <source>In Handbook of Research on Big Data and the IoT. IGI Global</source>
          ,
          <article-title>(</article-title>
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Gil</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          et al.:
          <article-title>A Controlled Crowdsourcing Approach for Practical Ontology Extensions and Metadata Annotations</article-title>
          . In: d'Amato,
          <string-name>
            <surname>C.</surname>
          </string-name>
          et al. (eds.) The Semantic Web - ISWC
          <year>2017</year>
          . pp.
          <fpage>231</fpage>
          -246 Springer International Publishing,
          <string-name>
            <surname>Cham</surname>
          </string-name>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Janowicz</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          et al.:
          <article-title>SOSA: A lightweight ontology for sensors, observations, samples, and actuators</article-title>
          .
          <source>Journal of Web Semantics</source>
          . (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Jayaraman</surname>
            ,
            <given-names>P. P.</given-names>
          </string-name>
          et al.:
          <article-title>The Schema Editor of OpenIoT for Semantic Sensor Networks</article-title>
          .
          <source>In: Joint Proceedings of the 1st Joint International Workshop on Semantic Sensor Networks and Terra Cognita (SSN-TC 2015) and the 4th International Workshop on Ordering and Reasoning (OrdRing</source>
          <year>2015</year>
          ). Bethlehem, Pennsylvania,
          <string-name>
            <surname>US</surname>
          </string-name>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Jupp</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          et al.:
          <article-title>Webulous and the Webulous Google Add-On-a web service and application for ontology building from templates</article-title>
          .
          <source>J Biomed Semantics. 7</source>
          ,
          <issue>17</issue>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Karlsson</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          et al.:
          <article-title>Semantic Krippendorff's for measuring inter-rater agreement in SNOMED CT coding studies</article-title>
          .
          <source>In MIE</source>
          , (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Khan</surname>
          </string-name>
          , M. T.:
          <article-title>Involving domain experts in ontology construction: A template based approach</article-title>
          . In: Extended Semantic Web Conference. Springer, Berlin, Heidelberg, (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Kottner</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          et al.:
          <article-title>Guidelines for Reporting Reliability and Agreement Studies (GRRAS) were proposed</article-title>
          .
          <source>Journal of Clinical Epidemiology</source>
          .
          <volume>64</volume>
          ,
          <issue>1</issue>
          ,
          <fpage>96</fpage>
          -
          <lpage>106</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Landis</surname>
            ,
            <given-names>J.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koch</surname>
            ,
            <given-names>G.G.</given-names>
          </string-name>
          :
          <article-title>The Measurement of Observer Agreement for Categorical Data</article-title>
          .
          <source>Biometrics</source>
          .
          <volume>33</volume>
          ,
          <issue>1</issue>
          ,
          <issue>159</issue>
          (
          <year>1977</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <given-names>O</given-names>
            <surname>'Connor</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.J.</surname>
          </string-name>
          et al.:
          <article-title>Mapping Master: A Flexible Approach for Mapping Spreadsheets to OWL</article-title>
          . In:
          <string-name>
            <surname>Patel-Schneider</surname>
            ,
            <given-names>P.F.</given-names>
          </string-name>
          et al. (eds.)
          <source>Proceedings of the 9th International Semantic Web Conference on The Semantic Web - Volume Part II</source>
          . pp.
          <volume>194</volume>
          ?208 Springer Berlin Heidelberg, Berlin, Heidelberg (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Park</surname>
          </string-name>
          , J.:
          <source>Information systems interoperability: What lies beneath? ACM Transactions on Information Systems</source>
          .
          <volume>22</volume>
          (
          <issue>4</issue>
          ),
          <fpage>595</fpage>
          -
          <lpage>632</lpage>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>Soldatos</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          et al.:
          <article-title>OpenIoT: Open source Internet-of-Things in the cloud</article-title>
          .
          <article-title>In Interoperability and open-source solutions for the Internet</article-title>
          of Things Springer, Cham,
          <fpage>13</fpage>
          -
          <lpage>25</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29.
          <string-name>
            <surname>Stevens</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          et al.:
          <article-title>Measuring expert performance at manually classifying domain entities under upper ontology classes</article-title>
          .
          <source>Journal of Web Semantics</source>
          . (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30.
          <string-name>
            <surname>Vetere</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lenzerini</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Models for semantic interoperability in service-oriented architectures</article-title>
          .
          <source>IBM Systems Journal</source>
          <volume>44</volume>
          (
          <issue>4</issue>
          ),
          <fpage>887</fpage>
          -
          <lpage>903</lpage>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          31.
          <string-name>
            <surname>Wickham</surname>
          </string-name>
          , H.:
          <article-title>Reshaping Data with the reshape Package</article-title>
          .
          <source>Journal of Statistical Software</source>
          ,
          <volume>21</volume>
          (
          <issue>12</issue>
          ),
          <fpage>1</fpage>
          -
          <lpage>20</lpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>