Ontology-Based Introspection in Support of Stream Reasoning

                                             Daniel de Leng and Fredrik Heintz
                                          Department of Computer and Information Science
                                          Linköping University, 581 83 Linköping, Sweden
                                               {daniel.de.leng, fredrik.heintz}@liu.se


                            Abstract                                 and how they relate. The same information query further re-
                                                                     quires different configurations for different systems. Such
  Building complex systems such as autonomous robots usu-
                                                                     a manual task leaves ample room for programmer errors,
  ally require the integration of a wide variety of components
  including high-level reasoning functionalities. One important      such as misspelling stream names, incorrect stream config-
  challenge is integrating the information in a system by setting    urations and misunderstanding the semantics of stream con-
  up the data flow between the components. This paper extends        tent. Furthermore, if the indefinite continuation of a stream
  our earlier work on semantic matching with support for adap-       cannot be guaranteed, manual reconfiguration may be nec-
  tive on-demand semantic information integration based on           essary at run-time, further increasing the risk for errors.
  ontology-based introspection. We take two important stand-            In this paper we extend earlier work on semantic match-
  points. First, we consider streams of information, to handle       ing (Heintz and de Leng 2013) where we introduced support
  the fact that information often becomes continually and in-        for generating indirectly-available streams based on fea-
  crementally available. Second, we explicitly represent the se-
  mantics of the components and the information that can be
                                                                     tures. The extension focuses on ontology-based introspec-
  provided by them in an ontology. Based on the ontology our         tion for supporting adaptive on-demand semantic informa-
  custom-made stream configuration planner automatically sets        tion integration. The basis for our approach is an ontology
  up the stream processing needed to generate the streams of         which represents the relevant concepts in the application do-
  information requested. Furthermore, subscribers are notified       main, the stream processing capabilities of the system and
  when properties of a stream changes, which allows them to          the information currently generated by the system in terms
  adapt accordingly. Since the ontology represents both the sys-     of the application-dependent concepts. Relevant concepts
  tem’s information about the world and its internal stream pro-     are for example objects, sorts and features which the sys-
  cessing many other powerful forms of introspection are also        tem wants to reason about. Semantic matching uses the on-
  made possible. The proposed semantic matching functional-          tology to compute a specification of the stream processing
  ity is part of the DyKnow stream reasoning framework and
  has been integrated in the Robot Operating System (ROS).
                                                                     needed to generate the requested streams of information. It is
                                                                     for example possible to request the speed of a particular ob-
                                                                     ject, which requires generating a stream of GPS-coordinates
                     1    Introduction                               of that object which are then filtered in order to generate a
Building complex systems such as autonomous robots usu-              stream containing the estimated speed of the object. Figure 1
ally require the integration of a wide variety of components         shows an overview of the approach. The semantic matching
including high-level reasoning functionalities. This integra-        is done by the Semantics Manager (Sec. 4) and the stream
tion is usually done ad-hoc for each particular system. A            processing is done by the Stream Processing Engine (Sec. 3).
large part of the integration effort is to make sure that each          Semantic matching allows for the automatic generation
component has the information it needs in the form it needs          of indirectly-available streams, the handling of cases where
it and when it needs it by setting up the data flow between          there exist multiple applicable streams, support for cop-
components. Since most of this information becomes incre-            ing with the loss of a stream, and introspection of the set
mentally available at run-time it is natural to model the flow       of available and potential streams. We have for example
of information as a set of streams. As the number of sensors         used semantic matching to support metric temporal logical
and other sources of streams increases there is a growing            (MTL) reasoning (Koymans 1990) over streams for collab-
need for incremental reasoning over streams to draw rele-            orative unmanned aircraft missions. Our work also extends
vant conclusions and react to new situations with minimal            the stream processing capabilities of our framework. In par-
delays. We call such reasoning stream reasoning. Reasoning           ticular, this includes ontology-based introspection to support
over incrementally available information is needed to sup-           domain-specific reasoning at multiple levels of abstraction.
port important functionalities such as situation awareness,             The proposed semantic matching functionality is in-
execution monitoring, and planning.                                  tegrated with the DyKnow stream reasoning frame-
   When handling a large number of streams, it can be dif-           work (Heintz and Doherty 2004; Heintz 2009; Heintz,
ficult to keep track of the semantics of individual streams          Kvarnström, and Doherty 2010; Heintz 2013) which pro-
                                                                  allocation. The work by Tang and Parker (Tang and Parker
                                                                  2005) on ASyMTRe is an example of a system geared to-
                                                                  wards the automatic self-configuration of robot resources
                                                                  in order to execute a certain task. Similar work was per-
                                                                  formed by Lundh, Karlsson and Saffiotti (Lundh, Karlsson,
                                                                  and Saffiotti 2008) related to the Ecology of Physically Em-
                                                                  bedded Intelligent Systems (Saffiotti et al. 2008), also called
                                                                  the PEIS-ecology. Lundh et al. developed a formalisation
                                                                  of the configuration problem, where configurations can be
                                                                  regarded as graphs of functionalities (vertices) and chan-
                                                                  nels (edges), where configurations have a cost measure. This
                                                                  is similar to considering actors and streams respectively. A
                                                                  functionality is described by its name, preconditions, post-
                                                                  conditions, inputs, outputs and cost. Given a high-level goal
                                                                  described as a task, a configuration planner is used to con-
                                                                  figure a collection of robots towards the execution of the
      Figure 1: High-level overview of our approach.              task. Some major differences between the work by Lundh
                                                                  et al. and the work on semantic information integration with
                                                                  DyKnow is that the descriptions of transformations are done
vides functionality for processing streams of information         semantically with the help of an ontology. Further, DyKnow
and has been integrated in the Robot Operating System             makes use of streams of incrementally available information
(ROS) (Quigley et al. 2009). DyKnow is related to both Data       rather than shared tuples as used by channels. The configu-
Stream Management Systems and Complex Event Process-              ration planner presented by Lundh et al. assumes full knowl-
ing (Cugola and Margara 2012). The approach is general and        edge of the participating agents’ capabilities and acts as an
can be used with other stream processing systems.                 authority outside of the individuals agents, whereas we as-
   The remainder of this paper is organized as follows. Sec-      sumes full autonomy of agents and make no assumptions on
tion 2 starts off by putting the presented ideas in the context   the knowledge of agents’ capabilities. Configuration plan-
of similar and related efforts. In Section 3, we give an in-      ning further shares some similarities with efforts in the area
troduction to the underlying stream processing framework.         of knowledge-based planning, where the focus is not on the
This is a prelude to Section 4, which describes the details of    actions to be performed but on the internal knowledge state.
our approach, where we also highlight functionality of in-           In a broader context, the presented ideas are in line with a
terest made possible as the result of semantic matching. The      broader trend that moves away from the how and towards the
paper concludes in Section 5 by providing a discussion of         what. Content-centric networks (CCN) seek to allow users
the introduced concepts and future work.                          to simply specify what data resource they are interested in,
                                                                  and lets the network handle the localisation and retrieval of
                                                                  that data resource. In the database community, the problem
                   2    Related Work                              of self-configuration is somewhat similar to the handling of
Our approach is in line with recent work on semantic mod-         distributed data sources such as ontologies. The local-as-
eling of sensors (Goodwin and Russomanno 2009; Rus-               view and global-as-view approaches (Lenzerini 2002) both
somanno, Kothari, and Thomas 2005) and work on se-                seek to provide a single interface that performs any neces-
mantic annotation of observations for the Semantic Sensor         sary query rewriting and optimisation.
Web (Bröring et al. 2011; Sheth, Henson, and Sahoo 2008;            The approach presented here extends previous work
Botts et al. 2008). An interested approach is a publish/sub-      by (Heintz and Dragisic 2012; Heintz and de Leng 2013;
scribe model for a sensor network based on semantic match-        Heintz 2013) where the annotation was done in a sepa-
ing (Bröring et al. 2011). The matching is done by creating      rate XML-based language. This is a significant improvement
an ontology for each sensor based on its characteristics and      since now both the system’s information about the world and
an ontology for the requested service. If the sensor and ser-     its internal stream processing are represented in a single on-
vice ontologies align, then the sensor provides relevant data     tology. This allows many powerful forms of introspective
for the service. This is a complex approach which requires        reasoning of which semantic matching is one.
significant semantic modeling and reasoning to match sen-
sors to services. Our approach is more direct and avoids most
of the overhead. Our approach also bears some similarity to
                                                                        3    Stream Processing with DyKnow
the work by (Whitehouse, Zhao, and Liu 2006) as both use          Stream processing is the basis for our approach to seman-
stream-based reasoning and are inspired by semantic web           tic information integration. It is used for generating streams
services. One major difference is that we represent the do-       by for example importing, synchronizing and transforming
main using an ontology while they use a logic-based markup        streams. A stream is a named sequences of incrementally-
language that supports ‘is-a’ statements.                         available time-stamped samples each containing a set of
   In the robotic domain, the discussed problem is some-          named values. Streams are generated by stream processing
times called self-configuration and is closely related to task    engines based on declarative specifications.
3.1    Representing Information Flows                                   7       http://www.dyknow.eu/config.xsd"
Streams are regarded as fundamental entities in DyKnow.                 8    xmlns:spec=
For any given system, we call the set of active streams the             9       "http://www.dyknow.eu/ontology#
stream space S ⊆ S ∗ , where S ∗ is the set of all possible                         Specification">
streams; the stream universe. A sample is represented as a             10    <spec:insertions>
                                                                       11       <spec:cu name="result"
tuple hta , tv , ~v i, where ta represents the time the sample be-
                                                                       12          type="project2Dto3D">
came available, tv represents the time for which the sam-
                                                                       13          <spec:cu type="fusionRGBIR">
ple is valid, and ~v represents a vector of values. A special
                                                                       14             <spec:cu type="rgbCam" />
kind of stream is the constant stream, which only contains
                                                                       15             <spec:cu type="irCam" />
one sample. The execution of an information flow process-
                                                                       16          </spec:cu>
ing system is described by a series of stream space transi-
                                                                       17          <spec:cu type="GPSto3D">
tions S t0 ⇒ S t1 ⇒ · · · ⇒ S tn . Here S t represents a stream
                                                                       18             <spec:cu type="gps" />
space at time t such that every sample in every stream in S
                                                                       19          </spec:cu>
has an available time ta ≤ t.
                                                                       20       </spec:cu>
   Transformations in this context are stream-generating
                                                                       21    </spec:insertions>
functions that take streams as arguments. They are associ-
                                                                       22    <spec:removals>
ated with an identifying label and a specification determin-
                                                                       23       <!-- Removals based on names of
ing the instantiation procedure. This abstracts away the im-
                                                                                    transformations and CUs -->
plementation of transformations from the stream processing
                                                                       24    </spec:removals>
functionality. Transformations are related to the combina-
                                                                       25 </spec:specification>
tion of implementation and parameters of their correspond-
ing implementations. This means that for a given implemen-
tation there might exist multiple transformations, each using            The shown specification can be executed by the stream
different parameters for the implementation.                         processing engine, which instantiates the declared computa-
   When a transformation is instantiated, the instance is            tional units and connects them according to the specification.
called a computational unit. This instantiation is performed         In the example shown here, we make use of an XML-based
by the stream processing engine. A computational unit is as-         specification tree, where the children of every tree node rep-
sociated with a number of input and output streams. It is able       resent the inputs for that computational unit. The cu tag is
to replace input and output streams at will. A computational         used to indicate a computational unit, which may be a source
unit with zero input streams is called a source. An example          taking no input streams. A computational unit produces at
of a source is a sensor interface that takes raw sensor data         most one stream, and this output stream can thus be used
and streams this data. Conversely, computational units with          as input stream for other computational units. Indeed, only
zero transmitters are called sinks. An example of a sink is a        one computational unit explicitly defines the output stream
storage or a unit that is used to control the agent hosting the      name as result. When no explicit name is given, DyKnow
system, such as an unmanned aerial vehicle (UAV).                    assigns a unique name for internal bookkeeping. Note that
                                                                     every cu tag has a label associated with it. This label rep-
   DyKnow’s stream processing engine as shown in Figure 1
                                                                     resents the transformation used to instantiate the computa-
is responsible for manipulating the stream space based on
                                                                     tional unit, which is then given a unique name by DyKnow
declarative specifications, and thereby plays a key role as
                                                                     as well. As long as a transformation label is associated with
the foundation for the stream reasoning framework.
                                                                     an implementation and parameter settings, the stream pro-
                                                                     cessing engine is able to use this information to do the in-
3.2    Configurations in DyKnow                                      stantiation. In this toy example, the specification tree uses a
A configuration represents the state of the stream process-          GPS to infer coordinates, and combines this with RGB and
ing system in terms of computational units and the streams           infrared video data to provide the coordinates of some en-
connecting them. The configuration can be changed through            tities detected in the video data. Since DyKnow has been
the use of declarative stream specifications. An example of          implemented in ROS, currently only Nodelet-based imple-
a stream specification is shown in Listing 1, and describes          mentations are supported.
a configuration for producing a stream of locations for de-              The result of the stream declaration is that the stream
tected humans.                                                       processing engine instantiates the necessary transformations
                                                                     and automatically assigns the necessary subscriptions for the
        Listing 1: Configuration specification format                result stream to be executed. Additionally, it uses its own
   1 <?xml version="1.0" encoding="UTF-8"?>                          /status stream to inform subscribers when it instantiates a
   2 <spec:specification                                             transformation or starts a new stream, along with the speci-
   3     xmlns:xsi=                                                  fication used. This makes it possible for other components
   4        "http://www.w3.org/2001/XMLSchema-                       or even computational units to respond to changes to the
                instance"                                            stream space. This is illustrated in Figure 1, where the /s-
   5     xsi:schemaLocation=                                         tatus stream reports to the semantics manager. The stream
   6        "http://www.dyknow.eu/ontology#                          space shows streams as arrows produced by computational
                Specification                                        units (C) and sources (S).
   The described stream processing capability serves as a
foundation for stream reasoning. It makes it possible to gen-
erate streams based on specifications over labels, abstracting
away some of the low-level details. However, further im-
provements can be made by considering ontology-based in-
trospection through semantic information integration.

        4    Semantic Information Integration
Semantic information integration in the context of this paper
is about allowing a component to specify what information
it needs relative to an ontology describing the semantics of
the information provided by a system. This allows the sys-
tem to reason about its own information and what is required
to generate particular information. It takes away the need to
know exactly which streams contain what information, what
information is currently being produced by the system, or
which combination of transformations generates a specific
kind of information. It greatly simplifies the architecture of
components connected by streams to one where only the de-
sired information needs to be described at a higher level
of abstraction, and wherein the underlying system config-
ures and adapts automatically. This is achieved through the
use of ontologies and a technique called semantic match-
ing. Both are maintained in our framework by the semantics       Figure 2: Protégé-generated concept graph of the application
manager. The advantages of this approach includes the au-        independent DyKnow Ontology for Stream Space Modeling
tomatic generation of indirectly-available streams, the han-
dling of cases where there exist multiple applicable streams,    we can model the computational units and their input and
providing support for coping with the loss of a stream, and      output streams. By using a DL reasoner, we can then infer
providing support for the introspection of the space of avail-   (through the inverse property between :hasInput and :isIn-
able and potential streams.                                      put) which computational units a stream acts as input and
                                                                 output for. Concretely, by populating the ontology with in-
4.1      Ontology for Configuration Modeling
                                                                 formation on the stream space, it can serve as a structured
Ontologies are used to describe concepts and relations           semantic model of the stream space that can be queried.
between concepts. The Web Ontology Language (OWL)                As an example, consider again the specification presented
(McGuinness, Van Harmelen, and others 2004) was de-              in Listing 1. It assumes a number of transformations (indi-
signed to describe such ontologies, and is closely related       cated by spec:type) which are represented as individuals
to Description Logic (DL). In efforts to further the Seman-      of dyknow:Transformation. The computational units gen-
tic Web (Berners-Lee, Hendler, and Lassila 2001), many           erated as the result of executing this specification are also
ontologies have been created. However, to the best of our        registered by the semantics manager and added to the on-
knowledge no ontology exists to describe the concepts re-        tology. The fusionRGBIR-type computational unit is con-
lated to streams and stream transformations. As such, in de-     nected to its input providers of types rgbCam and irCam.
veloping our own ontology to serve as a data model for our       Listing 2 shows how the ontology registers this computa-
stream reasoning framework.                                      tional unit. The registration for the other computational units
   In order to describe the stream space, we developed           in the specification is done similarly.
the DyKnow Ontology for Stream Space Modeling1 . Fig-
ure 2 shows the corresponding concept graph generated by                   Listing 2: Example CU in Turtle syntax
Protégé. We use the prefix : (colon) to refer to concepts in      1 :dyknow_cu1 a :ComputationalUnit ;
this ontology. The ontology seeks to specify the general con-       2    :hasInput :dyknow_stream1 ;
cepts related to streams. Some of these terms have been dis-        3    :hasInput :dyknow_stream2 ;
cussed in the previous section, and are formalized in the on-       4    :hasOutput :dyknow_stream3 ;
tology. For example, by using an ontology we can also in-           5    :instantiationOf :fusionRGBIR .
dicate that every stream has at least one sample, and that a
computational unit is an instance of a transformation and           The second group of concepts of interest are :Annota-
has some input and output streams. This makes it possi-          tion and :Specification. A specification as mentioned ear-
ble to model and reason about sets of streams and changes.       lier describes how something is constructed. As such, the
For example, we can assign an individual (object) to the         functional :hasSpecification object property can be used to
:Stream concept to represent an existing stream. Similarly,      assign one specification to for example a stream or a trans-
                                                                 formation. The :Annotation concept is used to provide an-
   1
       http://www.dyknow.eu/ontology                             notations for objects in the ontology. The annotations are
used for describing transformations by their input and out-      query which computational units it serves as inputs and out-
put features. DyKnow considers entities in the world to be       puts for. Likewise, one could query the ontology for trans-
classified as sorts, which represent alike groups of objects.    formations with some provided annotation.
For instance, a sort can be UAV, for which uav2 might be            The OWL-S (Martin et al. 2004) and SSN (Compton and
an object. Features are used to describe object properties or    others 2012) ontologies are closely related to the application
relations between objects. Therefore :Sort and :Feature are      focus of this paper. OWL-S is an upper ontology for ser-
also part of this ontology, which allows us to specify hierar-   vices in which services can be described by service profiles.
chical structures over sorts and features, e.g. Car v Vehicle    Being an upper ontology, it restricts itself to abstract repre-
(i.e. Car less general than Vehicle). Every ontological Thing    sentations, leaving more concrete extensions to users of the
can be regarded as a :Sort, and as such the two are con-         upper ontology. Similarly, the SSN ontology takes a sensor-
sidered to be equivalent concepts. The :Sort and :Feature        centric approach. Our ontology differs by representing both
concepts act like stubs that can be extended for a particu-      the transformations (services) and streams through popula-
lar application domain. An example for transformations is        tion of the ontology with individuals, and complements the
shown in Listing 3, where the fusionRGBIR transformation         aforementioned ontologies.
is used.
                                                                 4.2   Maintaining the System Ontology
       Listing 3: Example transformation in Turtle syntax              Correspondence
   1 :fusionRGBIR a :Transformation ;                            The ontology presented in the previous section can be used
   2      :hasAnnotation [                                       as a knowledge base (KB) for stream reasoning frameworks
   3          :hasOutputAnnotation [                             in general. Note that the KB treats streams and transforma-
   4              :describesFeature :ImagePosition ;             tions over streams as entities that can be assigned properties
   5              :describesSort :Human                          to. This approach is similar to semantic (or RDF) streams,
   6          ];                                                 which can be used to represent parts of an ontology that
   7          :hasInputAnnotation [                              change over time. Rather than representing the data con-
   8              :describesFeature :RawRGBCam ;                 tained within streams, we chose to represent the streams
   9              :describesSort :self ;                         themselves as entities. From a stream reasoning framework
  10              :nextSegment [                                 perspective, this allows us to model the active and potential
  11                  :hasInputAnnotation [                      streams and transformations.
  12                      :describesFeature                         Figure 1 showed a high-level outline of our approach.
  13                          :RawIRCam ;                        When considering the task of maintaining the knowledge
  14                      :describesSort :self                   base, our focus is on the semantics manager. The semantics
  15                  ]                                          manager is primarily tasked with detecting changes that take
  16              ]                                              place in the system, such as new computational units being
  17          ]                                                  instantiated or existing computational units changing their
  18      ];                                                     subscriptions to streams. It is able to perform this task by
  19      :hasName ‘‘fusionRGBIR”ˆˆstring .                      listening to the status streams of computational units, which
                                                                 they use to notify subscribers when their individual configu-
   In this example, :fusionRGBIR represents the transfor-        rations change. Of course, this leads to a bootstrapping issue
mation from RGB and IR camera images to 2D image co-             of finding the computational units in the first place. Recall
ordinates. As expected, it is an individual of the :Transfor-    that the stream processing engine is used to execute config-
mation concept. The example also states that it has an an-       uration specifications. Given a configuration specification,
notation and a name string. Note that these :Annotation in-      it instantiates new computational units and provides infor-
dividuals are not the same as OWL annotations, which are         mation on the names of the streams to subscribe to or pro-
treated as comments that are disregarded by DL reasoners.        duce. The ability to instantiate new computational units is
The annotation has an output annotation and an input an-         not limited to the stream processing engine, but it serves as
notation, both describing a feature and a number of sorts        the first computational unit in the system. As such, the se-
depending on the arity of the feature. For the input annota-     mantics manager can presume its existence and listen to its
tion, we specify two inputs where the ordering is kept ex-       status stream to capture the instantiation of any new compu-
plicit using :nextSegment in order to avoid ambiguity. The       tational units. By listening to status streams, the semantics
same construct can be used to construct non-unary features       manager is able to keep track of the state of the stream pro-
by specifying a list of sorts, making use of the :Description-   cessing system and update the ontology to match the state.
Sequence concept.                                                   In addition to tracking the system configuration and mod-
   The ontology makes it possible to draw inferences over        eling this in the ontology, the semantics manager is able
the internal state of the system. Even though the semantics      to model additional information. In our conceptualisation,
manager only reports status updates, the ontology specifies      computational units are instances of transformations, which
properties over predicates that can be used to perform lim-      in turn represent the combination of implementations and
ited reasoning. For instance, using DL reasoners it is possi-    parameters. For example, a people tracker implementation
ble to determine for a given transformation which computa-       may need a number of parameter assignments in order to
tional units are instances. Similarly, given a stream we can     work properly on a specific UAV type. There may be a num-
ber of such combinations consisting of a specific imple-         • humanCoordinates : PixelLocation[Human],
mentation and a number of parameter assignments. Every              GeoLocation[RMAX], Attitude[RMAX]
such combination is represented as a labelled transforma-           ⇒ GeoLocation[Human]
tion. A transformation can have multiple computational unit         In this small example, the source transformations are
instances, which are combinations of transformations with        marked as having no input features. RGB and IR are in-
specific input and output streams. Transformations thus do       tended to represent colour and infrared camera streams. A
not exist themselves as entities in the system state, but the    Yamaha RMAX is a type of rotary UAV, and self is assumed
ontology is able to describe them and relate them to com-        to be of sort RMAX. We also represent a human detector,
putational unit instances. Similarly, it is possible to anno-    which in the 2D version produces pixel location information
tate entities with additional information. For example, in the   from the camera data. This can then be combined with the
stream reasoning context, it is useful to annotate transforma-   state of an RMAX to produce an estimation of the 3D posi-
tions with the features it takes and produces. This is used to   tion of a detected human. Note that the detectors are specific
perform semantic matching, described in detail below.            to the RMAX sort because they depend on certain parame-
   By providing an interface to the model of the system state    ters that are specific to the UAV platform used. This allows
(or configuration), computational units themselves can re-       for the same implementation to be used with different pa-
quest changes to be made to the ontology. This can be useful     rameters for a different platform, and in such a case it is
when properties change and have to be updated accordingly,       treated as a different transformation.
such as may be the case when describing the semantics of            If we are interested in a stream of GeoLocation features
a stream using annotations in cases where the stream may         for the Human sort, we can generate a specification that
change due to environmental changes.                             produces such a stream if we make use of the above trans-
                                                                 formation annotations. While the example can provide one
4.3   Semantic Matching Algorithm                                specification, in some cases we may have multiple possible
Semantic matching in the context of a stream reasoning           alternative specifications for generating the desired feature
framework presented here is the task of providing a stream       information. This could happen when there already exists a
specification given a desired feature. Such a specification      computational unit producing the desired feature informa-
may make use of existing streams and computational units,        tion, or even just part of the information needed in order to
or it may use its knowledge of transformations to reconfig-      generate the desired feature information. Additionally, there
ure the system in such a way that it produces a stream with      might simply be multiple ways of generating the same fea-
the desired feature. We call such streams latent streams. The    ture information. For example, assume we add a transforma-
focus is on desired features because we are interested in rea-   tion that uses both the GPS and IMU to determine location:
soning over metric temporal logic formulas, where the fea-
ture symbols need to be grounded in streams in order for                 IMUGPSto3D : GPS[Thing], IMU[Thing]
them to have meaning. The semantic matching procedure is                    ⇒ GeoLocation[Thing]
another powerful form of introspection using the ontology.       Now there are two ways of getting GeoLocation informa-
Semantic matching is thus performed by the semantics man-        tion. In order to avoid a lot of duplicate subtrees, we make
ager and constitutes its secondary responsibility of providing   use of a tree datastructure, in which every node represents
high-level services related to the ontology.                     a transformation or computational unit instance and edges
   By providing semantic annotations for transformations,        correspond to features. A node’s children are collections of
we can specify which features a transformation produces or       nodes that produce the same feature. The transformation tree
requires. The semantics manager’s services make it possible      is produced for some desired feature, which then yields a set
to provide these semantic annotation during run-time, both       of valid subtrees each of which produces the desired feature.
by a human operator or a computational unit. Features de-        A subtree is valid iff none of its leaf nodes require any input
scribe properties of objects or relations between objects. For   features, i.e. computational unit instances or source trans-
example, Altitude(UAV) describes the unary Altitude fea-         formations. By adding the constraint that features may only
ture over the UAV sort. A transformation produces a sin-         occur once along every path in the tree, we prevent cycles.
gle output stream and any number of input streams. We               Once a transformation tree has been generated, it con-
consider the following example transformations, where the        tains all possible ways of generating the desired feature. A
name of the transformation is followed by the input and out-     stream specification can be generated by traversing the tree
put stream annotations, shown below.                             and picking a single transformation for every set of applica-
• gps : ∅ ⇒ GPS[self]                                            ble transformations. In the process, subtrees can be removed
• imu : ∅ ⇒ IMU[self]                                            based on some strategy.
• rgbCam : ∅ ⇒ RGB[self]                                         Fast solution By doing a depth-first traversal, a stream spec-
                                                                 ification can be found while excluding potentially a large
• irCam : ∅ ⇒ IR[self]
                                                                 part of the search space. This might be useful when a quick
• attitude : IMU[Thing] ⇒ Attitude[Thing]                        solution is desired, after which a more expensive strategy
• GPSto3D : GPS[Thing] ⇒ GeoLocation[Thing]                      can be used to find a better solution to switch to.
• humanDetector : RGB[RMAX],                                     Minimise cost Similar to the configuration problem work
   IR[RMAX] ⇒ PixelLocation[Human]                               by (Lundh, Karlsson, and Saffiotti 2008), we can assign a
cost to instantiating transformations as computational units.     ability of the system to cope with unexpected changes, and
The cost can be a property of a transformation in the ontol-      shows further importance of introspective capabilities.
ogy. This way the algorithm is encouraged to make use of
pre-existing streams where possible. Another cost could be                5    Conclusions and Future Work
assigned to the number of subscribers for a particular stream,
which can be determined from the ontology.                        We have presented an approach to ontology-based intro-
                                                                  spection supporting stream reasoning. Introspection is used
Maximise quality Instead of minimising the cost, we can           to configure the stream processing system and adapt it to
maximise the quality. This strategy picks transformations         changing circumstances. The presented semantic matching
with the highest information quality. Since cost and qual-        approach based on introspection makes it possible to spec-
ity are not necessarily inversely proportional, the two strate-   ify information of interest, which is then provided auto-
gies are distinct. In the example, quality maximisation might     matically. This functionality makes it possible to provide
mean that the strategy would pick the IMUGPSto3D trans-           high-level descriptions, for example in the evaluation of
formation rather than the GPSto3D transformation, com-            spatio-temporal logical formulas over streams, without hav-
bining IMU and GPS data even if this comes at a perfor-           ing to worry about individual streams or transformations.
mance cost.                                                       The high-level descriptions use an ontology, which provides
                                                                  a data model and a common language. Our DyKnow on-
4.4   Adaptive Subscriptions                                      tology for stream space modeling has been designed to be
Semantic matching makes it possible to find or generate           implementation-independent, and can therefore be used in
streams of desired information, but it can also be used to deal   other stream-based frameworks. Since the ontology repre-
with instances where streams stop. This could occur due to a      sents both the system’s information about the world and its
source no longer providing information, or a transformation       internal stream processing many other powerful forms of in-
becoming unresponsive, both of which are real problems in         trospection are also made possible.
integrated systems. This allows for adaptivity. Another issue        There remain many interesting extensions and improve-
that might occur in for example robotic systems is a change       ments. Currently the semantic annotations have focused on
of semantics for a particular stream due to the actions per-      features and sorts. However, this is a narrow semantic speci-
formed by the system. Being able to detect these changes          fication for e.g. sensors. Existing work towards the semantic
and provide an appropriate response makes a system more           sensor web such as the SensorML (Botts and Robin 2007)
rigid to (unexpected) changes. Adaptive subscriptions make        and the SSN ontology (Compton and others 2012) may pro-
use of semantic matching to this end, and thereby employ a        vide hints for such an extension. It is also interesting to con-
more refined form of ontology-based introspection.                sider the case where transformations are provided with their
   Subscriptions are usually based on the name of a gener-        own ontologies, and how such an ontology could be used
ated stream. The problem is that these break easily and fail      in combination with the DyKnow ontology for stream space
to capture the intended behaviour of a subscription. A client     modeling. Presently humans usually provide semantic an-
should not have to care about whether a stream is still active,   notations for transformations, although it is technically pos-
or whether its semantics changes over time. In our stream         sible for a program to provide annotations. A different di-
reasoning context, we are interested in streams represent-        rection is a multi-agent approach where local collections of
ing features. Using semantic matching, we are able to auto-       streams can be shared between heterogeneous agents.
matically construct a specification to generate a stream for         The presented work presents a great improvement to-
a specific feature. However, if the stream stops for whatever     wards our earlier semantic matching efforts by leveraging
reason, or if the semantics of the stream changes, it is im-      ontology-based introspection. This makes it possible to pro-
portant that the subscription is revised to take these changes    vide semantic subscriptions, which are more reliable than
into account. The semantics manager is able to do so to           syntactic subscriptions. Furthermore, by providing an on-
some degree. The main challenge is the detection and report-      tology to describe our stream processing system’s internal
ing of these changes. A computational unit is able to deter-      state, we have a common vocabulary that could be shared
mine when the quality of a stream drops, for example when         between multiple such systems. We believe that our taken
delays increase, and can request the semantics manager to         direction complements the work done by the semantic/RDF
for example change the quality property of some stream-           stream reasoning community, by focusing on stream pro-
producing computational unit to reflect the poor quality.         cessing at a higher level while using similar semantic web
Similarly, a change in semantics caused by a new compu-           technology tools, and by providing an ontology of our own.
tational unit can be reported by that computational unit.
   Since ontology updates are handled by the semantics                               Acknowledgments
manager, it can respond to those updates accordingly. In
some cases this might mean that semantic matching needs           This work is partially supported by grants from the National
to be performed to find an alternative stream for a de-           Graduate School in Computer Science, Sweden (CUGS),
sired feature. Because the semantics manager is able to per-      the Swedish Aeronautics Research Council (NFFP6), the
form ontology-based introspection, it can make sure to only       Swedish Foundation for Strategic Research (SSF) project
change the subscriptions for those computational units that       CUAS, the Swedish Research Council (VR) Linnaeus Cen-
require them. The resulting adaptivity greatly enhances the       ter CADICS, the ELLIIT Excellence Center at Linköping-
Lund for Information Technology, and the Center for Indus-      Heintz, F. 2009. DyKnow: a stream-based knowledge pro-
trial Information Technology CENIIT.                            cessing middleware framework. Ph.D. thesis, Linköping U.
                                                                Heintz, F. 2013. Semantically grounded stream reasoning
                       References                               integrated with ROS. In Proc. IROS.
Berners-Lee, T.; Hendler, J.; and Lassila, O. 2001. The         Koymans, R. 1990. Specifying real-time properties with
semantic web. Scientific American.                              metric temporal logic. Real-Time Systems 2(4):255–299.
Botts, M., and Robin, A. 2007. OpenGIS sensor model lan-        Lenzerini, M. 2002. Data integration: A theoretical perspec-
guage (SensorML) implementation specification. OpenGIS          tive. In Proc ACM SIGMOD-SIGACT-SIGART.
Implementation Specification OGC.
                                                                Lundh, R.; Karlsson, L.; and Saffiotti, A. 2008. Au-
Botts, M.; Percivall, G.; Reed, C.; and Davidson, J. 2008.      tonomous functional configuration of a network robot sys-
OGC R sensor web enablement: Overview and high level            tem. Robotics and Autonomous Systems 56(10):819–830.
architecture. GeoSensor networks 175–190.
                                                                Martin, D.; Burstein, M.; Hobbs, J.; Lassila, O.; McDermott,
Bröring, A.; Maué, P.; Janowicz, K.; Nüst, D.; and           D.; McIlraith, S.; Narayanan, S.; Paolucci, M.; Parsia, B.;
Malewski, C. 2011. Semantically-enabled sensor plug &           Payne, T.; et al. 2004. Owl-s: Semantic markup for web
play for the sensor web. Sensors 11(8):7568–7605.               services. W3C member submission.
Compton, M., et al. 2012. The SSN ontology of the W3C           McGuinness, D. L.; Van Harmelen, F.; et al. 2004. OWL
semantic sensor network incubator group. Web Semantics:         web ontology language overview. W3C recommendation.
Science, Services and Agents on the World Wide Web 17.          Quigley, M.; Gerkey, B.; Conley, K.; Faust, J.; Foote, T.;
Cugola, G., and Margara, A. 2012. Processing flows of           Leibs, J.; Berger, E.; Wheeler, R.; and Ng, A. 2009. ROS:
information: From data stream to complex event processing.      an open-source robot operating system. In ICRA Workshop
ACM Computing Surveys.                                          on Open Source Software.
Goodwin, J., and Russomanno, D. 2009. Ontology integra-         Russomanno, D.; Kothari, C.; and Thomas, O. 2005. Build-
tion within a service-oriented architecture for expert system   ing a sensor ontology: A practical approach leveraging ISO
applications using sensor networks. Expert Systems 26(5).       and OGC models. In Proc. the Int. Conf. on AI.
Heintz, F., and de Leng, D. 2013. Semantic information          Saffiotti, A.; Broxvall, M.; Gritti, M.; LeBlanc, K.; Lundh,
integration with transformations for stream reasoning. In       R.; Rashid, J.; Seo, B.; and Cho, Y.-J. 2008. The PEIS-
Proc. Fusion.                                                   ecology project: vision and results. In Proc. IROS.
Heintz, F., and Doherty, P. 2004. DyKnow: An approach to        Sheth, A.; Henson, C.; and Sahoo, S. 2008. Semantic sensor
middleware for knowledge processing. J. of Intelligent and      web. IEEE Internet Computing 78–83.
Fuzzy Syst. 15(1).
                                                                Tang, F., and Parker, L. E. 2005. Asymtre: Automated syn-
Heintz, F., and Dragisic, Z. 2012. Semantic information         thesis of multi-robot task solutions through software recon-
integration for stream reasoning. In Proc. Fusion.              figuration. In Robotics and Automation, 1501–1508. IEEE.
Heintz, F.; Kvarnström, J.; and Doherty, P. 2010. Bridging     Whitehouse, K.; Zhao, F.; and Liu, J. 2006. Semantic
the sense-reasoning gap: DyKnow – stream-based middle-          streams: a framework for composable semantic interpreta-
ware for knowledge processing. J. of Advanced Engineering       tion of sensor data. In Proc. EWSN.
Informatics 24(1):14–26.