<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Bridging the Gap between Semantic Web and Networked Sensors: A Position Paper</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Intelligent Systems Group and Infotech Oulu</institution>
          ,
          <addr-line>FIN-90014</addr-line>
          ,
          <institution>University of Oulu</institution>
          ,
          <country country="FI">Finland</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this position paper, we present our work towards designing a Semantic Web languages-compatible representation for networked sensors. The representation, Entity Notation, is proposed to connect sensors to the Semantic Web. Entity Notation can express RDF and OWL ontology models in a uniform format. Meanwhile, it o ers a lightweight alternative for sensors with limited computation and communication capabilities. We present motivation and design issues of Entity Notation in this paper.</p>
      </abstract>
      <kwd-group>
        <kwd>Entity Notation</kwd>
        <kwd>Semantic Web Representations</kwd>
        <kwd>Embedded Sensing</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>Sensors and sensor networks will be everywhere, from environment monitoring to
planet exploration. Increasing maturity of sensors and sensing technology opens
up questions about how to e ectively operate with massive amounts of data
produced by current large scale deployments. What kind of solutions should be
developed on top of these sensor networks to make the best use of all the data?</p>
      <p>Semantic Web approaches this challenge from the opposite direction and
o ers solutions for a heterogeneous, large scale inter-operating networks. When
sensors meet Semantic Web, there will be bene ts for both sides. For the
Semantic Web, sensors and sensor networks bring a whole new peripheral of massive
data from real time measurements. For sensing systems, Semantic Web enables
resource sharing, resource reallocation, and advanced functionality like inference
over events in space and time within a particular context.</p>
      <p>
        One main challenge for connecting networked sensors to the Semantic Web
is that most sensors produce raw data in their own formats. Hence, each sensor
model needs its own solution when sensors are connected to Semantic Web. We
tackle the gap between Semantic Web and networked sensors at the data
interchange level. As RDF [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] and ontology [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] models are based on triplet
representations, we design a uniform format, Entity Notation (EN), based on triplets for
expressing RDF models and OWL ontologies. Moreover, we suggest a compact
format for transmission when resources of sensor networks are constrained. The
compact format shortens our representation based on techniques of templates
and pre xes. Entity Notation is unique in its combination of expressiveness and
compactness. This combination makes it ideal for connecting networked sensors
to the Semantic Web.
      </p>
      <p>We start this paper by introducing related data representations in section 2.
Then, we present our requirements and design details in section 3. Finally, we
discuss our future work in section 4.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        Numerous data representations have been de ned in di erent sensing systems,
like Generic Sensor Format [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], Unisens [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and Uni ed Transportation Sensor
Data Format [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Sensor model speci c data formats introduce dependencies into
the system, and even set up a barrier to web applications. Binary XML emerges
as a new trend to support convenient solutions for transferring raw data to XML
representation. There are many competitive binary XML formats, like X.694 [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ],
E cient XML Interchange Format (EXI) [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], and Fast Infoset [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Data in these
formats can be transformed into XML straightforwardly, and some of them can
achieve good compression ratio. However, XML representation has no embedded
semantics, and the gap between syntax and semantics still exists.
      </p>
      <p>
        On the other hand, there are some compact serializations for representing
knowledge, like N3 [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] and Turtle [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], which can express RDF in a
straightforward manner. They are exible languages with strong expressive capability. But
unfortunately, they normally contain shorthand syntaxes to shorten the les.
This is not a suitable solution for resource-constrained sensors, because they do
not have enough processing power to process these shorthand syntaxes.
      </p>
      <p>
        Another e ort to increase interoperability is to annotate sensor data with
semantic metadata. Metadata languages, like RDFa, are utilized to provide context
information for raw data [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. However, as the expressivity of annotation formats
is limited, they cannot capture the full semantics of sensor data. In a nutshell,
more work is needed in developing representation that can interoperate Semantic
Web knowledge with sensing data straightforwardly and unambiguously.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Designing Entity Notation</title>
      <p>Our ultimate goal is to design a communication-convenient serialization for
Semantic Web languages. This serialization will ease the transferring of knowledge
models and advance interoperability between Semantic Web and networked
sensors.</p>
      <p>We examine W3C Semantic Web standards, and consider RDF and OWL
ontology as essential models for realizing knowledge interoperability. Intelligent
Semantic Web applications normally build on data represented by RDF and
knowledge based on ontologies. Then inference tools can be deployed on this
knowledge base to enable intelligent functions. At current stage, EN can serialize
RDF models and OWL DL ontology in a uniform format, and we will extend
this format for other standard languages.</p>
      <sec id="sec-3-1">
        <title>Design Requirements</title>
        <p>We have speci ed the following main design requirements for EN. First, EN must
be expressive enough for representing RDF and OWL DL ontology models in a
uniform format. It should be possible to transform any description of RDF and
OWL DL knowledge into EN packets, and vice versa. Ful lling this goal enables
transforming RDF/XML and OWL into EN packets. This is essential, as XML
is one of the most widely used serialization, and OWL utilizes XML syntax too.
Many available reasoning tools are based on RDF/XML and OWL. Second, EN
must support distributed knowledge production. Any snippet of RDF and
ontology knowledge can be de ned by any device and Semantic Web applications,
and these snippets can be discovered, transferred and integrated into larger
ontologies. This will be a unique feature that to the bests of our knowledge has
not been reported before. Third, EN must be suitable for bandwidth limited
and ultra-low power communication links of sensor networks. For example, an
unsophisticated embedded sensor should be able to compose EN packets using
minimal computation power and deliver packets to Semantic Web applications
in an energy e cient manner. Fourth, EN must be able to carry any kind of
information between sensors and Semantic Web applications. Fifth, the
information carried in EN packets needs to be identi ed in a unique fashion. This is
an important feature for an Internet scale system when all the possible usages
of the information cannot be speci ed in advance.
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>General EN Packet Formats</title>
        <p>The basis of RDF models is a sequence of triplets (Subject, Predicate, Object ),
and ontologies also contain simple de nitions and triples, which describe
relations among concepts. Complex relations are built on simple de nitions and
triplets. Besides, all information is identi ed in a unique fashion by URIs, except
literals.</p>
        <p>
          To ful ll the expressibility and lightweight requirements, we de ne two EN
formats: the complete format, and the short format. Complete format has enough
expressibility, as it follows the triple notation. Moreover, we adopt almost all
terms from OWL in the complete format. It's straightforward to serialize RDF
and ontologies into complete EN packets and vice versa. Lightweight short
packets can support resource-constrained devices and slow communication links.
Furthermore, EN supports composition and decomposition of knowledge, and thus
enables incremental knowledge de nition and communication. We de ne for RDF
and ontology knowledge a uniform EN format that does not constrain the type
of information. Finally, EN uses UUIDs (Universally Unique Identi ers) [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ] and
URIs for unique identi cation.
        </p>
        <p>Generally, an EN packet depicts an entity and its relationships with values
and other entities. An entity is some identi able whole, concrete or abstract.
Entities in the RDF level knowledge tend to be concrete, like sensors, robots and
measurements. Entities in the ontology level are abstract concepts and relations
in the ontology. An entity description is of the form:
EntityType EntityID
PropertyName PropertyValue
...</p>
        <p>PropertyName PropertyValue</p>
        <p>The subject can be same for multiple triplets, and this happens for most
entities in actual knowledge models. Hence, we identify one entity in an entity
description, and then any number of triplets can be de ned about it. Besides,
we include type information for EntityID, because this is important information
to identify its subsumption.</p>
        <p>Square brackets and angle brackets are utilized to identify the level of
knowledge an EN packet should mapped to. When an entity description is wrapped in
square brackets ([ and ]), this EN packet should be transferred to RDF models.
When an entity description is wrapped in angle brackets (&lt; and &gt;), this EN
packet should be transferred to OWL ontologies.</p>
        <p>When an EN packet is mapped to RDF statements, EntityID maps to the
subject of a triplet, PropertyNames map to predicates, PropertyValues map
to Objects, and EntityType maps to a class this entity is subsumed. When it
should be mapped to ontology snippets, EntityID maps to an element name,
EntityType indicates the type of the element, and PropertyNames are mapped
to relationships or EntityCharacteristics. Relationships between entities, like
rdfs:subClassOf, and rdfs:domain, can be mapped to OWL constructors.
EntityCharacteristics can be mapped to the term rdf:type, while CharacteristicsName
can be mapped directly to OWL types.</p>
        <p>For resource-constrained sensors, complete EN packets are still verbose as
full URIs are used. We do not utilize complex compression algorithms, because
this would introduce additional resource-consumption for sensors. Instead, we
suggest a short packet format for decreasing the need for computations and
communication bandwidth.</p>
        <p>The short EN format uses templates and pre xes to shorten packets. The
basic idea is that a template contains a description of the constant part of a
sequence of EN packets and placeholders for the variable items. The packet sent
over the communication links needs to contain only a template identi er and the
variable items. A sequence of complete EN packets can then be assembled by
replacing the template's placeholders with the values contained in a short packet.
Pre xes are used to shorten URI references. The receiver, like a gateway, needs to
possess the corresponding template in order to assemble complete packets but a
resource-constrained sensor needs only the template identi er. Pre xes are used
to shorten URI references. Similar to complete packets, square or angle brackets
are utilized to wrap following descriptions depending on the level of knowledge:
UUID PropertyValue ... PropertyValue</p>
        <p>UUID is an identi er that is guaranteed to be unique across space and time.
In its canonical form, a UUID is 128 bits long. We utilize UUIDs to identify
templates. When a short packet format includes EntityIDs the format is:
UUID
EntityID PropertyValue ... PropertyValue</p>
        <p>......</p>
        <p>EntityID PropertyValue ... PropertyValue
3.3</p>
        <p>The Role of EN in Connecting Semantic Web and Networked
Sensors
Figure 1 presents the role of EN in bridging the gap of Semantic Web and
networked sensors. Embedded sensors can compose complete or short EN packets
depending on their capabilities, but they all can transformed into Semantic Web
representation at gateways. Knowledge base and Semantic Web applications can
access sensor produced data and knowledge straightforwardly. Moreover, some
devices in sensor networks can actuate commands what Semantic Web
applications produce.
In this short paper, we presented motivation of designing a novel representation
for connecting networked sensors to Semantic Web. Then, we introduced design
requirements, general formats of EN, and EN's role in connecting networked
sensors to Semantic Web. We have implemented EN in a simulator, wearable
sensors, and embedded GPS sensors of Nokia N95 cell phones. Implementation
details are not included in this paper.</p>
        <p>EN is unique in its combination of expressiveness and compactness.
Utilization of UUIDs and URIs enables EN a scalable solution for Semantic Web. It is
recommended to adopt full 128 bits UUIDs for web-scale system. The shorter
alternatives, like 16 bit, 32 bit UUIDs, guarantee uniqueness in smaller scale
systems. UUIDs o er a convenient solution for mapping shorter UUIDs to full
ones, which facilities the integration of small scale systems to large, even Internet
scale systems.</p>
        <p>The current EN version does not support nesting although nesting is widely
supported by data representations and it compresses descriptions. Instead of
nesting we divide descriptions into several small EN packets and use URI
references and blank node identi ers to determine relations between them. Small
packets facilitate optimizing the communication. For example, packets can have
di erent routing paths or they can be sent over a longer period of time. We will
consider nesting in near future; we will analyze the computational load it sets
for constrained devices and decide based on this analysis whether EN should
support nesting.</p>
        <p>This is a early work, and we will continue it with several aspects. We will
specify formal syntax and semantics for our serializtion, and perform more
experiments based on standard dataset to evaluate it. Moreover, we will study
how to represent SWRL and RIF rules in EN. This is another important step
for building a uniform format for transferring representations of Semantic Web.
Finally, we will explore possibilities of applying EN and Semantic Web
technologies in sensor networks, especially how to infer knowledge from sensory data in
ubiquitous environments.</p>
        <p>Acknowledgments. This work was funded by Infotech Oulu and the MOTIVE
research project program of the Academy of Finland.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>RDF</given-names>
            <surname>Primer</surname>
          </string-name>
          , http://www.w3.org/TR/rdf-primer/
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Studer</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Benjamins</surname>
            ,
            <given-names>V.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fensel</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Knowledge Engineering:Principles and methods</article-title>
          .
          <source>Data Knowledge Engineering</source>
          .
          <volume>25</volume>
          ,
          <issue>161</issue>
          {
          <fpage>197</fpage>
          (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>Generic</given-names>
            <surname>Sensor</surname>
          </string-name>
          <article-title>Fromat (GSF)</article-title>
          , http://www.saic.com/maritime/gsf/
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>4. Unisens, http://www.unisens.org/index.php</mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Kwon</surname>
            ,
            <given-names>T.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dhruv</surname>
          </string-name>
          , N.:
          <article-title>Uni ed Transportation Sensor Data Format (UTSDF) for E cient Archiving and Sharing of Statewide Transportation Sensor Data</article-title>
          .
          <source>In: The Transportation Research Board 83rd Annual Meeting</source>
          , Washington D.C. (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>International</given-names>
            <surname>Telecommunication Union</surname>
          </string-name>
          ,
          <string-name>
            <surname>X.</surname>
          </string-name>
          <year>694</year>
          , http://www.itu.int/ITU-T/ studygroups/com17/languages/X694.pdf
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>W3C</given-names>
            <surname>Candidate Recommendation</surname>
          </string-name>
          ,
          <article-title>E cient XML Interchange (EXI) Format 1</article-title>
          .0, http://www.w3.org/TR/exi/
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>Fast</given-names>
            <surname>Infoset</surname>
          </string-name>
          <article-title>Binary XML for</article-title>
          .Net, http://www.noemax.com/products/ fastinfoset/index.html
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Notation</surname>
          </string-name>
          <article-title>3 (N3) A readable RDF syntax</article-title>
          , http://www.w3.org/DesignIssues/ Notation3
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Turtle - Terse RDF Triple Language</surname>
          </string-name>
          , http://www.w3.org/TeamSubmission/ turtle/
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Sheth</surname>
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Henson</surname>
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sahoo</surname>
            <given-names>S.S.</given-names>
          </string-name>
          :
          <article-title>Semantic Sensor Web</article-title>
          .
          <source>IEEE Internet Computing. 1089-7801/08</source>
          , 78{
          <fpage>83</fpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <given-names>A</given-names>
            <surname>Universally Unique</surname>
          </string-name>
          <article-title>IDenti er (UUID) URN Namespace</article-title>
          , http://www.ietf.org/ rfc/rfc4122.txt
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>