<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Determining Information Usefulness in the Semantic Web: A Distributed Cognition Approach</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Santtu Toivonen</string-name>
          <email>santtu.toivonen@vtt.fi</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tapio Pitka¨ranta</string-name>
          <email>tapio.pitkaranta@vtt.fi</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Oriana Riva</string-name>
          <email>oriana.riva@hiit.fi</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Helsinki Institute for Information Technology P.</institution>
          <addr-line>O.Box 9800, FIN-02015 HUT</addr-line>
          ,
          <country country="FI">FINLAND</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>VTT Information Technology P.</institution>
          <addr-line>O.Box 1203, FIN-02044 VTT</addr-line>
          ,
          <country country="FI">FINLAND</country>
        </aff>
      </contrib-group>
      <fpage>113</fpage>
      <lpage>117</lpage>
      <abstract>
        <p>Determining the usefulness of domain-specific information in the Semantic Web is a critical operational precondition that must be addressed in order to realize the Semantic Web's potential. We approach the problem through the notion of distributed cognition, which emphasizes the inclusion of external elements in agents' thinking processes. We concentrate on multi-agent scenarios of distributing cognition, meaning that a single externalized piece of distributed cognition can be internalized and utilized by multiple agents. We decompose the problem of determining information usefulness into the problems of understanding the information and subsequently determining its relevance.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Physical space</title>
    </sec>
    <sec id="sec-2">
      <title>Virtual space</title>
      <p>Semantic
Note</p>
      <p>SA
distributed. In the physical world, anything conceivable to a thinking creature can be
used for distributing cognition. In the Semantic Web, instead, the distribution media
are more restricted, as Figure 1 depicts. Human agents (HA) can distribute their
cognition to calculators, notebooks, tools, and so on, but software agents (SA) only to media
accessible from the virtual space they reside in.</p>
      <p>In principle also software agents could use physical structures for distributing
cognition, for example by printing on paper, as depicted by the narrow arrow in Figure 1,
but a more typical scenario is that software agents distribute their cognition in a digital
form. We use the term Semantic Note to refer to these kinds of entities. A Semantic Note
stores and transmits some meaningful piece of information, such as a definition of some
complex concept or instructions for completing a procedure. The domain of
information stored in Semantic Notes is unrestricted, meaning that a Semantic Note can contain
a definition of a complex concept from any area. That is why Semantic Notes are
defined functionally as being representations of one or more entities potentially of use in
carrying out a domain-specific task. In the following sections we limit the definitions to
cover only the Semantic Note, since it is the atomary unit of distributing cognition in
the Semantic Web, and hence enough for our purposes. However, the definitions could
be applied to other information content, too.
2</p>
      <p>Determining the Usefulness of a Semantic Note
A Semantic Note can be decomposed into its constituents, namely statements.
Statements are opinions about states-of-affairs, such as The web site ’http://www.vtt.fi/tte/
proj/dynamos’ is created by Santtu Toivonen. The terms in a statement can be organized
in the subject-predicate-object model of RDF, and conform to concepts in an ontology.
This kind of machine-accessibility is especially important for software agents. Using
RDF, the above statement could be defined as follows:
&lt;rdf:Description rdf:about="http://www.vtt.fi/tte/proj/dynamos/"&gt;
&lt;dc:creator&gt;Santtu Toivonen&lt;/dc:creator&gt;
&lt;/rdf:Description&gt;</p>
      <p>Of the above RDF excerpt’s terms, only the predicate (dc:creator) explicitly
refers to an ontology, namely that of the Dublin Core metadata elements [5].
Combining the notion of statements and the approach adopted in [6], an agent can be said to
understand a statement found in a Semantic Note as follows:
Definition 1. An agent (a) understands a statement (s), iff all the terms (t) constituting it conform
to concepts (φ) found in an ontology (o), which is accessible to a:</p>
      <p>understands(a, s) ↔ ∀t : (t ∈ s → ∃φ : (conforms(t, φ) ∧ φ ∈ o ∧ access(a, o))).</p>
      <p>We assume that one statement is either understood or not understood by an agent.
In principle a more specific definition could be given based on the understanding of the
terms constituting the statement. However, for our purposes a statement is on a more
appropriate level of granularity. By applying a function und we assign the statements
values, denoted by su, as follows:
und(s) = su =
1 if all terms (t ∈ s) are understood
0 otherwise
nu represents the agent’s level of understanding of the Semantic Note (n). Let Sn
be the set of statements included in n so that s1, s2, ..., sk ∈ n, where k = |Sn|.
nu receives values between 0 and 1 based on the number of understood statements
(su1, su2, ..., suk ∈ n) divided by the number of all statements in the Semantic Note
(|Sn|) as follows:
0 ≤nu =
nu = 0</p>
      <p>1 ∗
|Sn|
|Sn|
i=1
sui ≤ 1</p>
      <p>
        Sn 6= ∅
Sn = ∅
(
        <xref ref-type="bibr" rid="ref1">1</xref>
        )
(
        <xref ref-type="bibr" rid="ref2">2</xref>
        )
      </p>
      <p>Following [6], we assume that for an agent to understand a Semantic Note that
another agent has created or modified, the statements in it conform to an ontology known
by both agents. Based on that, we give the following definition for agents to share
knowledge via Semantic Notes:
Definition 2. A necessary condition for an agent a1 to share knowledge via a Semantic Note (n)
with agent a2 is that n conforms to a set of ontologies (O), which is a disjunction of the ontologies
accessible to a1 (O1) and a2 (O2):</p>
      <p>shares(a1, a2, n) → (understands(a1, n) ∧ understands(a2, n)).</p>
      <p>This entails that the set of ontologies (O1,2) has to be accessible to both a1 and
a2. Notes can also be partially shared between agents. Consider a simple case with
two agents (a1 and a2) and two partly overlapping ontologies (o1 and o2) so that
access(a1, o1) and access(a2, o2). Suppose that a1 has created a Semantic Note (n)
which contains two statements (si and sii). All the terms (ta, tb, and tc) of si conform
to respective concepts (φa, φb, φc) ∈ (o1 ∩ o2), and can therefore be shared between a1
and a2. sii, in contrast, has the terms ta, tb, and td, of which td conforms to a concept
φd ∈/ o2. Because of this, sii is not shared between the agents. Based on the number of
mutually understood statements, we can therefore conclude that 50% of n is shared.</p>
      <p>We define a new variable nrel for indicating the level of relevance the information
carried by a Semantic Note has. A rule-based approach is adopted for determining the
information relevance. The information content, of which the relevance is to be
determined, is connected with user context via general preference rules specified by the
user. The user context describes some essential details about the user’s current
situation, for example her location and current activity. Both the information content (i.e.,
the Semantic Notes) and the user context are realized as sets of statements.
Definition 3. If there exists a term (tctx) in a statement found in the user context, as well as a
term (tn) in a statement found in the Semantic Note so that both of those conform to respective
concepts (φctx,n) which are navigable from the concepts (φr1,r2) found in the rule (r), the rule
is said to be applicable (ra):</p>
      <p>∃tctx : conforms(tctx, φctx) ∧ ∃tn :
conforms(tn, φn) ∧ navigable(φr1, φctx) ∧ navigable(φr2, φn) → ra
where navigable(x,y) means that there exists a network of concepts and relationships,
realized as one ontology or several connected ontologies, that enables navigating
between x and y. A positive match indicates that an applicable rule is found, as well as
suitable values to satisfy it. Negative match means that there exists an applicable rule,
but that the statements plugged in it do not have suitable values. In order to assign
relevance values for the Semantic Notes utilizing the applicable rules, we define the
following abstract function:
nuse = a ∗ nu + b ∗ nrel</p>
      <sec id="sec-2-1">
        <title>1 positive match 0 negative match</title>
        <p>The function app is realized as various concrete rules, that determine the relevance
assignment (rm, where m comes from “match”). The applicable rules (ra) as well as
the match value (rm) are utilized in the relevance equation for Semantic Notes. Let Ra
be the set of applicable rules so that ra1, ra2, ..., rak, where k = |Ra|. The Semantic
Note relevance (nrel) can receive values between 0 and 1 as the ratio between the sum
of the match values (rm1, rm2, ..., rmk) and the number of applicable rules (|Ra|):
0 ≤nrel =
nrel = 0</p>
        <p>1 ∗
|Ra|
|Ra|
i=1
rmi ≤ 1</p>
        <p>Ra 6= ∅
Ra = ∅</p>
        <p>
          We define the usefulness of a Semantic Note for an agent to consist of both
understanding the note and considering it relevant. The information usefulness variable
(nuse) also receives values between 0 and 1, and is formalized as follows:
(
          <xref ref-type="bibr" rid="ref3">3</xref>
          )
(
          <xref ref-type="bibr" rid="ref4">4</xref>
          )
(
          <xref ref-type="bibr" rid="ref5">5</xref>
          )
where 0 ≤ a + b ≤ 1 and a, b ∈ R+. Parameters a and b in Equation 5 indicate the
weights that are assigned to the understanding (nu) and relevance (nrel), respectively.
The emphasis on these weight parameters depends on the application.
3
        </p>
        <p>Conclusions and Future Work
We described an approach for determining information usefulness in the Semantic Web
from a single agent’s point of view. Information usefulness is formed based on the levels
of understanding and context-dependent relevance of the information. We introduced a
notion of Semantic Note to refer to the meaningful unit of information for an agent
acting in the Semantic Web. Determining information usefulness forms a part of a broader
approach, namely applying the theory of distributed cognition in the Semantic Web.
Since the Semantic Web is an environment for software agents in addition to humans to
operate, both were considered as “cognition distributors”.</p>
        <p>Among our future work is to consider various context-aware filters with our model.
In addition to the most typical context attributes, namely location and time, activities
and user interests associated with them could be taken into account when evaluating the
relevance of content. Other future work includes developing a more refined
classification of content creators—ranging from individual users to commercial parties, public
administration, and virtual communities—and considering their impact in the
information usefulness determination. In our current implementation, developed in terms of the
DYNAMOS project3, we have support only for dividing between service providers and
individual users, but we plan to extend this. We will also pay more attention to the
interrelationships and relative importances of various statement kinds in Semantic Notes,
as well as to the rules that connect the Semantic Notes with users’ current contexts.
Acknowledgements. The work reported in this paper was conducted in terms of the
DYNAMOS project, funded by Tekes4, TeliaSonera, Suunto, ICT Turku, and VTT.</p>
      </sec>
      <sec id="sec-2-2">
        <title>3 Dynamic Composition and Sharing</title>
        <p>http://www.vtt.fi/tte/proj/dynamos/
4 National Technology Agency of Finland</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>T.</given-names>
            <surname>Berners-Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hendler</surname>
          </string-name>
          , and
          <string-name>
            <surname>O. Lassila.</surname>
          </string-name>
          <article-title>The semantic web</article-title>
          .
          <source>Scientific American</source>
          ,
          <volume>284</volume>
          (
          <issue>5</issue>
          ):
          <fpage>34</fpage>
          -
          <lpage>43</lpage>
          , May
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>J.</given-names>
            <surname>Hendler</surname>
          </string-name>
          .
          <article-title>Agents and the semantic web</article-title>
          .
          <source>IEEE Intelligent Systems</source>
          ,
          <volume>16</volume>
          (
          <issue>2</issue>
          ):
          <fpage>30</fpage>
          -
          <lpage>37</lpage>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>J.</given-names>
            <surname>Hollan</surname>
          </string-name>
          , E. Hutchins, and
          <string-name>
            <given-names>D.</given-names>
            <surname>Kirsh</surname>
          </string-name>
          .
          <article-title>Distributed cognition: Toward a new foundation for human-computer interaction research</article-title>
          .
          <source>ACM Transactions on Computer-Human Interaction</source>
          ,
          <volume>7</volume>
          (
          <issue>2</issue>
          ):
          <fpage>174</fpage>
          -
          <lpage>196</lpage>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>E. Hutchins.</surname>
          </string-name>
          <article-title>Cognition in the Wild</article-title>
          . MIT Press, Cambridge, MA,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5. Dublin Core Metadata Initiative.
          <source>Dublin Core Metadata Element Set, Version 1.1</source>
          ,
          <year>1999</year>
          . Specification identifier: http://dublincore.org/documents/2004/12/20/dces/.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>A.</given-names>
            <surname>Williams</surname>
          </string-name>
          and
          <string-name>
            <given-names>Z.</given-names>
            <surname>Ren</surname>
          </string-name>
          .
          <article-title>Agents teaching agents to share meaning</article-title>
          .
          <source>In AGENTS '01: Proceedings of the fifth international conference on Autonomous agents</source>
          , pages
          <fpage>465</fpage>
          -
          <lpage>472</lpage>
          , Montreal, Quebec, Canada,
          <year>2001</year>
          . ACM Press.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>