<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>SAM: Semantics Aware Instant Messaging for the Networked Semantic Desktop</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Thomas Franz</string-name>
          <email>franz@uni-koblenz.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Steffen Staab</string-name>
          <email>staab@uni-koblenz.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>ISWeb, University of Koblenz-Landau</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>While instant messaging (IM) became a mature communication means in business organizations over the last years, IM systems did not follow this evolution comparably. Communicated content is often stored insucffiiently and hard to recall, integration into other desktop applications impossible. In this paper, we address these shortcomings and provide concepts for novel instant messaging. In contrast to prior work such as the Haystack system, which integrates IM data into a personal information management application, we enhance IM based on a ready to integrate ontological meta model that introduces semantics to instant messaging and its content to foster advanced management. In particular, we address networked exchange of semantic meta information to integrate IM into the Networked Semantic Desktop. The Semantics Aware Messenger (SAM) is a prototypical implementation of the concepts presented in this paper.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>The objective of the Semantic Desktop is to improve personal information
management (PIM) by combining all content available on the desktop and relevant
to the user to i) easily manage that content, regardless of which type, and to ii)
simplify utilization of it.</p>
      <p>
        The Networked Semantic Desktop as envisioned in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] describes a networked
infrastructure that combines the Semantic Desktop with Social Networking and
P2P systems to benefit novel applications such as group collaboration.
      </p>
      <p>
        In this paper, we address instant messaging (IM) on the Networked
Semantic Desktop. In contrast to prior work such as the Haystack system [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], which
integrates various desktop sources into a consistent, meta data driven personal
information management application, we enhance IM based on a ready to
integrate ontological meta model that introduces semantics to instant messaging and
its content to foster advanced management. In particular, we address networked
exchange of semantic meta information to integrate IM into the Networked
Semantic Desktop.
      </p>
      <p>
        Today, communicating by instant messaging mainly comprises typing
messages and viewing incoming messages, while most of the time, no further
processing of messages is done or offered so that message content gets lost in plain
text communication logs that are more or less accessible depending on the client
application. Despite poor traceability, recent studies claim that IM usage has
matured and IM is employed for miscellaneous tasks including complex
conversation [
        <xref ref-type="bibr" rid="ref11 ref9">11, 9</xref>
        ]. Accordingly, we consider the content communicated via this
media as of increasing value that should be recallable and integrated into the
Networked Semantic Desktop.
      </p>
      <p>
        The objective of this paper is to tackle traceability shortcomings of IM,
improve management of IM content, and move IM towards the Networked Semantic
Desktop. We are proposing i) an ontological meta model for instant messaging
which ii) supports integration into the Networked Semantic Desktop, and iii)
introduces meta data and semantics for IM to enable iv) sophisticated
reutilization of instant messaging data. Based on the meta model and an v) identification
scheme for IM data including meta information and semantics we vi) enable
networked exchange of such information via IM to vii) ground novel applications as
envisioned in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>In Sect. 2, we sketch a typical IM scenario to indicate shortcomings of current
IM systems (Sect. 3), explain our concepts to overcome these shortcomings (Sect.
4), and illustrate the implementation of these concepts by examples of that
scenario in Sect. 6. We give a detailed overview of the ontological meta model
in Sect. 5, and contrast our work with related work to provide a conclusion in
Sect. 7. In Sect. 8, we suggest future research and give an outlook.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Scenario</title>
      <p>The extracts of chat conversations in this section render a common instant
messaging scenario and indicate different functions and particularities of IM.</p>
      <p>The scenario: Steffen, being the lecturer of the Semantic Web lecture uses
IM to get some quick responses concerning organizational issues from Thomas,
who held the last exercise session for the lecture:
[ 0 9 : 2 9 : 0 6 ] S t e f f e n : how was t h e e x e r c i s e s e s s i o n ?
[ 0 9 : 2 9 : 3 3 ] S t e f f e n : d i d you t e l l them t h e d a t e o f t h e exam ?
[ 0 9 : 3 0 : 3 3 ] Thomas : s o l u t i o n s were ok , p a r t i c i p a t i o n was weak
[ 0 9 : 3 0 : 5 6 ] Thomas : yes , i g u e s s a b o u t 20 w i l l s i g n up f o r i t</p>
      <sec id="sec-2-1">
        <title>Listing 1.1. Exercise Session</title>
        <p>At a later time, Thomas informs Steffen about his work on a paper he is writing
for the Semantic Desktop Workshop.
[ 1 2 : 0 7 : 4 5 ] Thomas : i w i l l put new v e r s i o n s o f t h e p a p e r f o r
t h e sdws a t h t t p : / / i s w e b . p a p e r s . x . y /sam . t e x
[ 1 2 : 0 8 : 1 8 ] S t e f f e n : ok , what i s g o i n g t o change ?
[ 1 2 : 0 8 : 3 5 ] Thomas : d e s c r i b e an IM s c e n a r i o t o i n d i c a t e c u r r e n t
s h o r t c o m i n g s , p r o p o s e improvements , and d e m o n s t r a t e SAM
i n t e r m s o f t h e s c e n a r i o</p>
      </sec>
      <sec id="sec-2-2">
        <title>Listing 1.2. Semantic Desktop Paper After lunch, Thomas contacts Steffen about the scenario he mentioned in Listing 1.2.</title>
        <p>[ 1 3 : 5 5 : 0 4 ] Thomas : any i d e a s f o r a s u i t a b l e s c e n a r i o ?
[ 1 3 : 5 5 : 2 4 ] S t e f f e n : why don ’ t u s e t h i s c o n v e r s a t i o n ?
[ 1 3 : 5 5 : 4 3 ] Thomas : r i g h t ! i t s a s u f f i c i e n t exa mp le o f a
work r e l a t e d c h a t
[ 1 3 : 5 6 : 1 5 ] Thomas : i ’ l l u s e o u r today ’ s e a r l i e r c h a t s
a s w e l l . t h e y n i c e l y i n d i c a t e d i f f e r e n t f u n c t i o n s o f IM</p>
      </sec>
      <sec id="sec-2-3">
        <title>Listing 1.3. Scenario for the Paper</title>
      </sec>
      <sec id="sec-2-4">
        <title>Later on, Steffen talks to Bernhard, a co-worker in project X:</title>
        <p>[ 1 7 : 1 5 : 4 2 ] S t e f f e n : w r t t h e p r o j e c t you might be
i n t e r e s t e d i n what thomas i s c u r r e n t l y d o i n g ; i ’ l l s e n d
you what thomas t o l d me a b o u t t h a t s o f a r
[ 1 7 : 1 6 : 0 3 ] B e r n h a r d : t h a n k s , i ’ l l c o n t a c t him when
i ’ ve r e a d i t</p>
      </sec>
      <sec id="sec-2-5">
        <title>Listing 1.4. Project X Work</title>
        <p>2.1</p>
        <sec id="sec-2-5-1">
          <title>Terminology and Observations</title>
          <p>
            Isaacs et al. [
            <xref ref-type="bibr" rid="ref9">9</xref>
            ] discovered that – in professional environments – IM messages
mostly are work-related (61.8%), followed by scheduling and coordinating ones
(30.8%) and those that resemble simple questions and information (27.8%).1
          </p>
          <p>Based on that terminology, we classify the chat excerpts (Listing 1.1 to 1.4)
as follows: Listing 1.1 is a sample of simple questions and information, while
Listings 1.2, 1.3, 1.4 represent work-related messages. In the given scenario, we
excluded scheduling/coordinating conversations, as they resemble a typical IM
function, but do not contribute much here.
2.2</p>
        </sec>
        <sec id="sec-2-5-2">
          <title>Use Cases</title>
          <p>Due to the fact that most IM conversations are about work, the content of
such conversations needs to be available for later reuse as illustrated by the two
following use cases.</p>
          <p>Use Case 1: About one week after the day when the listed conversations took
place, Steffen wants to check where Thomas stored that lfie on the server, and
what exactly he stated about his current work. As Thomas is not available he
cannot ask him again.</p>
          <p>Use Case 2: In order to track project development, and summarize the
current stage of project X, Steffen wants to compile all project X related content,
including messages that deal with the project.
1 Messages could be classified for more than one category.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Accomplishing the Use Cases Today</title>
      <p>Today’s instant messengers usually store messages in plain text logs and provide
a user interface to view the logs, sometimes ordered by message date or lfitered
by user. More sophisticated messengers may supply an additional search over
the message logs. Accomplishing the use cases with current systems reveals the
following shortcomings:</p>
      <sec id="sec-3-1">
        <title>1. Weak Message Classification:</title>
        <p>
          Finding appropriate messages by browsing the message logs requires high
user effort as the given classicfiations (by user, by date) do not narrow the
search space enough to easily nfid messages: Given that Steffen does not
recall the exact day when Thomas told him about his current work, he has
to read all the messages from several days to find the one he seeks.
2. Keyword search is unsuitable due to missing content semantics and
particularities of chat conversation style:
(a) A term denoting the subject of a message, or significantly distinguishing
a message from others is not necessarily contained in a message so that
creating efficient search strings is delicate. Entering a query that finds
the message Steffen looks for in use case 1 may be difficult as Thomas
did not use keywords like ”store”, ”server”, or ”lfie” that directly relate
to the semantics of his message in Listing 1.2.
(b) Ambiguity of search terms further decreases the average relevance of
search results. If Steffen searches for paper, he may receive messages
that deal with different concepts of paper such as writing paper, abrasive
paper, and research paper while only the latter is relevant for him.
3. Missing Context:
(a) Instant messages are rather short, and informal [
          <xref ref-type="bibr" rid="ref12 ref6 ref7">7, 6, 12</xref>
          ] therefore
become meaningless without context. In Listing 1.1, Thomas said ”the
solutions were ok, participation was weak”. Without the message’s
context it is hard to predict which solutions Thomas points at. Current
IM systems do not provide message context so that identifying relevant
messages is difficult.
(b) Topic switching and interleaving messages are particularities of IM
conversation. Listing 1.1 has interleaving messages, as Thomas’ first message
replies to Steffen’s first message although it appears after Steffen’s
second message. The context of interleaving messages is not based on the
sequence in which they appear in time so that even browsing message
logs ordered by time does not necessarily provide relevant context.
4. Missing Messaging Semantics:
        </p>
        <p>Current IM clients do not identify message properties, e.g. the creation date,
or sender of a message. Consequently, relations between them cannot be
exploited:
(a) Missing messaging semantics inhibit integration into the Networked
Semantic Desktop.
(b) Information exchange is of low value as just meaningless plain text can
be exchanged.
(c) Semantic querying using restrictions on properties is impossible, e.g.
querying for messages within a date range, sent by a certain user et
cetera.
4
4.1</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Improvements by SAM</title>
      <sec id="sec-4-1">
        <title>Message Classification for Message Semantics</title>
        <p>The first shortcoming mentioned in Sect. 3 denotes weak message classicfiations
provided by current IM clients. SAM offers a user-definable taxonomy that is
used to add semantics to messages by annotating them with entries from the
taxonomy. For instance, Steffen might denfie the category work with two
subcategories teaching and projectX. If he annotates any message related to project
X with the corresponding entry in the taxonomy, accomplishing the second use
case is as easy as browsing for all messages annotated with projectX. Message
classicfiation also benetfis search, as queries can restrict search results to be
annotated with certain taxonomy entries. How annotations and the taxonomy are
designed is detailed in Sect. 5, how the user annotates with SAM is explained in
Sect. 6.2.</p>
        <p>The main drawback of message classification is the user effort required to
annotate messages appropriately. This effort is lowered by automatic
annotation exchange between conversation partners as detailed in Sect. 4.3 and 6.4,
however, manual annotation still has to be done by at least one of a
conversation’s participants in order to gain benefits. The user interface of SAM tries to
minimize this effort as much as possible (see Sect. 6.2) and for future work we
propose to integrate automatic message classicfiation based on machine learning
technologies.
4.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>Ontological Meta Model</title>
        <p>We employ a meta model for instant messaging in form of a unified messaging
ontology (cf. Sect. 5) that tackles many of the shortcomings listed in Sect. 3.</p>
        <p>The ontological meta model provides semantics for IM entities such as
persons, messages, conversations, annotations, and message texts as it identiefis and
relates such entities to each other by meaningful properties. This permits several
enhancements as detailed in the following:
Message Context: Any message is accompanied by its context, i.e. messages link
to their following message, their sender and recipient and so on. Accordingly,
messages displayed while browsing or in search results are much more informative
thus reducing the user effort of determining whether or not they are relevant.
Semantic Querying: Querying becomes more powerful as the ontological meta
model permits to denfie what to query for, e.g. one can not only query for
messages but also for users or taxonomy entries. Moreover, restrictions on properties
can be defined, e.g. Steffen can request messages sent by Thomas within a
certain date range, including the keyword ”paper” in their message text. Resulting
messages will directly link to related messaging entities to provide context.
Integration: As the ontology unambiguously denfies messaging entities it
integrates IM into the Networked Semantic Desktop by providing interoperability
between applications. For instance, the sender of a message in Steffen’s store
can be identified as the author of a document on his hard disk, or the sender
of an email in his email client. Such features require, however, that applications
commit to the same ontology. Thus, SAM does not employ a proprietary
representation of persons, but integrates the Friend-of-a-Friend (FOAF2) ontology as
it is widely recognized for expressing identity.</p>
        <p>
          The ontology abstracts the concept of a message considering interoperability
of different message channels as proposed in [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. A unified view of messaging
aims at seamless integration between different messaging applications as it
allows to track conversations that comprise different message types and message
channels, e.g. receiving an email message and answering with an instant message.
4.3
        </p>
      </sec>
      <sec id="sec-4-3">
        <title>Meta Data Exchange</title>
        <p>All participants of a conversation deal with the same set of messages. As each
user decides how to annotate a message and which concepts to have in his
taxonomy, there are cases where annotations differ between users, and where one
user annotated a message while the other one did not. A common meta model
on each peer, unique identicfiation of IM entities, and provenance information
established by the messaging ontology enables automatic annotation exchange
between peers to either add further message semantics through additional
annotations, or add annotations for not yet annotated messages. The latter case is
especially important to reduce annotation effort for the user. As each user
maintains his own taxonomy, annotation exchange may also introduce new taxonomy
entries. SAM offers different user options to deal with incoming annotations as
explained in Sect. 6.2. Technical aspects of meta data transfer are mentioned in
Sect. 6.4.</p>
        <p>Meta data exchange is not only useful to decrease annotation effort, it
permits several novel applications. In Listing 1.4, Steffen tells Bernhard to send
him, what Thomas told him. Meta data exchange as proposed by SAM allows to
automatically integrate messages sent between Thomas and Steffen into
Bernhard’s data store so that Bernhard can utilize all features of SAM to access these
messages.
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>The Ontology</title>
      <p>2 http://www.foaf-project.org/
m:Conversation, which relates to messages exchanged within a conversation by
the m:hasMessage property. A message is a subclass of foaf:Document and is
associated to its content by the m:hasText and m:hasBinary properties. The
m:follows property and its inverse, m:precedes, track the chronological order
in which messages appear, while the m:repliesTo property records further
valuable context information that goes beyond chronological ordering: It relates a
message to the message it replies to thus relating these messages based on the
semantics of their content. This property is significant to store appropriate
context information for interleaving messages as illustrated in Listing 1.1. Section
6.2 explains how this property is set using SAM.</p>
      <p>Persons are represented by foaf:human as defined in the FOAF ontology,
which already features messaging relations, including instant messaging
properties such as foaf:jabberID.</p>
      <p>In order to add semantics to messages and conversations, they are
annotated with entries of a taxonomy. The taxonomy is defined using the
Simple Knowledge Organization System (SKOS3), an ontology to describe concept
schemes providing several predefined classes and properties for this purpose. The
skos:narrower and skos:broader properties are used to build a skos:Concept
hierarchy, while the skos:subject property is used to associate things - in our
case messages and conversations - with concepts.</p>
      <p>Employing a standard meta ontology for knowledge representation fosters
integration of ontologies that are based on the same meta ontology. However, as
the hierarchical structure is established by only two relations, namely broader
3 http://www.w3.org/2004/02/skos/
and narrower, transforming existing taxonomies or lexica defined with other
meta ontologies to a SKOS representation is straightforward as well. As an
example, Wordnet4 can be transformed to a concept hierarchy defined with SKOS
by interpreting the hypernym and hyponym relations of Wordnet as narrower
and broader relations of SKOS.</p>
      <p>Provenance data for annotations that allows to track who annotated what
and when is established by individuals of m:AnnotionStatement that references
the creator (m:annotator) and creation date of an annotation. Any such
annotation is a reiefid statement that points at the resources representing the
annotation.</p>
      <p>Provenance information is also kept for messages and taxonomy entries by
the m:sender, and m:conceptCreator properties as illustrated in Fig. 1.
6
6.1</p>
      <p>SAM</p>
      <sec id="sec-5-1">
        <title>Technologies Enabling SAM</title>
        <p>SAM builds upon the instant messaging client BuddySpace5 [17], which was
developed during research on online presence in instant messaging at Open
University. BuddySpace is a client for the Jabber6 network which we extended to
use the ontology depicted in Sect. 5. A programming interface was developed
that encapsulates the ontological model and provides methods to write to it and
read from it, such as adding an annotation, or retrieving messages annotated
with a given concept.</p>
        <p>
          The messaging ontology is defined using the Web Ontology Language (OWL)[
          <xref ref-type="bibr" rid="ref1">1</xref>
          ].
It denfies the properties and classes as explained in Sect. 5, including appropriate
restrictions for them (range, domain, cardinality, functional, inverse, et cetera).
Instances of the classes defined in the ontology are represented as RDF to
support integration with the Networked Semantic Desktop and to establish a well
structured and easy to access data store that simplifies incorporation of meta
information, interlinking of resources, and exchange. The Jena7 RDF API for
Java is used to access the store.
        </p>
        <p>
          The communication protocol used by the Jabber network is the Extensible
Messaging and Presence Protocol (XMPP)[
          <xref ref-type="bibr" rid="ref15">15</xref>
          ], an XML-based protocol that is
well supported by multiple open source programming libraries.
6.2
        </p>
      </sec>
      <sec id="sec-5-2">
        <title>Annotations and Context</title>
        <p>In contrast to common IM clients, the chat window of SAM contains an
additional taxonomy panel (cf. Fig. 2). The chat window permits message annotation,
taxonomy management, and the addition of context information while chatting.
Both, the message panel and the taxonomy panel allow to accomplish multiple
4 http://wordnet.princeton.edu
5 http://kmi.open.ac.uk/projects/buddyspace/
6 http://www.jabber.org
7 http://jena.sourceforge.net/
(a) Selecting multiple messages.</p>
        <p>(b) Annotating with a taxonomy entry.
annotations at once to reduce user effort. Annotations are made either by
doubleclicking on a particular message that automatically annotates that message with
all taxonomy entries that are currently selected, or by double-clicking a
taxonomy entry which automatically annotates all selected messages with that entry
as illustrated in Fig. 2. To further minimize user effort, if no message is selected,
double-clicking on a concept contained in the taxonomy automatically annotates
the last displayed message. As direct visual feedback, annotated messages are
displayed as child nodes in the taxonomy (cf. Fig. 2b).</p>
        <p>New annotations are automatically sent to the conversation partner to further
reduce annotation effort and gain additional message semantics. We propose
different policies (cf. Table 1) that define how new annotations that potentially
introduce new taxonomy entries are handled based on how much trust is given
to the creator of an incoming annotation.</p>
        <p>For any created message, the m:follows, m:precedes, m:sender, m:recipient,
m:hasText, and m:hasConversation properties are automatically set by SAM
to establish context information. The m:repliesTo property can be set through
the message panel of the chat window as illustrated in Fig. 3: Selecting a message
with a right-click automatically sets the m:repliesTo property of the next sent
message to the selected one. Messages that have this property set are
automatically displayed underneath the message they reply to. As IM conversations often
have interleaving messages (cf. Listing 1.1) with different topics, this feature does
not only provide additional message context, but also eases IM conversation as it
assists the user in identifying related messages. All context information created
for a message on one client is automatically transferred to the recipient when
that message is sent to provide as much meta information as possible on both
sides of a conversation. Section 6.4 describes in more detail how the transfer of
such information is implemented.</p>
        <p>(a) Selecting message to reply to.</p>
        <p>(b) After sending the message.
SAM allows to combine full-text search in message texts with semantic search
features as illustrated in Fig. 4a. The user can restrict a search by specifying
a date range for the message creation time, require specific persons to be the
sender and the recipient, and restrict search results to be associated with certain
taxonomy entries. Resulting messages are displayed with their context available
for further exploration through the property explorer that opens by clicking on
non-literal objects such as persons and taxonomy entries (cf. Fig. 4b).</p>
        <p>The semantic browser (cf. Fig. 5) allows to view messages classiefid by the
individual taxonomy. Non-annotated messages are associated with an additional
taxonomy entry so that the user can still access them. As for search results,
object properties (displayed underlined) can be further examined (cf. Fig. 4b).
6.4</p>
      </sec>
      <sec id="sec-5-3">
        <title>Meta Data Transfer</title>
        <p>Every messaging entity (e.g. person, message) is identiefid by its uniform resource
identifier (URI) to support global identicfiation and thus exchange of such
entities. For example, each new instance of m:Message needs to be available for the
sender and the recipient as both may want to reutilize it.</p>
        <p>SAM exploits the extension mechanism of the XMPP to transfer messaging
entities between different SAM clients, and to support automatic meta data
exchange. Different extension types, namely message, annotation, resourceRequest,
(a) Semantic search.</p>
        <p>(b) Property Explorer.
and resourceResponse are denfied for this purpose. A message extension contains
the RDF representation of a message while an annotation extension contains an
instance of a m:AnnotationStatement. The two other extensions enable to
request and retrieve one or multiple RDF resources with all their properties. The
following two use cases exemplify how the extensions are used:
1. When sending a chat message, SAM automatically creates a new instance of
m:Message with corresponding properties, and attaches its RDF
representation in a message extension to the XMPP packet that sends the message.
The receiving SAM client extracts the RDF data contained in the packet’s
extension and adds it to its own store.
2. When a client receives an annotation with a taxonomy entry that is not
contained in its RDF store, the client repeatedly requests more general
(skos:broader) taxonomy entries from the sender until a retrieved entry
matches an entry in the local taxonomy so that the new taxonomy entry
can be correctly inserted into the taxonomy and the annotation becomes
effective.
7</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Conclusion &amp; Related Work</title>
      <p>This paper presents concepts and an implementation of enhanced IM with
respect to the Social/Networked Semantic Desktop. SAM introduces rich meta
data, including semantics, to instant messaging and its content to provide
enhanced management features that exploit such additional information. The main
achievement, distinguishing SAM from existing systems, is the establishment of
an IM infrastructure to globally exchange content and its semantic meta data in
order to gain knowledge. This ability grounds several novel applications such as
knowledge collaboration.</p>
      <p>Zhang et al. present the Small World Instant Messenger in [18]. They build
user prolfies based on users’ bookmarks or homepages, which are then used for
expertise search. In contrast to our approach they rather exploit the
infrastructure provided by instant messaging without addressing any issues of instant
messaging itself. Consequently, they disregard management, reusability, and
integration issues while establishing a service on top of instant messaging.</p>
      <p>
        The Haystack system [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] comes with a general notion of messaging including
a unified messaging ontology [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] similar to the ontology presented in Sect. 5.
While the Haystack system focuses on integrating messaging into a personal
information manager, SAM considers the networked exchange of meta information
and is ready to integrate with other applications on the Semantic Desktop.
      </p>
      <p>
        Chirita et al. explain how to use activity based semantic meta data [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] in their
desktop search prototype. Exemplarily, they deal with email, file system, and web
cache meta data and have developed an architecture that combines such meta
data with standard full text search. While our work also combines full-text search
and meta data to improve management, we address different enhancements and
options for exploitation that are specicfi to the instant messaging context.
      </p>
      <p>Vogiazou et al. established enhanced symbolic presence for instant messaging
[17]. One outcome of this research is the BuddySpace instant messaging client
and server component that allow to automatically group buddies and visualize
their location and presence information respectively. SAM extends BuddySpace
by semantic annotations, semantic search, semantic browsing, and (semantic)
meta data communication.</p>
      <p>
        The CoAKTinG (Collaborative Advanced Knowledge Technologies in the
Grid) project [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] developed a meeting ontology to summarize content of different
collaborative technologies. The summarized content is used to provide meeting
replays that span content communicated via multiple channels, such as instant
messaging, or video conferencing. While CoAKTinG imports BuddySpace
communication logs into the meeting ontology, SAM contributes to CoAKTinG by
providing already well structured additional (semantic) meta data.
      </p>
      <p>
        The Gnowsis system [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] provides an architecture and server component for
integrating arbitrary applications on the Semantic Desktop. Applications are
required to describe their data by ontologies and are connected to the Gnowsis
desktop service by plugins. As a result, different data from various desktop
applications is unified through a single Gnowsis user interface. As SAM already
employs ontologies to represent all its data, integration into the Gnowsis system
is at hand.
8
      </p>
    </sec>
    <sec id="sec-7">
      <title>Outlook</title>
      <p>Meta data exchange as explained in Sect. 4.3 and 6.4 can enrich knowledge
bases but also institutes several applications that go beyond that scope.
Taxonomy overlappings between different communication partners represent a shared
view, naturally established based on communication of taxonomy entries and
their relations. Accordingly, rejecting and accepting incoming taxonomic data
is a simplistic example of online collaboration on a concept hierarchy. Further
work on generalizing the process model will allow online collaboration that is
independent of a specific problem domain.</p>
      <p>
        While IM is employed by business organizations, improving company wide
knowledge management through expertise search might be a welcomed feature
in businesses. The Bibster project [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] establishes semantic routing based on the
expertise of peers. In Bibster, expertise is computed from annotations of
bibliographic data with topics from the ACM topic hierarchy. The knowledge base
provided by SAM can be exploited similarly, however, not to implement semantic
routing but to compute the expertise of users and provide an expertise search.
      </p>
      <p>
        As mentioned in Sect. 4, we consider automatic message classification as a
future improvement. As instant messages differ from other text documents [
        <xref ref-type="bibr" rid="ref12 ref9">9,
12</xref>
        ], we consider classification of such messages as a challenging task. However,
as SAM provides rich message context, any message is usually related to several
other messages that may be exploited to improve classification. Moreover, if
each SAM client runs a classifier that works on a potentially different knowledge
base, we may investigate how to combine different classiefirs and their results to
improve overall classification quality.
      </p>
      <p>
        A very significant open issue is how to incorporate security and privacy issues,
especially trust as denfied in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] as credibility and reliability of resources.
      </p>
      <p>Application oriented visions include the integration with existing software
for the Semantic Desktop such as the Haystack or the Gnowsis systems.</p>
    </sec>
    <sec id="sec-8">
      <title>Acknowledgments</title>
      <p>We would like to thank Arup Malakar for his contributions to the development of
SAM. This work is conducted with respect to the upcoming project Knowledge
Sharing and Reuse across Media (X-Media), funded by the Information Society
Technologies (IST) programme of the 6th Framework Programme.
17. Yanna Vogiazou, Marc Eisenstadt, Martin Dzbor, and Jiı Kr´ omzak. From
Buddyspace to CitiTag: Large-Scale Symbolic Presence For Community Building and
Spontaneous Play. In ACM SAC, pages 1600–1606, 2005.
18. Jun Zhang and Marshall W. van Alstyne. SWIM: Fostering Social Network Based
Information Search. In CHI Extended Abstracts, page 1568, 2004.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>Grigoris</given-names>
            <surname>Antoniou</surname>
          </string-name>
          and Frank van Harmelen.
          <article-title>Web Ontology Language: OWL</article-title>
          . In Handbook on Ontologies, pages
          <fpage>67</fpage>
          -
          <lpage>92</lpage>
          .
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>Michelle</given-names>
            <surname>Bachler</surname>
          </string-name>
          , Simon Buckingham Shum,
          <string-name>
            <surname>Yun-Heh</surname>
            Chen-Burger,
            <given-names>Jeff</given-names>
          </string-name>
          <string-name>
            <surname>Dalton</surname>
          </string-name>
          , David De Roure, Marc Eisenstadt, Jiri Komzak, Danius Michaelides, Kevin Page, Stephen Potter, Nigel Shadbolt, and Austin Tate.
          <article-title>Collaboration in the Semantic Grid: a Basis For E-Learning</article-title>
          .
          <source>In Grid Learning Services Workshop, 7th International Conference On Intelligent Tutoring Systems</source>
          , pages
          <fpage>1</fpage>
          -
          <lpage>12</lpage>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Paul-Alexandru</surname>
            <given-names>Chirita</given-names>
          </string-name>
          , Rita Gavriloaie, Stefania Ghita, Wolfgang Nejdl, and
          <string-name>
            <given-names>Raluca</given-names>
            <surname>Paiu</surname>
          </string-name>
          .
          <article-title>Activity Based Metadata For Semantic Desktop Search</article-title>
          . In ESWC, pages
          <fpage>439</fpage>
          -
          <lpage>454</lpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>Stefan</given-names>
            <surname>Decker</surname>
          </string-name>
          and
          <string-name>
            <surname>Martin R. Frank</surname>
          </string-name>
          .
          <article-title>The Networked Semantic Desktop</article-title>
          .
          <source>In WWW Workshop On Application Design, Development and Implementation Issues in the Semantic Web</source>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>Jennifer</given-names>
            <surname>Golbeck</surname>
          </string-name>
          , Bijan Parsia, and
          <string-name>
            <surname>James</surname>
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Hendler</surname>
          </string-name>
          .
          <article-title>Trust Networks On the Semantic Web</article-title>
          .
          <source>In CIA</source>
          , pages
          <fpage>238</fpage>
          -
          <lpage>249</lpage>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Rebecca</surname>
            <given-names>E.</given-names>
          </string-name>
          <string-name>
            <surname>Grinter</surname>
            and
            <given-names>Margery</given-names>
          </string-name>
          <string-name>
            <surname>Eldridge</surname>
            .
            <given-names>Y Do</given-names>
          </string-name>
          <string-name>
            <surname>Tngrs</surname>
          </string-name>
          <article-title>Luv 2 Txt Msg</article-title>
          ? In ECSCW, pages
          <fpage>219</fpage>
          -
          <lpage>238</lpage>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Rebecca</surname>
            <given-names>E.</given-names>
          </string-name>
          <string-name>
            <surname>Grinter</surname>
            and
            <given-names>Leysia</given-names>
          </string-name>
          <string-name>
            <surname>Palen</surname>
          </string-name>
          .
          <article-title>Instant Messaging in Teen Life</article-title>
          . In CSCW, pages
          <fpage>21</fpage>
          -
          <lpage>30</lpage>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>Peter</given-names>
            <surname>Haase</surname>
          </string-name>
          , Jeen Broekstra, Marc Ehrig, Maarten Menken,
          <string-name>
            <given-names>Peter</given-names>
            <surname>Mika</surname>
          </string-name>
          , Mariusz Olko, Michal Plechawski, Pawel Pyszlak, Boj¨rn Schnizler, Ronny Siebes,
          <string-name>
            <surname>Steffen Staab</surname>
            , and
            <given-names>Christoph</given-names>
          </string-name>
          <string-name>
            <surname>Tempich</surname>
          </string-name>
          . Bibster - A
          <string-name>
            <surname>Semantics-Based Bibliographic</surname>
          </string-name>
          Peerto-Peer
          <string-name>
            <surname>System</surname>
          </string-name>
          .
          <source>In International Semantic Web Conference</source>
          , pages
          <fpage>122</fpage>
          -
          <lpage>136</lpage>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>Ellen</given-names>
            <surname>Isaacs</surname>
          </string-name>
          , Alan Walendowski, Steve Whittaker,
          <string-name>
            <given-names>Diane J.</given-names>
            <surname>Schiano</surname>
          </string-name>
          , and
          <string-name>
            <surname>Candace</surname>
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Kamm</surname>
          </string-name>
          . The Character, Functions, and
          <article-title>Styles of Instant Messaging in the Workplace</article-title>
          .
          <source>In CSCW</source>
          , pages
          <fpage>11</fpage>
          -
          <lpage>20</lpage>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>David</surname>
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Karger</surname>
            , Karun Bakshi, David Huynh,
            <given-names>Dennis</given-names>
          </string-name>
          <string-name>
            <surname>Quan</surname>
            ,
            <given-names>and Vineet</given-names>
          </string-name>
          <string-name>
            <surname>Sinha</surname>
          </string-name>
          .
          <article-title>Haystack: A General-Purpose Information Management Tool For End Users Based On Semistructured Data</article-title>
          .
          <source>In CIDR</source>
          , pages
          <fpage>13</fpage>
          -
          <lpage>26</lpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Michael J. Muller</surname>
          </string-name>
          , Mary Elizabeth Raven, Sandra L.
          <string-name>
            <surname>Kogan</surname>
            ,
            <given-names>David R.</given-names>
          </string-name>
          <string-name>
            <surname>Millen</surname>
            , and
            <given-names>Kenneth</given-names>
          </string-name>
          <string-name>
            <surname>Carey</surname>
          </string-name>
          .
          <article-title>Introducing Chat Into Business Organizations: Toward an Instant Messaging Maturity Model</article-title>
          . In GROUP, pages
          <fpage>50</fpage>
          -
          <lpage>57</lpage>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Bonnie</surname>
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Nardi</surname>
            , Steve Whittaker, and
            <given-names>Erin</given-names>
          </string-name>
          <string-name>
            <surname>Bradner</surname>
          </string-name>
          .
          <article-title>Interaction and Outeraction: Instant Messaging in Action</article-title>
          .
          <source>In CSCW</source>
          , pages
          <fpage>79</fpage>
          -
          <lpage>88</lpage>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <given-names>D.</given-names>
            <surname>Quan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Bakshi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Karger</surname>
          </string-name>
          .
          <article-title>A Unified Abstraction For Messaging On the Semantic Web</article-title>
          .
          <source>In The Twelfth International World Wide Web Conference</source>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <given-names>Dennis</given-names>
            <surname>Quan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>David</given-names>
            <surname>Huynh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and David R.</given-names>
            <surname>Karger</surname>
          </string-name>
          .
          <article-title>Haystack: A Platform For Authoring End User Semantic Web Applications</article-title>
          . In International Semantic Web Conference, pages
          <fpage>738</fpage>
          -
          <lpage>753</lpage>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15. Ed.
          <source>Saint-Andre. RFC</source>
          <volume>3920</volume>
          :
          <article-title>Extensible messaging and presence protocol (XMPP)</article-title>
          .
          <source>Technical report.</source>
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <given-names>Leo</given-names>
            <surname>Sauermann</surname>
          </string-name>
          .
          <article-title>The Gnowsis Semantic Desktop For Information Integration</article-title>
          .
          <source>In WM 2005: Professional Knowledge Management</source>
          , pages
          <fpage>39</fpage>
          -
          <lpage>42</lpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>