=Paper= {{Paper |id=None |storemode=property |title=Publishing Semantic Personal Notes as Linked Data |pdfUrl=https://ceur-ws.org/Vol-674/Paper161.pdf |volume=Vol-674 |dblpUrl=https://dblp.org/rec/conf/ekaw/DraganPGH10 }} ==Publishing Semantic Personal Notes as Linked Data== https://ceur-ws.org/Vol-674/Paper161.pdf
        Publishing Semantic Personal Notes as Linked Data

                  Laura Drăgan                       Alexandre Passant                       Tudor Groza
           DERI, National University of            DERI, National University of       DERI, National University of
                Ireland, Galway                         Ireland, Galway                    Ireland, Galway
            laura.dragan@deri.org                alexandre.passant@deri.org tudor.groza@deri.org
                                                   Siegfried Handschuh
                                                   DERI, National University of
                                                        Ireland, Galway
                                                 siegfried.handschuh@deri.org

ABSTRACT                                                            it requires two prerequisite steps, provided by existing ap-
There is an obvious shift of focus towards the Web, as peo-         plications: (i) the note-taking process and annotation of the
ple spend more of their time and store more of their data           note (adding the context), and (ii) the identification of Web
online. However, the desktop still handles a large amount of        URIs which represent the same real-world thing as the desk-
the personal data. The Semantic Desktop brings semantics            top resources that belong to the context of a note.
to the desktop, better interlinking and organization of the
data, thus allowing better management. However, in spite            Our contributions include: (i) mappings between the rela-
of common representation formats, personal and online data          tively small number of desktop vocabularies and the most
is still difficult to interlink, notably because of the different   popular Web vocabularies. The mappings are used in the
vocabularies used to describe it as well as the lack of com-        transformation of the desktop data, represented with the
mon identifiers between desktop applications and Web-based          desktop ontologies, to data represented with the Web vo-
services. We describe here a solution for easily publishing         cabularies, ready to be published online; (ii) a process for
and sharing of personal notes as Linked Data. We provide            the publication of desktop information on the Web using the
a two-phased publishing and interlinking process. It can be         Linked Data principles, while at the same time protect the
used to publish any kind of information from the desktop to         sensitive private data from being shared unwillingly; (iii) a
the Web, enabling integration of small chunks of personal           system implementation that allows sharing of semantic per-
knowledge into the Web of Data and focusing on a user-              sonal notes as semantic blog posts, interlinked with existing
driven approach of knowledge management.                            information within the Linked Data cloud.

1.   INTRODUCTION                                                   2.     USE CASE - SEMANTIC BLOGGING
The Linked Data initiative and the Semantic Desktop are             Two important characteristics of blog posts are: (i) their
two relevant subdomains of the Semantic Web vision. The             topics are of interest to the author and thus are very likely
first focuses on globally interlinking resources on the Web,        to have references to things present on the desktop; (ii) they
while the second enables information integration and inter-         belong to a context consisting of the references made in their
linking on the desktop. Although the two share the represen-        content. Writing a blog post in a desktop application can
tation models (RDF(S)/OWL), there is still a gap between            offer several benefits, if the application is a semantic one,
data from the Web and the desktop, due to different sets of         on a Semantic Desktop, where desktop resources are inter-
vocabularies that are used, and generally distinct identifiers.     connected [1]. Semantic note-taking tools like SemNotes1
                                                                    automatically generate relations between the notes and the
We describe here an approach for integrating data from              desktop things mentioned in their content. Such annotations
these two environments, specifically for publishing personal        give context to the note and should be preserved when the
notes from the desktop (using Semantic Desktop technolo-            note is published as a blog post on the Web, since it enables
gies) to the Web (using the Linked Data principles). The            serendipitous browsing and information discovery, as they
main requirement is to publish data online without losing           contain relevant additional links to other entities.
the personal context established on the desktop. Our ap-
proach consists of two main steps: (i) preparing the desktop        Currently, personal notes, even the ones semantically en-
data for sharing, and (ii) publishing it online. In addition,       riched using Semantic Desktop applications, must be pub-
                                                                    lished as blog posts by being manually copied into a blog-
                                                                    ging tool. In this way, any additional semantic information
                                                                    available on the desktop becomes lost or, if copied, leads to
                                                                    broken references as they point to the local resources which
                                                                    are not accessible outside of the desktop. The note-taking
                                                                    to publishing process is sometimes shortcut by using the
                                                                    drafting functionality that some systems like WordPress or
                                                                    Blogger offer, so that users can directly take the notes in
                                                                    1
                                                                        http://smile.deri.ie/projects/semn
the blogging tool, usually online, thus replacing the desk-       context of the Semantic Web browser Haystack [4]. The ex-
top note-taking application. Using online tools deprives the      isting systems for semantic blogging fall into two categories:
user from having the personal context automatically added         (i) desktop applications that involve publishing the actual
to the blog post, since desktop information cannot be easily      local resource information together with the blog post, or
integrated in Web-based interfaces.                               (ii) online applications that do not have access to desktop
                                                                  data relevant to the user. The first category, represented by
3.   SYSTEM OVERVIEW                                              tools like SemiBlog [5] or SemBlog [6], gives the user bet-
We propose an approach that enables the publishing and            ter access to the relevant data from the desktop. However,
sharing of personal notes by extending the functionality pro-     both tools require that the resources that contain private in-
vided by SemNotes. The process has two steps: (i) trans-          formation are published together with the blog post, which
formation and (ii) publication. In the first step, the note is    might lead to privacy issues. They are used for exchange of
transformed locally for publication, and private local data is    personal information in the blog posts, which differs from
replaced with public server references. In the second step,       our approach of using already published web data as to pro-
the transformed note is published online on a dedicated           tect the privacy of the personal information. The process
“SemNotes server”, where the resources referenced and the         described implies manually adding the metadata, while our
tags assigned, are shared between the notes of all users. As      approach relies on automatic export. Online services like
we mentioned above, there are also two prerequisite steps:        BlogAccord [2] for music information or Zemanta2 blogging
(i) the note-taking process and annotation of the note, which     assistant, belong to the second category. They enhance the
is the usual note-taking approach, and (ii) the identification    blogging experience by providing access to various online re-
of Web aliases for the desktop resources related to a note,       sources to create the context of a blog post, but not to the
where URIs are mined from the Web for locally defined re-         personal context of the user.
sources, such as people, events or projects. The annotation
is done semi-automatically and is an existing feature in Sem-     5.     CONCLUSION
Notes. The second prerequisite step consists in finding Web       We presented an approach for publishing personal notes as
resource for each of the desktop entity linked to the note that   Linked Data on the Web. The aim of our work was to pro-
is about to be published. This step is executed by a desk-        vide a way for publishing and sharing complete information
top service that relies on the Semantic Web index Sindice to      by preserving the personal context of the notes without com-
retrieve results.                                                 promising privacy. Our solution makes a step towards bridg-
                                                                  ing the gap between local semantic data and Linked Data.
The publishing system is divided between its local part and       We defined a publishing process that comprises two steps:
its remote part. The local part handles local private data,       (i) preparation – the note is transformed into a SIOC-based
while the remote one handles online public data. The sep-         Web representation; and (ii) publication / sharing – the note
aration between them extends over three layers: ontology,         is published online following the Linked Data principles.
data and application.
                                                                  6.     ACKNOWLEDGMENTS
On the ontology level, the NEPOMUK desktop ontologies             The work was supported by the Lı́on-2 project funded by Sci-
are used locally while popular Web vocabularies like SIOC         ence Foundation Ireland under Grant No. SFI/08/CE/I1380.
are used on the server-side. These ontologies are used to
describe the data exchanged between the applications. Per-
sonal desktop data is stored in the local repository, which       7.     REFERENCES
exists on any NEPOMUK Semantic Desktop, while Web                 [1] A. Bernardi, S. Decker, L. van Elst, G. Grimnes,
data is distributed in the Linked Data cloud. Finally, on             T. Groza, S. H. M. Jazayeri, C. Mesnage, K. Moeller,
the application level, the local component is an extension to         G. Reif, and M. Sintek. The Social Semantic Desktop:
SemNotes that provides publishing functionality for notes,            A New Paradigm Towards Deploying the Semantic Web
and the remote component is a server that hosts and pub-              on the Desktop. 2008.
lishes online the notes.                                          [2] S. Cayzer. What next for semantic blogging? In Proc.
                                                                      of the SEMANTICS 2006 conference, pages 71–81,
The first step of the process is executed on the local side,          Vienna, Austria, November 2006.
by an extension of the SemNotes application. It consists of       [3] S. Cayzer and P. Shabajee. Semantic blogging and
replacing the links to the local resources mentioned in the           bibliography management. In BlogTalk Proc., 2003.
note with their Web aliases, and enriching the content of         [4] D. R. Karger and D. Quan. What would it mean to
the note with RDFa. Then, the publication step is done by             blog on the semantic web? In Proc. of the 3rd ISWC,
the server, which receives information from the desktop and           Hiroshima, Japan, pages 214–228. Springer, 2004.
publishes the note according to the Linked Data publishing        [5] K. Möller and S. Decker. Harvesting desktop data for
principles. On the server, the notes, linked resources and            semantic blogging. In Proc. of Semantic Desktop
tags have dereferenceable URIs. The resources and tags                Workshop, ISWC, Galway, Ireland, 2005.
are shared among notes and users, thus providing object           [6] H. Takeda and I. Ohmukai. Semblog project. In
centered sociality. The dataset is also linking to external           Activities on Semantic Web Technologies in Japan, A
resources that are found to be sameAs the local ones.                 WWW2005 Workshop, 2005.

4.   RELATED WORK
Semantic blogging was introduced by Cayzer and Shabajee
                                                                  2
in [3]. Karger and Quan describe semantic blogging in the             http://www.zemanta.com