<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Understanding Personal Data as a Space { Learning from Dataspaces to Create Linked Personal Data</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Laura Dragan</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Markus Luczak-Roesch</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nigel Shadbolt</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Southampton</institution>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2014</year>
      </pub-date>
      <fpage>18</fpage>
      <lpage>25</lpage>
      <abstract>
        <p>In this paper we argue that the space of personal data is a dataspace as de ned by Franklin et al. We de ne a personal dataspace, as the space of all personal data belonging to a user, and we describe the logical components of the dataspace. We describe a Personal Dataspace Support Platform (PDSP) as a set of services to provide a uni ed view over the user's data, and to enable new and more complex work ows over it. We show the di erences from a DSSP to a PDSP, and how the latter can be realized using Web protocols and Linked APIs.</p>
      </abstract>
      <kwd-group>
        <kwd>Personal Information Management</kwd>
        <kwd>Dataspaces</kwd>
        <kwd>Linked Data</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        In their 2005 paper [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] Franklin et al. introduced Personal Information
Management (PIM) as one of the two usage scenarios for what they called dataspaces.
They de ne a dataspace as an \abstraction for data management" across
heterogeneous collections, and propose the design and development of a suite of basic
services, collectively named DataSpace Support Platforms (DSSPs), to solve in
a general way the recurring data management chalenges identi ed for
heterogeneous collections of data: search and query, integration, availability, recovery,
access control, evolution of data and metadata. At that time, PIM included
mostly desktop data, with the extension to remote le storage, and few Web
services and mobile devices, but since then, the notion of personal data evolved,
as did the online services available to users. Personal data shifted from being
desktop centric to the Web, and the types and amount of personal information
that were being captured increased and diversi ed greatly. An important
consequence of this shift was that the data, while still personal, moved out of the
users' control, and under the control of the many organizations providing Web
services, like Facebook, Runkeeper, Amazon, etc.
      </p>
      <p>Not being in full control of its data is one of the key distinguishing
characteristics of a DSSP. In this paper we argue that the space of personal data is a
dataspace as de ned by Franklin et al., and we describe the characteristics of a
Personal Dataspace Support Platform (PDSP). We show the di erences from a
DSSP to a PDSP, and how the latter can be realized using Web protocols and
Linked APIs.</p>
    </sec>
    <sec id="sec-2">
      <title>Background and Related Work</title>
      <p>In this section we start by detailing the characteristics of dataspaces as originally
de ned. Then, we present existing related work which describes the Web of Data
as a dataspace, followed by related work in the eld of Personal Information
Management.
2.1</p>
      <sec id="sec-2-1">
        <title>Characteristics of Dataspaces</title>
        <p>
          Franklin et al. describe dataspaces as having the following distinguishing
characteristics [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]: C1) Support all the data in the dataspace, without leaving out
any. This requires handling a wide variety of formats, systems, and interfaces.
C2) Is not in full control of the data it handles, as this data might be accessible
and changed though other, native intefaces. C3) May o er varying levels of
service, returning best-e ort or approximate answers, depending on the available
data sources. C4) Must provide the tools to create better integration between
data sources, in a pay-as-you-go manner. This includes the creation of links and
mappings between data from unconnected sources.
        </p>
        <p>
          Dataspaces are modeled [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] as a set of participants and relationships. The
participants are the data sources which are available in the dataspace, regardless
of the types of data, data access, and available services they provide.
Relationships are all references between any two or more of the participants, describing
mappings at the schema or instance level, but also rules describing the
transformation of the data from one participant into a di erent form, which results in
two distinct participants { the original data source and the transformed replica.
The dataspace should be able to model any kind of relationship between any of
its participants.
        </p>
        <p>
          Franklin et al. propose a suite of key services, under the name of DataSpace
Support Platform (DSSP) from which we highlight the following: (a) access to
data, (b) cataloging, (c) browsing, (d) search and querying, (e) monitoring and
event detection. The goal of a DSSP is to provide a set of basic functions over the
heterogeneous data in a dataspace, to support the incremental building of more
complex and specialized services. While dataspaces are \not a data integration
approach, they are more of a data co-existence approach", information can be
integrated in a \pay-as-you-go" fashion, as described by Madhavan et al.[
          <xref ref-type="bibr" rid="ref9">9</xref>
          ].
Heath and Bizer adopted this terminology to highlight the evolutionary nature
of Linked Data on the Web, which we brie y describe below.
2.2
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>The Web of Data</title>
        <p>
          What has started as a Web of Documents, has evolved into a Web of Data. The
Web architecture is well suited to solve data integration problems at very large
scale in an evolutionary fashion. By means of the Resource Description
Framework (RDF) a graph-based data representation is provided, which, in
combination with the use of stable HTTP URIs, allows for integrating data without the
need for fully- edged a priori schema alignment. The support for such exible,
pay-as-you-go type of data integration [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] makes the Web of Data \a realization
of the dataspaces concept on global scale"[
          <xref ref-type="bibr" rid="ref2">2</xref>
          ].
2.3
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>Personal Information Management</title>
        <p>Personal Information Management stemmed as a eld of study in response to the
problem of information overload. Many methods and tools have been developed
to enable us to organize information better, so that we can easily nd it and
re- nd it when needed. However, the growing number of applications designed
to aid the management of the information had the side e ect of the information
being trapped in unconnected repositories, and incompatible (proprietary) data
formats, which in turn led to duplication of data and increased e ort required
for organizing.</p>
        <p>
          The Semantic Desktop [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] systems aimed to solve the fragmentation problem
by marrying PIM with Semantic Web technologies like common representation
languages and ontologies. Systems like Gnowsis [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ], IRIS [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ], SEMEX [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ],
XCOSIM [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ], Nepomuk [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ], use prede ned ontologies to classify the resources
they manage. These ontologies vary from general to detailed, from xed to user
editable and from monolithic to layered and modular. Some of the systems use a
central data store for all the personal semantic information extracted from
nonsemantic applications, while others propose the replacement of the non-semantic
tools with semantic counterparts. They all provide enhanced search and query
functionality, as well as facetted browsing and link traversal for
\follow-yournose" exploratory browsing. Dittrich et al. [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] de ne the concept of a Personal
Dataspace, as containing the entire personal information of a user, and de ne
an architecture for a Personal Dataspace Management System (PDSMS), the
equivalent of a DSSP. However, the iMemex system described is more similar to
the vision of the Semantic Desktops.
        </p>
        <p>With the shift to the Web, much of the personal information residing online
consists of the usual PIM elements: documents, email, and calendar, but is
ampli ed by social networks, multimedia, personal annotations, browsing history,
activity feeds, location feeds, and much more, all out of the control of the user.</p>
        <p>Personal Data Stores are a solution to give users back some control over their
personal information. Some come in the form of cloud services which collect and
integrate data from numerous other services on behalf of the users, and some are
systems that can be installed locally or on servers under the administration of
the users themselves. Many Personal Data Store solutions rely on the principles
and standards of the Web architecture, as they provide a exible platform to
realise highly interoperable network-based systems.
3</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Personal Data as a Dataspace</title>
      <p>
        Personal Information Management is cited as the rst of two scenarios to
illustrate dataspaces by Franklin et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], and at that time it ful lled three out of
the four characteristics of dataspaces: PIM contained all the personal data (C1),
it supported pay-as-you-go integration (C4), and would return best e ort results
to queries, based on the available data (C3). Since then we have seen much
advancement in the area of PIM, including the development of Semantic Desktops,
which aim to solve the data integration problem on the desktop, and Personal
Data Stores which bring personal data back under users' control. However, with
the shift to the Web, the Semantic Desktops no longer manage all the user's
data, and the remaining characteristic becomes true for personal dataspaces {
the information contained is not under the full control of the DSSP (C2).
      </p>
      <p>Based on these characteristics we argue that the space of personal data, as
it is now, matches the de nition of a dataspace, by ful lling the characteristics
described above. The personal dataspace contains heterogeneous data, is
distributed, controlled by many di erent service providers, unlimited in relation to
the number and types of information that it can contain, and most importantly,
is centered around the user whose data it contains. This last characteristic is
what di erentiates a personal dataspace from a generic dataspace.</p>
      <p>In the next sections we describe in more detail the components of a personal
dataspace, and the services that a Personal Dataspace Support Platform (PDSP)
could provide.
3.1</p>
      <sec id="sec-3-1">
        <title>Participants and Relationships</title>
        <p>In our application to personal data, all the data sources that collect, store and
manage personal information are participants in the dataspace. In the past, the
desktop contained most of the personal data, so the participants in the personal
dataspace would have been the individual applications { le system and
manager, email clients, calendar, task manager, etc. Semantic Desktops uni ed most
of the desktop information under a single uniform data representation and
storage, regardless of the application it comes from. Thus, such a Semantic Desktop
would be seen as a single participant in the personal dataspace. However,
desktop applications whose data was not included in the Semantic Desktop can be
separate participants, even if they reside on the same desktop. Another set of
participants to the dataspace consists of the online applications that collect and
store information about the user. This set can include shopping history and wish
lists, activity and food tracking, trip planning, emails and calendaring, blogging
and micro-blogging, social networks, etc. Mobile applications which track the
user movements and activities are also participants in the dataspace.</p>
        <p>One of the key characteristics of a dataspace is that it contains all the data
in the space. Applied to personal data, this means that the personal dataspace
must contain all the data about the person at the centre of the space. However,
despite the ever growing number of applications and devices which collect and
deal with personal data, only those systems which handle data about the central
user, are participants in the personal dataspace of that user. Let's assume we
have two people, Alice and Bob, each using tness tracking applications: Alice
uses Runkeeper and Fitbit, and Bob uses Fitbit and Endomondo. The Fitbit
application will be a participant in the personal dataspaces of both Alice and
Bob, while Runkeeper will be a participant only in Alice's personal dataspace.
Going further with the example, while Fitbit is a participant in both users'
personal dataspaces, it participates in Alice's dataspace only with Alice's collected
data, and respectively in Bob's dataspace with Bob's collected data. So, while
under the control of the same organization (in this example Fitbit), the
participants are in fact distinct for the two di erent users. We can make the distinction
better by specifying that in the case of a Web application where the user has
an account, the participant to that user's dataspace provides in fact only the
view that the user's account in that application has over the data. Some service
providers might collect more information on the users than they make available
back to the users { for example the browsing history in online shops, and
unpublished posts in social networks. Such information, while it is \personal data"
is not available in the personal dataspace.</p>
        <p>Another consequence of the personal dataspaces revolving around a single
person's data is that the personal dataspace of a user can act as a participant in
another user's personal dataspace, by means of sharing information between the
two. This possibility requires access control management in the PDSP, a service
described in the following section.</p>
        <p>The relationships between the participants can be explicit and formally
expressed { like for example when the user manually connects two applications and
allows them to share data completely or partially. For example, our user Alice
can explicitly allow access to her Fitbit data from her Runkeeper account, for
better monitoring of her daily activity. Relationships can be also implicit, when
two participants manage the same type of data, like for instance social
connections or tness tracking, but without actually sharing any of the data itself. For
example if Bob would also buy and use a Nike+ Fuelband, both his Fitbit and
the Fuelband would record activity and movement information, like number of
steps, distance, and calories, but independently of each other and without
exchanging any data. Such relations can be made explicit by providing a mapping
between the schemas, and thus enabling better data integration outside of the
systems holding the data. The mappings can be created in the pay-as-you-go
fashion, as needed and as available. The PDSP must support the creation of
such mappings and the import of existing mappings from third parties.
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Services and Functions</title>
        <p>Like a DSSP, a Personal Dataspace Support Platform (PDSP) should provide
a set of services to access the data contained in the dataspace, and allow basic
handling of this data, to support the development of more specialized services
on top of the platform.</p>
        <p>
          We list below the services a PDSP should provide, although di erent PDSPs
can choose to implement just a subset, or add more services. Some of the services
here were identi ed in [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] for a DSSP, and are relevant for personal dataspaces
as well. In their description of the iMemex, Dittrich et al. [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] divide the list of
services in two layers, the Physical Data Independence Layer (PHIL), which is
closer to the data layer, and the Logical Data Independence Layer (LIL), closer
to the applications which are built on top of the platform. Although we also
de ne services which rely on other services, some closer to the data, and some
closer to the user and applications, we do not de ne speci c layers, because the
dependencies and relative positions of the services could change over time, we
do not restrict their relative positions, nor possible interactions.
        </p>
        <p>Data access The data available from the participants is heterogeneous and
dynamic. Accessing it requires support for multiple Web APIs, formats, and
protocols, as well as high tolerance to errors and unknowns. This service must
have available all the information required to be able to access the data in all
the participating data sources, thus relying on the identity management and
cataloging services.</p>
        <p>Cataloging Depending on the kinds of APIs o ered by the participants, the
catalog can contain alternative access points for di erent formats { for
example HTML or JSON, or capabilities provided by the participant { for example
SPARQL or REST. The catalog should also contain a schema for the data it
contains. The VOID1 vocabulary is one possible way of describing the
participants exposing RDF data. The catalog does not have the function of storage or
archive, but rather of a meta-data store containing information about the data
available.</p>
        <p>
          Indexing Search is one of the essential services of a PDSP, and it requires
a reliable indexing service. Some of the participants might already contain an
index over their data, and allow direct access to it, or mediated by a search
and query function, but the PDSP should provide a uniform API over all the
participants. Specialized indexes are recommended for special types of data like
time and location. In the context of personal information the time dimension is
important for reminding [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]. With the growth of location services and activity
tracking, geographical information becomes also an important dimension, which
can enable useful aggregations and lters.
        </p>
        <p>Mapping The PDSP should provide a way for incremental accumulation of
schema mappings between its participants as to enable the pay-as-you-go data
integration. The mappings can be manually created by the user, automatically
generated, or imported. They can be partial, when not all the possible types of
data from a participant are mapped. The mappings are data themselves, and as
such they can be shared with other users' PDSPs.</p>
        <p>Integration Pay-as-you-go data integration is one of the key functions of
a dataspace. In the PDSP it is provided by the mapping service, which allows
the incremental creation of a directory of mappings between the data structures
provided by various participants. Deeper integration can be done by allowing
the creation of relations between resources from di erent participants.</p>
        <p>Monitoring The PDSP must be aware of the changes in the data and in the
state of the participants, and update its catalog and indexes accordingly. The
monitoring service can use event detection mechanisms, subscriber APIs, or any
other feature provided by the data sources.</p>
        <p>Browsing, search, and query Browsing, search, and query services should
allow the users to explore their dataspace, through keyword search as well as
1 http://www.w3.org/TR/void/
structured query, by iteratively re ning and restricting a domain. These services
should work uniformly across the dataspace, regardless of the structure and
mode of access of the particular participants, as this allows the user to have a
uni ed and uniform view into the data, independently of where the data shown
comes from. The user should be able to use these services on data as well as on
metadata. As with the indexing service, some participants may already provide
a search interface.</p>
        <p>Identity management As described in the previous section, some of the
participants which provide Web APIs to access the users' personal information
require that the users are authenticated, or that the application used to access
the data on their behalf is authorised. The method for authentication and
authorisation could vary, and it is recommended that the PDSP supports all or
most of the existing methods. The account information and access tokens must
be securely stored.</p>
        <p>Access control In combination with the data access and the identity
management service, the PDSP becomes a gateway to the user's entire personal
information, thus it needs to ensure that the privacy and security of the data
is maintained. Additionally, in the case of one user's PDSP becoming a
participant in another user's dataspace, access must be controlled so that only the
authorised part of the personal data is disclosed.</p>
        <p>Annotation Annotation refers to the creation of metadata, and in PIM it
is an important feature. The PDSP should support annotation of any type of
resources, data, metadata, or participants. Annotations can include data from
multiple participants. Provenance is a special type of annotation, and all
annotations are data, which can be shared, annotated, queried, etc.</p>
        <p>Update Some participants may support updating data through Web APIs,
although not all, and not all types of data. In cases when the update cannot be
propagated to the source, the service could support saving local versions of the
updated data. This can however lead to con icting versions of the same data,
which is why provenance information and change logs are important. There are
cases when updating data cannot be done at all, for example when it comes
from sensors whose readings cannot be changed. In these cases annotations can
replace the update.</p>
        <p>The architecture of the PDSP that we envision does not provide any
applications, although it does not prohibit them either. We see applications as an
extra layer on top of the PDSP, and not at the core of the architecture as part of
the foundational layers. We propose that the applications, as well as higher level
services, are built on top of the PDSP, using the APIs provided by its services.</p>
        <p>We consider storage as a higher level service, thus not part of the core suite
of services provided by the PDSP. We do not require that the PDSP fetches and
keeps duplicates for the personal data already stored by the participating data
sources. The way the storage of data is handled is one of the main di erences
between a PDSP and Personal Data Stores.
A personal dataspace is the space of a user's entire personal data, regardless
of where it is located, on the desktop or on the Web; integrated in a Semantic
Desktop, or fragmented across many Web platforms. Following the original
definition of dataspaces by Franklin et al., we de ne the logical components of a
personal dataspace, and we describe a Personal Dataspace Support Platform as
a set of services to provide a uni ed view over the user's data.</p>
      </sec>
      <sec id="sec-3-3">
        <title>Acknowledgments</title>
        <p>This work was supported by the EPSRC Theory and Practice of Social Machines
Programme Grant, EP/J017728/1.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Bernardi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Decker</surname>
            , S., van Elst,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grimnes</surname>
            ,
            <given-names>G.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Groza</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Handschuh</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jazayeri</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mesnage</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Moeller</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reif</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sintek</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>The Social Semantic Desktop: A New Paradigm Towards Deploying the Semantic Web on the Desktop, chap</article-title>
          . 118, pp.
          <volume>2279</volume>
          {
          <fpage>2303</fpage>
          .
          <string-name>
            <given-names>IGI</given-names>
            <surname>Global</surname>
          </string-name>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Bizer</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heath</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Berners-Lee</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Linked data - the story so far</article-title>
          .
          <source>International Journal on Semantic Web and Information Systems</source>
          <volume>5</volume>
          (
          <issue>3</issue>
          ),
          <volume>1</volume>
          {
          <fpage>22</fpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Cheyer</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Park</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Giuli</surname>
          </string-name>
          , R.:
          <source>IRIS: Integrate. Relate. Infer. Share. In: Proceedings of the 1st Workshop on The Semantic Desktop</source>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Decker</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Frank</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>The social semantic desktop</article-title>
          . In: Workshop on Application Design,
          <article-title>Development and Implementation Issues in the Semantic Web (</article-title>
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Dittrich</surname>
            ,
            <given-names>J.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Blunschi</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Farber</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rene</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shant</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Karakashian</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          , Antonio,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Salles</surname>
          </string-name>
          ,
          <string-name>
            <surname>V.</surname>
          </string-name>
          :
          <article-title>From Personal Desktops to Personal Dataspaces: A Report on Building the iMeMex Personal Dataspace Management System</article-title>
          .
          <source>In: Proceedings of BTW</source>
          <year>2007</year>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Dong</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Halevy</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>A Platform for Personal Information Management and Integration</article-title>
          .
          <source>In: Proceedings of the Second Biennial Conference on Innovative Data Systems Research (CIDR2005)</source>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Franklin</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Halevy</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Maier</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>From databases to dataspaces: A new abstraction for information management</article-title>
          .
          <source>SIGMOD Rec</source>
          .
          <volume>34</volume>
          (
          <issue>4</issue>
          ),
          <volume>27</volume>
          {33 (Dec
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Franz</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Staab</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Arndt</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          :
          <article-title>The X-COSIM Integration Framework for a Seamless Semantic Desktop</article-title>
          .
          <source>In: Proceedings of the 4th International Conference on Knowledge Capture (K-CAP2007)</source>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Madhavan</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , Je ery, S.R., Cohen,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Dong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.L.</given-names>
            ,
            <surname>Ko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            ,
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Halevy</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.</surname>
          </string-name>
          :
          <article-title>Webscale Data Integration: You Can Only A ord to Pay As You Go</article-title>
          .
          <source>In: Proceedings of CIDR2007</source>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Ringel</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cutrell</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dumais</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horvitz</surname>
          </string-name>
          , E.:
          <article-title>Milestones in Time: The Value of Landmarks in Retrieving Information from Personal Stores</article-title>
          .
          <source>In: Proceedings of the International Conference on Human Computer Interaction</source>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Sauermann</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>The Gnowsis Semantic Desktop Approach to Personal Information Management</article-title>
          .
          <source>Ph.D. thesis</source>
          , Fachbereich Informatik der Universitat Kaiserslautern (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>