<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>The Effect of Introducing Content Price in Distributed Social Networks</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Christos Tryfonopoulos Dept. of Informatics and Telecommunications University of the Peloponnese GR22131</institution>
          ,
          <addr-line>Tripoli</addr-line>
          ,
          <country country="GR">Greece</country>
        </aff>
      </contrib-group>
      <fpage>19</fpage>
      <lpage>27</lpage>
      <abstract>
        <p>-Over the last few years a number of distributed social networks with content management capabilities have been introduced both by academia and industry. However, none of these efforts has so far focused on supporting both information retrieval and filtering functionality in a distributed social networking environment. In this work we present a social networking architecture that offers both functionalities -in addition to the usual social interaction tasks- in distributed social networks, outline the associated distributed protocols, and introduce a novel data source selection mechanism for identifying good data sources. This novel data source selection mechanism is designed to take into account a combination of resource selection, predicted publication behaviour, and content cost to improve the selection of information producers by users. To the best of our knowledge our approach, coined AGORA, is the first work to model the price of content and to study its effect on retrieval efficiency and effectiveness in a distributed social network setting. Finally, our work goes beyond modelling by providing proof-of-concept experiments with real-world corpora and social networking data.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Index Terms—distributed social networks, content
management, information retrieval/filtering, content price, economic
modelling, experimental evaluation</p>
    </sec>
    <sec id="sec-2">
      <title>I. INTRODUCTION</title>
      <p>
        Much information of interest to humans is available today
on the Web, making it extremely difficult to stay informed
without sifting through enormous amounts of information. In
addition, a vast amount of this information is published and
shared through social networking sites by users that participate
in ‘social’ activities through the generation, commenting,
tagging, liking and sharing vast amounts of digital content.
As users engage increasingly more in the usage of social
networks, demand for content management capabilities has forced
social networks to go beyond the usual social interactions
(e.g., like, post, or poke) and offer basic content management
functionality. To this end, many social networks adopt the
traditional approach of content search (information retrieval - IR)
[
        <xref ref-type="bibr" rid="ref23">23</xref>
        ], [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ], [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ], [
        <xref ref-type="bibr" rid="ref33">33</xref>
        ], [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ]. All these approaches however
ignore that the information filtering paradigm may also provide
a good alternative of keeping the users informed and at the
same time avoiding the information avalanche. In information
filtering (IF) –also referred to as publish/subscribe, continuous
querying, information push, or information dissemination–
users are able to subscribe to information sources and be
notified when content of interest is published. This need for
content-based push technologies to cope with the information
explosion is also stressed by the deployment of tools such
as Google Alert and CNN alerts. In an IF scenario, a user
posts a subscription (or continuous query) to the system to
receive notifications whenever certain events of interest take
place (e.g., when a blog post on Winter Olympics becomes
available).
      </p>
      <p>
        However, the increasing usage of social networks gave rise
to skepticism about use of centralised services that are able
to withhold all uploaded content. Such centralised social
networking services are typically owned by private companies and
users need to upload their content, thus giving away ownership
rights to make it available to others. This allows companies
to exploit user data and sell them to advertisers for profit.
To circumvent such practises, distributed social networking
services –building upon results from the P2P paradigm (e.g.,
[
        <xref ref-type="bibr" rid="ref29">29</xref>
        ], [
        <xref ref-type="bibr" rid="ref31">31</xref>
        ], [
        <xref ref-type="bibr" rid="ref40">40</xref>
        ])– have been proposed both by academia and
industry in the form of distributed social platforms [
        <xref ref-type="bibr" rid="ref34">34</xref>
        ],
[
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], [
        <xref ref-type="bibr" rid="ref35">35</xref>
        ] and distributed social content management
systems [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ], [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ], [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ], [
        <xref ref-type="bibr" rid="ref33">33</xref>
        ], [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ]. Although all these
approaches offer different types of distributed social networks
that allow users to create communities, share content, and send
messages, none of them focuses on supporting information
filtering functionality in such a setup. Moreover, such
distributed environments without centralised control, prove an
interesting business model for pricing the content delivered
by an information producer. In this setup, each information
producer (e.g., a news agency, a digital library, or a prolific
blogger) might have its own customer base of interested
followers/subscribers, and may charge the delivered content
by subscription or per item.
      </p>
      <p>In this work, we present AGORA, a distributed social
networking architecture that allows users to share, search for
(IR), and subscribe to (IF) content in a fully decentralised
way, while at the same time maintaining ownership of their
content. Such a design is ideal for creating social content
marketplaces1, where users are able to price and distribute
their content while at the same time they maintain ownership
rights. Content in our setup can be textual, such as status
updates/blog posts/tweets, or multimedia, such as photos or</p>
    </sec>
    <sec id="sec-3">
      <title>1AGORA is the Greek word for marketplace</title>
      <p>Copyright held by the author(s)
videos, appropriately tagged. Our proposed solution offers
fundamental social interactions, while emphasising on content
management (IR and IF) and economic aspects such as the
price/quality tradeoff of content. To the best of our knowledge,
this is the first approach that aims at combining IR and IF
in a social networking context, while taking into account
aspects of economic modelling. In the light of the above, the
contributions presented in this work are the following:
We propose a social networking architecture that offers
content management functionality in terms of IR and IF,
in addition to the usual social interaction tasks typically
supported in such scenarios. This is the first approach in
the literature to offer both functionalities.</p>
      <sec id="sec-3-1">
        <title>We present the distributed protocols and services that reg</title>
        <p>ulate node interactions, provide details on the distributed
IR and IF, and outline the different friendship facets
introduced in the architecture.</p>
        <p>We devise a novel method to rank information producers
according to (i) the content already published, (ii) the
expected future publishing behaviour, and (iii) the price
announced by the information producer. This method
allows us to achieve high recall with low cost (by searching
at/subscribing to a small number of information
producers).</p>
        <p>We study the effect of content price in our setup and
experimentally demonstrate that it is a key element on the
delivered content quality. Our modelling utilises concepts
such as correlation between the quality/expertise of the
information producer, demand-driven price rates, and cost
of resources.</p>
        <p>Figure 1 shows a high-level view of the envisioned
distributed architecture with different types of information
producers and users with varying information needs (IR, IF or
both).</p>
        <p>The rest of the paper is organised as follows. Section II
overviews related research, while Section III discusses the
AGORA architecture and the associated distributed protocols.
Subsequently, Section IV highlights the economic and
qualitative aspects of AGORA and presents the price/quality tradeoff,
while Section V experimentally demonstrates the effect of
cost in retrieval and filtering quality and compares the system
effectiveness with/without monetary flow. Finally, Section VI
discusses high-level conclusions, open issues, and possible
extensions.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>II. RELATED WORK In this section, we discuss related work in the context of systems that are designed with content management in mind, and platforms that provide distributed social networks.</title>
      <sec id="sec-4-1">
        <title>A. Distributed Social Platforms</title>
        <p>
          All available social networks (e.g., Facebook, LinkedIn,
Elgg) are currently based on centralised solutions both for
storing and managing of content, which set scalability
limitations on the system and reduce fault-tolerance. Industry
has already detected these drawbacks and has lately turned
into solutions that diverge from the centralised model of the
existing systems by developing platforms, such as Diaspora,
KrawlerX and OpenSocial, that provide APIs to support
application hosting in remote application servers, owned and
managed by the application providers. In a similar spirit,
a strand of research work also moved towards hierarchical
organisations for supporting distributed social networking. The
distributed social systems SuperNova [
          <xref ref-type="bibr" rid="ref34">34</xref>
          ] and Scope [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ] are
based on a two-tier architecture, where nodes with higher
computing capability become super-nodes and form an overlay
to provide distributed data management of the P2P social
network. Client nodes connect to super-nodes and rely on
them for bootstrapping, sharing their content, and accessing
the shared information. Although all of the platforms and
system schemes propose the decentralisation of the social
services, one of the main issues of the centralised architectures
persists: the existence of a single point where user information
is collected and may be exploited.
        </p>
        <p>
          To alleviate the above disadvantage, distributed platforms
for social online networks based on the P2P paradigm were
proposed [
          <xref ref-type="bibr" rid="ref27">27</xref>
          ], [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ], [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]. LifeSocial.KOM [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ] is a
pluginbased extendible social platform that provides secure
communication and user-based data access control, and integrates
a monitoring component that allows users and operators to
observe the quality of the distributed system. Similar efforts
aimed at spontaneous social networking; they include
proposals for distributed social services in resource constrained
devices (like tablets or smartphones) [
          <xref ref-type="bibr" rid="ref35">35</xref>
          ], [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ], or in environments
with no infrastructure guarantees (e.g., high-attendance events)
[
          <xref ref-type="bibr" rid="ref24">24</xref>
          ]. All these approaches offer different types of distributed
social platforms that allow users to create communities, share
content, and send messages, but do not emphasise expressive
content search mechanisms.
        </p>
        <p>
          Finally, other approaches in distributed social networking
emphasise on delivering innovative and competitive services;
SCIMS [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ] relies on an ontology-based model for managing
social relationships and status, the work in [
          <xref ref-type="bibr" rid="ref37">37</xref>
          ] aims at
personalising search results based on user context and friendship
relations, while Gemstone [
          <xref ref-type="bibr" rid="ref36">36</xref>
          ] targets data availability in the
absence of the data owner. To achieve this, a replica storage
scheme based on social relationships, online patterns of nodes,
and user experiences is utilised.
        </p>
      </sec>
      <sec id="sec-4-2">
        <title>B. Distributed Social Data Management</title>
        <p>
          Our work fits mainly into the area of content management
in distributed social networks and is inspired by previous
approaches on Semantic Overlay Networks (SONs) [
          <xref ref-type="bibr" rid="ref29">29</xref>
          ], [
          <xref ref-type="bibr" rid="ref31">31</xref>
          ],
[
          <xref ref-type="bibr" rid="ref40">40</xref>
          ] and based on works that emphasise on distributed content
location in social networks. Works like the eXO [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ] and
SoNet [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ] systems are, similarly to AGORA, inspired by the
P2P paradigm to provide content location and management
services on large-scale decentralised social networks. To do
so, the authors rely on a structured overlay and exploit the
accurate location mechanisms, but de-emphasise node
autonomy. Contrary to these approaches, AGORA employs a loose
component architecture and introduces a new type of social
relations between nodes: the semantic closeness of content.
In this way, nodes that are similar in terms of content, create
emergent groups likewise to the creation of social relations.
Our work shares ideas with the SocialCDN system [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ], where
social caches (links among friends) are introduced as a way to
alleviate the network traffic and optimise data dissemination
(mainly by social updates). In [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ], social cache selection
is formulated as the neighbour-dominating set problem and
a family of algorithms is proposed and evaluated. Contrary
to AGORA, where the emphasis is on efficiently supporting
expressive content retrieval in the social paradigm, the
emphasis on SocialCDN is on the reduction on network traffic to
facilitate fundamental social interactions.
        </p>
        <p>
          The loose component architecture and the emphasis on node
autonomy of [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ] resemble the architectural design of AGORA,
where an unstructured overlay network of nodes is utilised
to support the distributed social infrastructure. However, the
focus of [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ] is on the design of gossip protocols for efficiently
disseminating profile updates to all interested users and does
not put any attention to the problem of content search and
management.
        </p>
        <p>
          Furthermore, the concept of creating and maintaining social
connections in distributed infrastructures is affined with the
problem of distributed data management in P2P networks. In
SONs (e.g., [
          <xref ref-type="bibr" rid="ref40">40</xref>
          ], [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]), “social” connections between the peers
(e.g., similarity of content, pattern, or distance in a physical
level) are exploited to direct the search to nodes with relevant
data (e.g., as in [
          <xref ref-type="bibr" rid="ref32">32</xref>
          ] that studies query routing strategies based
on “social” relationships). Other works on SONs (e.g., [
          <xref ref-type="bibr" rid="ref31">31</xref>
          ])
focus more on the organisation of P2P networks as
smallworld networks, where peers self-organise in groups of similar
interests to facilitate message-efficient query answering. Our
work on AGORA borrows concepts and ideas from research on
SONs and extends them for facilitating efficient and effective
data management in a social network setting. We suggest that
SONs offer the most promising architectural solution inspired
from the P2P paradigm; it is a perfect fit for a distributed
social networking scenario providing high decentralisation,
high node autonomy, support for emergent semantic and social
structures, and effective object location mechanisms. Contrary,
DHT-based architectures [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ], [
          <xref ref-type="bibr" rid="ref28">28</xref>
          ] ignore node autonomy (by
enforcing deterministic key/content placement) and emergent
structures (by enforcing network structure).
        </p>
        <p>
          Finally, a large number of research in the domain of
distributed social networks consists of studies on system security
[
          <xref ref-type="bibr" rid="ref28">28</xref>
          ], [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ], user privacy [
          <xref ref-type="bibr" rid="ref28">28</xref>
          ], [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ], [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ], [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ], [
          <xref ref-type="bibr" rid="ref41">41</xref>
          ], distributed
access control [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ], [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ], and authentication mechanisms [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ].
Clearly, security issues are also relevant in our design and the
AGORA system could benefit by adopting approaches like [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]
or [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] that enforce user privacy and access control. However,
the problem of security is orthogonal to our design and is not
further analysed as it is not the emphasis of this work.
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>III. SYSTEM OVERVIEW In this section, we provide an overview of the architectural components of the proposed system and provide the distributed protocols executed locally by users and information producers.</title>
      <sec id="sec-5-1">
        <title>A. Architecture</title>
        <p>We consider a distributed social network, where each user,
characterised by its interests, is connected to friends and
information producers with similar interests. The interests of a
user are identified automatically, i.e., by applying clustering on
its local content repository. The network nodes use a heartbeat
protocol that runs continuously and aims at identifying new
information producers based on the likelihood to have similar
interests with the node at hand. Each user maintains two
routing indices holding information for social friends and
thematic friends. Social friend links correspond to the social
relationship aspect of the network, while thematic friend links
are of two types: short-range links (i.e., links to information
producers with similar interests) and long-range links (i.e.,
links to information producers with dissimilar interests to
maintain connectivity between different clusters in the system).
The reorganisation procedure is executed locally by each
node and aims at bringing together users and information
producers with similar interests that are likely to share/search
for/subscribe to content.</p>
      </sec>
      <sec id="sec-5-2">
        <title>B. Joining Agora</title>
        <p>When a user node connects to the network, its interests are
automatically derived by its local content. For each interest,
the node maintains a thematic index (T) containing the contact
details and interest descriptions of information producers.
These links form the thematic neighbourhood of the node;
the links contained in T are refined accordingly by using
the rewiring service described below. Furthermore, each node
maintains a friend index (F) containing the contact details
and interest descriptions of the social neighbourhood of the
node, comprised of explicitly declared friends in the network.
Figure 2 shows the F and T for an arbitrary node.</p>
      </sec>
      <sec id="sec-5-3">
        <title>C. Locating information producers</title>
        <p>This service is applied to locate new information producers
by establishing new connections and discarding old ones.
Each node may initiate this procedure by computing a scoring
function that combines the quality and price of all information
producers in its thematic index (T). If the computed score is
greater than a threshold then the node does not need to take
any further action, since it is already aware of information
producers that match its needs and budget. Otherwise, the
node initiates a process to identify new information producers
by forwarding a message in the network –bounded by a
time-to-live (TTL) mechanism– using the thematic and social
connections and collecting the interests of other information
producers.</p>
        <p>The issued message is forwarded with equal probability
to (i) a number of randomly chosen entries contained in a
node’s T, (ii) a number of randomly chosen entries contained
in a node’s F, or (iii) the most similar nodes to the message
initiator, found in either T or F. The rationale of applying
either of the forwarding strategies is that the message initiator
should be able to reach information producers both directly
(through other similar nodes), but also indirectly (through
propagation of the message through non-similar nodes). Each
node that receives the message adds its interest in it, reduces
TTL by one, and forwards it in the same manner. When the
TTL of the message reaches zero, the message containing
the contact info and interests of all information producers
that received the message is sent back to its initiator. To
speed up the process, every intermediate node receiving the
message may utilise the information in it to refine its thematic
connections.</p>
      </sec>
      <sec id="sec-5-4">
        <title>D. Subscribing for content at information producers</title>
        <p>
          Queries/subscriptions are issued as free text or keywords
under the Vector Space Model and are formulated as term
vectors. The user subscribing for specific content forwards
a message in the network with a TTL using both its social
and thematic connections. The issued message is forwarded
both to (i) friends that have interests similar to the query and
are contained in F and (ii) a small number of information
producers contained in the T chosen as described below.
Initially, the message initiator compares the user subscription
against its interests and, if similar, the message is forwarded to
all of its short-range links, i.e., the message is broadcasted to
the node’s neighbourhood (explosion). Otherwise, the message
is forwarded to a small fixed number of nodes that have the
highest similarity to the subscription (fixed forwarding). The
combination of the two routing strategies is referred to in
the literature as the fireworks technique [
          <xref ref-type="bibr" rid="ref29">29</xref>
          ]. All the nodes
receiving the message reduce TTL by one and apply the
same forwarding technique; the message is not forwarded
further in the network when TTL reaches zero. Additionally
to forwarding, every node receiving a message compares the
subscription against its identified interests and, if similar,
stores it in its local continuous query data structures to match
it against future publications. Content is kept locally at each
information producers in the spirit of [
          <xref ref-type="bibr" rid="ref42">42</xref>
          ]. This lack of
publication dissemination is a design decision that offers increased
scalability (by trading recall) and complies with the content
marketplace paradigm we aim for. Thus, only the nodes
indexing a subscription can notify the interested user, although
other information producers may also publish relevant content.
When an information producer wants to publish new content
to the network, it matches it against its local continuous query
database to decide which continuous queries match it and thus,
which user node should be notified. Then, the information
producers delivers a notification for each continuous query by
sending to the query initiator a pointer to the matching content;
if the user is not online, the provider stores the message
and delivers it upon user reconnection. Notice that since the
information producer maintains the content ownership, it is
now able to charge the user at the announced content price.
        </p>
      </sec>
      <sec id="sec-5-5">
        <title>E. Searching for content at information producers</title>
        <p>A user issuing content search forwards a message in the
network following a mechanism similar to the one described in
the previous section. Additionally to query forwarding, every
information producer receiving a query message compares
it against the identified interests and, if similar, matches it
against the locally stored content. Subsequently, pointers to the
matching content are sent to the query initiator (along with the
announced price for the content). Subsequently answers from
all information producers are ranked by a combination of price
and similarity to the query (discussed in the next section) and
the list is presented to the user.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>IV. RANKING OF INFORMATION PRODUCERS</title>
      <p>To select which information producer will be monitored,
the protocols described in the previous section use a scoring
function to rank information producers. In this section, we
quantify these concepts and give the rationale between our
choices.</p>
      <sec id="sec-6-1">
        <title>A. Quality vs Price</title>
        <p>
          The ranking strategy for the information producers is a
critical component since it allows users to locate relevant
information and information producers to maximise their revenue.
Empirical studies have shown that price and quality are the
two key determinants of the consumer’s choice to buy or not
a product [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]. Thus, to make an informed selection on the
information producers, a user has to rank them based on a
blend (denoted by tunable parameter ) of content quality and
content price both in a retrieval and a filtering scenario. This
combination describes the benefit/cost ratio for the content and
allows the user to assign a score to every information producer.
To this end, the score for each information producer is given
by:
score(p; i) =
price(p; i)
(1
) quality(p; i)
(1)
        </p>
        <p>Here, p denotes an information producer, q denotes an
information need (in the form of a query or a subscription),
price(p; i) refers to the announced price an information
producer provides for the provided content, and quality(p; i)
denotes how relevant p is to i. Information producers compute
the price based on the demand and according to popularity of
themselves and that of their content. Price and quality have
the same range of values to allow for their combination, while
price is typically recomputed whenever the popularity of the
information producer changes. In Section V we study the price
in different scenarios, and show the effect on the quality of
retrieved content (i.e., recall) when the price choice is (i)
random, (ii) strongly correlated with quality, and (iii) partially
correlated with quality.</p>
      </sec>
      <sec id="sec-6-2">
        <title>B. Content price specifics</title>
        <p>
          In this section, we analyse the economic modelling of
AGORA and review the basic assumptions and expectations
from such a modelling. Usefulness of the information goods
received by a subscriber is a qualitative criterion, that is
difficult to model. In AGORA, we model usefulness by matching
interests, i.e., by assuming that all received content relevant
to the information need are useful to the subscriber, and do
not discuss issues such as novelty, coverage of the field, or
user effort. In our modelling, after a user acquires a history
of transactions with certain information producers, develops
an affect for some of them. Affect can be modelled in various
ways, depending on the task at hand, and can be either positive
or negative [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]. In AGORA, a user does not a priori know the
quality of the information goods, but uses the affect developed
from previous transactions to approximate it. Subsequently, he
compares the values of information quality to the expected
values and update its affect [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ].
        </p>
        <p>
          The costs in AGORA are results of actions [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ], such as
transactions, network communication, and use of common
infrastructure. Information producers try to maximise their
revenue while minimising expenses that occur due to their
actions. Contrary users try to maximise the utility of the
received content, while minimising expenses that occur due to
actions. In general, the content market in AGORA is not a pure
competitive market in the sense of e.g., [
          <xref ref-type="bibr" rid="ref39">39</xref>
          ], since users do
not know in advance the exact content quality they are buying.
AGORA resembles the modelling of a team of sales people
[
          <xref ref-type="bibr" rid="ref38">38</xref>
          ], where stakeholders try to collaborate with others in order
to get their expertise for a (cross/up) sale. After deciding who
to collaborate with, it is possible to model the gap between
the initial expectations and the actual actions. In [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ], it is
shown that this gap is smaller in a competitive relationship
compared to that of a cooperative relationship. As in many
cooperative environments each stakeholder usually retains its
connections with the others, while also being free to explore
new mutually beneficial connections. This is in line with the
notion of friendship connections in any social network.
        </p>
        <p>The main goal of this modelling is to study the influence of
the cost component on the quality of received content, study
the interactions between users and information producers, and
gain insights about the overall behaviour of this prototype
content market.</p>
      </sec>
      <sec id="sec-6-3">
        <title>C. Content quality specifics</title>
        <p>The quality of the content is a difficult thing to model
as it cannot be known prior to acquiring it. Hence the best
option is to assess the quality of the information producer.
To do so, one has to take into account the dual capacity of
information producers in AGORA: to answer one-time content
requests (IR) and satisfy long-term information needs (IF).
To this end, the quality of an information producer has to be
based on (i) the quality of already published content (since
this depicts its ability to satisfy IR tasks from users) and (ii) a
predicted quality of the content to be published in the future
(since this depicts its ability to satisfy IF tasks from users).
The necessity of both facets is better illustrated in the case of
an information producer that provides content in the form of
technical articles and news items; articles have a long
shelflife and are good candidates for recurrent sales, while news
items have an extremely short shelf-life and after some time
users loose interest in them.</p>
        <p>
          1. Quality of published content. The quality of the already
published content is known as the resource selection problem
in the areas of information retrieval and databases. Hence,
a number of standard resource selection algorithms such as
tf-idf based methods, CORI, or language models (see [
          <xref ref-type="bibr" rid="ref30">30</xref>
          ]
for a nice overview) have been proposed. We use a standard
tf-idf based component that combines accuracy of modelling
and ease of implementation. Thus, in our setup, quality of
published content is given by:
        </p>
        <p>X [0:5 log (dfp;t) + 0:5 log tfpm;tax ]
t2i
(2)</p>
        <p>
          Our approach uses the standard IR constructs of document
frequency (df ) and maximum term frequency (tf max) [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ],
while a balanced combination of these two metrics (0.5 in the
above equation) is used to equally emphasise importance of
df and tf max according to [
          <xref ref-type="bibr" rid="ref30">30</xref>
          ], [
          <xref ref-type="bibr" rid="ref43">43</xref>
          ].
        </p>
      </sec>
      <sec id="sec-6-4">
        <title>2. Predicted quality of content to be published. To pre</title>
        <p>
          dict the quality of the content to be published, we model
IR statistics per information producer as time-series data
and use statistical analysis tools [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] to predict future values
based on past observations. Such techniques take into account
assumptions about the internal structure of the time series
like trends and seasonality and tend to emphasise recent
observations. Since seasonality requires long-term statistics
that are infeasible to maintain (e.g., several years of data to
observe seasonality in the publication of Christmas content),
we resort to double exponential smoothing techniques [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] as
our prediction mechanism, since it requires a small amount of
data, emphasises recent over older data observations, and
allows for trend identification (useful for news items or trending
topics). Thus, in our setup, predicted quality of content to be
published is given by:
        </p>
        <p>In the formula above, function (dfp;t) stands for the
difference between the predicted and the last document frequency
(df ) observed, and (cs ) is the difference in the collection
size of information producer p reflecting its overall expected
future publishing activity. These values are calculated for all
terms t contained in the user subscription. In this way we
model two aspects of the information producer’s behaviour:
(i) its potential to publish relevant documents in the future
(reflected by (dfp;t)) and (ii) its overall expected future
publishing activity (reflected by (cs )). Notice also that,
in the above formula, the publication of relevant documents
(i.e., (dfp;t)) is more emphasised than the publication rate
(delta(cs )) due to the nesting of the log functions. The
addition of 1 in the log functions is used to yield positive
predictions and avoid log(0).</p>
      </sec>
      <sec id="sec-6-5">
        <title>3. Putting it all together. Given Formulas 2 and 3, the</title>
        <p>overall quality of an information producer p with respect to
an information need i is given by:
quality(p; i) =</p>
        <p>X[(0:5 log (dfp;t) + 0:5 log tfpm;tax )+
t2i
log
(dfp;t) + log
(csp) + 1 + 1 ]
(4)</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>V. EXPERIMENTAL EVALUATION</title>
      <p>In this section, we present our findings from the introduction
of content cost, and how it affects the effectiveness of the
system. We study the behaviour of AGORA using different
scenarios, while varying the correlation between price, quality
and customer demand.</p>
      <sec id="sec-7-1">
        <title>A. Experimental Setup and Metrics</title>
        <p>We used a real-life document collection containing 2M
documents with a vocabulary of about 600K terms from a focused
Web crawl. The documents were categorised in ten different
categories by content, with the smallest category having 67K
documents and the largest one of 325K. In all experiments,
the network consists of 1; 000 information producers and 1M
users. Each information producer started with a database of
300 documents initialised with 15% random category, 10%
non-categorised, and 75% single category documents, resulting
in 100 specialised information producers for each category. We
artificially constructed information needs for the users (in the
form of queries and subscriptions with multiple terms) using
the document corpus and by selecting terms that are strong
representatives of a document category (i.e., a frequent term
in documents of one category and infrequent in documents of
the other categories). The simulation was done in rounds.</p>
        <p>
          We introduced budget constraints on a per user basis; thus,
each user was able to follow only a few selected information
producers that were chosen according to his own information
needs and ranked using the formulas discussed in Section IV.
For deciding the per user budget, we relied on studies about
budget distribution and spending for a variety of cases, ranging
from family budgets to consumer budgets [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ], [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]. The main
conclusions drawn from these studies are that (i) budget
distribution follows a power law, with a small percentage
of families/consumers having a high (yearly) budget, and a
large percentage of the families being in the (long) tail of
the distribution, with a low budget, and (ii) the percentage
of the income spend on content does not vary with the
budget. According to [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] and the above remarks, we divided
the users into three classes: low, average and high budget
users with a population of 600K (or 60% of total users),
300K (or 30% of total users) and 100K (or 10% of total
users) respectively. Subsequently, we experimentally computed
a budget that would allow the users to subscribe to the top-100
(i.e., 10%) information producers, and allowed the low budget
users to have 60%, the medium budget users to have 80% and
the high budget users to have 120% of this ideal budget.
        </p>
        <p>To measure the effect of content cost in quality (with and
without monetary flow) we utilise the following metrics:
Messages. We measure the message traffic per action in
AGORA.</p>
        <p>Recall. We measure average recall over all rounds by
computing the ratio of the total number of retrieved
content to the total number of relevant content for all
users.</p>
        <p>Ranking. We use an extension of Spearman’s footrule
distance to compare rankings of information producers
calculated by users. This metric allows us to compare two
different information producer rankings by calculating the
distance between the elements in two ranking lists. In our
implementation if an element from list A is not present
in list B, it is considered as being in the last available
position in B.</p>
        <p>Finally, to assist the reproducibility of our results we plan
to publicly release our code in an appropriate repository after
the publication of the work.</p>
      </sec>
      <sec id="sec-7-2">
        <title>B. Varying the price-quality correlation</title>
        <p>In this experiment, we aimed at observing the behaviour of
AGORA when varying the correlation 0 1 between the
price and the quality of an information producer; k = 1 means
that the price an information producer charges for content
is fully correlated to the quality of that producer. Notice
that quality in AGORA is easily calculated/forecasted using
Equation 4 and in this case users know that the content from
this information producer will be expensive but relevant (i.e.,
useful to them). The other extreme is when k = 0, where
prices have no correlation to the quality of the information
producer and are chosen randomly. For the rest of the cases,
i.e., 0 &lt; &lt; 1, the correlation is modelled as the likelihood
that an information producer sells % of content underpriced
(up to 20% of the initial value), and 1 overpriced (up to
20% of the initial value).</p>
        <p>Figure 3 presents the achieved recall for different values
of and , where we notice that the introduction of price
for content reduces the observed recall (notice that recall has
the highest values for = 0, i.e., no pricing is involved in
information producer ranking). This is an important result,
showing that when information producers charge for content,
consumers trade quality for cheaper options. Notice also that
for every value tested recall is retained high, as long as it
plays the most important role in the ranking ( &lt; 50%). This
was also expected, since when price is of importance,
consumers will choose cheaper information producers, leading to
a reduction in the observed recall. Additionally, when the price
is the only ranking criterion for users ( = 1), recall is close to
that of a random choice of information producers (remember
that users monitor only 10% of information producers in the
system).</p>
        <p>0.2
0.4
0.8</p>
        <p>1</p>
        <p>Figure 4 shows how recall is affected for three different
values of , when increases. When one of the two components
becomes dominant in the ranking function, it outweighs the
effect of the other. This is in line with our expectations, since
price dominates the ranking function and quality is sacrificed
to reduce costs. Finally, notice that consistent recall between
β = 0.2
ββ == 00..58
0.4
0.2
lcea 0.3
r
0.6
0.5
0.4
0.1
0
0
0.2
0.4
0.6
0.8</p>
        <p>1
β
0.2
0.4
0.8</p>
        <p>1.0
0.6
β
different values of is an effect of the modelling of AGORA as
a closed system where monetary flow is limited by the budgets
of the users and no new wealth is produced.</p>
        <p>Figure 5 shows how differently users rank information
producers for varying values of and . The difference in
the ranking of information producers is measured using an
extension of Spearman’s footrule metric. To produce a point
in the graph we compare the ranked lists of information
producers for each pair of users for and each value of in
the x-axis and average the results. Notice that the case of
= 0 is omitted as the extended Spearman’s footrule metric
is always 0 since the lists compared are identical (users take
into account only the quality to rank information producers).
When increases (i.e., price becomes more important in
the ranking process) Spearman’s metric increases too, as
information producers with high quality get lower positions in
the ranking, while information producers with lower quality
(but cheaper) are ranked high. Finally, when the price of
an information producer is not associated with its quality
(random price setup or = 100%), there are big differences in
the ranking of information producers (especially in the cases
where price matters more – the leftmost points in the graph)
due to the introduced randomness.
0.2
0.4
0.6
0.8</p>
        <p>1.0
β
Fig. 6. Message traffic against .
3.5
3
lrcea 0.3
0.45
0.4</p>
      </sec>
      <sec id="sec-7-3">
        <title>C. System Performance</title>
        <p>In the first series of experiments we targeted the
performance of AGORA in terms of message traffic. Figure 6 shows
that the message traffic per user incurred in AGORA is reducing
as increases. This happens because users utilise the price
component to rank information producers, thus choosing those
of lower price and poor quality. This reduces the overall
message number as less content is exchanged in the system
due to users subscribing to non-expert information producers;
our observations here are consistent with those in Section V-B
that correlate recall and .</p>
        <p>Figure 7 demonstrates the total amount of traffic observed in
AGORA and how this traffic is split in the various message
categories, as the price-quality correlation is varied. As expected
the heart-beat protocol messages dominate the messaging load,
as necessary messages with information producer statistics
and prices are disseminated in the system. Notice also that
these messages are not affected by price-quality correlation,
since the information producers have to update their
publication statistics and prices, regardless of their customer base.
Finally, notice that the number of messages for querying
for/subscribing to/receiving content are affected as increases
since information producers that are of high quality widen their
customer base with more users.</p>
      </sec>
      <sec id="sec-7-4">
        <title>D. Varying the behaviour of information producers</title>
        <p>In this section we look into recall and how this is affected by
two very different information producer behaviours: topic
specialisation and topic shift. In topic specialisation, information
producers disregard market conditions and maintain their topic
specialisation even if this results in lower revenues. Contrary in
topic shift, information producers may alter their specialisation
topic over time based on changes in user demand, revenue and
market conditions. In the latter case, an information producer
initially publishes content from one category, and some rounds
later may decide to switch to a different category to simulate
changes in portfolio or a different business strategy.</p>
        <p>Figure 8 shows the observed recall for both scenarios and
different values of . The most important observations in this
0
0.2
0.4
0.6
0.8</p>
        <p>1
β
graph are (i) the drop in recall for both scenarios as increases
and (ii) the higher recall values for the case of topic
specialisation. The reason for the first observation is the shift of users
to cheaper information producers due to the importance of
content price in the ranking function. Additionally, the reason
for the second observation is that the build up of expertise by
the information producers reflects on the higher quality values,
which in turn leads to ranking quality information producers
higher. Contrary, when information producers shift their topic,
users are not able to correlate price and quality and thus,
select information producers of poor quality (hence the drop
in recall).</p>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>VI. CONCLUSIONS AND FUTURE WORK</title>
      <p>In this work, we proposed AGORA, a social networking
architecture and the related distributed protocols to facilitate
distributed content management in the form of information
retrieval and information filtering. In addition, we introduced
a novel selection mechanism for information producers that
allows users to rank them according to (i) their expertise,
(ii) their predicted future content publications, and (iii) the
price of content they deliver. The experimental evaluation of
our proposal demonstrated the behaviour of such a content
marketplace in terms of price/quality tradeoffs, recall and
message traffic. To the best of our knowledge, these are
the first results that connect recall and message traffic in
a distributed social network with the content cost, and put
economic modelling at the heart of system design. The most
important outcome of our study is that price should participate
in the ranking of information producers less than a quarter
of the total score to avoid the delivery of irrelevant content
to users. We also showed that the introduction of content
price improves system scalability by reducing message traffic
and imposing a reasonable use of resources to stakeholders.
Overall, introducing a monetary value for the production and
dissemination of content in a distributed social network proves
an interesting business model that conserves resources and
improves scalability. However, this should be executed in a
careful fashion to avoid user dissatisfaction that may rise from
the content cost itself, and the reduced access to relevant
information.</p>
      <p>
        Future directions of research include more extensive
experimentation (using a grid service for vast-scale experiments
and analytics), more detailed economic modelling (e.g., model
AGORA as an open system, perform monetary flow
monitoring/analysis, incorporate agent-style BDI modelling [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]), and
implementation of a prototype system.
      </p>
      <p>ACKNOWLEDGEMENTS</p>
      <p>The author would like to thank the anonymous reviewers
for their comments and suggestions on improving the quality
and presentation of the work.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>S.</given-names>
            <surname>Abbas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Pouwelse</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Epema</surname>
          </string-name>
          , and
          <string-name>
            <given-names>H.</given-names>
            <surname>Sips</surname>
          </string-name>
          .
          <article-title>A Gossip-Based Distributed Social Networking System</article-title>
          .
          <source>In WETICE</source>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M.</given-names>
            <surname>Backes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Maffei</surname>
          </string-name>
          , and
          <string-name>
            <given-names>K.</given-names>
            <surname>Pecina</surname>
          </string-name>
          .
          <article-title>A Security API for Distributed Social Networks</article-title>
          .
          <source>In NDSS</source>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>L. P.</given-names>
            <surname>Breker</surname>
          </string-name>
          .
          <article-title>A survey of network pricing schemes</article-title>
          .
          <source>In Theoretical Computer Science</source>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>S.</given-names>
            <surname>Buchegger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Schioberg</surname>
          </string-name>
          , L.
          <string-name>
            <surname>-H. Vu</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <surname>A. Datta.</surname>
          </string-name>
          <article-title>PeerSoN: P2P social networking: early experiences and insights</article-title>
          .
          <source>In SNS</source>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>C.</given-names>
            <surname>Chatfield</surname>
          </string-name>
          .
          <source>The Analysis of Time Series - An Introduction. CRC Press</source>
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>A.</given-names>
            <surname>Crespo</surname>
          </string-name>
          and
          <string-name>
            <given-names>H.</given-names>
            <surname>Garcia-Molina</surname>
          </string-name>
          .
          <article-title>Routing indices for peer-to-peer systems</article-title>
          .
          <source>In Proceedings of 22nd IEEE International ICDCS Conference</source>
          , pages
          <fpage>23</fpage>
          -
          <lpage>32</lpage>
          , Vienna, Austria,
          <year>July 2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>L. A.</given-names>
            <surname>Cutillo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Molva</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Strufe</surname>
          </string-name>
          .
          <article-title>Safebook: A privacy-preserving online social network leveraging on real-life trust</article-title>
          .
          <source>IEEE Communications Magazine</source>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>A. de Spindler</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Grossniklaus</surname>
            , and
            <given-names>M. C.</given-names>
          </string-name>
          <string-name>
            <surname>Norrie</surname>
          </string-name>
          .
          <article-title>Development Framework for Mobile Social Applications</article-title>
          . In CAiSE,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>J. B.</given-names>
            <surname>DeLong. Six Families Budget Their</surname>
          </string-name>
          <article-title>Money</article-title>
          .
          <source>In Lecture notes for American Economic History</source>
          , University of California at Berkeley,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>A.</given-names>
            <surname>Dumrogsiri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Fan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Jain</surname>
          </string-name>
          , and
          <string-name>
            <given-names>K.</given-names>
            <surname>Moinzadeh</surname>
          </string-name>
          .
          <article-title>A supply chain model with direct and retail channels</article-title>
          .
          <source>In European Joumal of Operational Research</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>M.</given-names>
            <surname>Faig</surname>
          </string-name>
          and
          <string-name>
            <given-names>B.</given-names>
            <surname>Jerez</surname>
          </string-name>
          . Inflation, Prices, and
          <article-title>Information and Competitive Search</article-title>
          .
          <source>Journal of Macroeconomics</source>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>L.</given-names>
            <surname>Forker</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Stannack</surname>
          </string-name>
          .
          <article-title>Cooperation versus Competition: do buyers and suppliers really see eye-to-eye?</article-title>
          <source>In European Journal of Purchasing and Supply Management</source>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>M.</given-names>
            <surname>Georgeff</surname>
          </string-name>
          , B.
          <string-name>
            <surname>Pell</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Pollack</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Tambe</surname>
            , and
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Wooldridge</surname>
          </string-name>
          .
          <article-title>The Belief-Desire-Intention Model of Agency. In Intelligent Agents V: Agents Theories, Architectures, and</article-title>
          <string-name>
            <surname>Languages</surname>
          </string-name>
          ,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>K.</given-names>
            <surname>Graffi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Gross</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Mukherjee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kovacevic</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Steinmetz</surname>
          </string-name>
          . LifeSocial.KOM:
          <article-title>A P2P-Based Platform for Secure Online Social Networks</article-title>
          .
          <source>In P2P</source>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>B.</given-names>
            <surname>Greschbach</surname>
          </string-name>
          , G. Kreitz, and
          <string-name>
            <given-names>S.</given-names>
            <surname>Buchegger</surname>
          </string-name>
          .
          <article-title>The devil is in the metadata - New privacy challenges in Decentralised Online Social Networks</article-title>
          .
          <source>In PerCom Workshops</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>F.</given-names>
            <surname>Gunther</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Manulis</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Strufe</surname>
          </string-name>
          .
          <article-title>Key management in distributed online social networks</article-title>
          .
          <source>In WOWMOM</source>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>L.</given-names>
            <surname>Han</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Punceva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Nath</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Muthukrishnan</surname>
          </string-name>
          , and
          <string-name>
            <surname>L. Iftode.</surname>
          </string-name>
          <article-title>SocialCDN: Caching techniques for distributed social networks</article-title>
          .
          <source>In P2P</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>B.</given-names>
            <surname>Johansson</surname>
          </string-name>
          and
          <string-name>
            <given-names>H.</given-names>
            <surname>Persson</surname>
          </string-name>
          .
          <article-title>Self-organised adjustments in a market with price-setting firms</article-title>
          .
          <source>In Chaos, Solitons and Fractals</source>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Kabir</surname>
          </string-name>
          , J. Han,
          <string-name>
            <given-names>J</given-names>
            .
            <surname>Yu</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Colman. SCIMS: A Social Context</surname>
          </string-name>
          <article-title>Information Management System for Socially-Aware Applications</article-title>
          . In CAiSE,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>B. B. A.</given-names>
            <surname>Kennickell</surname>
          </string-name>
          and
          <string-name>
            <given-names>K.</given-names>
            <surname>Moore</surname>
          </string-name>
          . Recent Changes in U.S. Family Finances:
          <article-title>Evidence from the 2001 and 2004 Survey of Consumer Finances</article-title>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>C.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Singh</surname>
          </string-name>
          , and
          <string-name>
            <given-names>K.</given-names>
            <surname>Sycara</surname>
          </string-name>
          .
          <article-title>A Dynamic Pricing Mechanism for P2P Referral Systems</article-title>
          . In AAMAS,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>G.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Shen</surname>
          </string-name>
          , and
          <string-name>
            <given-names>L.</given-names>
            <surname>Ward</surname>
          </string-name>
          .
          <article-title>An efficient and trustworthy P2P and social network integrated file sharing system</article-title>
          .
          <source>In P2P</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>A.</given-names>
            <surname>Loupasakis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ntarmos</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Triantafillou</surname>
          </string-name>
          . eXO: Decentralized Autonomous Scalable Social Networking.
          <source>In CIDR</source>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>M.</given-names>
            <surname>Mani</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.-M. Nguyen</surname>
            , and
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Crespi</surname>
          </string-name>
          . SCOPE:
          <article-title>A prototype for spontaneous P2P social networking</article-title>
          .
          <source>In PerCom Workshops</source>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <surname>C. D. Manning</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Raghavan</surname>
          </string-name>
          , and H. Schu¨tze. Introduction to Information Retrieval. Cambridge University Press,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>G.</given-names>
            <surname>Mega</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Montresor</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Picco</surname>
          </string-name>
          .
          <article-title>Efficient dissemination in decentralized social networks</article-title>
          .
          <source>In P2P</source>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>J.</given-names>
            <surname>Mitchell-Wong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Goh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Chhetri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Kowalczyk</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Q.</given-names>
            <surname>Vo</surname>
          </string-name>
          .
          <article-title>A Framework for Open, Distributed and Self-Managed Social Platforms</article-title>
          .
          <source>In VECN</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>R.</given-names>
            <surname>Narendula</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Papaioannou</surname>
          </string-name>
          , and
          <string-name>
            <given-names>K.</given-names>
            <surname>Aberer. My3</surname>
          </string-name>
          :
          <article-title>A highly-available P2P-based online social network</article-title>
          .
          <source>In P2P</source>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>C. H.</given-names>
            <surname>Ng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. C.</given-names>
            <surname>Sia</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C. H.</given-names>
            <surname>Chang</surname>
          </string-name>
          .
          <article-title>Advanced Peer Clustering and Firework Query Model in the Peer-to-Peer Network</article-title>
          .
          <source>In WWW</source>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>H.</given-names>
            <surname>Nottelmann</surname>
          </string-name>
          and
          <string-name>
            <given-names>N.</given-names>
            <surname>Fuhr</surname>
          </string-name>
          .
          <article-title>Evaluating Different Methods of Estimating Retrieval Quality for Resource Selection</article-title>
          .
          <source>In SIGIR</source>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>P.</given-names>
            <surname>Raftopoulou</surname>
          </string-name>
          and
          <string-name>
            <surname>E. Petrakis.</surname>
          </string-name>
          <article-title>iCluster: a Self-Organising Overlay Network for P2P Information Retrieval</article-title>
          . In ECIR,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>P.</given-names>
            <surname>Raftopoulou</surname>
          </string-name>
          , E. Petrakis, and
          <string-name>
            <given-names>C.</given-names>
            <surname>Tryfonopoulos</surname>
          </string-name>
          .
          <article-title>Rewiring strategies for semantic overlay networks</article-title>
          .
          <source>In DPD</source>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <string-name>
            <given-names>P.</given-names>
            <surname>Raftopoulou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Tryfonopoulos</surname>
          </string-name>
          , E. Petrakis, and
          <string-name>
            <given-names>N.</given-names>
            <surname>Zevlis</surname>
          </string-name>
          . DS4:
          <article-title>Introducing Semantic Friendship in Distributed Social Networks</article-title>
          .
          <source>In CoopIS</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [34]
          <string-name>
            <given-names>R.</given-names>
            <surname>Sharma</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Datta</surname>
          </string-name>
          . SuperNova:
          <article-title>Super-peers based architecture for decentralized online social networks</article-title>
          .
          <source>In COMSNETS</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [35]
          <string-name>
            <given-names>P.</given-names>
            <surname>Stuedi</surname>
          </string-name>
          , I. Mohomed,
          <string-name>
            <given-names>M.</given-names>
            <surname>Balakrishnan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Mao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Ramasubramanian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Terry</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Wobber</surname>
          </string-name>
          . Contrail:
          <article-title>Enabling Decentralized Social Networks on Smartphones</article-title>
          . In Middleware,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          [36]
          <string-name>
            <given-names>F.</given-names>
            <surname>Tegeler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Koll</surname>
          </string-name>
          , and
          <string-name>
            <given-names>X.</given-names>
            <surname>Fu</surname>
          </string-name>
          . Gemstone:
          <article-title>Empowering Decentralized Social Networking with High Data Availability</article-title>
          .
          <source>In GLOBECOM</source>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          [37]
          <string-name>
            <given-names>B.</given-names>
            <surname>Upadhyaya</surname>
          </string-name>
          and
          <string-name>
            <given-names>E.</given-names>
            <surname>Choi</surname>
          </string-name>
          . Social Overlay:
          <article-title>P2P Infrastructure for Social Networks</article-title>
          .
          <source>In NCM</source>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          [38] T. U¨stu¨ner and
          <string-name>
            <given-names>D.</given-names>
            <surname>Godes</surname>
          </string-name>
          .
          <article-title>Better Sales Networks</article-title>
          .
          <source>In Harvard Business Review</source>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          [39]
          <string-name>
            <given-names>H. R.</given-names>
            <surname>Varian</surname>
          </string-name>
          .
          <article-title>Pricing Information Goods</article-title>
          . In Research Libraries Group Symposium,
          <source>Harvard Law School</source>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          [40]
          <string-name>
            <given-names>S.</given-names>
            <surname>Voulgaris</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. van Steen</surname>
          </string-name>
          , and
          <string-name>
            <given-names>K.</given-names>
            <surname>Iwanicki</surname>
          </string-name>
          .
          <article-title>Proactive Gossip-based Management of Semantic Overlay Networks</article-title>
          .
          <source>CCPE</source>
          ,
          <volume>19</volume>
          (
          <issue>17</issue>
          ),
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          [41]
          <string-name>
            <given-names>M.</given-names>
            <surname>Xue</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Carminati</surname>
          </string-name>
          , and
          <string-name>
            <surname>E. Ferrari.</surname>
          </string-name>
          <article-title>P3D - Privacy-Preserving Path Discovery in Decentralized Online Social Networks</article-title>
          .
          <source>In COMPSAC</source>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref42">
        <mixed-citation>
          [42]
          <string-name>
            <given-names>C.</given-names>
            <surname>Zimmer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Tryfonopoulos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Berberich</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Koubarakis</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Weikum. Approximate Information</surname>
          </string-name>
          <article-title>Filtering in Peer-to-Peer Networks</article-title>
          .
          <source>In WISE</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref43">
        <mixed-citation>
          [43]
          <string-name>
            <given-names>C.</given-names>
            <surname>Zimmer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Tryfonopoulos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Berberich</surname>
          </string-name>
          , G. Weikum, and
          <string-name>
            <given-names>M.</given-names>
            <surname>Koubarakis</surname>
          </string-name>
          .
          <article-title>Node Behavior Prediction for Large-Scale Approximate Information Filtering</article-title>
          .
          <source>In LSDS-IR @ SIGIR</source>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>