<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Patient Engagement: Theoretical and Heuristic Approaches for Supporting the Clinical Practice.</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Italo Zoppis</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Riccardo Dondi</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sara Manzoni</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Giancarlo Mauri</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department Computer Science, University of Milano Bicocca</institution>
          ,
          <addr-line>Milano</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Letters</institution>
          ,
          <addr-line>Philosophy, Communication</addr-line>
          ,
          <institution>University of Bergamo</institution>
          ,
          <addr-line>Bergamo</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Social interaction allows to support the disease management by creating online spaces where patients can interact with clinicians, and share experiences with other patients. Therefore, promoting targeted communication in online social spaces is a mean to group patients around shared goals, offer emotional support, and nally engage patients in their healthcare decision-making process. In this paper, we approach the argument from a theoretical perspective: we design an optimization problem aimed to encourage the creation of (induced) sub-networks of patients which, being recently diagnosed, wish to deepen the knowledge about their medical treatment with some other similar pro led patients, which have already been followed up by speci c (even alternative) care centers. In particular, due to the computational hardness of the proposed problem, we provide approximated solutions based on distributed heuristics (i.e., Genetic Algorithms). Results are given for simulated data using Erdos-Renyi random graphs.</p>
      </abstract>
      <kwd-group>
        <kwd>Social Network</kwd>
        <kwd>Distributed Heuristic</kwd>
        <kwd>Genetic Algorithm</kwd>
        <kwd>Dense Communities</kwd>
        <kwd>2-club optimization problem</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Social media can directly support the disease management by creating online
spaces where patients can interact with clinicians, and share experiences with
other patients [
        <xref ref-type="bibr" rid="ref11 ref27">11,27</xref>
        ]. For example, cancer patients use Twitter to discuss
treatments and provide psychological support [
        <xref ref-type="bibr" rid="ref32">32</xref>
        ], and online engagement seems to
correlate with lower levels of self reported stress and depression [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
      <p>
        Similarly, wellness programs frequently incorporate social media to create a
sense of community [
        <xref ref-type="bibr" rid="ref34">34</xref>
        ], group people around shared goals, and offer social and
emotional support. A trial reported that adding on line community features to
an Internet-mediated wellness and walking program improves adherence, and
did reduce participant attrition [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ].
      </p>
      <p>Nevertheless, much more work remains to be carried out for sharing targeted
and optimized information content. How can we optimize a procedure which is
able to facilitate the encounter between patients who want to deepen or share
experiences about treatments, care points, and specialists? How to correlate, for
example, similar clinical pro les, while inducing networks of medical stuff, and
treated patients which offer their availability to share experiences or
suggestions? These are exactly the issues we deal with in this paper.</p>
      <p>In particular, we focus on the problem of creating a space of individuals and
care centers, by considering the case where recently diagnosed patients could be
interested to meet some other patients (experience), for sharing information on
their disease and the suggested (or available) care center. In this situation, it
would be useful, for example, to encourage the diagnosed subjects to socialize,
and confront with the experience of other patients with similar clinical pro le,
who have been already followed up within the same (or even alternative)
proposed care point.</p>
      <p>
        It is clear that a proper handling of procedures and data is fundamental in
order to convert available information into useful formulation that leads to
particular (induced) communities. From a theoretical perspective this consideration
can be expressed as the problem of nding dense or cohesive subgraphs (with
particular properties), inside a network. Unfortunately, as we will report in the
next sections, the intrinsic complexity of the considered computational problem
make optimization potentially impracticable. For this reason, we design a
distributed heuristic (i.e., Genetic Algorithms, GAs) to seek faster approximation
solutions (see, e.g., [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] for details). In particular, we propose a general
framework, which, similarly to a Map-Reduce programming paradigm, abstracts away
the complexity of learning by allowing programmers to describe their processes
in terms of two main procedures, here referred as Training, and Combine. With
this approach we are able to provide both distributed concept learning and a
proper delivering of the trained models on different nodes of the learning system.
      </p>
      <p>In Section 2 we introduce the main theoretical aspects, focusing on the
computational hardness of the considered problem (Sections 2.2). Then, we
discuss the GA-based approach to seek approximated results for our formulation
(Section 3). In Section 4, we extend this approach to a general distributed
environment. Finally, after reporting numerical experiments on simulated data
(Section 5), we conclude the paper (Section 6) by discussing our results and
describing future directions of this research.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Main De nitions</title>
      <p>
        Suppose we wish to model the situation where recently diagnosed patients (shortly
reported as RD patients) are fostered to deepen the knowledge of their
diseases with some other patient experience (say, engaged patients or shortly, ED),
or they need more information concerning the suggested (or even alternative)
health-care centers (HC). In this case, RD patients could bene t from the
social interaction with similarly pro led (ED) patients, which have already been
followed up by speci c HC centers. To this aim, it should be useful for a social
platform to encourage, and optimize, the creation of a sub-network from the
available social data. From a theoretical point of view, a network is most
commonly modeled using a graph which represents relationships between objects, V
(vertices), through a set of edges, E. In this way, our goal can be formulated
by maximizing, within a de ned graph, a cohesive sub-graph (i.e., by seeking
the largest cohesive sub-graph) with particular properties (as we will detail in
the following). Finding cohesive subgraphs inside a network is a well-known
problem that has been applied in several contexts. For example, in computational
biology, dense sub-graphs are sought in protein interaction networks, as they
are considered related to protein complexes [
        <xref ref-type="bibr" rid="ref31 ref4">4,31</xref>
        ]. In gene networks, dense
subgraphs are generally applied to detect relevant co-expression clusters [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ]. While
a classical approach to compute dense sub-graphs is the identi cation of cliques
(i.e., complete sub-graphs induced by a set of vertices which are all pairwise
connected by an edge), this de nition is often too stringent for particular
applications. This is the case, when the knowledge on how an individual (vertex)
is embedded in the sub-network (e.g.,, some vertices could act as \bridges"
between groups, as in our case) is a critical issue to take into account. Therefore
alternative de nitions of cohesive sub-graphs can be introduced, for example by
relaxing some constraints, leading to the concept of relaxed clique [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. Here,
we follow this approach by relaxing the de nition of distance between vertices.
In a clique distinct vertices are at distance of 1, in our case, vertices can be at
distance of at most s = 2. A sub-graph where all the vertices are at distance of
at most 2 is called a 2-club (or, more in general, s-club for different values of
s). 2-clubs have been extensively applied to social networks analysis [
        <xref ref-type="bibr" rid="ref1 ref19 ref21 ref22">21,1,19,22</xref>
        ],
and biological network analysis, e.g., protein-protein interaction networks, [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ].
Note that, when s = 1, a 1-club is exactly a clique. An example of 2-club with
5 vertices is represented in Figure 1.
Given a set V ′ V , we say that V ′ induces the graph G[V ′]3.
      </p>
      <p>The distance dG(u; v) between two vertices u; v of G, is the length of a shortest
path in G which has u and v as endpoints. The diameter of a graph G = (V; E)
is maxu;v2V dG(u; v), i.e., the maximum distance between any two vertices of
V . In other words, a 2-club in a graph G = (V; E) is a sub-graph G[W ], with
W V , that has diameter of at most 2.</p>
      <p>Moreover, given a vertex v 2 V , we de ne the set N (v) as follows</p>
      <p>N (v) = fu : fv; ug 2 Eg
N (v) is called the neighborhood of v. Finally, we denote by N [v] the closed
neighborhood of v, where N [v] = N (v) [ fvg.</p>
      <p>We will formulate the problem using 2-clubs whose shortest path connecting
RD patients with specialist centers/staff (e.g., care center, hospital, or clinical
staff) has to \transit" through, at least one EP patient who has already been
followed up by the considered specialist \point".</p>
      <p>Formally, for any pair, (d; h), composed by the recently diagnosed patient,
d, and, e.g., the health care center, h, the social platform should suggest for the
patient d, to compare (even to meet) with an identi ed (available) patient x's
experience. More speci cally, we are currently seeking (within the input \social
graph") a 2-Club, G[D [X [H], where D, X and H represent the sets of recently
diagnosed patients, experienced patients, and care point centers. respectively.
Please note that, when such a structure (i.e., a maximum size 2-clubs) exists,
within the identi ed starting social space, then for any pair of vertices, it must
exist at least one simple path of length 2, i.e., a path composed by a triple of
vertices. This, in turn, will be also true for any pair, (d; h) where d 2 D; h 2 H.
Indeed, our goal will be to nd a largest-size 2-clubs which has the further
property of providing, for any pair (d; h), a shortest path characterized by the
triple of vertices (d; x; h) 2 (D X H). In this case, the set of edges, modeling
the starting social network, will be de ned as follow.</p>
      <p>{ Edges between similar pro led patients.
{ Edges expressing that an experienced patient x, has already been followed
up from the care center, h. In this case the edges in X H will be constructed
by knowing both the clinical history of each (experienced) patient, x, and
the clinical staff or hospital h, which has already properly followed up the
patients, x.
{ Edges between vertices h1; h2 2 H, for example because two care centers are
similar (have similar services or are part of the same institution).</p>
      <p>In this situation, the simple path given by the triple of vertices (d; x; h) 2
(D X H) in the 2-club G would suggest for patient d 2 D to contact the
patient, x 2 X, about the health care center (or specialist staff), h 2 H. For
3 Notice that all the graphs we consider are undirected.
sake of clarity, before de ning computationally the problem, we refer to any
pair of vertices, (d; h) 2 D H (such that the minimum path connecting d
to h is given by the vertex sequence (d; x; h), for any x 2 X), as a "feasible
pair". Considering the above discussion, we can de ne the following variant of
the 2-clubs maximization problem.</p>
      <p>Problem 1. Maximum 2-Club (Max-2-club)
Input: a graph G = (D [ X [ H; R [ F ).</p>
      <p>Output: a set V ′ D [ X [ H such that G[V ′] is a 2-club having maximum
size, and for each pair of vertices (d; h) 2 D H in G[V ′], a minimum path
connecting d to h is given by the vertex sequence (d; x; h) for some x 2 X (i.e.,
(d; h) is feasible).
2.2</p>
      <sec id="sec-2-1">
        <title>Computational hardness</title>
        <p>
          The complexity of the problem of Maximum s-club has been extensively studied
in literature, and unfortunately it turns to be NP-hard for each s 1 [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ];
Maximum s-Club is NP-hard even if the input graph has diameter s + 1, for each
s 1 [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. The same property holds for our variant of Maximum 2-club. Indeed,
the computation of a 2-club of maximum size containing a speci c vertex v is
also NP-hard. By de ning D = fvg, X = N (v) and H the remaining set of
vertices, it follows that the "feasibility" property holds.
        </p>
        <p>
          Given an input graph G = (V; E), Maximum s-club is not approximable within
factor jV j1=2 ", for any " &gt; 0 and s 2 [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]. On the positive side,
polynomialtime approximation algorithms [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] have been given, with factor jV j1=2 for every
even s 2, and factor jV j2=3 for every odd s 3. The parametrized
complexity of Maximum s-Club has also been studied, leading to xed-parameter
algorithms [
          <xref ref-type="bibr" rid="ref10 ref18 ref28">28,18,10</xref>
          ]. Maximum 2-Club has been considered also for speci c
graph classes [
          <xref ref-type="bibr" rid="ref15 ref16">16,15</xref>
          ].
3
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>A Genetic Algorithm</title>
      <p>
        The complexity (and approximation) complexity of the problem introduced so
far make optimization potentially impracticable. For this reason, we designed
a Genetic Algorithm (GA) to seek faster approximation solutions see, e.g., [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]
for details. In particular, given an input graph G = (V; E), the GA represents
a solution (a subset V ′ V such that G[V ′] is a 2-club of G) as a binary
chromosome c, of size n = jV j, such that for all vi 2 V , c[i] is either 1 or 0. Note
that, with a slight abuse of notation, we will denote by G[c] the subgraph of
G induced by the representation of chromosome c. Similarly, V [c] and E[c] will
denote the set of vertices (V ′) and edges of G[c] = G[V ′].
      </p>
      <p>During the offspring generation, chromosomes are interpreted as hypotheses of
feasible solutions (i.e., they may represent unfeasible solutions, for example an
s-club, with s &gt; 2, or a disconnected graph, during the rst phases of
evolution: they can later evolve in feasible solutions), undergoing to mutation,
crossvalidation and selection. Chromosome evaluation (i.e., hypothesis on a potential
2-club) is then provided, as usually, through the tness function.
3.1</p>
      <sec id="sec-3-1">
        <title>Fitness de nition</title>
        <p>In this paper, tness is designed to promote adaptation in such a way that new
candidate chromosomes, able to represent graphs with a feasible (correct)
diameter value (i.e., not grater than 2) will \evolve" through the elitism mechanism. In
this way, a small proportion of ttest candidates (chromosomes with high tness
values) are selected and preserved unchanged in the next generation. Speci cally,
given a chromosome c, and an input graph G = (V; E), the tness realizes such
a mechanism using the following quantities.
1. An estimation of the number of the pairs of vertices, (vi; vj ) 2 V [c] V [c], for
which at least one shortest path (between vi and vj ) of maximum distance 2
exists (in the considered chromosome representation G[c].). For a consistent
discussion, we will refer to such a collection of pairs (vi; vj ) as the Feasible
Set (FS), C V [c] V [c].
2. The number of vertices, nV , of the (sub)graph, G[c], induced by c.</p>
        <p>In particular, for any chromosome c, we observed a sample (of dimension
n) S C, and by considering the induced subgraph representation G[c], we
evaluated the following tness
f (nv; diam) =
{(1
(1
np=nS )nv
np=nS ) n1v
if 0</p>
        <p>diam
if 2 &lt; diam ;
2 ;
of the (\unfeasibile") set of pairs, (vi; vj ), whose shortest paths have
dimension greater than 2, and</p>
        <p>In this way, by using the frequency, (1 np=nS ), of the sampled feasible
pairs, the tness weights (when 0 diam 2) the number of vertecies nv in
G[c], thus promoting large (sub)graph (size). On the other hand, given diam &gt; 2
(i.e., unfeasible solutions) and (1 np=nS ), we have tness values which decrease
asymptotically as n1v , for large (sub)graph size, nv, thus penalizing the
corresponding chromosome.
where:
{
{ nv is the (set of vertices) cardinality of G[c] induced (speculated) by c;
{ np=nS is the (simple plug-in sample) proportion</p>
        <p>n
n 1 ∑ I((vi; vj )k 2= C);</p>
        <p>k=1
I(vi; vj )k =
{1 if (vi; vj )k 2= C ;
0
otherwise:
(1)
(2)
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Search operators</title>
        <p>To provide adaptation (with regard to the Maximum 2-club problem), we
redened the following standard operators.
{ Mutation (applied with probability 0.75). Mutation must be carefully
considered. Indeed, vertex mutations can affect not only the vertex involved in the
current operations, but can even modify other parts of the solution. For this
reason we de ned two further mutation operators, both with the objective
to correct hypotheses (i.e., chromosomes) consistently and parsimoniously.
More speci cally three type of mutations are considered.</p>
        <p>Base Mutation (applied with probability 0.5, given that mutation has
been chosen). Similarly to the standard case, each individual from the
current population at time i is modi ed with a given probability (see
details in the experimental section). In this case, mutation ips a bit of
the selected chromosome c, in such a way that the corresponding vertex
is either removed (i.e., bit ipped to 0) or added (i.e., bit ipped to 1) to
the solution induced by c. Note that eliminating or adding vertices can
induce subgraphs that actually do not have feasibility, i.e., they are not
2-clubs, since the property of being a 2-club is not hereditary. Moreover,
modi cations of chromosomes introduce the possibility of overcoming
local minimum.</p>
        <p>Non Standard Mutation 1 (applied with probability 0.25, given that
mutation has been chosen). This modi cation has the objective to correct
hypotheses (i.e., chromosomes) consistently and parsimoniously. Since
any chromosome, by representation, induce a sub-graph G[V ′] of G,
which in turn may re ects feasible solutions, such hypotheses are
partially veri ed using the following principle. Given a selected chromosome
c, a vertex v′ is (randomly) sampled from the set V+ = fvi : c[i] = 1g and
the minimum length of simple paths connecting every pair (vi; v′); vi 2
V+ is checked to be consistent with the chromosome representation, i.e.,
since each chromosome \speculates" a feasible 2-club, for such
hypothesis to be true, there must be, at least, a simple path of size at most equal
to 2 connecting any vi 2 V+ with v′. If a negative feedback is observed
after this veri cation, then the sampled vertex v′ is ipped to 0.
Non Standard Mutation 2 (applied with probability 0.25, given that
mutation has been chosen). This modi cation has the objective to
increment (parsimoniously) the size of a solution. In this case, given a selected
chromosome c a vertex v′ is sampled from V = fvj : c[j] = 0g and the
minimum length of simple paths connecting every pair (vi; v′) (vi 2 V+)
is checked to be consistent with the current representation of the
chromosome c. In this case, we consider to extend the hypothesis represented
by c, by adding v′ to V+ if the minimum distances from v′ to vertices of
V+ are not larger than 2.
{ Cross-over (applied with probability 0.25). The following operations are
provided.</p>
        <p>Standard cross-over (applied with probability 0.5, given that crossover
has been chosen). Offspring is generated by copying and mixing parts
of parents' chromosomes. In this case a standard one-point crossover is
implemented, and applied by the GA with probability 0.25.</p>
        <p>
          Logical AND between parents (applied with probability 0.25, given that
crossover has been chosen). This operation has the objective to provide
an offspring consistent with the selected parents. For this, pairs of
chromosomes are generated through logical AND operations between the
ascendents. This operator i chosen with probability 0.5 (conditioned by
the application of Logical Cross-over
Logical OR between parents (applied with probability 0.25, given that
crossover has been chosen). This operation has the objective to provide
offspring extending parent hypotheses. Extension is given by realizing a
logical OR operation between two selected parents.
{ Elitist selection (or elitism) In order to guarantee that solution quality does
not decrease from one generation to another [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ], best (25%) hypotheses (high
tness values) are allowed to be part of a new offspring.
3.3
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>Computational Cost of the Search Operators</title>
        <p>Mutation and cross-over do not affect the computational cost of the GA
evolution. As discussed above, given a selected chromosome c, a vertex v′ is
(randomly) sampled from V+ = fvi : c[i] = 1g and the minimum length of simple
paths connecting every pair (vi; v′); vi 2 V+ is checked to be consistent with the
chromosome representation. For this, we intersect, for each vi 2 V+, the closed
neighborhoods N [vi] and N [v′]. If an empty set is obtained (i.e., any vertex
adjacent both to vi and v′ exists), then the corresponding i-th bit in v′, is ipped to
0, this way, thus not increasing the complexity of the procedure. A similar
argument applies when switching a bit from 0 to 1 (i.e., mutation 2; as de ned above).</p>
        <p>Conceptually, crossover is \easier" than mutation: it permits the generation
of new chromosomes by implementing simple logical AND (respectively, OR)
operations (as de ned previously), which still do not affect, computationally,
the process.</p>
        <p>Finally, by observing the tness formulation, we note that its complexity
is mainly due to the diameter evaluation, which is known to be bounded by
O(jV j3), thus preserving the tractability of the whole procedure.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Distributed learning</title>
      <p>As reported above, the huge amount of data obtainable from social interaction
activities, and the intrinsic complexity of the proposed computational problem
make practically untreatable the optimization goal. An effective distribution of
the computational load over different processors is therefore desirable.</p>
      <p>In this section, we focus on how to obtain an efficient load distribution, by
providing both parallel concept learning and a proper delivering of the trained
(i.e., evolved) models on different nodes of a learning system. To this concern,
we design a general framework, which applies the previous GA approach, and
similarly to the Map Reduce programming paradigm, abstracts away the
computational complexity of \learning" by allowing programmers to describe their
processes in terms of two main procedures, which we will call Training, and
Combine.
4.1</p>
      <sec id="sec-4-1">
        <title>A genetic cascade model</title>
        <p>Learning is practically based following the process represented in Figure 2.
Different nodes (referred as \sites") realize, locally, the training phases by
implementing the GAs described in Section 3. Beginning at the highest level, and
progressing towards the lowest level, information on models, trained by the
higher sites, are combined (equivalently, following a Map-Reduce \logic", models
are \reduced") into sites of lower levels. The following functions are provided by
the framework.
1. Init. To initialize both the training parameters and the number of rst
level sites. Init realizes the \data mapping" over the rst level sites, i.e., it
distributes the input data for the rst training phase.
2. Training. Sites read local data as mapped by the Init function, and
provides the rst level models (i.e., best tted chromosomes): GAs are executed
by using the process described in the Training function. Similarly to an
ensemble learning systems, sites are trained independently by applying the
approach described previously.
3. Combine. Sites are used as lters. Following a boosting approach, the i-th
level in Figure 2 attempts to create more accurate and general levels (i.e., i +
1) of solutions by inserting into site si+1;k's population (i.e., suggesting to site
k of level i + 1) the best chromosome (solutions) obtained by both \parents"
si;k and si;k+1. In other terms, a lower level population is initialized with
parents' best chromosomes.</p>
        <p>Large scale optimization is divided into independent, local smaller
optimization problems, where best tted chromosomes are distributed among different
sites of the system. Best solutions are nally \reduced" (i.e., Combined ) into the
lowest level chromosome population, in a hierarchical fashion. Moreover,
without any loss of generality, by assuming that patients and structures (i.e., input
data and care centers) are initially ordered, each site provides local solutions (if
any) on the corresponding input subnetwork. This process is detailed in Figure
3. In this case, since best tted chromosomes from site A1 can clearly represent
(speculate) feasible solutions from its input data only, one can obtain from site
A1 a partial community (solution) referred to the associated input subnetwork
A1, or equivalently corresponding to the respective adjacency (sub)matrix.
Similarly, from site A2 input data, we can obtain a feasible local community for
the corresponding subnetwork A2. The Combine operation extends the analysis
to obtain feasible chromosomes for the subnetwork B1: local solutions
represented by best chromosomes of site A1, and those from site A2 are combined and
inserted into the new population of site B1. The new offspring and the
obtained best chromosomes representation will provide optimized feasible solutions
(if any) for the corresponding extended input data, which in turns represent a
larger local solution suggested by the (\parents") A1 and A2 sites. Finally the
lowest level site completes the evolution (for all the input data), starting from an
optimized (suggested) population. Its solutions correspond (represent) to 2-club
communities for the global input network.
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Numerical Experiments</title>
      <p>
        The genetic algorithm described in Sec. 3 was coded in R using the Genetic
Algorithm (\igraph") package [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ]. As for the cascade model, we used the standard
\Multiprocessing" Pyhton package, working on local concurrency with 4 cores
(i.e., the cascade model uses 4 starting sites). R and Python interfaces were
managed by the rpy2 (https://rpy2.readthedocs.io/en/version_2.8.x)
utility. Experiments are executed on Apple OSX 10:12:6; System Type MacBook
Pro Retina; Processor: Intel(R), Core(TM) i5-6360U CPU @ 2.00GHz, 3.100
Ghz, 2 Core(s), 4 Logical Processors; Installed Physical Memory (RAM) 8,00
GB. Results are given for synthetic data, by sampling Erdos-Renyi random
graphs ER(n; p) with different values of number of vertices, n, and probability,
p, to create edges between (pair of) vertices [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Numerical experiments have
the main objective to evaluate both the effectiveness of the distributed learning
and the capability of the GAs to obtain correct solutions in a reasonable time.
      </p>
      <p>To provide correctness at \reasonable" cost, we followed the standard practice
of the evolutionary algorithms: keep the tractability of the search operators and
the tness, and promote, at the same time, evaluable chromosomes, which in our
case, provide feasible input diameter values. Moreover, a widely applied principle
for termination has been applied: we set up both the number of re-evaluation
of the tness over new populations (equivalently, the number of the GA
iterations), and the number of consecutive generations without improvement of the
best tness value4. Note that to check the robustness of the solutions, each ER
model has been sampled iteratively 5 times. The corresponding standard
deviation, obtained in each experiment, will be reported together with the obtained
performances in the following section.
5.1</p>
      <sec id="sec-5-1">
        <title>Results</title>
        <p>The results are reported in Tables 1 and 2. In particular the following attributes
are considered.</p>
        <p>
          { Input and output diameters. The best GA solution is re-coded, and the
resulting graph diameter is reported.
{ Number of Output vertices. Input vertices are given by the rst parameter,
n, of the random model ER(n; p)
{ Fitness value. Fitness is described in Sec. 3.
{ Ratio between the number of the input graph vertices and the number of
vertices of the resulting 2{club.
4 See, e.g. [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ], for a critical analysis of various aspects associated with the speci cation
of termination conditions in a simple genetic algorithm.
before the GA search is halted.
        </p>
        <p>
          iterations of the lowest site.
{ Average CPU User Time per level in seconds. In case of a single processing
unit, i.e., standard GA evolution, this quantity corresponds to the GA
execution time. When the distributed process is considered this time is averaged
by the number of framework levels.
{ Early stopping for no improvement. The number of consecutive generations
without improvement in the best tness value before the GA is stopped.
{ Max Number of Generation. The maximum number of iterations to run
{ Final generation number. Iteration number associated to the obtained
solution. Note that for the distributed evolution this number corresponds to the
The following main considerations emerge from the results.
{ In all model, execept ER(500; 0:1), we obtained feasible 2-clubs (with null
standard deviation).
{ As for the considered network: we are able to
nd combinatorial structures
which actually requires impractical computational time. In particular, this
is interesting for those solutions which have been obtained for a large input
graph (e.g., networks with more than 1000 vertices).
{ It is clear that, due to the complexity of the problem, we cannot compare
the size of the obtained 2-club community with the size of an optimal
solution. Therefore, to give a qualitative idea of the solutions, we reported the
ratio between the vertices of the input and the output graphs. Note that, the
approximation complexity for the Maximum 2-club shows that the problem
is not approximable within factor jV j1=2 ", for each " &gt; 0 [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]. Thus, even for
the approximability, the problem is very hard to approach. Observing our
results, and in particular the ratio in Table 2, we nd that, the most
compelling solutions are those for which, the larger is the input graph (number
of vertices), the lower is the difference between the ratio and the unit.
{ While the size of the inferred communities does not seem to differ, the
(average) number of iterations of the last level site, for the distributed case,
is much lower than the iterations reported by the standard, centralized
evolution. This is also evident from the decrease in the average execution time
per level, reported by the distributed case. Since GA uses the same
parameters for the standard and the distributed evolution, this behavior can surely
be traced back to the initial suggestions that each lower level site receives
from the higher level.
{ Computational cost seems to depend by the edge number (i.e., high expected
connectivity of the random models E(n; p)) as well as the number of the
input vertices. As discussed for the tness 3.3, this assertion can be easily
justi ed by considering the diameter computational cost, which is known to
be bounded by O(jV j3).
6
        </p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Conclusions</title>
      <p>
        While GAs optimization technique is not new in literature, a new design of
these models is now needed to cope with the hardness of many computational
problems, which actually nd new applications in many contexts [
        <xref ref-type="bibr" rid="ref13 ref14">13,14</xref>
        ]. In this
paper we focused on the optimization of a social network of patients which aim
to deepen the knowledge about the available care centers for their pathologies
through the help of other \experienced" patients. We considered this problem
from a computational point of view by de ning a variant of the max 2-club
problem.
      </p>
      <p>Due to computational complexity of the de ned formulation, we provided
GAbased heuristics on a distributed environment. The main advantage of this
approach can be shortly summarized as follows.</p>
      <p>{ Similarly to the Map Reduce programming paradigm, the genetic cascade
model described in Section 4.1 abstracts the computational complexity of
learning by allowing programmers to describe the learning process in terms
of Training, and Combine functions.
{ Every site (as described in Section 4.1) provides local learning over the
considered input graph. In fact, best tted chromosomes generated by each GA's
site represent local sub-graphs solutions. In this way, learning can be
addressed to obtain feasible communities (i.e., 2-clubs) for speci c issues: e.g.,
local optimized sub-networks characterized by particular (type of) patients
or care centers.
{ Large scale data optimization problems can be divided into independent,
smaller optimization tasks. In this case, only local solutions are distributed
downwards the system to create more accurate and general site level of
computation.
{ The proposed distributed learning environment is able to decrease the
standard GA evolution time thanks to a series of appropriate initial suggestions,
organized in a hierarchical manner.
{ Finally, by considering a different learning algorithm, we can easily generalize
the approach, simply by properly coding the Training function provided by
the framework.</p>
      <p>
        It is clear that, like in any problem solving paradigm, the question whether
the considered approach reaches optimal solutions cannot be unconditionally
positive expected. In fact, while the attempts to establish a theoretical account
of GA's operations have proved more difficult in literature [
        <xref ref-type="bibr" rid="ref12 ref2 ref25 ref33">33,12,25,2</xref>
        ], the
correctness of the solution is one of the desirable, yet not the only crucial issue
(another is, for example, the time needed to reach correctness.).
      </p>
      <p>Practice indicates that self-adaptation is a powerful tool to provide correct
solutions: new chromosomes (i.e., solution representations) are generated by
properly mutating, and/or recombining old solutions, and evaluable candidates (i.e.,
chromosomes) can survive (i.e., elitism) in a new population which, in turn, is
immediately re-evaluated through the tness function. Similarly, the number of
re-evaluations is one of the most commonly used practical measure for
assessing whether \efficiency", is (practically) achieved with reasonable cost. The
approach given in this paper follows such a standard practice: while keeping
the tractability of the search operators, i.e., mutation and crossover (as detailed
in Sec. 3.3), the tness (re)evaluation promotes correct populations by
guaranteeing, in our case, feasible diameter values (as discussed in Sec. 3.1).</p>
      <p>Many different extensions can be provided for this work. First of all, while
a detailed comparison, between the presented approach and others comparable
alternatives, is important for evaluating the practical conclusions of our paper,
here, to the best of our knowledge, there are no state-of-the-art works to
consider, for such a critical analysis. This question will be an issue which one should
be consider for a further future (literature) research, in detail. Moreover, the
distributed architecture reported in Section 4.1 was mainly introduced to help
the combination and ow of information from higher level sites to lower level
ones. In this paper, to speed up the experiments on laptops, we have used only a
reduced number of levels and sites. Although this architecture is easily scalable
to many different levels and sites, it is clear that, further architectures can be
similarly applied. This is another interesting issue for a future research in this
context.</p>
      <p>Finally, it is important to emphasize that the work described in this paper
has to be considered, as discussed in Section 1, a means to facilitate and
promote the patient engagement. It is not absolutely intended as a constrain to
the spontaneous nature of the communication and interaction activity, which
characterize the freedom of a social networks.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>R. D.</given-names>
            <surname>Alba</surname>
          </string-name>
          .
          <article-title>A graph-theoretic de nition of a sociometric clique</article-title>
          .
          <source>Journal of Mathematical Sociology</source>
          ,
          <volume>3</volume>
          :
          <fpage>113</fpage>
          {
          <fpage>126</fpage>
          ,
          <year>1973</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>Lee</given-names>
            <surname>Altenberg</surname>
          </string-name>
          .
          <article-title>The schema theorem and prices theorem</article-title>
          .
          <source>Foundations of genetic algorithms, 3:23{49</source>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>Yuichi</given-names>
            <surname>Asahiro</surname>
          </string-name>
          , Eiji Miyano, and
          <string-name>
            <given-names>Kazuaki</given-names>
            <surname>Samizo</surname>
          </string-name>
          .
          <article-title>Approximating maximum diameter-bounded subgraphs</article-title>
          .
          <source>In LATIN 2010: Theoretical Informatics, 9th Latin American Symposium</source>
          , Oaxaca, Mexico,
          <source>April 19-23</source>
          ,
          <year>2010</year>
          . Proceedings, pages
          <volume>615</volume>
          {
          <fpage>626</fpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Gary</surname>
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Bader</surname>
            and
            <given-names>Christopher W. V.</given-names>
          </string-name>
          <string-name>
            <surname>Hogue</surname>
          </string-name>
          .
          <article-title>An automated method for nding molecular complexes in large protein interaction networks</article-title>
          .
          <source>BMC Bioinformatics</source>
          ,
          <volume>4</volume>
          :
          <fpage>2</fpage>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>Balabhaskar</given-names>
            <surname>Balasundaram</surname>
          </string-name>
          , Sergiy Butenko, and
          <string-name>
            <given-names>Svyatoslav</given-names>
            <surname>Trukhanov</surname>
          </string-name>
          .
          <article-title>Novel approaches for analyzing biological networks</article-title>
          .
          <source>J. Comb. Optim.</source>
          ,
          <volume>10</volume>
          (
          <issue>1</issue>
          ):
          <volume>23</volume>
          {
          <fpage>39</fpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>Shumeet</given-names>
            <surname>Baluja</surname>
          </string-name>
          and
          <string-name>
            <given-names>Rich</given-names>
            <surname>Caruana</surname>
          </string-name>
          .
          <article-title>Removing the genetics from the standard genetic algorithm</article-title>
          .
          <source>In Machine Learning, Proc. of the 12th Int. Conf. on Machine Learning</source>
          , Tahoe City, California, USA, pages
          <volume>38</volume>
          {
          <fpage>46</fpage>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Christopher</surname>
            <given-names>E</given-names>
          </string-name>
          <string-name>
            <surname>Beaudoin</surname>
          </string-name>
          and
          <string-name>
            <surname>Chen-Chao Tao</surname>
          </string-name>
          .
          <article-title>Modeling the impact of online cancer resources on supporters of cancer patients</article-title>
          .
          <source>New Media &amp; Society</source>
          ,
          <volume>10</volume>
          (
          <issue>2</issue>
          ):
          <volume>321</volume>
          {
          <fpage>344</fpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>B.</given-names>
            <surname>Bollobas</surname>
          </string-name>
          . Random Graphs. Cambridge University Press,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Jean-Marie</surname>
            <given-names>Bourjolly</given-names>
          </string-name>
          , Gilbert Laporte, and
          <string-name>
            <given-names>Gilles</given-names>
            <surname>Pesant</surname>
          </string-name>
          .
          <article-title>An exact algorithm for the maximum k-club problem in an undirected graph</article-title>
          .
          <source>European Journal of Operational Research</source>
          ,
          <volume>138</volume>
          (
          <issue>1</issue>
          ):
          <volume>21</volume>
          {
          <fpage>28</fpage>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Maw-Shang</surname>
            <given-names>Chang</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ling-Ju</surname>
            <given-names>Hung</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chih-Ren Lin</surname>
          </string-name>
          , and
          <string-name>
            <surname>Ping-Chen Su</surname>
          </string-name>
          .
          <article-title>Finding large k-clubs in undirected graphs</article-title>
          .
          <source>Computing</source>
          ,
          <volume>95</volume>
          (
          <issue>9</issue>
          ):
          <volume>739</volume>
          {
          <fpage>758</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <given-names>Enrico</given-names>
            <surname>Coiera</surname>
          </string-name>
          .
          <article-title>Social networks, social media, and social diseases</article-title>
          .
          <source>BMJ: British Medical Journal (Online)</source>
          ,
          <volume>346</volume>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <given-names>Pierre</given-names>
            <surname>Collet</surname>
          </string-name>
          and
          <string-name>
            <surname>Jean-Philippe Rennard</surname>
          </string-name>
          .
          <article-title>Stochastic optimization algorithms</article-title>
          .
          <source>arXiv preprint arXiv:0704.3780</source>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <given-names>R.</given-names>
            <surname>Dondi</surname>
          </string-name>
          , G. Mauri,
          <string-name>
            <surname>and I. Zoppis.</surname>
          </string-name>
          <article-title>Orthology correction for gene tree reconstruction: Theoretical and experimental results</article-title>
          .
          <source>Procedia Computer Science</source>
          ,
          <volume>108</volume>
          :
          <fpage>1115</fpage>
          {
          <fpage>1124</fpage>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Riccardo</surname>
            <given-names>Dondi</given-names>
          </string-name>
          , Giancarlo Mauri, and
          <string-name>
            <given-names>Italo</given-names>
            <surname>Zoppis</surname>
          </string-name>
          .
          <article-title>Clique editing to support case versus control discrimination</article-title>
          .
          <source>In Intelligent Decision Technologies</source>
          <year>2016</year>
          , pages
          <fpage>27</fpage>
          {
          <fpage>36</fpage>
          . Springer,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Petr</surname>
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Golovach</surname>
          </string-name>
          , Pinar Heggernes, Dieter Kratsch, and
          <article-title>Arash Ra ey</article-title>
          .
          <article-title>Finding clubs in graph classes</article-title>
          .
          <source>Discrete Applied Mathematics</source>
          ,
          <volume>174</volume>
          :
          <fpage>57</fpage>
          {
          <fpage>65</fpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Sepp</surname>
            <given-names>Hartung</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Christian</given-names>
            <surname>Komusiewicz</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Andre</given-names>
            <surname>Nichterlein</surname>
          </string-name>
          .
          <article-title>Parameterized algorithmics and computational experiments for nding 2-clubs</article-title>
          .
          <source>J. Graph Algorithms Appl</source>
          .,
          <volume>19</volume>
          (
          <issue>1</issue>
          ):
          <volume>155</volume>
          {
          <fpage>190</fpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <given-names>Christian</given-names>
            <surname>Komusiewicz</surname>
          </string-name>
          .
          <article-title>Multivariate algorithmics for nding cohesive subnetworks</article-title>
          .
          <source>Algorithms</source>
          ,
          <volume>9</volume>
          (
          <issue>1</issue>
          ):
          <fpage>21</fpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <given-names>Christian</given-names>
            <surname>Komusiewicz</surname>
          </string-name>
          and
          <string-name>
            <given-names>Manuel</given-names>
            <surname>Sorge</surname>
          </string-name>
          .
          <article-title>An algorithmic framework for xedcardinality optimization in sparse graphs applied to dense subgraph problems</article-title>
          .
          <source>Discrete Applied Mathematics</source>
          ,
          <volume>193</volume>
          :
          <fpage>145</fpage>
          {
          <fpage>161</fpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Steven</surname>
            <given-names>Laan</given-names>
          </string-name>
          , Maarten Marx, and
          <string-name>
            <given-names>Robert J.</given-names>
            <surname>Mokken</surname>
          </string-name>
          .
          <article-title>Close communities in social networks: boroughs and 2-clubs</article-title>
          .
          <source>Social Netw. Analys. Mining</source>
          ,
          <volume>6</volume>
          (
          <issue>1</issue>
          ):
          <volume>20</volume>
          :1{
          <fpage>20</fpage>
          :
          <fpage>16</fpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <given-names>Melanie</given-names>
            <surname>Mitchell</surname>
          </string-name>
          .
          <article-title>An introduction to genetic algorithms. Complex adaptive systems</article-title>
          . MIT press, Cambridge (Mass.),
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <given-names>Robert</given-names>
            <surname>Mokken</surname>
          </string-name>
          .
          <article-title>Cliques, clubs and clans</article-title>
          .
          <source>Quality &amp; Quantity: International Journal of Methodology</source>
          ,
          <volume>13</volume>
          (
          <issue>2</issue>
          ):
          <volume>161</volume>
          {
          <fpage>173</fpage>
          ,
          <year>1979</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Robert J. Mokken</surname>
          </string-name>
          ,
          <string-name>
            <surname>Eelke M. Heemskerk</surname>
            , and
            <given-names>Steven</given-names>
          </string-name>
          <string-name>
            <surname>Laan</surname>
          </string-name>
          .
          <article-title>Close communication and 2-clubs in corporate networks: Europe 2010</article-title>
          .
          <article-title>Social Netw</article-title>
          .
          <source>Analys. Mining</source>
          ,
          <volume>6</volume>
          (
          <issue>1</issue>
          ):
          <volume>40</volume>
          :1{
          <fpage>40</fpage>
          :
          <fpage>19</fpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <given-names>Srinivas</given-names>
            <surname>Pasupuleti</surname>
          </string-name>
          .
          <article-title>Detection of protein complexes in protein interaction networks using n-clubs</article-title>
          .
          <source>In Evol. Comput., Machine Learning and Data Mining in Bioinf., 6th EvoBIO</source>
          <year>2008</year>
          , Naples, Italy, March
          <volume>26</volume>
          -28,
          <year>2008</year>
          , pages
          <fpage>153</fpage>
          {
          <fpage>164</fpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Caroline R Richardson</surname>
          </string-name>
          , Lorraine R Buis, Adrienne W Janney, David E Goodrich, Ananda Sen, Michael L Hess, Kathleen S Mehari, Laurie A Fortlage, Paul J Resnick,
          <string-name>
            <surname>Brian J Zikmund-Fisher</surname>
          </string-name>
          , et al.
          <article-title>An online community improves adherence in an internet-mediated walking program. part 1: results of a randomized controlled trial</article-title>
          .
          <source>J. of Med</source>
          . Internet Res.,
          <volume>12</volume>
          (
          <issue>4</issue>
          ),
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Gu</surname>
          </string-name>
          <article-title>nter Rudolph. Convergence properties of evolutionary algorithms</article-title>
          .
          <source>Kovac</source>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Mart</surname>
            n Safe, Jessica Carballido, Ignacio Ponzoni, and
            <given-names>Nelida</given-names>
          </string-name>
          <string-name>
            <surname>Brignole</surname>
          </string-name>
          .
          <article-title>On stopping criteria for genetic algorithms</article-title>
          .
          <source>Advances in Arti cial Intelligence{SBIA</source>
          <year>2004</year>
          , pages
          <fpage>405</fpage>
          {
          <fpage>413</fpage>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Eugenio</surname>
            <given-names>Santoro</given-names>
          </string-name>
          , Gianluca Castelnuovo, Italo Zoppis, Giancarlo Mauri, and
          <string-name>
            <given-names>Francesco</given-names>
            <surname>Sicurello</surname>
          </string-name>
          .
          <article-title>Social media and mobile applications in chronic disease prevention and management</article-title>
          .
          <source>Frontiers in psychology, 6</source>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28. Alexander Schafer, Christian Komusiewicz, Hannes Moser, and
          <string-name>
            <given-names>Rolf</given-names>
            <surname>Niedermeier</surname>
          </string-name>
          .
          <article-title>Parameterized computational complexity of nding small-diameter subgraphs</article-title>
          .
          <source>Optimization Letters</source>
          ,
          <volume>6</volume>
          (
          <issue>5</issue>
          ):
          <volume>883</volume>
          {
          <fpage>891</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29. L. Scrucca. GA:
          <article-title>A package for genetic algorithms in R</article-title>
          .
          <source>Journal of Statistical Software</source>
          ,
          <volume>53</volume>
          (
          <issue>4</issue>
          ):1{
          <fpage>37</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30.
          <string-name>
            <given-names>Roded</given-names>
            <surname>Sharan</surname>
          </string-name>
          and
          <string-name>
            <given-names>Ron</given-names>
            <surname>Shamir</surname>
          </string-name>
          .
          <article-title>Center CLICK: A clustering algorithm with applications to gene expression analysis</article-title>
          .
          <source>In Proc. of the Eighth Int. Conf. on Intelligent Syst. for Molecular Biol., August 19-23</source>
          ,
          <year>2000</year>
          , La Jolla / San Diego, CA, USA, pages
          <volume>307</volume>
          {
          <fpage>316</fpage>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          31.
          <string-name>
            <given-names>V.</given-names>
            <surname>Spirin</surname>
          </string-name>
          and
          <string-name>
            <given-names>L. A.</given-names>
            <surname>Mirny</surname>
          </string-name>
          .
          <article-title>Protein complexes and functional modules in molecular networks</article-title>
          .
          <source>Proc. of the Nat. Academy of Sciences</source>
          ,
          <volume>100</volume>
          :
          <fpage>12123</fpage>
          {
          <fpage>12</fpage>
          {
          <fpage>128</fpage>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          32.
          <string-name>
            <surname>Atsushi</surname>
            <given-names>Tsuya</given-names>
          </string-name>
          , Yuya Sugawara, Atsushi Tanaka, and
          <string-name>
            <given-names>Hiroto</given-names>
            <surname>Narimatsu</surname>
          </string-name>
          .
          <article-title>Do cancer patients tweet? examining the twitter use of cancer patients in japan</article-title>
          .
          <source>Jour. of mMed. Internet research</source>
          ,
          <volume>16</volume>
          (
          <issue>5</issue>
          ),
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          33.
          <string-name>
            <surname>Thomas</surname>
            <given-names>Weise</given-names>
          </string-name>
          , Michael Zapf, Raymond Chiong, and Antonio J Nebro.
          <article-title>Why is optimization difficult? In Nature-Inspired Algorithms for Optimisation</article-title>
          , pages
          <fpage>1</fpage>
          <lpage>{</lpage>
          50. Springer,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          34.
          <string-name>
            <surname>Italo</surname>
            <given-names>Zoppis</given-names>
          </string-name>
          , Giancarlo Mauri, Ferancesco Sicurello, Eugenio Santoro, Giada Pietrabissa, and
          <string-name>
            <given-names>Gianluca</given-names>
            <surname>Castelnuovo</surname>
          </string-name>
          .
          <article-title>Diabesity: A study for mhealth integrated solutions</article-title>
          .
          <source>In International Conference on Wireless Mobile Communication and Healthcare</source>
          , pages
          <volume>195</volume>
          {
          <fpage>199</fpage>
          . Springer,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>