<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>On Parameter Identification Methods for Markov Models Applied to Social Networks</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Denis Fedyanin</string-name>
          <email>dfedyanin@inbox.ru</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>V.A. Trapeznikov Institute of Control Sciences, Russian Academy of Sciences Russia</institution>
          ,
          <addr-line>Moscow, 117997, Profsoyuznaya ulitsa, 65</addr-line>
        </aff>
      </contrib-group>
      <fpage>21</fpage>
      <lpage>29</lpage>
      <abstract>
        <p>In this paper we investigate the mutual influence of participants (agents) of a social network on each other using the framework of Markov models. The main objective of this study was to check several hypotheses concerning dependencies between the influence of agents and their impact on several computational models.</p>
      </abstract>
      <kwd-group>
        <kwd>social network</kwd>
        <kwd>dissemination of information</kwd>
        <kwd>Markov model</kwd>
        <kwd>influence</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        In this paper, we investigate the mutual influence of participants of a social network
(we will call them agents in accordance with the terminology used in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]). We used a
general approach taking its roots in previous work [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. The initial project was divided
into several subprojects. The same data analysis methods were used but the input
datasets were different. Work [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] used an on-line community consisting of 964
members. In this paper we use well-known methods but apply them to a different on-line
community consisting of 2960 members.
      </p>
      <p>
        Influence is understood as a process of changes in a subject caused by the behavior
of other entities, their settings, intentions, views, assessments and their actions during
cooperation with them [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Observations of psychologists show [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] that agents in a
social network often do not have sufficient information for decision-making or are
unable to handle available information, which causes that their decisions can be based
on the decisions and/or views of other agents (social influence) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>Our analysis is based on data of three communities extracted from the Live Journal
website. Live Journal (http://www.livejournal.com) consists of blogs, which contain
sequences of messages called posts. Additionally we have available event logs, online
diaries and other website content including images, multimedia, texts, etc.</p>
      <p>The differences between a blog and a traditional diary are caused by the
environment: blogs are usually public and involve third-party readers, who may enter into a
public debate with the author, by commenting on blogs.</p>
      <p>Authors of posts are called bloggers. The majority of posts are available for
reading and commenting by other bloggers. Live Journal also provides an opportunity to
bloggers to unite in a community, and subscribe to a community to read their blogs. In
this case all the new posts in the selected blogs are displayed in a special news feed.
The blogger can belong to several communities at the same time.</p>
      <p>Information about communities, subscriptions and records themselves in most
cases, are open and accessible to any Internet user. For each of the communities anyone
can get the list of participants and the list of friends for each participant. Data consists
of three tables: the list of communities, the list of bloggers and the list of links
between bloggers. Further in this paper the terms "blogger" and "agent "will be used as
synonyms. The number of entries in the list of participants who are members of one
of the three communities of Live Journal are 964, 2960, 6587, and the number of links
are 6359, 49504 and 190427 respectively.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Motivation</title>
      <p>
        There are numerous works about properties of Markov models [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], which describe a
social network. Traditional, analysis is mostly theoretical [
        <xref ref-type="bibr" rid="ref1 ref16">1, 16</xref>
        ], but for the
successful application of the obtained theoretical results it is necessary to have
wellproven algorithms for identification of the model from the observed data. There are
some works where these methods are described, for example [
        <xref ref-type="bibr" rid="ref1 ref16">1,16</xref>
        ].
      </p>
      <p>Despite the high effectiveness of existing computational algorithms, the main
disadvantage of a Markov model is the need to build the initial matrix of influence in
the infinite or a high degree. In addition there is some uncertainty in the determination
of the initial matrix of mutual trust agents and their relationships with other agents.</p>
      <p>In this paper preliminary comparison of different methods for determining the
influence of agents of the social network without taking into consideration the data of
the messages exchanged between the agents, was conducted. The basis for the
identification of the network was data of the agents about whose blogs they read. The
format of available data and the sample data is presented in table 1.</p>
      <p>The ID of the
connection
between agents</p>
    </sec>
    <sec id="sec-3">
      <title>Review of existing mathematical models</title>
      <p>
        In literature, several approaches have been proposed to describe the interaction
between participants in a social network: a Markov model or model of De Groot [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], a
Linear Threshold Model [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], Independent Cascade Model [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], a filtering and intrusion
model, Ising model, cellular automata model, etc. [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. The models have been
investigated from several perspectives: the conditions of convergence of opinions of
members of the social network (see [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]), the dynamic of changes of power, the speed
of convergence, the condition of the uniqueness of the final opinion (see [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]). In this
work, we will use a model, described in detail in the book [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>
        In some models, ranking of agents is used, for example, by means of power
indices, index of Houde-Bakker [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], calculation of impact-factor of journals,
ranking of web pages, PageRank algorithms, as well as the ordering of parameters
"betweenness" [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] , "centrality"[
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], "clustering" etc. [
        <xref ref-type="bibr" rid="ref12 ref15 ref5">5,12,15</xref>
        ].
4
      </p>
    </sec>
    <sec id="sec-4">
      <title>Abbreviations and definitions</title>
      <p>
        Because of its wide popularity, the description of Markov models in this work for the
sake of brevity was not given. Details can be found, for example, in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Note that
transitive influence of the i-th agent is defined by
      </p>
      <p>where aij is an element of the transitive closure of the matrix of direct influence,
can also be computed for the original stochastic matrix of direct influence. In this case
we will call it direct influence of i-th agent. The common method of agent
identification is based on the direct influence matrix which is derived from the
adjacency matrix by the formula where aij is a weight in the matrix of direct influence
and bij is an element of the adjacency matrix.</p>
      <p>wj   aij ,</p>
      <p>i
aij </p>
      <p>bij
 bij</p>
      <p>i
aij </p>
      <p>fi bij 
 fi bij </p>
      <p>i</p>
      <p>In some cases, one can try to take into account the impact of the authority of the
agent on the strength of the influence. We consider the case when the impact of
authority is proportional to the number of friends of the agent, where fj represents the
credibility of the i-th agent.
(1)
(2)
(3)
5</p>
      <p>Hypotheses</p>
      <p>
 
fi  x    bij 
 i 
(4)
1. Direct influence depends on the number of friends of an agent.
2. The number of friends is not correlated with transitive influence.
3. There is a correlation between transitive influences of agents, calculated by
different methods taking into account the authority of agents.
4. The direct influence of the agent does not correlate with its transitive influence.
5. Implementation of hypotheses does not depend on the size of the network.
6</p>
    </sec>
    <sec id="sec-5">
      <title>Data analysis results</title>
      <p>Testing hypothesis 1 reveals that direct influence depends on the number of friends of
an agent, and the relationship between them is close to a power-law function as shown
in figure 1. The coefficient of correlation is 0.85.</p>
      <p>Testing hypothesis 2 revealed that the number of friends is not correlated with
transitive influence. This is shown in figure 2. The coefficient of correlation is 0.72. In
addition to a linear dependence we observe the almost vertical "tail". Its presence
means that there are several agents who have a small number of friends, but a
substantial influence. In particular, there are three agents for whom the transitive
influence exceeds the transitive influence of the agent with the highest number of friends
(whose influence can be assumed). The existence of this phenomenon has been
theoretically predicted, but validation on real experimental data had not been performed
yet. Note that we do not yet have an explanation for the presence of only two main
lines in diagram and this issue should be investigated more thoroughly in the future.
Fig. 3. The transitive influence of agents, calculated without taking into account their authority
(horizontal axis) and calculated while taking authority into account (β=4) (vertical axis).</p>
      <p>Testing hypothesis 3 revealed that there is no correlation between transitive
influences of agents, neither without taking into account the authority, nor when
taking authority into account. In figure 3 we see that there is no correlation. The
coefficient of the correlation is 0.31. This is an important observation because by
making assumptions about the impact of the number of friends of an agent on his
credibility, you can get, generally speaking, different results. If we ignore some of the
outlier observations, we can once again identify two "tails". The main tail shows a
linear dependency, which is not equal to the constant β, and the second tail indicates a
non-increasing transitive influence of the agents, despite of the increase in their
transitive influence in the case of not taking into account their credibility. Moreover,
the figure shows that there are a number of influential agents with low authority. This
is consistent with the result that we received in the process of verification of
hypothesis 2. So it can be argued that the correlation between transitive influences is
complex in nature, and thus, hypothesis 3, cannot be affirmed without additional
clarifications.</p>
      <p>Testing hypothesis 4 revealed that the direct influence of an agent does not
correlate with its transitive visibility. In figure 4 you can see that the linear correlation
between direct and transitive influences is not clear. The coefficient of correlation is
0.78. This is also interesting, since we believe that not all agents can make decisions
based on the computation of transitive influence, and therefore are forced to use direct
influence measures.
Fig. 5. The dependency of the transitive influence of the agent (vertical) from its direct
influence (horizontal), calculated taking into account the authority of agents
Then we come to a rather obvious conclusion, that in real social networks such agents
can be mistaken. However, we conclude that there is no ground for hypothesis 4. In
the case shown in figure 5, the linear correlation is noticeable. However, it is different
for small values of direct influence than for higher values, where you can also identify
the correlation. The coefficient of the correlation is 0.92.</p>
      <p>Hypothesis 5 states the assumption that the validity of the hypotheses does not
depend on the size of the network. This has not yet been verified and is a possible
direction for future research.
7</p>
    </sec>
    <sec id="sec-6">
      <title>Conclusions and future work</title>
      <p>The study showed the presence of a certain number of anomalies and effects that need
to be taken into account while identifying optimal Markov model parameters for
experimental data. It was shown that the credibility of agents has a significant impact on
the influence of agents. It was shown that there is a specific dependence between
transitive influence and direct influence. We identified an abnormal cluster of agents,
which have a small number of friends, but which have a great transitive influence.</p>
      <p>It may be interesting to continue our study by verifying hypothesis 5, as well as
including in the analysis the possibility of taking into account the exchange of messages
between agents. We also intend to investigate the ranking of agents using methods
such as alpha-centrality, the PageRank algorithm, as well as other widely used
methods based on direct and transitive influences of agents.</p>
      <p>Acknowledgement. The research is supported by the grant 10-07-00129 of Russian
Foundation for Basic Research. We would like to express our gratitude to Jonas
Poelmans and Dmitry Ignatov for improving the language quality.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Gubanov</surname>
            ,
            <given-names>D.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Novikov</surname>
            ,
            <given-names>D.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chkhartishvili</surname>
            ,
            <given-names>A. G.</given-names>
          </string-name>
          :
          <article-title>Social network: a model of information influence, control and confrontation</article-title>
          .
          <source>Fizmatlit</source>
          ,
          <volume>228</volume>
          p.,
          <string-name>
            <surname>Moscow</surname>
          </string-name>
          (
          <year>2010</year>
          )
          <article-title>(in Russian)</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Fedyanin</surname>
            ,
            <given-names>D.N.</given-names>
          </string-name>
          :
          <article-title>Application of Markov models for the analysis of influence of the participants of the Internet-community</article-title>
          .
          <source>In: Lecture Notes of the all-Russian scientific-practical conference "Analysis of Images, Networks and Texts" (AIST</source>
          <year>2012</year>
          ), pp.
          <fpage>132</fpage>
          -
          <lpage>143</lpage>
          .
          <article-title>The national Open University "INTUIT"</article-title>
          ,
          <source>Yekaterinburg</source>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <source>Glossary on Control Theory and its Applications</source>
          , http://glossary.ru (in Russian)
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Deutsch</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gerard</surname>
          </string-name>
          , H.:
          <article-title>Study of Normative and Informational Social Influence upon Individual Judgment</article-title>
          .
          <source>In: Journal of Abnormal and Social Psychology</source>
          . no.
          <issue>51</issue>
          , pp.
          <fpage>629</fpage>
          -
          <lpage>636</lpage>
          . (
          <year>1995</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Zuyev</surname>
            ,
            <given-names>A.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fedyanin</surname>
            ,
            <given-names>N.A.</given-names>
          </string-name>
          :
          <article-title>Model of management of views agents in co-social networks</article-title>
          .
          <source>In: the Problems of management. № 1</source>
          . pp.
          <fpage>37</fpage>
          -
          <lpage>45</lpage>
          . ICP RAS, Moscow (
          <year>2011</year>
          ).
          <article-title>(in Russian)</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>DeGroot</surname>
          </string-name>
          , M.H.:
          <article-title>Reaching a Consensus</article-title>
          .
          <source>In: Journal of American Statistical Association. №69</source>
          , pp.
          <fpage>118</fpage>
          -
          <lpage>121</lpage>
          (
          <year>1974</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Ganovetter</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Threshold Models of Collective Behavior</article-title>
          .
          <source>In: American Journal of Sociology</source>
          , vol.
          <volume>83</volume>
          . №6, pp.
          <fpage>1420</fpage>
          -
          <lpage>1443</lpage>
          (
          <year>1978</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Goldberg</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Libai</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Muller</surname>
          </string-name>
          , E.:
          <article-title>Talk of the Networks: A Complex Systems looks at the Underlying Process of Word-of-Mouth</article-title>
          . In: Marketing Letters, №
          <volume>2</volume>
          , pp.
          <fpage>11</fpage>
          -
          <lpage>34</lpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Berger</surname>
            ,
            <given-names>R.L.</given-names>
          </string-name>
          :
          <article-title>Necessary and Sufficient Conditions for Reaching a Consensus using DeGroot's method</article-title>
          .
          <source>In: Journal of American Statistical Association</source>
          , vol.
          <volume>76</volume>
          , pp.
          <fpage>415</fpage>
          -
          <lpage>419</lpage>
          (
          <year>1981</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Golub</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jackson</surname>
            ,
            <given-names>M.O.</given-names>
          </string-name>
          :
          <article-title>Naive Learning in Social Networks: Convergence, Influence and Wisdom of Crowds</article-title>
          .
          <source>Technical Report 64</source>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Hoede</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bakker</surname>
          </string-name>
          , R.:
          <article-title>A Theory of Decisional Power</article-title>
          .
          <source>In: Journal of Mathematical Sociology, №8</source>
          , pp.
          <fpage>309</fpage>
          -
          <lpage>322</lpage>
          . (
          <year>1982</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Rusinowska</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Swart</surname>
          </string-name>
          , H.:
          <article-title>Generalizing and Modifying the Hoede-Bakker Index</article-title>
          .
          <source>In: Theory and Applications of Rational Structures as Knowledge Instruments. №2. Springer's Lecture Notes in Artificial Intelligence 4342</source>
          , pp.
          <fpage>60</fpage>
          -
          <lpage>88</lpage>
          . Springer (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Freeman</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>A set of measures of centrality based upon betweenness</article-title>
          .
          <source>In: Sociometry №40</source>
          , pp.
          <fpage>35</fpage>
          -
          <lpage>41</lpage>
          . (
          <year>1977</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Borgatti</surname>
            ,
            <given-names>S</given-names>
          </string-name>
          , Everett,
          <string-name>
            <surname>M.</surname>
          </string-name>
          :
          <string-name>
            <given-names>A</given-names>
            <surname>Graph-Theoretic Perspective</surname>
          </string-name>
          on Centrality.
          <source>In: Social Networks</source>
          ,
          <volume>28</volume>
          . pp.
          <fpage>466</fpage>
          -
          <lpage>484</lpage>
          . Elsevier (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Wasserman</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Faust</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Social Network Analysis: Methods and Applications</article-title>
          . Cambridge: Cambridge University Press (
          <year>1994</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Jackson</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          : Social and
          <string-name>
            <given-names>Economic</given-names>
            <surname>Networks</surname>
          </string-name>
          . Princeton: Princeton University Press (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>