<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Simulation of the Enhanced Version of Prisoner's Dilemma Game</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Tatyana Vlasova</string-name>
          <email>vlasova.tatyana@bk.ru</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maryna Vladymyrova</string-name>
          <email>vladymyrova@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dmitry Shabanov</string-name>
          <email>d.a.shabanov@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>V.N. Karazin Kharkiv National University</institution>
          ,
          <addr-line>4, Svobody Sqr., Kharkiv, 61077</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
      </contrib-group>
      <fpage>82</fpage>
      <lpage>86</lpage>
      <abstract>
        <p>This paper presents the model and software to explore pair interactions of objects with different behavior and their patterns. The research is based on the enhanced version of a classic prisoner's dilemma game. The non-cooperative finite and infinite pair games with non-zero sums are investigated. Pure and mixed strategies with finite and infinite memory developed by Biology School of V. N. Karazin Kharkiv National University are used to analyze the results.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>
        This paper presents the model and software to explore pair interactions of biological
objects with specified behavior. The research was carried out on request of the
herpetology department of V. N. Karazin Kharkiv National University and its results
are used in educational process to illustrate some topics of the ecology discipline as
well as in biology students’ research for studying patterns of pair behavior of some
biological species. Our model is based on the classic prisoner’s dilemma game [
        <xref ref-type="bibr" rid="ref1 ref2 ref3">1,2,3</xref>
        ]
as it is widely accepted as the model to study the pair interactions and behavior of
different agents from interactions of animals in nature up to economical transactions
in human world [
        <xref ref-type="bibr" rid="ref4 ref5 ref6 ref7 ref8">4, 5, 6, 7, 8</xref>
        ]. The main goal of the game in its classic version is to
get maximal score doing preset number of steps with given values of the fine, the
cooperation bonus and the cooperation award. Each participant can remember not
more than two its own and the opponent’s previous actions.
      </p>
      <p>In distinction to classic case we enhance the rules of the game by allowing more
complex behavior of agents depending on their ability to remember their own or
opponent’s previous steps and on values of fines, bonuses and awards.</p>
      <p>The important factor which influences the evolution of living organisms including
humans is their communications with environment. It is only typical for such
communications to have conflicts of interest between opponents, absence of
information about future actions of an opponent and need to foresee its future actions
only on the base of the prehistory of similar interactions. The paradox of the game
clearly shows the contradiction between group interests and individual ones: what is
optimal for the group of two is not good for each member of the group. In fact the
same is true for multilevel systems with optimization functions on different levels:
their behavior is intuitively unpredictable and even paradoxical.</p>
      <p>It is absolutely obvious that the outcome of the game fully depends on the
participants’ strategies. Here we define strategy in a slightly different way than it is
done in the game theory.</p>
      <p>Since the objective of this research is to explore the pair interactions in real
biological environment it is natural to define strategy as a set of rules used by a
participant to make its next step. Because of the infinite variety of strategies for
different biological objects we use simulation with strategies constructed by the
experimenter. The developed software allows experimenter to set different strategies
as input information for simulation. The main goal of experimenter is to find optimal
strategy and the value of the game in their classical sense.</p>
    </sec>
    <sec id="sec-2">
      <title>2 Model and simulation description</title>
      <p>We consider the pair non-cooperative finite and infinite games with a non-zero sum
[11; 12; 14] so the general result can be both positive (in the case when participants
cooperate during the whole game) and negative. We take the game in its normal form
N  N ; X1,..., X i ,..., X n ; K1,..., Ki ,..., Kn  , where N is the notion of the game,
N  {1,...,i,..., n} is a set of participants, X i  {xi} is a set of strategies for the i-th
participant and Ki (x1,..., xi ,..., xn ) is the gain function for the i-th participant, the
value of which is the gain obtained by the i-th participant if participants use
strategies (x1,..., xi ,..., xn ) . In our model we allow participants to use pure or mixed
strategies, values of the gain function, transition rules for a participant’s behavior, and
initial conditions. Transition rules depend on the type of a participant’s strategy which
can be pure or mixed. Simulation repeats the preset number of times or till the
winning of one of its participants.</p>
      <p>There exist four scenarios of simulation each corresponding to four different goals.</p>
      <p>First scenario is the experiment between the two chosen strategies (“pure-pure”,
“mixed-pure”, “mixed-mixed”), when the values of gain/bonus/fine (M, L, and K,
respectively) are fixed. The simulation results are presented by the graph, where
xaxis denotes the sequence of steps and y-axis denotes the quantitative characteristics
of the gain/loss (Fig.1).</p>
      <p>
        The result data of this scenario show that among the strategies with zero and finite
memory, the strategy of random steps and zero memory wins in most cases. Among
the strategies with finite and infinite memory the finite memory strategy wins in most
cases. Analogous conclusions were made previously by R. Dockins [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] when he
analyzed the classical version of the game.
      </p>
      <p>Analyzing the obtained results of pair interaction between the pure and mixed
strategies we may conclude that the mixed strategy is more advantageous than any of
pure strategies. This can be explained by the flexible behavior of the participant with
the mixed strategy.</p>
      <p>The second scenario is the evaluation of competition between the one fixed
strategy and the variety of others. The results of the experiment are presented by the
bar chart displaying the number of win points over each of the chosen strategies.</p>
      <p>Besides the graphical visualization one can look through the steps history (absolute
values of gains or losses) of each of the simulation participants.</p>
      <p>The third scenario allows experimenter to approve or disprove the hypothesis that
the strategy of a participant depends on the gain function. In order to do it we
implemented the feature which allows experimenter to conduct the simulation with
fluctuating parameters M, L, K. In this case the results are presented by the graph and
the table showing the data of each participant’s results for chosen strategies with
respect to varying values of K, L, and M (Fig.2).</p>
      <p>The results of the experiments show that the outcome of the game depends on the
values of the gain function. So one can find experimentally the values of the gain
function for any given strategy to be optimal.</p>
      <p>
        Fourth scenario deals with a strategy as an element of the given set. We do not use
the formal methods of working with such sets but accepted the way used by
experimenters (biologists) to form them. In this case our goal was to find the
probability for a strategy from one set to win a strategy from another one. For
example our experimenters divided strategies into three sets namely provocative,
forgiving and neither provocative nor forgiving exactly in the same way as it is done
in R. Axelrod’s experiment [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. In our case a provocative strategy means immediate
change of behavior in condition of the participant’s own loss (or the opponent’s gain)
and keeping it till the next loss or till the end of the game. A forgiving strategy means
that the participant changes its behavior under the same conditions but keeps it only
some limited time (definite number of steps). In some way one can see it as follows:
in forgiving strategy the participant only fights back as a response to the smack while
in provocative strategy the participant not only fights back as a response to the smack
but retaliates. Such division is just conditional as it reflects the view of experimenters.
      </p>
      <p>The results of experiments show that the provocative strategies win in more cases
than forgiving ones.</p>
    </sec>
    <sec id="sec-3">
      <title>3 Conclusion</title>
      <p>The software for simulation the enhanced version of the prisoner’s dilemma game to
set up experiments and explore pair interactions of objects with different behavior has
been developed. The software allows experimenter to set pure and mixed strategies
with finite and infinite memory. The simulation depends on its goal: whether it is
maximization of the participant’s gain or minimization of the opponent’s loss. The
participant’s gain depends on the values of the gain function. If the participants’
strategies are fixed then one can find experimentally such values of the gain / bonus /
fine that the strategy of one of the participants is optimal. The results of the
simulation comply with those given in literature in the case of classic game which can
be accepted as an adequacy of the model.</p>
      <p>The software was tested and operates at Biology School of V. N. Karazin Kharkiv
National University but it can be successfully used at other schools and fields such as
economy, sociology, etc.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Merrill</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <article-title>Flood: Some Experimental Games</article-title>
          .
          <source>Research Memorandum RM 789</source>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Tucker</surname>
            <given-names>A. W.</given-names>
          </string-name>
          <string-name>
            <surname>On</surname>
          </string-name>
          <article-title>Jargon: The Prisoner's Dilemma</article-title>
          .
          <source>UMAP Journal</source>
          <volume>1</volume>
          (
          <year>1950</year>
          ):
          <fpage>101</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Rheingold</surname>
            ,
            <given-names>H.: The</given-names>
          </string-name>
          <string-name>
            <surname>Virtual Community - Perennial</surname>
          </string-name>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Axelrod</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          :
          <source>The evolution of cooperation</source>
          . New York: Basic Books (
          <year>1984</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Axelrod</surname>
          </string-name>
          , Robert and Hamilton, William D.: The Evolution of Cooperation. Science,
          <volume>211</volume>
          (
          <year>1981</year>
          )
          <fpage>1390</fpage>
          -
          <lpage>1396</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Dawkins</surname>
            ,
            <given-names>R.:</given-names>
          </string-name>
          <article-title>The selfish gene (</article-title>
          <year>1993</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>The</given-names>
            <surname>Iterated Prisoner's Dilemma</surname>
          </string-name>
          <string-name>
            <surname>Competition</surname>
          </string-name>
          [Electronic resource]. - Mode of access: http://www.prisoners-dilemma.com/
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Adaptive</surname>
            <given-names>modeler</given-names>
          </string-name>
          [Electronic resource]. - Mode of access: http://www.softsoft.ru/business/investment-tools/3210.htm
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Vorobjev</surname>
            ,
            <given-names>N.N.</given-names>
          </string-name>
          :
          <article-title>Game theory</article-title>
          . (
          <year>1976</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Grabovski</surname>
            ,
            <given-names>B.I.</given-names>
          </string-name>
          :
          <article-title>Cellular automata, as simple models of complex systems</article-title>
          . (
          <year>1995</year>
          )
          <fpage>412</fpage>
          -
          <lpage>419</lpage>
          .
          <fpage>11</fpage>
          .
          <string-name>
            <surname>Karlin</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Mathematical methods in the game theory, programming and economy (</article-title>
          <year>1964</year>
          ).
          <fpage>12</fpage>
          .
          <string-name>
            <surname>Tseitlin</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>Research on the theory of state machines and modeling of biological systems</article-title>
          . (
          <year>1969</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>