<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Language Contact: Peaceful Coexistence or Emergence of a Contact Language</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jérôme Michaud</string-name>
          <email>jerome.michaud84@gmail.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Gerhard Schaden</string-name>
          <email>gerhard.schaden@univ-lille3.fr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Université Lille SHS, CNRS UMR 8163 STL</institution>
          ,
          <addr-line>59000 Lille</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Edinburgh, SOPA</institution>
          ,
          <addr-line>Peter Guthrie Tait Road, EH9 3FD Edinburgh</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper presents a simple model of linguistic priming between languages in contact, based on the utterance selection model (USM) for language change of Baxter et al. (2006). It will be shown that the emergence or the non-emergence of a new contact language depends on the way potentially bilingual agents choose a language to communicate.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>One major factor driving language evolution is the
interaction of its speakers. In our paper, we consider
a situation where speakers of two different
communities are in contact, and where (at least some of)
the speakers of the two groups need to
communicate with one another. There are basically two ways
of resolving the communicative problem in such
cases: speakers can either use (some variant of)
their community languages or a contact language
can emerge — which corresponds to neither of
the two community languages. This new language,
which can take the form of a pidgin, is not random
and highly correlates with the two languages it
originates from. The fine-grained processes controlling
this process are poorly understood. In this paper, we
provide a simple computational simulation of the
stochastic dynamics of a contact situation. We show
that the way agents choose a language when they
interact partly controls the emergence of a contact
language.</p>
      <p>
        In order to capture the stochastic aspects of
linguistic interactions, Baxter et al. (2006) designed
the utterance selection model (USM) for language
change
        <xref ref-type="bibr" rid="ref5">(see also Croft, 2000)</xref>
        . This is a stochastic
agent-based model that accounts for the evolution
of a single (socio-)linguistic variable
        <xref ref-type="bibr" rid="ref7">(Tagliamonte,
2011)</xref>
        , which can be instantiated in a finite number
of equivalent variants. USMs can be seen as formal
models of what Calvet (1999) calls an
ecolinguistic system. The USM is well-adapted to capture
the dynamics of a single linguistic variable and its
stochastic evolution. It can also be used to predict
the evolution of a linguistic variable in a larger
population using coarse-graining techniques as shown in
Michaud (2017). Other modelling methods, such as
the model of Tria et al. (2015), can accurately
reproduce the conditions under which a creole emerges
based on census data. However, their model is highly
idealized and makes assumptions such as “if the
hearer does not already possess the language of
the utterance in her repertoire and therefore cannot
make sense of it, she learns it by adding it to her
repertoire”
        <xref ref-type="bibr" rid="ref8">(Tria et al., 2015, p. 6)</xref>
        , which do not
seem very realistic. Our aim is to provide a (still
very simple) model, but whose agents correspond
more closely to ‘real’ humans’ capacities.
      </p>
      <p>We study a simple extension of the USM that
models potentially bilingual agents and explicitly
takes into account a priming effect between the two
languages to model a situation of language contact.
In particular, we study how the choice of a specific
language in the interaction can lead either to the
coexistence of the two group languages or to the
emergence of a new contact language.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Methodology</title>
      <p>
        Our model is an extension of the USM for language
change
        <xref ref-type="bibr" rid="ref1">(Baxter et al., 2006)</xref>
        that takes into account
potentially bilingual agents and models a priming
effect between a group language and a non-group
language. Below, we recall the definition of the
USM and then explain the modifications made to
model potentially bilingual agents. We conclude
this section by explaining how we measure this
stochastic system and explain how we decide when
a contact language emerges.
The USM models the evolution of the usage
frequency of a linguistic variable with V equivalent
variants. The probability distribution over the
different variants of an agent i is represented by the vector
x(i), where a component xv(i) represents the
probability/frequency with which agent i uses variant v.
x(i)
      </p>
      <p>U
h, λ
u(i)
u(j)</p>
      <p>h, λ
U
x(j)</p>
      <p>In order to communicate, an agent i produces
an utterance u(i) of length L from a production
process U (u := U x). The process U is defined
by</p>
      <p>1</p>
      <p>U x = L MMulti(L; x); (1)
where M is a matrix representing production errors
or innovations and Multi(L; x) is a vector counting
the outcome of L multinomially sampled variables.</p>
      <p>During an interaction, two connected agents are
randomly selected. Then, they both produce an
utterance u and update their usage frequency
distribution x by</p>
      <p>x(i);new = x(i);old + x(i);
where the increment x(i) is defined by
x(i) = [(1
h) u(i) + hu(j)
x(i);old];
where is a usually small learning parameter and
the attention parameter h controls the relative
importance of the utterance u(j) of the other agents
with respect to her own utterance u(i). The presence
of u(i) accounts for a self-monitoring process and
the presence of u(j) accounts for an accommodation
process. This learning rule assumes communicative
success.1</p>
      <p>1One way of interpreting this is to assume that the context
and non-verbal communication provide sufficient clues for the
interpretation.
(2)
(3)</p>
      <p>
        The USM has been used to study the conditions
under which a consensus can be achieved in a
population
        <xref ref-type="bibr" rid="ref1 ref6">(Baxter et al., 2006; Michaud, 2017)</xref>
        . It has
been applied to test the hypothesis of Trudgill about
the emergence of New Zealand English
        <xref ref-type="bibr" rid="ref2">(Baxter et
al., 2009)</xref>
        and to test under which conditions the
time series of usage frequency of an innovative
variant takes the form of an S-shaped curve
        <xref ref-type="bibr" rid="ref3">(Blythe
and Croft, 2012)</xref>
        .
2.2
      </p>
      <sec id="sec-2-1">
        <title>Bilingual agents, social structure and priming</title>
        <p>In order to model a language contact situation, the
USM needs to take into account the possibility
that agents become bilingual. We assume that each
agent belongs to a group labelled by capital letters
(A; B; : : : ) and every agent knows the group
membership of every other agents. An agent belonging
to some group Y is able to represent two languages
and we denote the corresponding frequency
vectors xY for the group language Y and xY¯ for the
non-group language Y¯ . With this modification, the
utterance production and learning rules have to be
adapted.</p>
        <p>During an interaction, if two agents belong to
the same group, they interact as usual using the
standard USM production and learning rules. If the
two agents belong to different groups, we consider
two scenarios:</p>
      </sec>
      <sec id="sec-2-2">
        <title>Scenario 1: Symmetric adaptation When two</title>
        <p>agents of different groups interact, they both
adapt to the other agent. For example if agent
i of group A and agent j of group B interact,
they both use their non-group language, i.e. A¯
and B¯, respectively.</p>
        <p>Scenario 2: Unilateral adaptation When two
agents of different groups interact, for each
interaction they randomly choose a group
language to use, either A (with probability
p) or B (with probability 1 p), and the
agent who doesn’t know the language uses his
non-group language. For example, if agent
i belongs to group A and agent j belongs
to group B, then one language is chosen
randomly, say language of group A, then
agent i uses her group language and j her
non-group language B¯.</p>
        <p>When an agent uses her group language, her
knowledge of the language is assumed to be perfect and she
uses the corresponding frequency vector. However,
when an agent needs to use a non-group language,
her knowledge is only partial and the non-group
language is primed by the group language. This
priming is implemented by the rule that whenever a
non-group language has to be used, instead of using
the frequency vector xA¯ purely, the group language
frequencies modifies the distribution through
xA¯;eff = (1
) xA¯ + xA:
(4)
The priming parameter models the degree of
mixing between languages A and A¯. If = 0, then
there is no priming and the effective frequency
vector boils down to xA¯ and if = 1, then priming
is total and the effective frequency distribution
xA¯;eff = xA. In the production rule (1), it is the
effective frequency vector xA¯;eff that is sampled.
The learning rule (2) is the same but is only applied
to the languages associated with the interaction.</p>
        <p>The social structure used in our model is made of
two random regular graphs of degree 3, containing
20 agents each, connected with each other by 5
connexions, see Fig. 2. The agents situated at an
end of an intergroup connexion are the potentially
bilingual agents, the other agents are monolingual,
since they never use their non-group language.
We measure the outcome of the simulation by
computing Pearson’s correlation coefficient between the
time series of the averaged use of a language by
each group. Note that the non-group languages are
only used by agents with intergroup connexions
and only these agents are updating their non-group
language and can, therefore, become bilinguals.</p>
        <p>We introduce the following notation: rXY
correlation between language X of group A and language
Y of group B, illustrated in Fig. 3. If rXY is close
to 1, then the two languages can be considered
Group A
xA
x A¯
rAB
as being the same. If rXY is close to 0, the two
languages are independent. For medium values of
rXY the languages are different but correlated.
3</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Results</title>
      <p>For the simulation of the two scenarios, we used
the network topology discussed in Sec. 2.2 and
illustrated in Fig. 2. The parameters are N = 40
agents with 5 intergroup connexions, the number of
variants is V = 3, and the utterance length is L = 2.
The learning parameter = 0:1 and the attention
parameter h = 0:5. The matrix M used to simulate
errors and innovations is of the form
21
6
M = 66 q
66 0
4
q
1 q
0
q
1
q 3</p>
      <p>7
0 77 ;</p>
      <p>7
q7
5
(5)
where q = 3 10 4. The structure of this matrix is
such that the innovations are ordered and variant 1
can only be transformed into variant 2, but not
into variant 3, and similarly for the other variants.
The pattern of mutation/innovation should be read
columnwise. The simulations have been performed
for T = 5000 full network updates and the priming
parameter is varied.</p>
      <p>In Scenario 1, two interacting agents of
different groups used their non-group language. Results
are displayed in Fig. 4 and we observe that the
correlation between xA and xB (r AB) is close to
zero for all values of the priming parameter ; the
correlation between xA¯ and xB¯ (r A¯B¯ ) is close to
one for all values of the priming parameter ; the
other correlation coefficients grow from 0 to about
0:7 when is increased. From these results, one
can conclude that there are three languages in these
settings, the language of group A, the language of
group B, and a new contact language A¯ = B¯ partly
correlated with both languages.</p>
      <p>In Scenario 2, when two agents of different
groups interact, at each interaction, they choose
language A with probability p and language B
10−3 10−2 10−1
Priming Parameter ρ
100
with probability 1 p. Here p = 0:5 and the two
languages are equivalent. Results are displayed in
Fig. 5 and we observe that r AB¯ and r A¯B are close to
one for all values of and the other correlation
coefficients increase from zero to one as increases. In
this situation, there are only two languages present,
the two group languages. When is large enough,
the two languages converge to the same language
and there is a single language remaining.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Discussion</title>
      <p>We have shown that the decision of which language
to use has an important impact on the outcome of the
simulation, and can lead either to the emergence of
a contact language, or to the stable cohabitation of
the two group languages. Compared to the naming
rAB
rA B¯
rAB
game model of Tria et al. (2015), the agents of
our model do not instantaneously learn or forget a
language but gradually adapt their behaviour. As
a result, the emergence of a contact language, or
absence thereof, is more gradual and better accounts
for the influence of the stance that agents take during
intergroup communication.</p>
      <p>Our model makes a number of idealising
assumptions. First of all, we assume that there is no reason
to choose one language rather than the other for
intergroup communication — which implies the
absence of any hierarchy between the languages (or
groups). This is probably a rather rare setting in the
wild. There are different degrees of divergence from
this configuration: instead of a perfectly symmetric
situation with a probability p = 0:5 for using each
language, there may be a different p tilted towards
one group language. In extreme cases, if p = 1
or 0, or when the priming parameter = 1, the
agents of one group do not adapt to the language
of the other group at all. Therefore, their group
language will always be used, forcing the agents of
the other group to adapt. Furthermore, in our model,
the preferences and attitudes of the agents as well
as the network structures do not evolve over time
(bilinguals cannot switch group allegiance, etc.).</p>
      <p>That being said, in which circumstances of
reallife language contact would we expect the two
scenarios we have considered to arise? Notice first
that the asymmetric scenario should have a lower
cognitive cost than the symmetric one, since only
one agent in an intergroup interaction needs to
adapt his behaviour, whereas scenario 1 requires
both agents to do so. Using this argument, scenario
2 should be preferred overall and no contact
language should emerge. One can also argue that an
asymmetric scenario will take longer to reach a
consensus through the population. As a consequence,
if the pressure for communication is strong enough,
the more costly, but more rapidely converging
scenario 1 would be preferred and a contact language
is likely to emerge. The additional cognitive cost of
a symmetric adaptation should be partly
compensated by the fact that contact languages are usually
simpler than fully-fledged languages.</p>
      <p>To conclude, scenario 1 is expected if
communication pressure is strong and the group languages are
unrelated. Otherwise, we expect scenario 2. This is
consistent with the conclusions of Tria et al. (2015)
concerning the influence of population structure on
communicative needs and creole-formation.
We would like to thank the three anonymous
reviewers for their comments on a previous version of
the paper. We would also like to thank the members
of the project “Parallel Evolutions”, on whom we
inflicted a first version of this paper. All remaining
errors and omissions are ours alone.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Gareth J. Baxter</surname>
            , Richard A. Blythe,
            <given-names>William</given-names>
          </string-name>
          <string-name>
            <surname>Croft</surname>
          </string-name>
          , and
          <string-name>
            <surname>Alan J. McKane</surname>
          </string-name>
          .
          <year>2006</year>
          .
          <article-title>Utterance selection model of language change</article-title>
          .
          <source>Physical Review E</source>
          ,
          <volume>73</volume>
          (
          <issue>4</issue>
          ):
          <fpage>046118</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Gareth J. Baxter</surname>
            , Richard A. Blythe,
            <given-names>William</given-names>
          </string-name>
          <string-name>
            <surname>Croft</surname>
          </string-name>
          , and
          <string-name>
            <surname>Alan J. McKane</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Modeling language change: An evaluation of Trudgill's theory of the emergence of New Zealand English</article-title>
          .
          <source>Language Variation and Change</source>
          ,
          <volume>21</volume>
          (
          <issue>02</issue>
          ):
          <fpage>257</fpage>
          -
          <lpage>296</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>Richard A.</given-names>
            <surname>Blythe</surname>
          </string-name>
          and
          <string-name>
            <given-names>William</given-names>
            <surname>Croft</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>S-curves and the mechanisms of propagation in language change</article-title>
          .
          <source>Language</source>
          ,
          <volume>88</volume>
          (
          <issue>2</issue>
          ):
          <fpage>269</fpage>
          -
          <lpage>304</lpage>
          , June.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Louis-Jean Calvet</surname>
          </string-name>
          .
          <year>1999</year>
          .
          <article-title>Pour une écologie des langues du monde</article-title>
          .
          <source>Plon</source>
          , Paris.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <given-names>William</given-names>
            <surname>Croft</surname>
          </string-name>
          .
          <year>2000</year>
          .
          <article-title>Explaining language change: An evolutionary approach</article-title>
          . Pearson Education.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>Jérôme</given-names>
            <surname>Michaud</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Continuous time limits of the utterance selection model</article-title>
          .
          <source>Phys. Rev. E</source>
          ,
          <volume>95</volume>
          :
          <fpage>022308</fpage>
          ,
          <string-name>
            <surname>Feb</surname>
          </string-name>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>Sali A.</given-names>
            <surname>Tagliamonte</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Variationist sociolinguistics: Change, observation, interpretation</article-title>
          , volume
          <volume>39</volume>
          . John Wiley &amp; Sons.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>Francesca</given-names>
            <surname>Tria</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Vito D.P.</given-names>
            <surname>Servedio</surname>
          </string-name>
          , Salikoko S. Mufwene, and
          <string-name>
            <given-names>Vittorio</given-names>
            <surname>Loreto</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Modeling the emergence of contact languages</article-title>
          .
          <source>PloS one</source>
          ,
          <volume>10</volume>
          (
          <issue>4</issue>
          ):
          <fpage>e0120771</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>