Investigating cross-linguistic differences in the
                    pragmatics of scalar terms1

                      Izabela Skoczeń1 and Aleksander Smywiński-Pohl2
             1
                 Department of Legal Theory, Jagiellonian University, Cracow, Poland
                     2
                       WIET AGH University of Technology in Cracow, Poland
                                    izabela.skoczen@uj.edu.pl


         Abstract. N. Goodman and A. Stuhlmüller made an experiment to empirically
         confirm a model that predicted the probability of implicature formation in con-
         texts where speaker and hearer do not have full knowledge of the situation.
         However, there remained the question whether the effects were cross-linguistic
         or rather confined to the semantics of the English language. In our paper we
         present the replication of the above-described experiment in the Polish lan-
         guage. The data depicts differences that may point out to differences in the
         pragmatics of the Polish word for some, namely ‘niektóre’, since the scalar im-
         plicature is not cancelled in incomplete knowledge contexts.

         Keywords: scalar implicature, incomplete knowledge, rational speech act mod-
         el.


1        Introduction

   The aim of the present paper is to answer two main questions. Firstly, we investi-
gate whether the rational Speech Act model is cross-linguistic. Secondly, we investi-
gate whether pragmatic behavior, namely cancelling scalar implicatures in incomplete
knowledge situations occurs cross-linguistically.
1.1   Scalar implicatures

          In 1975, Paul Grice, a British philosopher, introduced in his article ‘Logic
and Conversation’ the notion of implicature (Grice, 1975). Grice noticed that through
uttering sentences in context, people convey more than just the meaning of the words
they use. Consider the following example:
A: Are you hungry?
B: I have had breakfast.
Imagine that this conversation takes place before noon. Although, literally taken, B’s
reply has not much in common with A’s question, the conversation is perfectly under-
standable. This is because it is possible to infer from B’s reply that she is not hungry.

1
    The research was funded by the National Science Centre, Poland, Preludium grant nr
     2015/19/N/HS5/00029. The Authors would like to thank Leon Bergen, Daniel Lassiter, Ka-
     tarzyna Kijania-Placek and Krzysztof Kasparek for very helpful comments on data analysis.
Thus, the implicature carried by B’s answer will be that ‘B is not hungry’. Moreover,
the speaker can cancel implicatures explicitly. Consider the following example:
         (I.) A: Are you hungry?
         (II.) B: I have had breakfast, but a very modest one.
    The second part of B’s reply is an explicit cancellation of the implicature formed
on the basis of the first part of the sentence, namely that ‘B is not hungry’.
    Scalar implicatures are special because they are based on conventional meanings
attributed to words with the use of lexical scales (Horn, 2006). If you hear ‘some of
the students have passed the exam’ you will infer that not all of them have passed.
    The reason for which the ‘not all; part of ‘some’ is pragmatic rather than semantic
is that if you compare two utterances:
         (III.)      ‘Some of the students have passed the exam’, (while in fact all of
               them have)
         (IV.)        ‘All of the students have passed the exam’, (while in fact some of
               them have)
    The utterer of (IV) is more likely to be accused of lying than the utterer of (III).
Note that this is a contextual intuition: if you say ‘some of the accused were sen-
tenced’, while in fact all of them have been sentenced, your intuition might change.
For this reason, the ‘not all’ part of some seems contextual rather than semantic.
    A lexical scale is defined as a ‘totally ordered set of lexical items which vary
along a single dimension’. (Bergen et al, 2015, p. 10) Placing a word in a definite
position on a scale enables the speakers to attribute it a definite meaning that does not
have to be consistent with the lexical meaning of the word. The most discussed exam-
ple in the literature is the quantifier term “some” that can mean either ‘at least some’
or ‘some but not all’. In ordinary conversation, ‘some’ is understood as ‘some but not
all’. Thus, we take ‘at least some or some but not all’ to be the lexical meaning of
some. An analogous phenomenon is observed in the Polish language. The quantifier
‘niektóre’ (some) can mean either ‘przynajmniej niektóre’ (at least some) or ‘tylko
niektóre’ (some but not all). Moreover, in an ordinary conversation ‘niektóre’ (some)
is understood as ‘tylko niektóre’ (some but not all).
    Consider the following example:
         (V.) Some of the guests have gone.
    When uttering (V.) the speaker is normally taken as implicating that not all of the
guests have gone. This is because, had the speaker wanted to be understood as impli-
cating ‘at least some of the guests have gone, in fact all of them’ she would have used
the word ‘all’ rather than ‘some’. The above considerations are valid in situations
where the speaker and hearer have full contextual knowledge. This means that the
speaker knows exactly how many of the guests have gone. She knows whether all of
them have gone or only a number of them have gone. Moreover, the hearer knows
that the speaker has this knowledge. Thus, the scale is the following:
    <some, all>
Before proceeding further, one last remark concerning the semantics of the Polish
word ‘niektóre’. It is worth noting that although in example (V) the Polish word
would be understood as ‘some but not all’ and the implicature would be cancellable,
there are differences in the remaining possible uses of the English ‘some’. Namely,
the English ‘some’ can be used as a singular or plural indefinite in the following sen-
tences:
        (VI.)      Would you like some biscuits? Yes, I would like some.
        (VII.)     Yesterday, I went to the store and I bought some books.
By contrast, the Polish translation of the sentences above could not include the word
‘niektóre’. Polish users would rather employ words such as ‘jakieś’ or ‘pewne’.

1.2    The Rational Speech Act model of scalar implicatures

The scalar inference should not occur when both speaker and hearer do not have full
information about the number of guests that are gone from the party. In such situation,
when the speaker uses the word ‘some’ she should be understood as implicating ‘at
least some’. Thus, the scalar implicature ‘some but not all of the guests are gone’
should be cancelled.
   Numerals such as “two” or “twenty” are also scalar notions. This is because, lexi-
cally speaking ‘two’ could mean: ‘exactly two’, ‘at least two’ and ‘not more than
two’. Analogously ‘twenty’ could mean ‘exactly twenty’, ‘at least twenty’ and ‘not
more than twenty’. When the speaker has full knowledge and the hearer knows that
the speaker has full knowledge then through using ‘two’ or ‘twenty’ the speaker will
usually be taken as implicating ‘exactly two’ and ‘exactly twenty’. It is based on the
assumption that if the speaker had meant more or less than ‘twenty’ she would have
used a word that is higher on the scale (for instance ‘twenty-one’) or lower on the
scale (for instance ‘nineteen’). (Horn, 2006) Thus, the scale is this:
   <one, two, three, four, five, etc.>
   By contrast, the implicated exactness should be cancelled when the speaker does
not have full contextual knowledge. (Goodman and Stuhlmüller, 2013) Imagine a
party to which three guests have been invited. The speaker knows that two of the
guests have left the party, yet the speaker does not know whether the third guest has
left the party. The speaker says:
         (VIII.)    Two of the guests have gone.
   If the hearer is aware of the speaker’s partial knowledge, the hearer should cancel
(or not infer) the implicature ‘exactly two’.
   Noah Goodman and Andreas Stuhlmüller have created a model based on game
theory and Bayesian decision theory, which aims to predict the probability of (not)
cancelling the scalar implicature, in scenarios where the speaker has partial contextual
knowledge and the hearer knows that the speaker has partial contextual knowledge.
   The model is a ‘rational speech act model’. It views ‘language comprehension as a
rational inference based on an intuitive theory of language production (...) the listener
infers the world state, s, given the speaker’s utterance, w, and shared information
about the speaker’s (possibly incomplete) information access, a. By Bayes’ rule:
                          P!"#$%&%' (s|w,a)∝P!"#$%#& (w|s,a)P(s)
   where P(s) captures the listener’s prior beliefs about the world state and Pspeaker
(w⁄s,a) describes the listener’s intuitive theory of how the speaker chooses words.’
(Goodman and Stuhlmüller, 2013, p. 175)
   A speaker ‘makes observation o about the true state of the world and selects an ut-
terance w to convey information about the world state to a listener and does so by
soft-max optimizing expected utility’:
                        P!"#$%#& w|o,a ∝exp(αE! !|#,% [U(w;s)])
   ‘The speaker’s utility function, U(w;s), captures the value of saying w if the world
is actually s. The expectation is taken over the speaker’s belief state, P(s|o,a), because
the speaker may still be uncertain about the state of the world. The parameter α con-
trols the deviation from optimality.’ (Goodman and Stuhlmüller, 2013, p. 175)
   ‘To capture a motivation to be informative, utility must be related to the infor-
mation conveyed in the utterance. More specifically, utility is related to the amount of
information that a literal listener would not yet know about state s after hearing it
described by utterance w (the negative surprisal):
                                 U w;s = ln (P!"# (s|w))
   where the literal interpretation probability Plex(s|w) is determined by the lexicon
‘(Goodman and Stuhlmüller, 2013, p. 175)
   The ‘speaker’s access a is common knowledge of speaker and listener, but the lis-
tener still does not know what observation
                                         .
                                            the speaker made:
                     P!"#$%#& w|s,a =       P!"#$%#& (w|o,a)P(o|a,s)
                                        !
   For this reason, the hearer is performing a summing on the possible observations
made by the speaker. (Goodman and Stuhlmüller, 2013, p. 175)
   The model remains agnostic on the ‘question under discussion’, which is a re-
striction of the available alternatives, since this concept was introduced later (see for
instance Benz et al, 2017).
   This model has been empirically tested and confirmed in the English language in
(Goodman and Stuhlmüller, 2013) During the experiment the speaker saw for exam-
ple three letters. However, he had information concerning whether there are checks in
only two of the letters. The hearer was aware of the speaker’s state of knowledge. The
utterance ‘some of the letters have checks inside’ cancelled the implicature ‘not all’
since hearers inferred a similar probability of two and three letters having checks. If
the speaker saw two out of three letters and said ‘one of the letters has a check inside’,
then the hearer inferred a similar probability of one and two of the letters having
checks inside and a low probability of three letters having checks inside. Thus, the
implicature ‘not more than one’ was cancelled. During the experiment six scenarios
were used and the speaker had a varied access to objects and their features, while the
hearer was aware of the speaker’s state of knowledge.
   Each scenario began with a boosting of the probability that all the objects have the
salient property, for example:
     Letters to Laura almost always have checks inside. (Goodman and Stuhlmüller,
     2013, p. 178)
   Next, came information about the number of considered objects and a question
about the prior probability distribution:
     Today Laura received three letters. How many of the 3 letters do you think have
     checks inside? (Goodman and Stuhlmüller, 2013, p. 178)
The participants had to bet a total of 100 dollars on each of the options, namely zero,
one, two or three objects having the property. After having captured the priors came
the information about the number of objects checked by the speaker:
     Laura tells you on the phone: “I have looked at 2 of the 3 letters. Some of the let-
     ters have checks inside.” (Goodman and Stuhlmüller, 2013, p. 178)
Finally, the posterior probability distribution as well as a control question followed:
     Do you think Laura knows exactly how many of the 3 letters have checks inside?
     (Goodman and Stuhlmüller, 2013, p. 178)
   The experiment confirmed a fine-grained interaction between the state of
knowledge about the world and pragmatic inference. It also provided a strong argu-
ment against modular theories of mind since the data showed that the language faculty
and the inference about world state were strongly connected. However, there re-
mained the question whether the effects were cross-linguistic or rather confined to the
English language.
   Below we present the replication of the above-described experiment in the Polish
language. We translated the same scenarios into Polish. We employed the Polish
words ‘niektóre’ for ‘some’ and ‘wszystkie’ for ‘all’ as well as the numerals ‘jeden,
dwa, trzy’ for ‘one, two, three’ respectively. The data proved to be a replication of
Goodman and Stuhlmüller’s experiment with some differences in the results. Namely,
there was an interaction between the state of knowledge about the world and pragmat-
ic inference conforming to the model predictions. However, the scalar implicature of
the word some, that is ‘not all’ was not cancelled when the speaker had access to two
out of three objects. In the paper, we investigate hypotheses that explain the differ-
ences.


2      Replication results in Polish

                                        Methods
   We carried four experiments in Polish: two that tested numerals and two that tested
the quantifier ‘some’. The experiments were an exact methodological replication of
the original experiment. Participants were presented two control questions. In the
numeral-experiment we employed 6 scenarios that appeared in a randomized order. In
the quantifier experiment we also employed 6 scenarios but each participant was pre-
sented with only 3 of them in a randomized order. Each scenario was about an three
objects and a potential property that these objects could have. Each participant dis-
posed of a quantity of 100 units (zlotys) and had to bet on four possibilities: zero of
the objects having the property, one of the objects having the property, two of the
objects having the property and three of the objects having the property.
   We tested 6 conditions: ‘”One” access 1’ where the speaker had access to infor-
mation concerning the properties of one object out of three and uttered ‘one’. ‘Access
2’ –the speaker has access to information concerning the properties of two objects out
of three and ‘access 3’ – the speaker has access to information concerning the proper-
ties of three objects out of three. The possible utterances were ‘one’, ‘two’ and
‘three’.
   The participants were recruited through an online platform ‘Research online’ and
performed the experiment for a small payment. Each of the experiments was made
with the participation of 50 persons – 50% men and 50% women.
   Interestingly, we carried a pilot study involving 10 academics and it proved to be a
neat replication of the American results. The bets that the speaker had complete
knowledge in partial-access conditions were: M=18,89 SD=26,07, in complete access
conditions: M=92,59 SD=21,94 compared to the original: M=42, SD=3,4 and M=92,1
SD=1,6.
   The graphs present mean participant bet on each world state, varying the word the
speaker used and the speaker’s perceptual access. Error bars are standard error of the
mean. Statistical significance was calculated using the t-student test for dependent or
independent measurement. The confidence interval (CI) was 95%. The effect size
calculations were based in Cohen’s d.
                                     Testing numerals
   50 students too part in our first experiment and the results replicated (see Fig 1).
The bets that the speaker had complete knowledge in partial-access conditions were:
M=27,95 SD=33, in complete access conditions: M=91,79 SD=14,94 compared to the
original: M=42 SD=3,4 and M=92,1 SD=1,6.


    Fig 1 replication with the participation of 50 students - Mean participant bet on each world
state, varying the word the speaker used and the speaker’s perceptual access. Data have been filtered to
include only trials where the participant’s bet that the speaker had complete knowledge was greater than 70
in the expected direction. Error bars are standard error of the mean. The formulation ‘”One” access 1’
means that the speaker has access to information concerning the properties of one object out of three and
utters ‘one’. ‘Access 2’ –the speaker has access to information concerning the properties of two objects out
of three and ‘access 3’ – the speaker has access to information concerning the properties of three objects
out of three.
   In the experiment represented in Fig 2, the sample was modified to 20 people aged
16-29 years (10 men and 10 women), 20 people aged 30-49 (10 men and 10 women),
10 people aged over 50 (5 men and 5 women). Subjects were recruited through an
online survey platform ‘Research online’. Differences appeared in the ‘one access
one’ and ‘two access two’ conditions. The difference in the ‘two access two’ condi-
tions was crucial since in the original experiment the scalar implicature ‘not more
than two’ was cancelled, while in the data we collected the scalar implicature persist-
ed. In other words, the bets on ‘two’ and ‘three’ in the ‘”two” access 2’ condition
were similar in the American experiment, while in the replication in Polish the bets on
‘two’ were significantly higher than the bets on ‘three’. The bets that the speaker had
complete knowledge in partial-access conditions were: M= 48,53 SD= 37,04, in
complete access conditions: M= 86,47 SD= 20,47 compared to the original: M=42
SD=3,4 and M=92,1 SD=1,6.


  Fig 2 replication with the participation of 50 subjects from varied age groups


   Fig 3 Differences between the original experiment (to the left, reuse license number
4343821165053, source: https://onlinelibrary.wiley.com/doi/abs/10.1111/tops.12007) and
the replication (to the right) in conditions ‘”One” access 1’ and ‘”Two” access two’.
                              Testing the quantifier ‘some’
   Next, we conducted a replication of the experiment with the use of the quantifier
some (‘niektóre’). The participants had to bet again on zero, one, two and three ob-
jects having the property. In the original experiment, the condition ‘some access 2’
generated higher bets on three objects having the property rather than one object hav-
ing the property. The replicated results provide a reverse trend, which points out to a
lack of cancellation of the scalar implicature ‘not all’. In other words, the implicature
was inferred, while the predictions were that it would not be formed (see Fig 5).


  Fig 5 replication with the participation of 50 students


   Fig 6 The original experiment in condition ‘”some” access two’. Reuse license number
4343821165053, source: https://onlinelibrary.wiley.com/doi/abs/10.1111/tops.12007

   The means and standard deviations for the first experiment with the quantifier
‘some’ are presented in Figure 7.

                                     Original                     Replication
          Access 1                     27.1                          39,18
                                       (4,9)                         (4,9)
          Access 2                     34,8                          46,68
                                       (5,7)                         (6,1)
          Access 3                     93,0                          89,2
                                    (2,7)                            (3,3)
  Fig 7 Means and standard deviations for the replication with the quantifier ‘some’

                                         Discussion
  Our hypothesis was that a quantifier scope ambiguity occurred during the experi-
ment, which is presented in Figure 8.


   Fig 8 Hypothesis concerning quantifier scope ambiguity in the condition ‘”Some” ac-
cess two’

   Polish participants seemed not to interpret the speaker as indicating that some of all
of the objects have the salient feature. Rather, they seemed to adopt the interpretation
that some of the checked objects (for instance letters) had the salient feature (for in-
stance checks inside). Let us focus on the ‘”Some” access 2’ condition presented in
Figure 5. The difference between the bets on one of the object having the property
and two of the objects having the property are of 7,7%. This is an effect size of only
0,26%. By contrast, the difference between the bets on two of the objects having the
property and three of the objects having the property are of 31,4%. This is an effect
size of 1,12%. Thus, the bets on three are significantly lower than the bets on two,
which supports the hypothesis of Polish speakers interpreting the scenario utterance as
‘some of the checked objects’. We decided to test this hypothesis.


3      The quantifier scope disambiguation experiment

   To test our hypothesis, we modified the content of the scenarios by adding ‘some
of all’. Thus, for instance the letter scenario took the following form:
     ‘Letters to Laura almost always have checks inside.
     Today Laura received three letters.
     How many of the 3 letters do you think have checks inside?
     Laura tells you on the phone: “I have looked at 2 of the 3 letters. Some of all of
     the letters have checks inside.”
     Now how many of the 3 letters do you think have checks inside?
     Do you think Laura knows exactly how many of the 3 letters have checks in-
     side?’
   The results received (see fig 9) provided an only 7% weaker implicature that how-
ever still persisted since the bets on three objects having the property in questions
were still lower than the bets on one object having the property in question compared
to the previous results without introducing the disambiguation to the scenario (see Fig
5). In other words there was still no cancellation of the scalar implicature.


   Fig 9 Comparing the ‘some’ quantifier experiment without disambiguation (B1) and
with disambiguation (B2)


4      General discussion

   Thus, the experiment depicted in Figure 9 disproved our hypothesis concerning
quantifier scope ambiguity in experiments employing the quantifier ‘some’. For this
reason, there remained the question on why Polish participants did not cancel the
scalar implicature ‘not all’ in partial knowledge conditions.
   Our hypothesis is that Polish people are ‘cache’ing’ the mechanisms of implicature
formation from full-knowledge contexts and are cognitively biased in applying them
to contexts where the knowledge about the situation is incomplete.
   What could back this hypothesis is the fact that the participants’ bets while answer-
ing to the question measuring their prior probability distribution was not as predicted.
Participants did not bet as if the relevant objects ‘almost always’ had the property in
question. Thus, they probably ignored the ‘almost always’- experimental manipula-
tion in their answers on prior probability distribution and employed a prior probability
distribution based on their every-day experience.
5      Conclusion

The carried experiments depicted that the RSA model is cross-linguistic as the pat-
terns of interaction between knowledge and linguistic inference replicated. However,
it also depicted differences in pragmatic inferences such as scalar implicatures carried
by English and Polish native speakers.


6      References

       1.   Bergen L., Levy R., Goodman N., 2015, Pargmatic reasoning through semantic in-
            ference, Semantics and Pragmatics 9, doi:http://dx.doi.org/10.3765/sp.9.20
       2.   Benz, A., Jasinskaja K., 2017. Questions Under Discussion: From Sentence to
            Discourse,       Discourse         Processes,         54:3,      177-186,      DOI:
            10.1080/0163853X.2017.1316038
       3.   Chierchia, G., 2004. Scalar implicatures, polarity phenomena and the syn-
            tax/pragmatics interface, in: Bettelli, A. (Ed.), Structures and beyond: The Cartog-
            raphy of Syntactic Structures. Oxford University Press, pp. 39–103.
       4.   Goodman, N.D., Stuhlmüller, A., 2013. Knowledge and Implicature: Modeling
            Language Understanding as Social Cognition. Top. Cogn. Sci. 5, 173–184.
            doi:10.1111/tops.12007
       5.   Grice, P., 1975. Logic and Conversation, in: Cole, P., Morgan, J.. (Eds.), Syntax
            and Semantics. Academic Press, New York.
       6.   Horn, L.R., 2006. Implicature, in: The Handbook of Pragmatics. Blackwell,
            Malden, MA; Oxford.
       7.   Noveck, I.A., Sperber, D., 2004. Experimental pragmatics. Palgrave Macmillan,
            Basingstoke, Hampshire; New York.


7      Appendix - vignettes

Link to repository: http://www.apohllo.pl
   Below are the scenarios we employed translated into Polish:
1.       Nasiona
Nasiona słonecznika prawie zawsze kiełkują w ciągu jednego dnia, gdy umieści się je
w wodzie.
Dwa dni temu botanik Jan umieścił trzy nasiona słonecznika w wodzie.
Jak myślisz, ile ziaren słonecznika spośród trzech wykiełkowało?
Jan informuje Cię przez telefon: “Sprawdziłem dwa spośród trzech nasion.
Dwa nasiona wykiełkowały.”
Jak myślisz, ile spośród trzech nasion wykiełkowało?
Czy uważasz, że Jan wie ile dokładnie spośród trzech nasion wykiełkowało?
2.       Losy
Losy na loterii fantowej prawie zawsze wygrywają.
Janusz zakupił trzy losy na loterii fantowej wczoraj.
Jak myślisz, ile spośród tych trzech losów wygrało?
Janusz informuje Cię przez telefon: “Sprawdziłem dwa spośród trzech losów.
Dwa spośród trzech losów wygrały.’
Jak myślisz ile spośród trzech losów wygrało?
Czy uważasz, że Janusz wie ile dokładnie spośród trzech losów wygrało?
3.       Egzaminy
Uczestnicy kursu ‘Wprowadzenie do biologii’ prawie zawsze zdają egzamin.
Trzech studentów Marka podeszło wczoraj do egzaminu z ‘Wprowadzenia do bi-
ologii’.
Jak myślisz, ilu spośród tych trzech studentów zdało egzamin?
Marek informuje Cię przez telefon: “Sprawdziłem dwa spośród trzech egzaminów.
Dwóch studentów zdało egzamin.”
Jak myślisz, ilu spośród trzech studentów zdało?
Czy uważasz, że Marek wie ilu dokładnie spośród trzech studentów zdało egzamin?
4.       Owoce
Jagody Goi to małe owoce, które prawie zawsze mają we wnętrzu wyschnięty miąższ.
Monika zakupiła trzy jagody Goi wczoraj.
Jak myślisz, ile spośród trzech jagód Goi ma wyschnięty miąższ?
Monika informuje Cię przez telefon: “Sprawdziłam dwa spośród trzech owoców.”
Dwa owoce mają wyschnięty miąższ.
Jak myślisz, ile spośród trzech owoców ma wyschnięty miąższ?
Czy uważasz, że Monika wie ile dokładnie spośród trzech owoców ma wyschnięty
miąższ?
5.       Telefony
Zepsute telefony marki Sigo prawie zawsze mają przepalone tranzystory.
Bartosz musi naprawić trzy telefony marki Sigo.
Jak myślisz, ile spośród trzech telefonów marki Sigo ma przepalone tranzystory?
Bartosz informuje Cię przez telefon: “Sprawdziłem dwa spośród trzech telefonów.”
Dwa telefony mają przepalone tranzystory.
Jak myślisz, ile spośród trzech telefonów ma przepalone tranzystory?
Czy uważasz, że Bartosz wie ile dokładnie spośród trzech telefonów ma przepalone
tranzystory?
6.       Listy
Listy do Laury prawie zawsze mają czeki w środku. Dzisiaj Laura otrzymała trzy
listy.
Jak myślisz, ile spośród trzech listów ma czeki w środku?
Laura informuje Cię przez telefon: “Sprawdziłam dwa spośród trzech listów.
Dwa listy mają czeki w środku.”
Jak myślisz, ile spośród trzech listów ma czeki w środku?
Czy uważasz, że Laura wie ile dokładnie spośród trzech listów ma czeki w środku?