=Paper=
{{Paper
|id=Vol-1113/paper7
|storemode=property
|title=Logics for Non-Cooperative Games with Expectations
|pdfUrl=https://ceur-ws.org/Vol-1113/paper7.pdf
|volume=Vol-1113
|dblpUrl=https://dblp.org/rec/conf/eumas/GodoM13
}}
==Logics for Non-Cooperative Games with Expectations==
Logics for Non-Cooperative Games with Expectations Lluı́s Godo1 , Enrico Marchioni2 1 IIIA, Artificial Intelligence Research Institute CSIC, Spanish National Research Council Campus de la Univ. Autònoma de Barcelona s/n 08193 Bellaterra, Spain Email: godo@iiia.csic.es 2 Institut de Recherche en Informatique de Toulouse Université Paul Sabatier, 118 Route de Narbonne 31062 Toulouse, France Email: enrico.marchioni@irit.fr Abstract. We introduce the logics E(G) for reasoning about probabilis- tic expectation over classes G of games with discrete polynomial payoff functions represented by finite-valued Lukasiewicz formulas and provide completeness and complexity results. In addition, we introduce a new class of games where players’ expected payoff functions are encoded by E(G)-formulas. In these games each player’s aim is to randomise her strategic choices in order to affect the other players’ expectations over an outcome as well as their own. We offer a logical and computational characterisation of this new class of games. 1 Introduction In this work, we introduce the logics E(G) for reasoning about probabilistic expectation over classes G of games with discrete polynomial payoff functions represented by finite-valued Lukasiewicz formulas. This type of games, called Lukasiewicz games [14], is a generalisation of Boolean games [1, 12] and involves a finite set of players P = {P1 , . . . , Pn }, each controlling a finite set of propositional variables Vi , so that the sets Vi are mutually disjoint. Being in control of a set Vi of propositional variables means that Pi assigns to the variables in Vi values from the scale 1 k−1 Lk = 0, , . . . , ,1 . k k A strategy for a player Pi is a function s : Vi → Lk that corresponds to a valuation of the variables controlled by Pi . Strategies can be interpreted as efforts or costs, and each player’s strategic choice can be seen as an assignment to each controlled variable carrying an intrinsic cost. Each player is assigned Sn a finite-valued Lukasiewicz formula ϕi , with proposi- tional variables from i Vi , whose valuation is interpreted as the payoff function for Pi and corresponds to the restriction over Lk of a continuous piecewise linear polynomial function with integer coefficients. Notice that not all variables in ϕi might be under Pi ’s control and, consequently, Pi ’s payoff from playing a certain strategy (i.e. choosing a certain variable assignment) also depends on the choices made by (some of) the other players. In this work, we expand Lukasiewicz games by providing an explicit definition of a class G of games, and also defining suitable concepts of a mixed strategy, best response and Nash Equilibrium, adapted to this framework. Then, we define a probabilistic logic E(G) over many-valued formulas (of finite-valued Lukasiewicz logic) to reason about expectations in a class G of Lukasiewicz games. This might be in principle a bit surprising since probabilities are not the same as ex- pectations, but the reason is simple. The generalisation of a classical probability measure on Boolean algebras to the algebraic setting of MV-algebras [3] (the algebraic counterpart of Lukasiewicz logics) corresponds to the so-called states [16], which can be seen as averages of truth-values. Indeed, a state (or finitely- additive probability) σ over the set of Lukasiewicz logic formulas L (built from a given set of propositional variables V ) is a mapping σ : L → [0, 1] such that: – σ(1) = 1, – σ(ϕ ⊕ ψ) = σ(ϕ) + σ(ψ), if ¬(ϕ&ψ) is a Lukasiewicz tautology, – σ(ϕ) = σ(ψ), if ϕ ↔ ψ is a Lukasiewicz tautology. When we consider only finite-valued Lukasiewicz logics Lk , as proved in [18], a mapping σ : L → [0, 1] is a state on formulas iff there exists a probability distribution π : Ωk → [0, 1] on the set of Lk -valuations Ωk on L such that X σ(ϕ) = p(w) · w(ϕ). w∈Ωk The state σ(ϕ) corresponds to a weighted average of the truth-values of ϕ under all possible valuations, and it can also be regarded as the expected value of the function fϕ : Ωk → [0, 1], defined as fϕ (w) = w(ϕ) for all w ∈ Ωk . If we look at Lukasiewicz formulas as a particular class of functions, to reason about the probability of these formulas amounts to reasoning about the expectation of the corresponding functions. This is the view we take in this paper. Note that a logic, called F P (Ln , L), to reason about the probability of Lk - formulas was defined and axiomatised in [6]. The logic E(G) introduced here is a (partially) syntactical and semantic expansion of F P (Ln , L), and its expres- sive power makes it possible to represent expectations in games where players’ payoffs are given by discrete polynomial functions with integer coefficients over Lk (encoded by finite-valued Lukasiewicz formulas). E(G) is built from a set of non-modal formulas ϕ, ψ, . . . that correspond to formulas of the finite-valued Lukasiewicz logic Lck (with additional truth con- stants for each element c ∈ Lk ), and a set of modal formulas Eϕ, Eψ, . . . to repre- sent the expectation associated to each ϕ, ψ, . . . . Modal formulas are combined with the connectives of the LΠ 21 -logic [5], which makes it possible to express combinations of polynomial equalities and inequalities with rational coefficients. Based on the logics E(G), we also introduce a new class of games, called Lukasiewicz games with expectations, that generalise Lukasiewicz games. In the situations of strategic interactions modelled in Game Theory, the goal of each player is essentially the maximisation of her own expected payoff. Players, how- ever, often care not only about maximising their own expectation, but also about influencing other players’ expected outcomes. Lukasiewicz games with expecta- tions offer a formalisation of these kinds of strategic interactions where each player’s aim is to randomise her strategic choices in order to affect the other play- ers’ expectations over an outcome as well as their own expectation. Lukasiewicz games with expectations expand Lukasiewicz games by assigning to each player Pi a modal formula Φi of E(G), whose interpretation corresponds to a piecewise rational polynomial function whose variables are interpreted as expected val- ues. The modal formula Φi assigned to each player is then meant to represent a player’s goal concerning the relation between her and other players’ expectations. This work is organised as follows.1 The next section introduces the basic background notions about Lukasiewicz logics and the LΠ 12 -logic. In Section 3, we present the main definition of a Lukasiewicz game, define the concept of a class G of games and build the logics E(G) to represent expectations in each G. We provide both completeness and complexity results for E(G). In Section 4, we introduce games with expectations based on E(G) and offer some examples along with a logical and computational characterisation. We end with some final remarks. 2 Logical Background The language of Lukasiewicz logic L (see [3]) is built from a countable set of propositional variables {p1 , p2 , . . .}, the binary connective → and the truth con- stant 0 (for falsity). Further connectives are defined as follows: ¬ϕ is ϕ → 0̄, ϕ ∧ ψ is ϕ&(ϕ → ψ), ϕ&ψ is ¬(ϕ → ¬ψ), ϕ ∨ ψ is ((ϕ → ψ) → ψ), ϕ ⊕ ψ is ¬(¬ϕ&¬ψ), ϕ ↔ ψ is (ϕ → ψ)&(ψ → ϕ), ϕ ψ is ϕ&¬ψ, d(ϕ, ψ) is ¬(ϕ ↔ ψ). Let Form denote the set of Lukasiewicz logic formulas. A valuation e from Form into [0, 1] is a mapping e : Form → [0, 1] assigning to all propositional variables a value from the real unit interval (with e(0) = 0) that can be extended to complex formulas as follows: e(ϕ → ψ) = min(1 − e(ϕ) + e(ψ), 1) e(¬ϕ) = 1 − e(ϕ) e(ϕ&ψ) = max(0, e(ϕ) + e(ψ) − 1) e(ϕ ⊕ ψ) = min(1, e(ϕ) + e(ψ)) e(ϕ ψ) = max(0, e(ϕ) − e(ψ)) e(ϕ ∧ ψ) = min(e(ϕ), e(ψ)) e(ϕ ∨ ψ) = max(e(ϕ), e(ψ)) e(d(ϕ, ψ)) = |e(ϕ) − e(ψ)| e(ϕ ↔ ψ) = 1 − |e(ϕ) − e(ψ)| 1 Notice that the proofs of the main results are either simply sketched or omitted due to space constraints. A valuation e satisfies a formula ϕ if e(ϕ) = 1. As usual, a set of formulas is called a theory. A valuation e satisfies a theory T , if e(ψ) = 1, for every ψ ∈ T . Infinite-valued Lukasiewicz logic has the following axiomatisation: (L1) ϕ → (ψ → ϕ), (L2) (ϕ → ψ) → ((ψ → χ) → (ϕ → χ)), (L3) (¬ϕ → ¬ψ) → (ψ → ϕ), (L4) ((ϕ → ψ) → ψ) → ((ψ → ϕ) → ϕ). The only inference rule is modus ponens, i.e.: from ϕ → ψ and ϕ derive ψ. A proof in L is a sequence ϕ1 , . . . , ϕn of formulas such that each ϕi either is an axiom of L or follows from some preceding ϕj , ϕk (j, k < i) by modus ponens. We say that a formula ϕ can be derived from a theory T , denoted as T ` ϕ, if there is a proof of ϕ from a set T 0 ⊆ T . A theory T is said to be consistent if T 6` 0. Lukasiewicz logic is complete with respect to deductions from finite theories for the given semantics, i.e.: for every finite theory T and every formula ϕ, T ` ϕ iff every valuation e that satisfies T also satisfies ϕ. For each k ∈ N, the finite-valued Lukasiewicz logic Lk is the schematic ex- tension of L with the axiom schemas: (L5) (n − 1)ϕ ↔ nϕ, (L6) (kϕk−1 )n ↔ nϕk , for each integer k = 2, . . . , n − 2 that does not divide n − 1, and where nϕ is an abbreviation for ϕ ⊕ · · · ⊕ ϕ (n times) and ϕk is an abbreviation for ϕ& . . . &ϕ, (k times). The notions of valuation and satisfiability for Lk are defined as above just replacing [0, 1] by 1 k−1 Lk = 0, , . . . , ,1 k k as set of truth values. Every Lk is complete with respect to deductions from finite theories for the given semantics. It is sometimes useful to introduce constants in addition to 0 that will denote values in the domain Lk . Specifically, we will denote by Lck the Lukasiewicz logic obtained by adding constants c for every value c ∈ Lk . We assume that valuation functions e interpret such constants in the natural way: e(c) = c. A McNaughton function [3] is a continuous piecewise linear polynomial func- tions with integer coefficients over the nth-cube [0, 1]n . To each Lukasiewicz formula ϕ(p1 , . . . , pn ) we can associate a McNaughton function fϕ so that, for every valuation e fϕ (e(p1 ), . . . , e(pn )) = e(ϕ(p1 , . . . , pn )). Every L-formula is then said to define a McNaughton function. The converse is also true, i.e. every continuous piecewise linear polynomial function with integer coefficients over [0, 1]n is definable by a formula in Lukasiewicz logic. In the case of finite-valued Lukasiewicz logics, the functions defined by formulas are just the restrictions of McNaughton functions over (Lk )n . In this sense, we can associate to every formula ϕ(p1 , . . . , pn ) from Lk a function fϕ : (Lk )n → Lk . As for each Lck , the functions defined by a formula are combinations of restrictions of McNaughton functions and, in addition, the constant functions for each c ∈ Lk . The class of functions definable by Lck -formulas exactly coincides with the class of all functions f : (Lk )n → Lk , for every n ≥ 0. The expressive power of infinite-valued Lukasiewicz logic lies in, and is limited to, the definability of piecewise linear polynomial functions. Expanding L with the connectives , →Π of Product logic [10], interpreted as the product of reals and as the truncated division, respectively, significantly augments the expressive power of the logic. The LΠ 12 logic [5] is the result of this expansion, obtained by adding the connectives , →Π , 12 , whose valuations e extend the valuations for L as follows: ( 1 e(ϕ) ≤ e(ψ) e(ϕ ψ) = e(ϕ) · e(ψ) e(ϕ →Π ψ) = e(ψ) otherwise eϕ 1 e(ϕ) = 1 1 1 e 2 = 2 e(∆ϕ) = 0 otherwise Notice that the presence of the constant 12 makes it possible to define constants for all rationals in [0, 1] (see [5]). LΠ 12 ’s axioms include the axioms of Lukasiewicz and Product logics (see [10]) as well as the following additional axioms, where ∆ϕ is ¬ϕ →Π 0: (LΠ1) (ϕ ψ) (ϕ χ) ↔ ϕ (ψ χ), (LΠ2) ∆(ϕ → ψ) → (ϕ →Π ψ), (LΠ3) ∆(ϕ →Π ψ) → (ϕ → ψ), (LΠ4) 12 ↔ ¬ 12 . The deduction rules are modus ponens for & and →, and the necessitation rule for ∆, i.e.: from ϕ derive ∆ϕ. LΠ 12 is complete with respect to deductions from finite theories for the given semantics [5]. While L is the logic of McNaughton functions, LΠ 21 is the logic of piecewise rational functions with integer coefficients over [0, 1]n (see [15]). In fact, the function defined by each LΠ 21 -formula corresponds to a supremum of rational fractions P (x1 , . . . , xn ) Q(x1 , . . . , xn ) over [0, 1]n , where P (x1 , . . . , xn ), Q(x1 , . . . , xn ) are polynomials with rational co- efficients. Conversely, every piecewise rational function with rational coefficients over the unit cube [0, 1]n can be defined by an LΠ 21 -formula. 3 Logics for Representing Expectation In this section we first introduce the concept of a Lukasiewicz game on Lck along with the notion of a class G of Lukasiewicz games. Then, we define the logic E(G) to represent expected payoffs for games in G, and provide completeness and complexity results. 3.1 Lukasiewicz Games A Lukasiewicz game G on Lck [14] is a tuple G = hP, V, {Vi }, {Si }, {ϕi }i where: 1. P = {P1 , . . . , Pn } is a set of players; 2. V = {p1 , . . . , pm } is a finite set of propositional variables; 3. For each i ∈ {1, . . . , n}, Vi ⊆ V is the set of propositional variables under control of player Pi , so that the sets Vi form a partition of V, |Vi | = mi , and P n i=1 mi = m. 4. For each i ∈ {1, . . . , n}, Si is the strategy set for player Pi that includes all valuations s : Vi → Lk of the propositional variables in Vi , i.e. Si = {s | s : Vi → Lk }. 5. For each i ∈ {1, . . . , n}, ϕi (p1 , . . . , pt ) is an Lck -formula, built from variables in V, whose associated function fϕi : (Lk )t → Lk corresponds to the payoff function of Pi , and whose value is determined by the valuations in {S1 , . . . , Sn }. We denote by S = S1 × · · · × Sn the product of the strategy spaces. A tuple s = (s1 , . . . , sn ) ∈ S of strategies is called a strategy combination. s−i denotes the tuple of strategies (s1 , . . . , si−1 , si+1 , . . . , sn ) not including si . Aside from s, we use the form (si , s−i ) do denote a strategy combination. With an abuse of notation, we denote by fϕi (si , s−i ) (or, equivalently, fϕi (s)) the value of the payoff function fϕi under the valuation corresponding to the strategy combination (si , s−i ) [or, equivalently, s].2 Given a game G, let δ : P → {1, . . . , m} be a function assigning to each player Pi an integer from {1, . . . , m} that corre- sponds to the number of variables in Vi : i.e.: δ(Pi ) = mi . δ is called a variable distribution function. Given a game G, the type of G is the triple hn, m, δi, where n is the number of players, m is the number of variables in V, and δ is the variable distribution function for G. 2 fϕi (si , s−i ) can be also seen as the value of e(ϕi ), when e coincides with the valuation s. Definition 1 (Class). Let G and G 0 be two Lukasiewicz games G and G 0 on Lck of type hn, m, δi and hn, m, δ 0 i, respectively. We say that G and G 0 belong to the same class G if there exists a permutation of the indices {1, . . . , n} such that, for all Pi , δ(P(i) ) = δ 0 (Pi ). Notice that what matters in the definition of a type is not which players are as- signed certain variables, but rather their distribution. For instance, take two games G and G 0 all having three players P1 , P2 , P3 and the same variables p1 , . . . , p6 . Suppose that, in G, P1 controls p1 , P2 controls p2 , p3 , and P3 con- trols p4 , p5 , p6 , while in G 0 , P2 controls p3 , P3 controls p4 , p5 , and P1 controls p1 , p2 , p6 . G and G 0 have the same type, since they have the same number of players, the same number of variables, and the permutation , where 1 7−→ 2 2 7−→ 3 3 7−→ 1, is such that δ P(i) = δ 0 (Pi ) for all Pi . Given the above definitions, we can adapt some well-known game-theoretic concepts to this settings. We introduce the notion of mixed strategy in order to define the concept of expected payoff along with the notions of best response and equilibrium. Let G = hP, V, {Vi }, {Si }, {ϕi }i be a Lukasiewicz game on Lck . A mixed strat- egy πi for player Pi is a probability distribution on the strategy space Si . By π−i , we denote the tuple of mixed strategies (π1 , . . . , πi−1 , πi+1 , . . . , πn ). Given a tuple (π1 , . . . , πn ) of mixed strategies for P1 , . . . , Pn , respectively, the expected payoff for Pi of playing πi , when P−i play π−i , is given by X Yn expϕi (πi , π−i ) = πj (sj ) · fϕi (s) s=(s1 ,...,sn )∈S j=1 Let Σi denote the set of mixed strategies of Pi . Pi ’s best response to P−i ’s mixed strategy combination π−i is a mixed strategy πi such that, for all strategies πi0 ∈ Σi : expϕi (πi , π−i ) ≥ expϕi (πi0 , π−i ). Definition 2 (Nash Equilibrium). Let G be a Lukasiewicz game on Lck . We call a tuple of mixed strategies (π1∗ , . . . , πn∗ ) a Nash Equilibrium for G if each player’s mixed strategy πi∗ is a best response to the other players’ mixed strategy ∗ combination π−i . Example: Matching Pennies. The following game is a generalisation of Match- ing Pennies3 , a classic example of a zero-sum game without a pure strategy equi- librium. In the original game, two players P1 and P2 both have a penny and must secretly choose to turn it to head or tails and reveal their choice at the same 3 The idea behind this generalisation comes from [13]. time. If their choices agree, P1 takes both pennies, but if they do not match, P2 is the one winning both. Now, imagine that both players have a dice with n + 1 faces, they both choose one and reveal it at the same time. P1 ’s overall strategy is to be as close as possible to P2 ’s choice, who, instead, wants to keep the greater possible distance between the outcomes. Clearly, we can represent each player’s strategy space with the set Lk . P1 ’s payoff function is given by the formula ¬d(p1 , p2 ), whose associated function is 1−|e(p1 )−e(p2 )|, while P2 ’s payoff is defined by the formula d(p1 , p2 ), which corresponds to the function |e(p1 ) − e(p2 )|. The game is formally defined on Lck as follows: G= {P1 , P2 }, {p1 , p2 }, {p1 }1 , {p2 }2 , {s1 : {p1 } → Lk }, {s2 : {p2 } → Lk }, {¬d(p1 , p2 ), d(p1 , p2 )} . Table 1 shows the payoff matrix for this generalised version of Matching Pennies with k = 5 (the values in each cell correspond to the first and second player payoffs under the valuation to the variables p1 , p2 ). By Nash’s Theorem [17], we know that an equilibrium with mixed strategies always exists for the generalised version of Matching Pennies. P2 0 1/5 2/5 3/5 4/5 1 0 1, 0 4/5, 1/5 3/5, 2/5 2/5, 3/5 1/5, 4/5 0, 1 1/5 4/5, 1/5 1, 0 4/5, 1/5 3/5, 2/5 2/5, 3/5 1/5, 4/5 2/5 3/5, 2/5 4/5, 1/5 1, 0 4/5, 1/5 3/5, 2/5 2/5, 3/5 P1 3/5 2/5, 3/5 3/5, 2/5 4/5, 1/5 1, 0 4/5, 1/5 3/5, 2/5 4/5 1/5, 4/5 2/5, 3/5 3/5, 2/5 4/5, 1/5 1, 0 4/5, 1/5 1 0, 1 1/5, 4/5 2/5, 3/5 3/5, 2/5 4/5, 1/5 1, 0 Table 1. Generalised Matching Pennies. 3.2 The Logic E(G) The aim of this section is to introduce the logic E(G) for reasoning about ex- pected utility in games with Lukasiewicz strategies. Notice that, while sometimes we refer to E(G) as “a logic”, we are actually defining a whole family of logics, one for each class G. Syntax The construction of E(G) mimics the one provided for logics for reason- ing about uncertainty in [7]. The syntax of E(G) is built by taking the m-variable fragment mLck 4 of Lck as inner logic and LΠ 21 as outer logic. Its language is defined as follows: 1. The set NModF of non-modal formulas corresponds to the set of mLck - formulas built from the propositional variables p1 , . . . , pm . 2. The set ModF of modal formulas is built from the atomic modal formulas Eϕ, with ϕ ∈ NModF, using the LΠ 12 connectives. Eϕ is meant to encode a player’s expected payoff from playing a mixed strategy, given the payoff function associated to ϕ. Nested modalities are not allowed. Semantics Given a class of games G on mLck , a model M for E(G) is a tuple hS, e, {πi }i, such that: 1. S is the set of all strategy combinations {s = (s1 , . . . , sn ) | (s1 , . . . , sn ) ∈ S1 × · · · × Sn }. 2. e : (NModF × S) → Lk is a valuation of non-modal formulas, such that, for each ϕ ∈ NModF e(ϕ, s) = fϕ (s), where fϕ is the function associated to ϕ and s = (s1 , . . . , sn ). 3. πi : Si → [0, 1] is a probability distribution, for each Pi . Given a formula Φ, the truth value of Φ in M at the combination s, denoted kΦkM,s , is inductively defined as follows: 1. If Φ is a non-modal formula ϕ ∈ NModF, then kϕkM,s = e(ϕ, s), 2. If Φ is an atomic modal formula Eϕ, then X n Y kEϕkM,s = expϕ (π1 , . . . , πn ) = πj (sj ) · e(ϕ, s) .5 s=(s1 ,...,sn )∈S j=1 3. If Φ is a non-atomic modal formula, its truth value is computed by evalu- ating its atomic modal subformulas and then by using the truth functions associated to the LΠ 12 -connectives occurring in Φ. 4 Notice that the functions definable in mLck are exactly all the functions f : (Lk )t → Lk , where t ≤ m. 5 Notice that the notation is slightly different from the one used above for the definition of expected payoff but the meaning is the same. Since the valuation of a modal formula Φ does not depend on a specific strategy combination but only on the model M, we will often simply write kΦkM to denote the valuation of Φ in M. As usual, we say that a formula Φ is satisfiable if there exists a model M such that kΦkM = 1. Similarly, a modal theory Γ , i.e. a set of modal formulas, is satisfiable if there exists a model that satisfies each and every one of the formulas contained in Γ . Validity clearly means satisfiability in all models. Axiomatisation The axioms of E(G) are the following: 1. All the Lck -tautologies for mLck , i.e.: all the Lck -tautologies in the variables p1 , . . . , pm , for non-modal formulas. 2. All the LΠ 12 -axioms and rules for modal formulas. 3. Probabilistic axioms for E, with ϕ, ψ, r ∈ NModF: (a) E(¬ϕ) ↔ ¬Eϕ (b) E(ϕ ⊕ ψ) ↔ [(Eϕ → E(ϕ&ψ)) → Eψ] (c) Er ↔ r 4. Independence axioms for E, where p1i , . . . , pmi is the tuple of variables as- signed to Pi , for all tuples r11 , . . . , rm1 , . . . , r1n , . . . , rmn ∈ (Lk )m : V V Jn Vmi n mi (a) E i=1 ji =1i (∆ (p ji ↔ r ji )) ↔ i=1 E ji =1i ∆ (p ji ↔ r ji ) 5. The following inference rules for E, with ϕ, ψ ∈ NModF: (a) Necessitation: from ϕ derive Eϕ (b) Monotonicity: from ϕ → ψ derive Eϕ → Eψ Notice that each mi ^ (†) (∆ (pji ↔ rji )) ji =1i encodes a particular strategy for player Pi . In fact, each ∆ (pji ↔ rji ) is satisfi- able if and only if e(pji ) = rji . So, given a tuple r1i , . . . , rmi , (†) is satisfiable by a strategy si if and only if si (pji ) 7−→ rji , for all pji . The notion of proof in E(G) is defined as usual. For any modal theory Γ and formula ϕ, we write Γ `E(G) ϕ to denote that ϕ is a consequence of Γ in E(G). Completeness Before we prove completeness for E(G) we provide an axiomatic characterisation for the expectation of mixed strategies over mLck -formulas. Theorem 1. Let G be a class of Lukasiewicz games on Lck and let mLck be the m-variable fragment of Lck . The following statements are equivalent: 1. There exists a state σ : mLck → [0, 1]. such that for all tuples r11 , . . . , rm1 , . . . , r1n , . . . , rmn ∈ (Lk )m n ^ mi ^ Yn mi ^ σ (∆ (pji ↔ rji )) = σ ∆ (pji ↔ rji ) , i=1 ji =1i i=1 ji =1i where p1i , . . . , pmi is the tuple of variables assigned to Pi . 2. There exists a probability distribution πi : Si → [0, 1] for each Pi , such that, for all ϕ ∈ mLck , σ(ϕ) = expϕ (π1 , . . . , πn ), i.e.: X Yn σ(ϕ) = πj (sj ) · fϕ (s) , s=(s1 ,...,sn )∈S j=1 where fϕ is the function associated to ϕ. Proof. It is easy to check that (2) implies (1). To prove the converse, suppose that there exists a state σ : mLck → [0, 1]. As shown by Paris in [18, Appendix 2] there exists a probability distribution π : S1 × · · · × Sn → [0, 1] such that X σ(ϕ) = π(s) · fϕ (s), s∈S for all ϕ ∈ mLck . Now, let mi Lck [pi ] be the mi -variable fragment of Lck in the variables pi = {p1i , . . . , pmi }, i.e. the variables assigned to Pi . Let σ↓i be the restrictions of σ to mi Lck [pi ]. It is clear that each σ↓i is still a probability measure. It follows, again, from [18, Appendix 2] that, for each i, there exists a probability distribution πi : Si → [0, 1] such that, for all ψ ∈ mi Lck [pi ] X σ(ψ) = σ↓1 (ψ) = πi (si ) · fψ (si ). si ∈Si By assumption, σ satisfies n ^ mi ^ n Y mi ^ σ (∆ (pji ↔ rji )) = σ ∆ (pji ↔ rji ) , i=1 ji =1i i=1 ji =1i for all tuples r11 , . . . , rm1 , . . . , r1n , . . . , rmn ∈ (Lk )m . It is possible to check that the above condition guarantees the fact that all probability distributions πi are independent, and so for all s ∈ S n Y π(s) = πi (si ). i=1 Therefore X X n Y σ(ϕ) = π(s) · fϕ (s) = πj (sj ) · fϕ (s) . s∈S s=(s1 ,...,sn )∈S j=1 We can now proceed to proving the Completeness Theorem. Theorem 2 (Completeness). Let Γ and Φ be a finite modal theory and a modal formula in E(G). Then, Γ `E(G) Φ if and only if for every model M such that, for each Ψ ∈ Γ , kΨ kM = 1, also kΦkM = 1. Proof. The proof follows from an adaptation of the strategy laid out in [10] and generalised in [7]. We now study the computational complexity of certain kinds of satisfiability problems for E(G). Let r ∈ Q ∩ [0, 1] and let [ ∈ {<, >, ≤, ≥, =}. We call an E(G)-modal formula Φ [r-satisfiable if there is a model M such that kΦkM [r. Following a strategy similar to the one laid out by Hájek in [11] for probability logics, satisfiability of an E(G)-formula Φ can be translated into an existential formula in the theory of the reals whose size is exponential in the number of non-modal propositional variables in Φ. Decidability for the existential theory of the reals Th∃ (R) was shown by Canny to be in PSPACE [2]. Therefore, we obtain the following result: Theorem 3. Checking [r-satisfiability for E(G) is in EXPSPACE. 4 Games with Expectations In this section we introduce a class of games with polynomial constraints over ex- pectations. These games expand Lukasiewicz games by assigning to each player a formula of E(G), which is a piecewise rational polynomial function whose vari- ables correspond to expected values. The idea is that in a situation of strategic interaction players might be interested not only in maximizing their own ex- pectation, but also in influencing others’. The modal formula assigned to each player is then meant to represent a player’s goal concerning the relation between her and other players’ expectations. A game with expectations EG on E(G) is a tuple EG = hP, V, {Vi }, {Si }, {ϕi }, {Mi }, {Φi }i, where: 1. G = hP, V, {Vi }, {Si }, {ϕi }i is a Lukasiewicz game on Lck , with G ∈ G, 2. for each i ∈ {1, . . . , n}, Mi is the set of all mixed strategies on Si of player Pi , 3. for each i ∈ {1, . . . , n}, Φi is an E(G)-formula such that every atomic modal formula occurring in Φi has the form Eψ, with ψ ∈ {ϕ1 , . . . , ϕn }. Let EG be a game with expectations on E(G). A model M = hS, e, {πi }i for E(G) is called a best response model for a player Pi whenever, for all models M0 = hS, e, {πi0 }i with π−i 0 = π−i , kΦi kM0 ≤ kΦi kM . Definition 3 (Equilibrium). A game with expectations EG on E(G) is said to have a Nash Equilibrium, whenever there exists a model M∗ that is a best response model for each player Pi . Example 1. Let EG be any game with expectations where each Pi is simply assigned the formula Φi := Eϕi . This game corresponds to the the situation where each player cares only about her own expectation and whose goal is its maximisation. Clearly, by Nash’s Theorem [17], every EG of this form admits an Equilibrium, since it offers a formalisation of the classical case where equilibria are given by tuples of mixed strategies over valuations in a Lukasiewicz game. Example 2. Not every game with expectations has an equilibrium. In fact, consider the following game EG = hP, V, {Vi }, {Si }, {ϕi }, {Mi }, {Φi }i, with i ∈ {1, 2}, where: 1. ϕ1 := p1 and ϕ2 := p2 , and 2. Φ1 := ¬d(E(p1 ), E(p2 )) and Φ2 := d(E(p1 ), E(p2 )). The above game can be regarded as a particular version of Matching Pennies with expectations. In fact, while P1 aims at matching P2 ’s expectation, P2 ’s goal is quite the opposite, since she wants their expectations to be as far as possible. It is easy to see that there is no model M that gives an equilibrium for EG . Proposition 1. There exist games with expectations on E(G) that do not admit a Nash Equilibrum. As mentioned above, the satisfiability of every E(G)-formula can be trans- lated into the validity of an existential formula of the theory Th(R) of real closed fields. In a similar way, we can express the existence of an equilibrium in a game with expectations EG through a first-order sentence ξ of Th(R) having exponen- tially many variables and a fixed alternation of quantifiers. By using the fact that the general decidability problem for Th(R) is singly exponential in the number of variables when the alternation of quantifiers is fixed [8], we obtain the following result. Theorem 4 (Complexity). Checking the existence of a Nash Equilibrium in a game with expectations EG on E(G) is in 2-EXPTIME. By exploiting the connection between LΠ 21 and real closed fields it is also possible to express the existence of an equilibrium through an LΠ 12 -formula (see [4]). Therefore, we obtain the following logical characterisation. Theorem 5. For every game with expectations EG on E(G) there exists an LΠ 12 - formula φ such that EG admits a Nash Equilibrium if and only if φ is satisfiable. 5 Final Remarks In this work, we presented a logic E(G) for reasoning about expectations in a class G of Lukasiewicz games. We have also introduced a new class of games based on E(G) that expand Lukasiewicz games. These games capture strategic interactions in which players randomise their choices in order to influence the expectations of the other players as well as their own. While our approach to representing expectation in games in a logical frame- work is certainly novel, other works have dealt with similar topics. In [9], Halpern and Pucella introduced a propositional logic for reasoning about probabilistic ex- pectation. Their work, though, is mainly concerned with modelling expectation in general and not in a game-theoretic setting. Certainly closer to our paper is the work by Sack and van der Hoek [19], where the authors study a modal logic to reason about mixed strategies in games. Although (each instance of) their logic is based on a fixed game and a fixed set of mixed strategies, it makes it possible to represent the concept of a Nash Equilibrium through logical formulas: a feature that is not possible in our logic. In our future work, we plan to extend the logical study of expectation for Lukasiewicz games to those situations where payoff formulas are functions de- fined over the whole unit cube [0, 1]n , and, consequently, players have an infinite strategy space. Acknowledgments Godo acknowledges support from the Spanish projects EdeTRI (TIN2012-39348- C02-01) and AT (CONSOLIDER CSD 2007-0022). Marchioni acknowledges sup- port from the Marie Curie Project NAAMSI (FP7-PEOPLE-2011-IEF). References 1. E. Bonzon, M.C. Lagasquie-Schiex, J. Lang, B. Zanuttini. Boolean games revisited. In Proceedings of the Seventeenth European Conference on Artificial Intelligence (ECAI-2006), Riva del Garda, Italy, 265–269, 2006. 2. J.F. Canny. Some algebraic and geometric computations in PSPACE. In Proceedings of the 20th ACM Symposium on Theory of Computing, 460–467, 1988. 3. R. Cignoli, I. M. L. D’Ottaviano, D. Mundici. Algebraic Foundations of Many-alued Reasoning, Volume 7 of Trends in Logic, Kluwer Academic Publishers, Dordrecht, 2000. 4. F. Esteva, L. Godo, E. Marchioni. Fuzzy logics with enriched language. In Handbook of Mathematical Fuzzy Logic, Cintula P., Hájek P., and Noguera C. (Editors), College Publications, 627–712, 2011. 5. F. Esteva, L. Godo, F. Montagna. The LΠ and LΠ 21 logics: two complete fuzzy logics joining Lukasiewicz and product logic. Archive for Mathematical Logic, 40: 39–67, 2001. 6. T. Flaminio, L. Godo. A logic for reasoning about the probability of fuzzy events. Fuzzy Sets and Systems, 158: 625–638, 2007. 7. T. Flaminio, L. Godo, E. Marchioni. Reasoning about uncertainty of fuzzy events: an overview. In Reasoning under Vagueness - Logical, Philosophical, and Linguistic Perspectives, Cintula P., Fermüller C., Godo L., Hájek P. (Editors), Studies in Logic, College Publications, 367–400, 2011. 8. D. Y. Grigor’ev. Complexity of deciding Tarski algebra. Journal of Symbolic Com- putation, 5: 65–108, 1988. 9. J.Y. Halpern, R. Pucella. Characterizing and reasoning about probabilistic and non- probabilistic expectation. Journal of the ACM, Article No. 15, 54(3), 2007. 10. P. Hájek Metamathematics of Fuzzy Logic. Volume 4 of Trends in Logic, Kluwer Academic Publishers, Dordrecht, 1998. 11. P. Hájek. Complexity of fuzzy probability logics II. Fuzzy Sets and Systems, 158(23): 2605–2611, 2007. 12. P. Harrenstein, W. van der Hoek, J.J.Ch. Meyer, C. Witteveen. Boolean games. In Proceeding of the Eighth Conference on Theoretical Aspects of Rationality and Knowledge (TARK VIII), J. van Benthem (Ed.), Siena, Italy, 287–298, 2001. 13. T. Kroupa, O. Majer. Nash Equilibria in a class of constant-sum games represented by McNaughton functions. In Logic, Algebra and Truth Degrees 2012, Book of Abstracts, K. Terui and N. Preining (Eds.), 2012. 14. E. Marchioni, M. Wooldridge. Lukasiewicz Games. Submitted. 15. F. Montagna, G. Panti. Adding structures to MV-algebras. Journal of Pure and Applied Algebra, 164: 365–387, 2001. 16. D. Mundici, Averaging the truth-value in Lukasiewicz logic. Studia Logica 55(1): 113–127, 1995. 17. J. Nash. Non-cooperative games. The Annals of Mathematics, Second Series, 54(2): 286–295, 1951. 18. J. B. Paris. A note on the Dutch Book method (revised version). In Second Interna- tional Symposium on Imprecise Probability and their Application, Cornell University, Ithaca, NY (USA), 2001. 19. J. Sack, W. van der Hoek. A Modal Logic for Mixed Strategies. Studia Logica, accepted.