-

Implementable strategies for a two-player asynchronous game on Petri nets

Federica Adobbati

Luca Bernardinello

Lucia Pomello

Riccardo Stramare

Università degli studi di Milano-Bicocca

DISCo

2022

20 21

We consider a two-player game on Petri nets, in which each player controls a subset of transitions. The players are called 'user' and 'environment'; we assume that the environment must guarantee progress on its transitions. A play of this game is a run in the unfolding of the net, satisfying the progress assumption. In general, we define a strategy for the user as a map from the set of 'observations' to subsets of transitions owned by the user. Diferent restrictions on strategies can be used to encode observability assumptions. We say that a given strategy is implementable if the net can be endowed with new places so that the runs of the new net coincide with the plays of the original net, complying with the strategy. We propose an algorithm based on the search of regions to decide whether a strategy is implementable.

markings as sets of places. We denote with MG(Σ) = (, , , 0) the sequential marking graph of Σ, where = {(, , ′) : , ′ ∈ , ∈ [⟩′} is the set of labelled arcs. We denote with the set of transitions controlled by the user, and with the set of transitions uncontrollable for him. We assume that ∪ = , ∩ = ∅.

We will also refer to the unfolding of the net (see [2] for the formal definitions). The unfolding is an acyclic, possibly infinite, net representing all the possible histories of the executions of a net system. In the unfolding of a net system, only forward conflicts are allowed: two events can share a precondition, but no postcondition. Since the unfolding is acyclic, the reflexive and transitive closure of the flow relation is a partial order. If two events 1 and 2 are in conflict, then we say that every descendant of 1 is in conflict with every descendant of 2. Two elements are concurrent if they are neither ordered nor in conflict. The events of the unfolding are partitioned into controllable and uncontrollable events depending on their correspondence to occurrences of controllable or uncontrollable transitions, respectively.

A run is a subnet of the unfolding representing an execution, i.e. it is a net without conflicts and close with respect to the past of its elements. A run is maximal if no event can be added without creating a conflict.

A play in the game is formally defined as a run in the unfolding of the net, maximal with respect to uncontrollable transitions. The winning condition for the user is a set of plays. The user wins a play if the play belongs to the winning condition. In the example discussed above, the winning condition is formed by all the maximal runs with an infinite number of occurrences of the place . Figure 2 shows, on the left, a play ending in a deadlock (the environment wins) and, on the right, the initial segment of an infinite play (the user wins).

2. Strategies

In order to reach his goal, the user can apply a strategy. We assume that the user has no memory and only a partial knowledge of the current state of the net. This is formalized by assuming an equivalence relation, denoted by ≡ , on the set of reachable markings. The equivalence classes of this relation will be called observations, and a strategy is defined as a map from observations to sets of controllable transitions. For example, if we assume the user may observe only some places, two markings are considered equivalent if they share the same observable places. An example of equivalence relation between partially observable markings has been considered in the game presented in [3].

Let Obs be the set of observations. Then a strategy is a map : Obs → 2 . The notion of observation can be extended to unfoldings: a maximal set of pairwise concurrent places in the unfolding is called a B-cut. Every B-cut corresponds to a reachable marking. We say that two B-cuts are equivalent if their corresponding markings are equivalent.

Two B-cuts of a run , and ′, are ordered, denoted < ′, if ̸= ′ and ∀ ∈ , ∃′ ∈ ′ such that: ≤ ′. A sequence 1 2... ... of B-cuts is increasing if < for each , ∈ N with < .

A play complies with a strategy if ( 1 ) there is a sequence of observations which is the projection of an increasing sequence of B-cuts in , and ( 2 ) every controllable event in is chosen by the strategy in one of the observations in .

Given a strategy , we can construct the reduced version of MG(Σ), by removing arcs corresponding to controllable transitions not chosen by the strategy, and then removing all states and transitions unreachable from the initial marking. The reduced version will be denoted by MG (Σ), and the operation will be referred to as the -reduction of MG(Σ). Fig. 3 shows the reduced version of the marking graph of the net in Fig. 1, when is the strategy described above. 2.1. Implementable strategies As defined above, the notion of strategy is not encoded in the net system itself. We will now investigate the possibility of defining a strategy within the net.

The idea we pursue is this: a strategy is encodable within a net if its decisions can be represented as extra places, corresponding to causal relations from uncontrollable transitions to controllable transitions. We will then say that a strategy is implementable if we can add new places to the given net, so that the runs of the augmented nets are exactly the plays complying with the strategy in the original net. Formally, we need a notion of “augmented” net system, in which we add places that restrict the possible behaviours.

For Σ = (, , , 0), with = 1, 2, we write Σ1 ⊑ Σ2 if Σ2 is obtained by adding new places to Σ1, hence 1 ⊆ 2, 1 = 2, 1 ⊆ 2 and new arcs in 2 have one end in 2 ∖ 1, 01 ⊆ 02 and 02 ∖ 01 ⊆ 2 ∖ 1.

A given strategy might prohibit any occurrence of a given controllable transition, and might make some uncontrollable transition unreachable in complying plays. Let be the set of reachable transitions in complying plays, and let Σ be the net system whose underlying net is the subnet generated by , i.e.: Σ = ( , , , 0 ), where = ∙ ∪ ∙ , is restricted to ( × ) ∪ ( × ) and 0 = 0 ∩ .

Definition 1. A strategy for a game on Σ is implementable if there exists a 1-safe net system Σ′ = ( ′, , ′, ′0) such that ( 1 ) Σ ⊑ Σ′; ( 2 ) the marking graph of Σ′ is isomorphic to MG (Σ).

Consider again the net system in Fig. 1. Suppose that the user can observe all the places in the net, so that it has full knowledge of the current marking. If his goal is to reach infinitely often the place , then a winning strategy is defined by the following clauses: ( 1 ) ({0, 0}) = ∅; ( 2 ) ({0, 1}) = {1}; ( 3 ) ({0, 2}) = {2}. In this case, the strategy does not prevent the occurrence of any controllable transition, it only selects which one between 1 and 2 to choose, depending on the observation of {0, 1} or {0, 2}; therefore the set of reachable transitions in plays complying with , , is itself, and then Σ = Σ.

The marking graph of Σ, MG(Σ), is shown on the left side of Fig. 3, whereas the reduced version MG (Σ) is on the right side of the same figure. The information needed to take the right decision in accordance with , in this simple case, can be encoded by adding to Σ a couple of places, one going from 1 to 1, the other from 2 to 2, as shown in Fig. 4, on the right, where the new places are drawn in dark grey. It is easy to see that the obtained system has a marking graph isomorphic to MG (Σ).

3. An algorithm to decide if a strategy is implementable

In this section, we propose a simple algorithm to decide whether a given strategy is implementable. The algorithm checks whether the reduced marking graph is isomorphic to the marking graph of a 1-safe Petri net, and relies on the theory of regions [4]. We first recall some basic notions related to regions of transition systems.

Region theory and separation problems As remarked above, each node of the marking graph of a 1-safe Petri net is a set of places. Hence, we can associate to each place its extension, namely the set of reachable markings to which it belongs: ∀ ∈ () = { ∈ | ∈ }. The extension of a place satisfies the uniform crossing property: for any given transition , if one occurrence of in the marking graph enters (), then all occurrences of do the same, and analogously when an occurrence leaves (). This property can be defined more abstractly for general labelled transition systems, giving the notion of a region of a transition system.

The synthesis problem consists in deciding if a given labelled transition system is isomorphic to the marking graph of a net system, and, if so, in constructing such a net. The problem admits a solution if the set of regions satisfies the so-called state separation problems and the event-state separation problems. In this case, we say that the transition system is separated.

Consider now a marking graph MG(Σ), a strategy , and the reduced transition system MG (Σ) = (, , , 0). Then, it is easy to prove that, for each region of MG(Σ), the set ∩ is a region of MG (Σ). This implies that all state separation problems of MG (Σ) are solved by those regions, since MG(Σ) is separated by definition. This is not the case for the event-state separation problems: for each edge labelled with removed from MG(Σ) in the state , we need to check if there is a region without state from which leaves. The algorithm Given a strategy for a game on Σ, we can now describe an algorithm to decide whether is implementable. The algorithm first computes MG (Σ), and then checks if it is separated by trying to solve each new event-state separation problem. The new regions solving those problems, if any, encode the flow of information from observations to the controllable part of the system.

4. Conclusion

In this work we presented a notion of implementable strategy on 1-safe nets, and we provided an algorithm to decide whether the strategy is implementable. In future works, we plan to broad the definition of implementable strategy by allowing for strategies modelled by adding general bounded places. As an example, consider the net on the left of Fig. 5 where the user can observe the places {3, 4, 5, 6, 10}, and his goal is to reach place 12. A winning strategy is: ({4, 6, 10}) =H, ({3, 5, 10}) = ({3, 6, 10}) = ({4, 5, 10}) = I. The choice of H can be represented by adding a place from B to H, and a place from D to H. Instead, the choice of I depends on the occurrence of either A or C, hence it may be represented with a place allowing for two tokens, making the net not 1-safe. Although the strategy is not implementable according to the given definition, the net on the right of Fig. 5 realizes the strategy.

Furthermore, we plan to relax the definition of implementable strategy, by allowing for implementations in which only parts of the behaviours in the reduced marking graph are reproducible on the net. Specifically, a strategy is implementable if we can add places to the initial net so that there is at least a maximal run in the unfolding of the augmented net, and all its maximal runs are associated to maximal paths of the reduced marking graph.

A similar use of regions can be found in applications to problems of control: in [5] and [6] regions are used to synthesize maximally permissive controllers avoiding unwanted states; [7] proposes an implementation based on region theory of a controller, applied to flexible manufacturing systems.

A general overview of works and approaches to automatic control using Petri nets can be found in [8].

Acknowledgments

This work is supported by the Italian MUR. games on Petri nets with partial observability, 2022. URL: https://arxiv.org/abs/2204.01603. doi:10.48550/ARXIV.2204.01603. [4] E. Badouel, L. Bernardinello, P. Darondeau, Petri Net Synthesis, Texts in Theoretical Computer Science. An EATCS Series, Springer, 2015. URL: http://dx.doi.org/10.1007/ 978-3-662-47967-4. doi:10.1007/978-3-662-47967-4. [5] A. Ghafari, N. Rezg, X. Xie, Design of a live and maximally permissive Petri net controller using the theory of regions, IEEE Transactions on Robotics and Automation 19 (2003) 137–141. doi:10.1109/TRA.2002.807555. [6] Z. Li, M. Zhou, M. Jeng, A maximally permissive deadlock prevention policy for fms based on petri net siphon control and the theory of regions, IEEE Transactions on Automation Science and Engineering 5 (2008) 182–188. [7] S. Rezig, C. Ghorbel, Z. Achour, N. Rezg, PLC-based implementation of supervisory control for flexible manufacturing systems using theory of regions, International Journal of Automation and Control 13 (2019) 619–640. [8] A. Giua, M. Silva, Petri nets and automatic control: A historical perspective, Annual Reviews in Control 45 (2018) 223–239.

[1]

Adobbati ,

Bernardinello ,

Pomello , A two-player asynchronous game on fully observable Petri nets ., Trans. Petri Nets Other Model. Concurr . 15 ( 2021 ) 126 - 149 .

[2]

Engelfriet , Branching processes of Petri nets , Acta Inf . 28 ( 1991 ) 575 - 591 . URL: https: //doi.org/10.1007/BF01463946. doi: 10 .1007/BF01463946.

[3]

Adobbati ,

Bernardinello , L. Pomello, Looking for winning strategies in two-player