Norm Reﬁnement and Design through Inductive
                      Learning�

                   Domenico Corapi1 , Marina De Vos2 , Julian Padget2 ,
                         Alessandra Russo1 , and Ken Satoh3
                     1
                       Department of Computing, Imperial College London
                        {d.corapi,a.russo}@imperial.ac.uk
                     2
                       Department of Computer Science, University of Bath
                              {mdv,jap}@cs.bath.ac.uk
                              3
                                National Institute of Informatics
                                   ksatoh@nii.ac.jp


        Abstract. In the physical world, the rules governing behaviour are debugged by
        observing an outcome that was not intended and the addition of new constraints
        intended to prevent the attainment of that outcome. We propose a similar ap-
        proach to support the incremental development of normative frameworks (also
        called institutions) and demonstrate how this works through the validation and
        synthesis of normative rules using model generation and inductive learning. This
        is achieved by the designer providing a set of use cases, comprising collections
        of event traces that describe how the system is used along with the desired out-
        come with respect to the normative framework. The model generator encodes
        the description of the current behaviour of the system. The current speciﬁcation
        and the traces for which current behaviour and expected behaviour do not match
        are given to the learning framework to propose new rules that revise the existing
        norm set in order to inhibit the unwanted behaviour. The elaboration of a nor-
        mative system can then be viewed as a semi-automatic, iterative process for the
        detection of incompleteness or incorrectness of the existing normative rules, with
        respect to desired properties, and the construction of potential additional rules for
        the normative system.


1    Introduction
Norms and regulations play an important role in the governance of human society. So-
cial rules such as laws, conventions and contracts prescribe and regulate our behaviour,
however it is possible for us to break these rules at our discretion and face the conse-
quences. By providing the means to describe and reason about norms in a computational
context, normative frameworks (also called institutions or virtual organisations) may be
applied to software systems allowing for automated reasoning about the consequences
of socially acceptable and unacceptable behaviour, by monitoring the permissions, em-
powerment and obligations of the participants and generating violations when norms
are not followed.
�
    This work is partially supported through the EU Framework 7 project ALIVE (FP7-IST-
    215890), and the EPSRC PRiMMA project (EP/F023294/1).


                                                33
    The formal model put forward in [9] and its corresponding operationalisation through
Answer Set Programming (ASP) [3, 18] aims to support the top-down design of norma-
tive frameworks. AnsP rolog is a knowledge representation language that allows the
programmer to describe a problem and required properties on the solutions in an intu-
itive way. Programs consist of rules interpreted under the answer set semantics. Answer
set solvers, like CLASP[17] or SMODELS[25], can be used to reason about the given
AnsP rolog speciﬁcation, by returning acceptable solutions in the form of traces, as
answer sets. In a similar way, the correctness of the speciﬁcation with respect to given
properties can be veriﬁed.
    Currently, the elaboration of behavioural rules and norms is an error-prone process
that relies on the manual efforts of the designer and would, therefore, beneﬁt from au-
tomated support. In this paper, we present an inductive logic programming (ILP) [24]
approach for the extraction of norms and behaviour rules from a set of use cases. The
approach is intended as a design support tool for normative frameworks. Complex sys-
tems are hard to model and even if testing of properties is possible, sometimes it is hard
to identify missing or incorrect rules. In some cases, e.g. legal reasoning, the abstract
speciﬁcation of the system can be in part given in terms of speciﬁc instances and use
cases that ultimately drive the design process and are used to assess it. We propose
a design support tool that employs use-cases, i.e. traces together with their expected
normative behaviour, to assist in the revision of a normative framework. The system
is correct when none of the traces are considered disfunctional, i.e. they match the ex-
pected normative behaviour. When a disfunctional trace is encountered the normative
speciﬁcation needs to be adjusted: the task is to reﬁne the given description by learning
missing norms and/or behavioural rules that, added to the description, entail the ex-
pected behaviour over the traces. We show how this task can be naturally represented
as a non-monotonic ILP problem in which the partial description of the normative sys-
tem provides the background knowledge and the expected behaviour comprises the ex-
amples. In particular, we show how a given AnsP rolog program and traces can be
reformulated into an ILP representation that makes essential use of negation in induc-
ing missing parts of the speciﬁcation. As the resulting learning problem is inherently
non-monotonic, we use a non-monotonic ILP system, called TAL [12], to compute the
missing speciﬁcation from the traces and the initial description.
    Given the declarative nature of ASP, the computational paradigm used for our nor-
mative frameworks, we needed to adopt a declarative learning approach as we aim to
learn declarative speciﬁcations. This differs from other approaches, such as reinforce-
ment learning whereby norms or policies are learned as outcomes of estimation and
optimisation processes. Such types of policies are not directly representable in a declar-
ative format and are quite different in nature from the work reported here.
    The paper is organised as follows. Section 2 presents some background material on
the normative framework, while Section 3 introduces the non-monotonic ILP system
used in our proposed approach. Section 4 describes the AnsP rolog modelling of nor-
mative frameworks. Section 5 illustrates how the revision task can be formulated into
an ILP problem, and how the generated ILP hypothesis can be reformulated as norms
and behaviour rules within the AnsP rolog representation. In Section 6 we illustrate
the ﬂexibility and expressiveness of our approach through a number of different par-


                                           34
tial speciﬁcations of a reciprocal ﬁle sharing normative framework. Section 7 relates
our approach to existing work on learning norms with respects to changing/improved
requirements. We conclude with a summary and remarks about future work.


2     Normative Frameworks
The concept of normative framework has become ﬁrmly embedded in the agent com-
munity as a necessary foil to the essential autonomy of agents, in just the same way as
societal conventions and legal frameworks have grown up to constrain people. In both
the physical and the virtual worlds, and the emerging combination of the two, the argu-
ments in favour centre on the minimisation of disruptive behaviour and supporting the
achievement of the goals for which the normative framework has been conceived and
thus also the motivation for submission to its governance by the participants. While the
concept remains attractive, its realisation in a computational setting remains a subject
for research, with a wide range of existing logics [29, 1, 7, 9, 32] and tools [26, 14, 19].


2.1   Formal Model
To provide context for this paper, we give an outline of a formal event-based model
for the speciﬁcation of normative frameworks that captures all the essential properties,
namely empowerment, permission, obligation and violation. Extended presentations ap-
pear in [9] and [10].
     The essential elements of our normative framework are: (i) events (E), that bring
about changes in state, and (ii) ﬂuents (F), that characterise the state at a given instant.
The function of the framework is to deﬁne the interplay between these concepts over
time, in order to capture the evolution of a particular institution through the interaction
of its participants. We distinguish two kinds of events: normative events (Enorm ), that
are the events deﬁned by the framework and exogenous (Eex ), that are outside its scope,
but whose occurrence triggers normative events in a direct reﬂection of the “counts-as”
principle [21]. We further partition normative events into normative actions (Eact ) that
denote changes in normative state and violation events (Eviol ), that signal the occurrence
of violations. Violations may arise either from explicit generation, from the occurrence
of a non-permitted event, or from the failure to fulﬁl an obligation. We also distinguish
two kinds of ﬂuents: normative ﬂuents that denote normative properties of the state such
as permissions P, powers W and obligations O, and domain ﬂuents D that correspond
to properties speciﬁc to the normative framework itself. The set of all ﬂuents is denoted
as F. A normative state is represented by the ﬂuents that hold true in this state. Fluents
that are not presented are considered to be false. Conditions on a state are therefore
expressed by a set of ﬂuents that should be true or false. The set of possible conditions
is referred to as X = 2F ∪¬F .
     Changes in state are achieved through the deﬁnition of two relations: (i) the gen-
eration relation, which implements counts-as by specifying how the occurrence of one
(exogenous or normative) event generates another (normative) event, subject to the em-
powerment of the actor and the conditions on the state, and (ii) the consequence relation.
This latter speciﬁes the initiation and termination of ﬂuents subject to the performance
of some action in a state matching some expression. The generation relation is formally


                                            35
deﬁned as G : X ×E → 2Enorm , and the consequence relation as C : X ×E → 2F ×2F .
The ﬂuents to be initiated as a result of an event E are often denoted by C ↑ (φ, E) while
the ones to be terminated are denoted by C ↓ (φ, E).
     The semantics of our normative framework is deﬁned over a sequence, called a
trace, of exogenous events. Starting from the initial state, each exogenous event is
responsible for a state change, through initiation and termination of ﬂuents. This is
achieved by a three-step process: (i) the transitive closure of G with respect to a given
exogenous event determines all the generated (normative) events, (ii) to this all viola-
tions of events not permitted and obligations not fulﬁlled are added, giving the set of
all events whose consequences determine the new state, (iii) the application of C to this
set of events identiﬁes all ﬂuents that are initiated and terminated with respect to the
current state so giving the next state. For each trace, we can therefore compute a se-
quence of states that constitutes the model of the normative framework for that trace.
This process is realised as a computational model through Answer Set Programming
(see Section 4) and it is this representation that is the subject of the learning process
described in Section 5.


3   Learning
Inductive Logic Programming (ILP) [24] is a machine learning technique concerned
with the induction of logic theories from (positive and negative) examples and has been
successfully applied to a wide range of problems [15]. Automatic induction of hypothe-
ses represented as logic programs is one of the distinctive features of ILP. Moreover,
the use of logic programming as representation language allows a principled represen-
tation of background information relevant to the learning. To reﬁne normative theories
we employ an ILP learning system, called TAL [12], that is able to learn non-monotonic
theories, and can be employed to perform learning of new rules and the revision of ex-
isting rules. The TAL approach is based on mapping a given inductive problem into an
abductive reasoning process. The current implementation of TAL relies on an extension
of the abductive procedure SLDNFA [13] and preserves its semantics.

Deﬁnition 1. A non-monotonic ILP task is deﬁned as �E, B, S� where E is a set of
ground positive or negative literals, called examples, B is a background normal theory
and S is a set of clauses called language bias. The normal theory H ∈ ILP �E, B, S�,
called hypothesis, is an inductive solution for the task �E, B, S�, if H ⊆ S, H is
consistent with B and B ∪ H |= E.
    B and H are normal theories and thus support negation as failure. The choice of an
appropriate language bias is critical. In TAL the language bias S is speciﬁed by means
of mode declarations [?].

Deﬁnition 2. A mode declaration is either a head or body declaration, respectively
modeh(s) and modeb(s) where s is called a scheme. A scheme s is a ground literal
containing place-markers. A place-marker is a ground function whose functor is one of
the three symbols ’+’ (input), ’-’ (output), ’#’ (constant) and the argument is a constant
called type.


                                           36
    Given a schema s, s∗ is the literal obtained from s by replacing all place-markers
with different variables X1 , ..., Xn . A rule r is compatible with a set M of mode dec-
larations iff (a) there is a mapping from each head/body literal l in r to a head/body
declaration m ∈ M with schema s such that each literal is subsumed by its correspond-
ing s∗ ; (b) each output place-marker is bound to an output variable; (c) each input
place-marker is bound to an output variable appearing in the body or to a variable in
the head; (d) every constant place-marker is bound to a constant; (e) all variables and
constants are of the corresponding type. From a user perspective, mode declarations
establish how rules in the ﬁnal hypotheses are structured, deﬁning literals that can be
used in the head and in the body of a well-formed hypothesis. Although we show M in
the running example of this paper for reference, the mode declarations can be concealed
from the user and derived automatically. They can be optionally reﬁned to constrain the
search whenever the designer wants to employ useful information on the outcome of
the learning to reduce the number of alternative hypotheses or improve performance.


4     Modelling Normative Frameworks

While the formal model of a normative framework allows for clear speciﬁcation of a
normative system, it is of little support to designers or users of these systems. In order
to be able to do so, computational tools are needed. The ﬁrst step is a computational
model equivalent to the formal model. We have opted for a form of logic programming,
called Answer Set Programming (ASP)[18]. Here we only present a short ﬂavour of the
language AnsP rolog, and the interested reader is referred to [3] for in-depth coverage.
    AnsP rolog is a knowledge representation language that allows the programmer to
describe a problem and the requirements on the solutions in an intuitive way, rather than
the algorithm to ﬁnd the solutions to the problem. The basic components of the language
are atoms, elements that can be assigned a truth value. An atom can be negated using
negation as failure so creating the literal not a. We say that not a is true if we cannot
ﬁnd evidence supporting the truth of a. If a is true then not a is false and vice versa.
Atoms and literals are used to create rules of the general form: a ← B, not C, where
a is an atom and B and C are set of atoms. Intuitively, this means if all elements of B
are known/true and no element of C is known/true, then a must be known/true. We refer
to a as the head and B ∪ not C as the body of the rule. Rules with empty body are are
called facts; A program in AnsP rolog is a ﬁnite set of rules.
    The semantics of AnsP rolog are deﬁned in terms of answer sets, i.e. assignments
of true and false to all atoms in the program that satisfy the rules in a minimal and
consistent fashion. A program has zero or more answer sets, each corresponding to a
solution.

4.1   Mapping the formal model into AnsP rolog
In this section we only provide a summary description of how the formal institutional
model is translated in to AnsP rolog . A full description of the model can be found in
[9] together with completeness and correctness of model with respect to traces. Each
program models the semantics of the normative framework over a sequence of n time


                                           37
instants such that ti : 0 ≤ i ≤ n. Events are considered to occur between these snap-
shots, where for simplicity we do not deﬁne the intervals at which events occur explic-
itly, and instead refer to the time instant at the start of the interval at which an event is
considered to occur. Fluents may be true or false at any given instant of time, so we use
atoms of the form holdsat(f, ti ) to indicate that ﬂuent f holds at time instant ti . In
order to represent changes in the state of ﬂuents over time, we use atoms of the form
initiated(f, ti ) and terminated(f, ti ) to denote the fact that ﬂuent f was initiated
or terminated, respectively, between time instants i and i + 1. We use atoms of the form
occurred(e, ti ) to indicate that event e ∈ E is considered to have occurred between
instant ti and ti+1 . These atoms denote events that occur in an external context or
are generated by the normative framework. For exogenous events we additionally use
atoms of the form observed(e, ti ) to denote the fact that e has been observed.
    The mapping of a normative framework consists of three parts: a base component
which is independent of the framework being modelled, the time model and the frame-
work speciﬁc component. The independent component deals with inertia of the ﬂuents,
the generation of violation events of un-permitted actions and unsatisﬁed obligations.
The time model deﬁnes the predicates for time and is responsible for generating a sin-
gle observed event at every time instance. In this paper we will focus solely on the
representation of the speciﬁc features of the normative framework.
    In order to translate rules in the normative framework relations G and C, we must
ﬁrst deﬁne a translation for expressions which may appear in these rules. The valuation
of a given expression taken from the set X depends on which ﬂuents may be held to
be true or false in the current state (at a give time instant). We translate expressions
into ASP rule bodies as conjunctions of extended literals using negation as failure for
negated expressions.
    With all these atoms deﬁned, mapping the generation function and the conse-
quence relation of a speciﬁc normative framework becomes rather straightforward.
The generation function speciﬁes that an normative event e occurs at a certain in-
stance (occurred(e, t)) when an another event e� occurs, the event e is empowered
(holdsat(pow(e), t) and a set of conditions on the state are satisﬁed (holdsat(f, t)
or not holdsat(f, t)). The rules for initiation (initiated(f, t)) and termina-
tion (terminated(f, t) of a ﬂuent f are triggered when a certain event e occurs
(occurred(e, t)) and a set of conditions on the state are fulﬁlled. The initial state of
our normative framework is encoded as simple facts (holdsat(f, i00)).
    Figure 1 gives a summary of all AnsP rolog rules that are generated for a speciﬁc
normative framework, including the deﬁnition of all the ﬂuents and events as facts. For
a given expression φ ∈ X , we use the term EX(φ, T ) to denote the translation of φ
into a set of ASP literals of the form holdsat(f, T) or not holdsat(f, T).
    In situations where the normative system consists of a number of agents whose
actions can be treated in the same way (e.g. the rules for borrowing a book are the
same for every member of alibrary) or where the state consists of ﬂuents that can
be treated in a similar way (e.g. the status of book), we can parameterise the events
and ﬂuents. This is represented in the AnsP rolog program by function symbols (e.g
borrowed(Agent, Book)) rather than terms. To allow for grounding, extra atoms to


                                            38
                 p ∈ F ⇔ ifluent(p).
                  e ∈ E ⇔ event(e).
                e ∈ Eex ⇔ evtype(e, obs).
               e ∈ Eact ⇔ evtype(e, act).
              e ∈ Eviol ⇔ evtype(e, viol).
        C ↑ (φ, e) = P ⇔ ∀p ∈ P · initiated(p, T)← occurred(e, I),EX(φ, T ).
        C ↓ (φ, e) = P ⇔ ∀p ∈ P · terminated(p, T)← occurred(e, I),EX(φ, T ).
         G(φ, e) = E ⇔ g ∈ E, occurred(g, T)←occurred(e, T),
                                 holdsat(pow(e), I),EX(φ, T ).
                 p ∈ S0 ⇔ holdsat(p, i00).

        Fig. 1. The translation of normative framework speciﬁc rules into AnsP rolog

ground these variables need to be added. Grounded versions of the atoms also need to
be added to the program. An example of this can be found in Section 6.


5     Learning Normative Rules

5.1   Methodology
The development process is supported by a set of use cases U . Use cases represent
instances of executions that are known to the designer and that drive the elaboration
of the normative system. If the current formalisation of the system does not match
the intended behaviour in the use case then the formalisation is still not complete or
incorrect. Each use case u ∈ U is a tuple �T, C, O� where T is a trace that speciﬁes
all the exogenous events occurring at all the time points considered (observed(e, T));
C are ground holdsat or occurred facts that the designer believes to be important
and represents the conditional expected output; O are ground holdsat and occurred
literals that represent the expected output of the use case.
     The design process is iterative. A current formalisation of the model in AnsP rolog
is tested against a set of use cases. Together with the AnsP rolog speciﬁcation of the
normative framework we add the observed events and a constraint indication that no
answer set that does not satisfy O is acceptable. The latter is done by adding a constraint
containing the negation of all the elements in O. If for some use cases the solver is not
able to ﬁnd an answer set (returns unsatisﬁable), then a revision step is performed. All
the use cases and the current formalisation are given as input to T AL. Possible revisions
are provided to the designer who ultimately chooses which is the most appropriate. The
success of the revision step depends on the state of the formalisation of the model. The
set of supporting use cases can be extended as the design progresses to more accurate
models.
     In this paper we focus on the learning step and we show how a non-monotonic
ILP system can be used to derive new rule. Reﬁning existing rules (i.e. deleting rules
or adding and delete conditions in rules) is a straightforward extension of the current
framework. Though we do not discuss it in this paper, revision can be performed by
extending the original rules with additional predicates that extend the search to deletion
of conditions in rules and to exceptions as shown in [11].


                                            39
                                        Normative
                                       framework
                                       AnsP rolog
                                      formalisation
                                                                      Learning

                Designer
                                       Use Cases


                                            Suggested revisions


                           Fig. 2. Iterative design driven by use cases.

5.2   Mapping ASP to ILP
The differences between the AnsP rolog program and the translation into a suitable
representation for T AL is procedural and only involves syntactic transformations. Thus
the difference in the two representations only consists in how the inference is performed.
The two semantics coincide since the same logic program is encoded and the mapping
of a normative framework has exactly one answer set when given a trace. If conditions
are added this can be reduced to zero.
     A normative model F corresponds to a AnsP rolog program PF as described in
Section 4. All the normal clauses contained in PF are part of B; the only differences
involve time points, that are handled in B by means of a ﬁnite domain constraint solver.
B also contains all the facts in C and T (negated facts are encoded by adding exceptions
to the deﬁnitions of holdsat and occurred). The set of examples E contains the
literals in O. Each H ∈ ILP �E, B, S� represents a possible revision for P and thus for
the original normative model.


6     Example

To illustrate the capabilities of the norm learning mechanism, we have developed a
relatively simple scenario that, at the same time, is complicated enough to demonstrate
the key properties with little extraneous detail.
     The active parties—agents—of the scenario each ﬁnd themselves initially in the
situation of having ownership of several (digital) objects—the blocks—that form part
of some larger composite (digital) entity—a ﬁle. An agent may give a copy of one its
blocks in exchange for a copy of another block with the aim of acquiring a complete
set of all the blocks. For simplicity, in the situation we analyse here, we assume that
initially each agent holds the only copy of a given block, and that is there is only one
copy of each block in the agent population. Furthermore, we do not take into account
the possibility of exchanging a block for one that the agent already has. We believe that
neither of these issues does more than complicate the situation by adding more states
that would obscure the essential properties that we seek to demonstrate. Thus, we arrive
at a statement of the example: two agents, Alice and Bob, each holding two blocks from
a set of four and each having the goal of owning all four by downloading the blocks they
miss from the other while sharing, with another agent, the ones it does.


                                                 40
 % Normative and Domain Rules
 initiated(hasBlock(Agent, Block), I) ←
         occurred(myDownload(Agent, Block), I), holdsat(live(f ilesharing), I).
 initiated(perm(myDownload(Agent, Block)), I) ←
         occurred(myShare(Agent), I), holdsat(live(f ilesharing), I).
 terminated(pow(f ilesharing, myDownload(Agent, Block)), I) ←
         occurred(myDownload(Agent, Block), I), holdsat(live(f ilesharing), I).
 terminated(needsBlock(Agent, Block), I) ←
         occurred(myDownload(Agent, Block), I), holdsat(live(f ilesharing), I).
 terminated(pow(f ilesharing, myDownload(Agent, Block)), I) ←
         occurred(misuse(Agent), I), holdsat(live(f ilesharing), I).
 terminated(perm(myDownload(Agent, Block)), I) ←
         occurred(myDownload(Agent, Block), I), holdsat(live(f ilesharing), I).
 occurred(myDownload(AgentA, Block), I) ←
         occurred(download(AgentA, AgentB, Block), I), holdsat(hasBlock(AgentB, Block), I),
         holdsat(pow(f ilesharing, myDownload(AgentA, Block)), I), AgentA! = AgentB.
 occurred(myShare(AgentB), I) ←
         occurred(download(AgentA, AgentB, Block), I), holdsat(hasBlock(AgentB, Block), I),
         holdsat(pow(f ilesharing, myDownload(AgentA, Block)), I), AgentA! = AgentB.
 occurred(misuse(Agent), I) ← occurred(viol(myDownload(Agent, Block)), I), i).
 % Initial state
 holdsat(pow(f ilesharing, myDownload(Agent, Block)), i0).
 holdsat(pow(f ilesharing, myShare(Agent)), i0).
 holdsat(perm(download(AgentA, AgentB, Block)), i0)).
 holdsat(perm(myDownload(Agent, Block)), i0).
 holdsat(perm(myShare(Agent)), i0).
 holdsat(hasBlock(alice, x1), i0).        holdsat(hasBlock(alice, x2), i0).
 holdsat(hasBlock(bob, x3), i0).        holdsat(hasBlock(bob, x4), i0).
 holdsat(needsBlock(alice, x3), i0).         holdsat(needsBlock(alice, x4), i0).
 holdsat(needsBlock(bob, x1), i0).         holdsat(needsBlock(bob, x2), i0).
 holdsat(live(f ilesharing), i0).
 % fluent rules
 holdsat(P, J) ← holdsat(P, I), not terminated(P, I), next(I, J).
 holdsat(P, J) ← initiated(P, I), next(I, J).
 occurred(E, I) ← evtype(E, ex), observed(E, I).
 occurred(viol(E), I) ←
         occurred(E, I), not holdsat(perm(E), I), holdsat(live(X), I), evinst(E, X).
 occurred(viol(E), I) ←
         occurred(E, I), evtype(E, inst), not holdsat(perm(E), I), event(viol(E)).


  Fig. 3. Translation of the “sharing” normative framework into AnsP rolog (types omitted).
    We model this as a simple normative framework, where the brute event [20] of
downloading a block initiates several normative events, but the act of downloading re-
vokes the permission of that agent to download another block until it has shared (this the
complementary action to download) a block with another agent. Violation of this norm
results in the download power being revoked permanently. In this way reciprocity is
assured by the normative framework. Initially, each agent is empowered and permitted
to share and to download, so that either agent may initiate a download operation.
    Fig. 3 shows the AnsP rolog representation of the complete normative framework
representing this scenario. In the following examples a variety of normative rules will
be deliberately removed and re-learned.

6.1   Learning Setting
To show how different parts of the formal model can be learned we start from a cor-
rect speciﬁcation and, after deleting some of the rules, we use TAL to reconstruct the
missing parts based on a single use case. In our example TAL is set to learn hypotheses
of at most three rules with at most three conditions. The choice of an upper bound on


                                            41
the complexity (number of literals) of the rule ultimately rests on the ﬁnal user. Alter-
natively, TAL can iterate on the complexity or perform a best ﬁrst search that returns
increasingly more complex solutions. We use the following mode declarations, M :

  m1 : modeh(terminated(perm(myDownload(+agent, +block)), +instant)).
  m2 : modeh(initiated(perm(myDownload(+agent, +block)), +instant)).
  m3 : modeb(occurred(myDownload(+agent, +block), +instant)).
  m4 : modeb(occurred(myDownload(+agent, −block), +instant)).
  m5 : modeb(occurred(myShare(+agent), +instant)).
  m6 : modeb((+agent!= +agent)).
  m7 : modeb(holdsat(hasblock(+agent, +block), +instant)).
  m8 : modeb(holdsat(powf ilesharing(myDownload(+agent, +block)), +instant)).

    The ﬁrst two mode declarations state that terminate and initiate permission rules for
the normative ﬂuent myDownload can be learned. The other declarations constrain the
structure of the body. The difference between m3 and m4 is that the former must refer
to the same block as the one in the head of the rule while the latter introduces a possibly
different block. m8 is an inequality constraint between agents. In general more mode
declarations should be considered (e.g. initiation and termination of all types of ﬂuents
should be included) but the revision can be guided by the designer. For example new
changes to a stable theory are more likely to contain errors and thus can be isolated in
the revision process. The time to compute all the reported hypotheses ranges from 30
to 500 milliseconds on a 2.8 GHz Intel Core 2 Duo iMac with 2 GB of RAM.
    The background knowledge B contains the rules in Fig. 3 together with the traces
T given in the use cases. C in this example is empty to allow for the demonstration of
the most general types of learning.


Learning a single terminate/initiate rule We suppose one of the initiate rules is
missing from the current speciﬁcation:

                  initiated(perm(myDownload(Agent, Block)), I) ←
                         occurred(myShare(Agent), I).

The designer inputs the following observed events that show how in a two agent sce-
nario, one of the agents loses permission to download after downloading a block and
reacquires it after providing a block for another agent. The trace T looks like:

                         observed(download(alice, bob, x3), 0).
                         observed(download(bob, alice, x1), 1).

The expected output O is:

                    not holdsat(perm(myDownload(alice, x4)), 1).
                    holdsat(perm(myDownload(alice, x4)), 2).

The trace is disfunctional if the expected output is not true in the answer set of T ∪ B.
The ILP task is thus to ﬁnd a set of rules H within the language bias speciﬁed by
mode declarations in M such that given the background knowledge B in Fig. 3 and the


                                           42
given expected output O as conjunction of literals, O is true in the only answer set of
B ∪ T ∪ H (if one exists). T AL produces the following hypotheses:
                   initiated(perm(myDownload(A, )), C) ← (H1 )
                          occurred(myShare(A), C).
and
                   terminated(perm(myDownload( , )), ). (H2 )
                   initiated(perm(myDownload(A, )), C) ←
                          occurred(myShare(A), C).
The second solution is not the one intended but it still supports the use case. Note
that according to current implementation, whenever a ﬂuent f is both initiated and
terminated at the same time point, f still holds at the subsequent time point.

Learning multiple rules In this scenario two rules are missing from the speciﬁcation:
                initiated(perm(myDownload(Agent, Block)), I) ←
                       occurred(myShare(Agent), I).
                terminated(perm(myDownload(Agent, Block2)), I) ←
                       occurred(myDownload(Agent, Block1), I).
We use the same T and O as previously. T AL produces the following hypotheses:
                  terminated(perm(myDownload(A, )), C) ← (H1 )
                         occurred(myDownload(A, ), C).
                  initiated(perm(myDownload(A, )), C) ←
                         occurred(myShare(A), C).
                  terminated(perm(myDownload( , )), ).   (H2 )
                  initiated(perm(myDownload(A, )), C) ←
                         occurred(myShare(A), C).
The second solution is consistent with the use case, but the designer can easily discard
it, since the rule is not syntactically valid with respect to the normative framework: a
ﬂuent can only be terminated as a consequence of the occurrence of an event. Using
more advanced techniques for the language bias speciﬁcation it would be possible to
rule out such a hypothesis.

Learning of undesired violation We assume the following rule is missing:
                  initiated(perm(myDownload(Agent, Block)), I) ←
                         occurred(myShare(Agent), I).
This time we provide a different trace T :
                         observed(download(alice, bob, x3), 0).
                         observed(download(bob, alice, x1), 1).
                         observed(download(alice, bob, x4), 2).
As a result of the trace, a violation at time point 2 is implied that the designer knows to
be undesired. The expected output is:
                    not occurred(viol(myDownload(alice, x4)), 2).


                                             43
              occurred(myShare(A), B) ←                               (H1)
                     occurred(download(C, A, E), B), A! = C,
                     holdsat(pow(f ilesharing, myDownload(A, E)), B).
              occurred(myShare(A), B) ←                               (H2)
                     occurred(download(C, A, E), B), A! = C,
                     holdsat(pow(f ilesharing, myDownload(A, E)), B),
                     holdsat(hasblock(A, E), B).
              occurred(myShare(A), B) ←                               (H3)
                     occurred(download(C, A, E), B), A! = C,
                     holdsat(pow(f ilesharing, myDownload(C, E)), B).
              occurred(myShare(A), B) ←                               (H4)
                     occurred(download(C, A, E), B), A! = C,
                     holdsat(pow(f ilesharing, myDownload(C, E)), B),
                     holdsat(hasblock(A, E), B).
              occurred(myShare(A), B) ←                                 (H5)
                     occurred(download(C, A, E), B), A! = C,
                     holdsat(hasblock(A, E), B).
              occurred(myShare(A), B) ←                               (H6)
                     occurred(download(C, A, E), B),
                     holdsat(pow(f ilesharing, myDownload(C, E)), B).

                        Fig. 4. Proposals to revise the generate rule

The outcome of the learning consists of the following two possible solutions:

                   initiated(perm(myDownload(A, )), C) ← (H1 )
                          occurred(myShare(A), C).
                   initiated(perm(myDownload( , )), ).   (H2 )

that show how the missing rule is derived from the undesired violation. As in the previ-
ous scenario the designer can easily dismiss the second candidate.


Learning a generate rule To account for the different type of rules that need to be
learned, the language bias is extended to consider learning of generate rules. The new
mode declarations are:

          modeh(occurred(myShare(+agent), +instant)).
          modeb(occurred(download(−agent, +agent, −block), +instant)).

We use the same trace and expected output as in the previous scenario (three observed
events). The following rule is eliminated from the speciﬁcation:

         occurred(myShare(AgentB), I) ←
               AgentA! = AgentB,
               occurred(download(AgentA, AgentB, Block), I),
               holdsat(hasblock(AgentB, Block), I),
               holdsat(pow(f ilesharing, myDownload(AgentA, Block)), I).

This is the most complicated case for the designer as a set of six different hypotheses
are returned by TAL (see Figure 4). Knowing the semantics of the function symbol
download(AgentA, AgentB, Block) as AgentA downloads from AgentB the designer
should be able to select the most appropriate rule.


                                            44
7   Related Work

The motivation behind this paper is the problem of how to converge upon a complete
and correct normative framework with respect to the intended range of application,
where in practice these properties may be manifested by incorrect or unexpected be-
haviour in use. Additionally, we would observe, from practical experience with our
particular framework, that it is often desirable, as with much software development, to
be able to develop and test incrementally—and regressively—rather than attempt veri-
ﬁcation once the system is (notionally) complete.
     The literature seems to fall broadly into three categories: (a) concrete language
frameworks (OMASE, Operetta, InstSuite, MOISE, Islander, OCeAN and the constraint
approach of Garcia-Camino (full references to these are currently omitted because of
page limitations)) for the speciﬁcation of normative systems, that are typically sup-
ported by some form of model-checking, and in some cases allow for change in the
normative structure; (b) logical formalisms, such as [16], that capture consistency and
completeness via modalities and other formalisms like [5], that capture the concept of
norm change, or [?] and [?]; (c) mechanisms that look out for (new) conventions and
handle their assimilation into the normative framework over time and subject to the
current normative state and the position of other agents [2, 8]. Essentially, the objective
of each of the above is to realize a transformation of the normative framework to ac-
commodate some form of shortcoming. These shortcomings can be identiﬁed in several
ways: (a) by observing that a particular state is rarely achieved, which can indicate there
is insufﬁcient normative guidance for participants, or (b) a norm conﬂict occurs, such
that an agent is unable to act consistently under the governing norms [23], or (c) a par-
ticular violation occurs frequently, which may indicate that the violation conﬂicts with
an effective course of action that agents prefer to take, the penalty notwithstanding. All
of these can be viewed as characterising emergent [28] approaches to the evolution of
normative frameworks, where some mechanism, either in the framework, or in the envi-
ronment, is used to revise the norms. In the approach taken here, the designer presents
use cases that effectively capture their behavioural requirements for the system, in or-
der to ‘ﬁx’ bad states. This has an interesting parallel with the scheme put forward by
Serrano and Saugar [30], where they propose the speciﬁcation of incomplete theories
and their management through incomplete normative states identiﬁed as “pending”. The
framework lets designated agents resolve this category through the speech acts allow
and forbid and scheme is formalised using an action language.
     A useful categorisation of normative frameworks appears in [6]. Whether the norms
here are ‘strong’ or ‘weak’ —the ﬁrst guideline— depends on whether the purpose of
the normative model is to develop the system speciﬁcation or additionally to provide
an explicit representation for run-time reference. Likewise, in respect of the remaining
guidelines, it all depends on how the framework we have developed is actually used:
we have chosen, for the purpose of this presentation, to stage norm reﬁnement so that
it is an off-line (in the sense of prior to deployment) process, while much of the dis-
cussion in [6] addresses run-time issues. Whether the process we have outlined here
could effectively be a means for on-line mechanism design, is something we have yet
to explore.


                                           45
    From an ILP perspective, we employ an ILP system that can learn logic programs
with negation (stratiﬁed or otherwise). Though recently introduced and in its early
stages of development TAL is the most appropriate choice to support this work for
two main reasons: it is supported by completeness results, unlike other existing non-
monotonic ILP systems ([27], [22]), and it can be tailored to particular requirements
(e.g. different search strategies can address performance requirements). The approach
presented in this paper is related to other recently proposed frameworks for the elab-
oration of formal speciﬁcations via inductive learning. Within the context of software
engineering, [?] has shown how examples of desirable and undesirable behaviour of
a software system can be used by an ILP system, together with an incomplete back-
ground knowledge of the envisioned system and its environment, to compute missing
requirements speciﬁcations. A more general framework has been proposed [?] where
desirable and undesirable behaviours are generated from counterexamples produced by
model checking a given (incomplete) requirements speciﬁcation with respect to given
system properties. The learning of missing requirements has in this case the effect of
eliminating the counterexamples by elaborating further the speciﬁcation.


8   Conclusions and Future Work

We have presented an approach for learning norms and behavioural rules, via inductive
logic programming, from example traces in order to guide and support the synthesis
of a normative framework. This addresses a crucial problem in normative systems as
the development of such speciﬁcations is in general a manual and error-prone task. The
approach deploys an established inductive logic programming system [12] that takes in
input an initial (partial) description of a normative system and use cases of expected
behaviours provided by the designer and generates hypothesis in the form of missing
norms and behavioural rules that together with the given description explain the use
cases. Although the approach presented in this paper has been tailored for learning
missing information, it can also be applied to computing revisions over the existing de-
scription. In principle this can be achieved by transforming the existing normative rules
into defeasible rules with exceptions and using the same ILP system to compute excep-
tion rules. These exceptions would in essence be prescriptions for changes (i.e. addition
and/or deletion of literals in the body of existing rules) in the current speciﬁcation.
An appropriate refactoring of the defeasible rules based on the learned exception rules
would give a revised (non-defeasible) speciﬁcation. In this case, the revision would be
in terms of changes over the rules of a normative framework instead of changes over its
belief state, as would be the case if a TMS approach were adopted.
    There are several criticisms that can be levelled at the approach as it stands. Firstly,
the design language is somewhat unfriendly: a proper tool would have a problem-
oriented language, like InstAL/QL [10, 19]. A system designer would then start from
an initial description of their normative framework with some use cases and receive
automated suggestions of additional norms to include in the framework written in the
same high-level language. The machinery described here, based on ASP syntax and ILP
formulation, would then be used as a sound “back- end” computation to a formalism
familiar to the system designer. Secondly, better control is needed over the rules that are


                                            46
learned and over the ﬁltering of incorrect rules; at present this depends on specialised
knowledge of the learning process. This can to some extent be controlled through care-
ful choice of and limits on the size of use cases—probably involving heuristics—to
improve the effectiveness of the learning process in the search for relevant hypotheses
and pruning of those potential solutions that cannot be translated back into the canonical
form of the normative framework. Despite these issues, we believe we have identiﬁed an
interesting path for automating and development and debugging of practical normative
speciﬁcations and perhaps, in the long term, a mechanism for on-line norm evolution.


References
 1. A. Artikis, M. Sergot, and J. Pitt. Specifying electronic societies with the Causal Calculator.
    In Proceedings of workshop on agent-oriented software engineering iii (aose), LNCS 2585.
    Springer, 2003.
 2. Alexander Artikis. Dynamic protocols for open agent systems. In Sierra et al. [31], pages
    97–104.
 3. Chitta Baral. Knowledge Representation, Reasoning and Declarative Problem Solving. Cam-
    bridge Press, 2003.
 4. Guido Boella, Pablo Noriega, Gabriella Pigozzi, and Harko Verhagen, editors. Normative
    Mult-Agent Systems, number 09121 in Dagstuhl Seminar Proceedings. Schloss Dagstuhl -
    Leibniz-Zentrum fuer Informatik, Germany, 2009.
 5. Guido Boella, Gabriella Pigozzi, and Leendert van der Torre. Normative framework for
    normative system change. In Sierra et al. [31], pages 169–176.
 6. Guido Boella, Gabriella Pigozzi, and Leendert van der Torre. Normative systems in computer
    science - ten guidelines for normative multiagent systems. In Normative Mult-Agent Systems,
    2009.
 7. Guido Boella and Leendert van der Torre. Constitutive Norms in the Design of Normative
    Multiagent Systems, City College. In Proceedings of the Sixth International Workship on
    Computational Logic in Multi-Agent Systems (CLIMA-VI), June 2005.
 8. George Christelis and Michael Rovatsos. Automated norm synthesis in an agent-based plan-
    ning environment. In Sierra et al. [31], pages 161–168.
 9. Owen Cliffe, Marina De Vos, and Julian Padget. Answer set programming for representing
    and reasoning about virtual institutions. In Computational Logic for Multi-Agents (CLIMA
    VII), volume 4371 of LNAI, pages 60–79. Springer, May 2006.
10. Owen Cliffe, Marina De Vos, and Julian A. Padget. Embedding landmarks and scenes in
    a computational model of institutions. In Coordination, Organizations, Institutions, and
    Norms in Agent Systems III, volume 4870 of LNCS, pages 41–57, September 2008.
11. D. Corapi, O. Ray, A. Russo, A.K. Bandara, and E.C. Lupu. Learning rules from user be-
    haviour. In 5th Aritiﬁcial Intelligence Applications and Innovations (AIAI 2009), April 2009.
12. D. Corapi, A. Russo, and E. Lupu. Inductive logic programming as abductive search. In
    26th International Conference on Logic Programming, Leibniz International Proceedings in
    Informatics. Schloss Dagstuhl Research Online Publication Server, 2010.
13. Marc Denecker and Danny De Schreye. Sldnfa: An abductive procedure for abductive logic
    programs. J. Log. Program., 34(2):111–167, 1998.
14. V. Dignum. A model for organizational interaction: based on agents, founded in logic. PhD
    thesis, University of Utrecht, 2004.
15. Sas̆o Dz̆roski. Relational data mining applications: an overview. pages 339–360, 2000.
16. Christophe Garion, Stéphanie Roussel, and Laurence Cholvy. A modal logic for reasoning
    on consistency and completeness of regulations. In Boella et al. [4].


                                               47
17. M. Gebser, B. Kaufmann, A. Neumann, and T. Schaub. Conﬂict-Driven Answer Set Solving.
    In Proceeding of IJCAI07, pages 386–392, 2007.
18. Michael Gelfond and Vladimir Lifschitz. Classical negation in logic programs and disjunc-
    tive databases. New Generation Computing, 9(3-4):365–386, 1991.
19. Luke Hopton, Owen Cliffe, Marina De Vos, and Julian Padget. Instql: A query language for
    virtual institutions using answer set programming. In Proceedings of the 10th International
    Workshop on Computational Logic in Multi-Agent Systems (ClimaX), IfI Technical Report
    Series, pages 87–104, September 2009.
20. John R. Searle. The Construction of Social Reality. Allen Lane, The Penguin Press, 1995.
21. Andrew J.I. Jones and Marek Sergot. A Formal Characterisation of Institutionalised Power.
    ACM Computing Surveys, 28(4es):121, 1996. Read 28/11/2004.
22. Tim Kimber, Krysia Broda, and Alessandra Russo. Induction on failure: Learning connected
    horn theories. In LPNMR, pages 169–181, 2009.
23. Martin Kollingbaum, Timothy Norman, Alun Preece, and Derek Sleeman. Norm conﬂicts
    and inconsistencies in virtual organisations. In Proceedings of COIN 2006, volume 4386 of
    LNCS, pages 245–258. Springer, 2007.
24. N. Lavrač and S. Džeroski. Inductive Logic Programming: Techniques and Applications.
    Ellis Horwood, 1994.
25. I. Niemelä and P. Simons. Smodels: An implementation of the stable model and well-founded
    semantics for normal LP. In LPNMR, volume 1265 of LNAI, pages 420–429. Springer,
    July 28–31 1997.
26. Juan A. Rodriguez-Aguilar. On the Design and Construction of Agent-mediated Institutions.
    PhD thesis, Universitat Autonoma de Barcelona, 2001.
27. Chiaki Sakama. Nonmonotonic inductive logic programming. In LPNMR, page 62, 2001.
28. Bastin Tony Roy Savarimuthu and Stephen Craneﬁeld. A categorization of simulation works
    on norms. In Boella et al. [4].
29. Marek Sergot. (C+)++: An Action Language For Representing Norms and Institutions. Tech-
    nical report, Imperial College, London, August 2004.
30. Juan-Manuel Serrano and Sergio Saugar. Dealing with incomplete normative states. In
    Proceedings of COIN 2009, volume 6069 of LNCS. Springer, 2010. in press.
31. Carles Sierra, Cristiano Castelfranchi, Keith S. Decker, and Jaime Simão Sichman, editors.
    8th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS
    2009), Budapest, Hungary, May 10-15, 2009, Volume 1. IFAAMAS, 2009.
32. Munindar P. Singh. A social semantics for agent communication languages. In Issues in
    Agent Communication, pages 31–45. Springer-Verlag: Heidelberg, Germany, 2000.


                                             48