<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Defining and Identifying the Legal Culpability of Side Effects Using Causal Graphs</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Hal Ashton University College London London</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>Deployed algorithms can cause certain negative side effects on the world in pursuit of their objective. It is important to define precisely what an algorithmic side-effect is in a way which is compatible with the wider folk concept to avoid future misunderstandings and to aid analysis in the event of harm being caused. This article argues that current treatments of side-effects in AI research are often not sufficiently precise. By considering the medical idea of side effect, this article will argue that the concept of algorithm side effect can only exist once the intent or purpose of the algorithm is known and the relevant causal mechanisms are understood and mapped. It presents a method to apply widely accepted legal concepts (The Model Penal Code or MPC) along with causal reasoning to identify side effects and then determine their associated culpability.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>When certain types of Algorithms are deployed in the wider
world, they can cause changes (effects) to that world. We
can divide those effects into things which are the purpose of
the algorithm (and its creators) and those that are not. We
can augment this by understanding which effects are
necessary for the algorithm to fulfil its purpose and which are not
using the concept of means-end intent. Often risk analysis
concentrates on those effects which might be caused if the
algorithm fails to achieve its purpose. Side effects concern
those effects which are caused by the algorithm, but whose
occurrence does not affect the purpose of the algorithm and
its creators. Often side effects have a cost not born by the
person who caused them. Such costs are called negative
externalities by economists. Analysis after the event can
identify those effects of the algorithm which were foreseeable to
the algorithm and its designers and those which should have
been. Questions of Intent, Causation and Foreseeability are
asked when courts decide on the culpability of algorithm
designers when actionable harm has been caused. This article
will use the culpability definitions as found in the US Model
Penal Code and use causal reasoning coupled with Causal
Inference Diagrams to provide a way of identifying and
reasoning about side effects. We will use a running example of
a recommender system as an illustrative example of a system
which may display ill side effects.</p>
      <p>2</p>
    </sec>
    <sec id="sec-2">
      <title>The existing definition in Safe AI</title>
      <p>
        <xref ref-type="bibr" rid="ref3">Amodei et al. (2016)</xref>
        identify ’Avoiding Negative Side
Effects’ as one of their vfie key problems of AI Safety. Whilst
they do not formally present a definition of side effect, to
paraphrase they are seen as any negative effect that might
be caused by a policy which is not explicitly represented in
the agent’s reward function. I argue that defining side-effects
solely in terms of an agent’s reward function with no
reference to either the underlying causal processes, or the agent’s
policy, is the wrong way to proceed. The very general
problem of side-effects in Amodei et al has been relabelled as
one of value alignment
        <xref ref-type="bibr" rid="ref30">(Russell 2019)</xref>
        ; the general problem
of describing a reward function for a task which allows an
AI to solve tasks without causing harm directly or indirectly
by following a strategy that would be obviously
unacceptable for a human. Whilst there is overlap between the
sideeffect and value alignment problems, as
        <xref ref-type="bibr" rid="ref8">Ashton and Franklin
(2022)</xref>
        and Saisubramanian, Zilberstein, and Kamar (2021)
point out, sometimes side-effects align with the objectives
of the AI designer as with the case of recommender systems
and polarisation.
      </p>
      <p>The problem with defining side-effects solely in terms of
the reward function is that effects necessary for a strategy
to succeed, brought about through a policy are mislabelled
as unintentional. The danger with Computer Scientists
proceeding with an overly liberal definition of side-effect is that
labelling caused harms as such implies they are not
intentional. This in turn is important because intentional harms
attract the highest criminal sanctions. Intent is not the sole
determinant in determining culpability and this will article will
go on to show how unintentional yet foreseen side-effects
also attract criminal sanctions. Nevertheless certain crimes
(such as attempt crimes) cannot be committed without
intent. A definition of side-effect solely dependent on agent
reward function risks incentivising myopia so as to avoid
responsibility for harm.</p>
      <p>Consider the story of the AI Physician tasked with curing
cancer in a human patient. It comes up with a novel solution
and proceeds and the result is that the patient is killed by
the intervention. Note that it succeeds in its task because the
patient does not die of cancer. Since patient survival was not
in its objective function, patient death is understood as a side
effect according to the definition above. Think of 3 surgical
procedures:
T1 The AI Physician removes the brain of the patient so that
they could not subsequently die of cancer, but die from
having no brain.</p>
      <p>T2 The AI physician diverts all of the patient’s immune
system to destroying the tumour but that necessarily made
the immune system attack some other vital part of the
body leading to death.</p>
      <p>T3 The AI physician came up with a genuinely novel
procedure with a p% recovery rate but the patient fails to
recover.</p>
      <p>These are all different causal mechanisms and we will
later see that in Treatment 1, since death is necessary for the
procedure to work it is most definitely not a side-effect.
Additionally do we think about the side-effect status of patient
differently in the contrasting cases of when the AI physician
understands the outcomes of its actions and when it does
not?</p>
      <p>Rather than choosing a definition of side-effects
ourselves, I argue that computer scientists are better off looking
in other domains for one. That way we can borrow accrued
wisdom, avoid a proliferation of conflicting definitions
between sciences and deflect any accusations that an overly
generous definition of side effect is a device to insulate
ourselves from blame for harm.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Sourcing an independent definition of side effect from medicine and law</title>
      <p>In common speech what do we mean by the term side
effect? Firstly to disambiguate, I should say that a concept of
side effect does exist in programming and it has a formal
definition fit for its intended purpose. A hazard with terms
that have domain specific meanings is to assume that those
meanings are shared outside the discipline. Instead we want
to take the idea of side effect that exists outside computer
science and bring it into the discipline in a process that does
not alter it. As with most primitive or folklore concepts,
people intuitively know what a side effect is, but pinning down
a decent definition of one takes some effort. The benefit of
such an endeavour is twofold. It aids cross-disciplinary
communication for when a regulator and a computer scientist
discuss side effects it is preferable that both mean the same
thing. From a programming perspective, a formal definition
of side effects written in such a way as an algorithm would
be able to understand, can prevent algorithms from causing
harm.</p>
      <p>
        The most common place that people see the term side
effects is in a medical setting, so it is intuitive to start the
process of definition here. It is also an appropriate source
given that medicines and the medical profession are strictly
regulated. The APA (American Psychological Association)
defines a side effect as follows
        <xref ref-type="bibr" rid="ref4">(APA 2021)</xref>
        :
      </p>
      <p>Any reaction secondary to the intended therapeutic
effect that may occur following administration of a
drug or other treatment
This definition makes the distinction between effects which
are intended and those which are not, with side effects
appearing in the latter class. The term secondary requires
further unpacking which we will do once we have introduced
some causal mechanisms.</p>
      <p>
        The concepts of intentionality and foreseeability are
commonly used in the legal world and it is from here that we
will source their definitions. By looking to the law for a
definition of intent we can borrow centuries of legal thought
and endeavour. One can consider legal definitions as
opensource in the sense that they are accessible to public scrutiny
and have been democratically tested over time. As
        <xref ref-type="bibr" rid="ref21">Hildebrandt (2019)</xref>
        states, legal questions enjoy closure, that is to
say, within any jurisdiction, definitions and questions have
answers.
      </p>
      <p>
        Despite its key role in Criminal law amongst others 1, for
various reasons a singular definition of what constitutes
intent is elusive. We will use the US Model Penal Code (MPC)
        <xref ref-type="bibr" rid="ref40">(The American Law Insitute 2017)</xref>
        which does provides
definitions of the four levels of mens-rea or criminal
culpability; Purpose (aka Intent), Knowledge, Recklessness and
Negligence. The MPC was drafted in the 1960s in an effort
to unite US state law and has been adopted at least partially
by most states since. These definitions also invoke the term
foreseeability; importing definitions of intent from law also
implicitly brings conventions concerning foreseeabilty. The
MPC defines Purpose 2 (Intent) as follows:
      </p>
      <p>A person acts purposely with respect to a material
element of an offense when... if the element involves the
nature of his conduct or a result thereof, it is his
conscious object to engage in conduct of that nature or to
cause such a result</p>
      <p>In essence this definition says that someone intends
something if it is the object outcome of their actions. This
definition largely corresponds to the folk-definition of intent.
Agent Ag intends X if they choose to do action ψ in order
to cause X. This implies an epistemic condition on the
outcome; Ag can foresee that X could be an outcome of them
ψ -ing. However this definition does not mention probability
of outcome so it follows that long-shot type outcomes can
be intended.</p>
      <p>At this point we will introduce some causal modelling
terminology to illustrate more easily things which are intended
and things which could be called side effects.</p>
      <p>4</p>
    </sec>
    <sec id="sec-4">
      <title>Causal Modelling Approach</title>
      <p>Consider a directed acyclic graph G with vertices V and
edges E, for A, B ∈ V we will adopt the convention that
A is a cause of B iff there is a directed edge in E from A
to B. That is to say, all other things constant, a change in
A will imply some change in B. We shall call this a Causal
DAG.</p>
      <p>1Intent appears in almost aspect of law - contract, tort,
regulatory. Sometimes in obvious ways, sometimes not</p>
      <p>2When drafting the MPC, the authors took the approach that
defining concepts such as intent could be better done by not
mentioning them so as not to conflate with possibly wrong folk
concepts of the word.</p>
      <p>
        We will use the Structural Causal Influence Models
(SCIM)
        <xref ref-type="bibr" rid="ref13">(Everitt et al. 2021)</xref>
        to make the Causal Dag more
applicable to intent. An influence diagram named I D is
causal DAG such that the vertices are divided into disjunct
groups - Decision vertices VD (represented with rectangles),
outcome vertices VO (represented by circles) and utility
vertices VU (octagons or diamonds) with VD ∪ V0 ∪ VU = V .
Let R(Y ) denote the full set of realisations that vertex Y
can take. Structural equations determine the relationship
between parent outcome or decision vertices and child
outcome or utility vertices. Thus the structural equation
associated with arc AB for A ∈ VO, B ∈ V is a function
fAB : R(A) → R(B). A policy π ID is a set of
structural equations with decision vertices as children such that
the parents of any decision vertex D denoted P a(D)
determine a distribution over R(D); the possible realisations of
D. A non-deterministic policy would apply unit mass to
single element of RD for every possible realisation of P a(D).
      </p>
      <p>
        Additionally without any loss of generality we can
enforce the restriction that all stochastic elements of the I D
do not have a parent. Practically this just means the
rewriting of non-deterministic structural equations to separate
deterministic variables from (possibly new) non-deterministic
variables which themselves have no parents. This ensures
that every vertex that is a descendent of a decision vertex is
deterministic and the I D is said to be in Howard Canonical
Form
        <xref ref-type="bibr" rid="ref19">(Heckerman and Shachter 1994)</xref>
        . Once the policy is
set and the non-deterministic variables are set, all other
variables are uniquely realised. This form also allows us to
interpet the SCIMas a Structural Causal Model (SCM) and use
the accompanying definitions of Actual-causality
        <xref ref-type="bibr" rid="ref17">(Halpern
2016)</xref>
        and Do-algebra
        <xref ref-type="bibr" rid="ref28">(Pearl 2000)</xref>
        should we wish.
5
      </p>
    </sec>
    <sec id="sec-5">
      <title>Basic properties of Intent and Side Effects in Causal Models</title>
      <p>
        Within the framework of SCIMs and related Causal
Analysis, a number of definitions of intent might exist which are
arguably compatible with the MPC’s definition of intent or
Purpose but might require assumptions about the agent. For
example
        <xref ref-type="bibr" rid="ref22">Kleiman-Weiner et al. (2015)</xref>
        present an account
of intent using Influence Diagrams which assumes a utility
maximising agent. Ashton (2021a) does not make the
assumption but presents a definition of intent without a
formal causal framework. For this article I will assume intended
outcomes are given. This should not be problematical for a
system designer, since it is good practice to identify what the
intended purpose of an algorithm is before creating it. For
clarity we will use this very general definition of Intended
realisations (outcomes) and Intended Variables:
Definition 1 (Intended Realisation, Intended Variables). For
an Influence Diagram ID, an intended realisation is a finite
set of realisations for outcome and utility variables under a
ifxed policy π . The intended variables are those variables
which occur have an intended realisation
      </p>
      <p>Whilst for the purposes of identifying side effects we can
simply assume that it is known which things are intended
and which are not but we must still ensure intentional
knowledge is consistent with the legal concept of intent. For this
we need to make three assumptions.
1. An outcome can only be intended if its realisation is
dependent on an action realising a certain value.
2. Actions made by an algorithm are done so intentionally
only if their is choice not to act in that way (the action set
has more than one member)
3. The concept of means-end consistent intent is respected.</p>
      <p>
        Condition one just says that an outcome can only be
intended if a decision variable is able to affect the it. We
cannot intend our football team to win by attending the match
as a spectator. This rules out any parental chance or
outcome variables from being intended and stops any foregone
conclusions from being intended. Condition two says that
the action-decisions that a decision maker makes are not
coerced (there is always a choice of policy design).
Condition three requires explanation. There exists in law
        <xref ref-type="bibr" rid="ref37">(Simester
et al. 2019)</xref>
        and philosophy
        <xref ref-type="bibr" rid="ref10">(Bratman 2009)</xref>
        a concept called
means-end intent which is culpably equivalent to intent or
MPC Purpose.
      </p>
      <p>Suppose we know outcome O = o for O ∈ VO is intended
in an influence diagram. Then for any outcome variable O′ ∈
Anc(O) ∩ Vo, if it is necessary for O′ = o′ for the intended
outcome O = o to occur then O′ = o is also an intended
outcome.</p>
      <p>Definition 2 (Side Effect and Unintended outcome).
Consider an influence diagram SCIMwith outcome vertices VO,
decision vertices VD and utility vertices VU , a policy
function π ID and a set of intended variables VE ⊂ V and for
each intended variable an intended realisation x ∈ RX for
every x ∈ VE .
• A Side Effect Variable of a Decision is any descendent
variable of that decision vertex which is not an intended
variable. A Side effect of a Decision is a realisation of a
Side Effect Variable of a decision.
• The Policy Side Effects and Policy Side Effects
Variables are analogously defined for sets of intended
realisations and variables according to some policy.
• An Unintended outcome is a realisation of an intended
variable other than the intended realisation</p>
      <p>The definition requires that side effects are caused by a
policy since they are descendants of action variables in a
causal DAG; the policy has an influence on their outcome.
We can restate Definition 2 by saying Side Effects of a
decision are those vertices which are descendants of the
decision but are not themselves ancestors of any vertices with
intended realisations according to that decision. This
definition incorporates the idea of means-end consistency into it
assumed in Definition 1.</p>
      <p>The definition of Unintended outcomes separates the
analysis of when a certain policy fails in achieving its objective
(chance of failure) from the analysis of side effects whose
occurrence is not dependent a policy’s success. The legality
or culpability of unintended outcomes is not straightforward.
A doctor performing a life saving treatment which will either
result in the death of their patient or their saviour is not
usually punished in the event of failure even though the chances
of success might be exceptionally slim. On the other hand a
fund-manager who loses all of their investor’s money on a
risky bet might be punished for their reckless actions.
6</p>
    </sec>
    <sec id="sec-6">
      <title>Illustrative example</title>
      <p>
        In this section we will explore some of the previously
discussed features of side effects and intent using the example
of a company wishing to deploy a new recommender
system on its users. I recognise this is a departure from issues
of robot induced vase breaking as is typically considered
in related side-effect literature, nevertheless I think this
setting is pertinent. Article 5(1)(a-b) of The Draft EU AI Act
        <xref ref-type="bibr" rid="ref11">(CNECT 2021)</xref>
        prohibits AI systems from using techniques
which manipulate the behaviour of users to their detriment
with techniques beyond their consciousness.
      </p>
      <sec id="sec-6-1">
        <title>Serve</title>
      </sec>
      <sec id="sec-6-2">
        <title>Content uC</title>
      </sec>
      <sec id="sec-6-3">
        <title>Preferences</title>
      </sec>
      <sec id="sec-6-4">
        <title>Changes</title>
      </sec>
      <sec id="sec-6-5">
        <title>Preferences uR</title>
      </sec>
      <sec id="sec-6-6">
        <title>Watches</title>
      </sec>
      <sec id="sec-6-7">
        <title>Post</title>
      </sec>
      <sec id="sec-6-8">
        <title>Radicalises</title>
        <p>Suppose a company has the choice between serving two
items of video content, one normal and one which has been
designed to intoxicate its viewer into watching all of the
video. If the user views this intoxicating content there is a
chance that they become radicalised in some way. Hidden
from the company are the user’s preferences which dictate
what type of content they would want to watch, preferences
undisturbed.
• Let Serve Content decision be represented by variable</p>
        <p>S ∈ {0, 1}
• Let the Change Preference outcome be represented by
variable C ∈ {0, 1}
• Let the Radicalised outcome be represented by variable</p>
        <p>R ∈ {0, 1}
• Let Watches Post utility vertex be represented by variable</p>
        <p>W ∈ {0, 1}
• Let the user’s Preferences be represented by variable P ∈
{0, 1}. This is an exogenous random variable unknown
at the time of decision S. Let P be Bernoulli distributed
with chance of success 0 ≤ µ P ≤ 1
• UC and UR are two exogenous random variables which
help determine the chance of Preference change and
Radicalisation respectively. Let them both be Bernoulli
distributed with probabilities of success µ C , µ R ∈ [0, 1]
respectively.</p>
        <p>The structural equations are as follows:</p>
        <p>R =
C =
W =
1 if C = 1 and Ur = 1
0 else
1 if S = 1 and US = 1
0 else
1 if S = P or C = 1
0 else</p>
        <p>Content can be one of two types as can a user’s
preferences; p(P = 1) = µ P . If the two types match then the user
will watch the post and the recommender is rewarded with
a unit of advertising revenue. Additionally if the content is
of the intoxicating type (S = 1) then there is chance µ C
that the user will have their preferences changed C = 1. If
that is the case then they will definitely watch the post.
Additionally there is chance the user will be radicalised with
probability µ R
Example 1. Let W = 1 be the intended outcome under the
Policy S = 1. The first question is whether C must have
intended realisation under means-end consistency. As long
as µ P &gt; 0 (the chance that the user would actually like
content type 1) W = 1 is over-determined; either P = 1
or C = 1 is sufficient for W = 1. C could be be both an
intended variable and not.</p>
        <p>Suppose C = 1 is the intended realisation. The set of
intended variables is {S, C, W } and the set of possible side
effects is {R} since there are no other descendant.</p>
        <p>Alternatively suppose C has no intentional status. The set
of intended variables is {S, W } and the set of possible side
effects is {C, R}.</p>
        <p>
          <xref ref-type="bibr" rid="ref22">Kleiman-Weiner et al. (2015)</xref>
          and
          <xref ref-type="bibr" rid="ref18">Halpern and
KleimanWeiner (2018)</xref>
          use a counterfactual type test for intent. If
we set variables to their expected reward maximising
realisations, and then swap the realisation of C and adjust
the rewards of its descendants accordingly, would the
policy change? In other words, is the policy dependent on the
causal relationship between S and C? The expected reward
from choosing S=1 is µ P + (1 − µ P )(µ C ) and the expected
reward from choosing S=0 is (1 − µ P ). Assuming a reward
maximising agent this implies µ c &gt; 1− 2µ P Breaking the
1− µ P
causal link between S and C and setting C = 0 whilst
continuing to choose S = 1 would give an expected reward of
µ P - the strategy would change if µ p &lt; 0.5. In the next
section we will look at the possible culpability classification of
the side effects in this example.
        </p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>7 Culpability of side effects: The role of knowledge</title>
      <p>
        The subject of culpability as to side effects of actions has
received a lot of attention in Psychology since
        <xref ref-type="bibr" rid="ref24">Knobe (2003)</xref>
        discovered the Side-effect effect, the phenomenon where
people are judged to have intended negative side effects
which they foreseeably cause but not for any positive side
effects that they cause.
      </p>
      <p>
        Legal systems have a lot to say about side effects and
culpability3 and provide us with an independent framework to
reason about them. Conversely, courts currently have very
little to say about harms caused by algorithms and who
should take responsibility for them. Only legal persons can
commit crimes and so harms caused by algorithm might
have an indeterminate status
        <xref ref-type="bibr" rid="ref1">Abbott and Sarch (2020)</xref>
        . In
this section we will refer to the agent and actor and take
that to mean the algorithm, algorithm designer and owner
together. We will assume knowledge available to one is
available to the other.
      </p>
      <p>Just as with our use of the MPC to find a definition of
Purpose (intent), we can use its definitions of Knowledge,
Recklessness and Negligence to analyse the culpability of
side effects. These four concepts are in descending order of
culpability. Recklessness is typically the minimum level of
culpability required for criminal charges. Negligence is the
benchmark required for most civil-damages cases. The key
features of these definitions are summarised in Table 1. The
table also includes the features of Intent or Purpose for ease
of comparison.</p>
      <p>All four definitions of culpability in the table require
someone to be able to foresee a bad outcome occurring as
a result of an action or policy. It is here that the MPC’s
decision to define the second most serious level of
culpability ’Knowledge’ problematic since the word is useful to
describe the epistemic properties of all of the definitions. We
will refer to this as ’Culpable Knowledge’ to disambiguate.</p>
      <p>
        The table shows aim or desire is only required for
Purpose/Intent which is consistent with our prior definition of
side effects being. It makes a distinction between two types
of knowledge - subjective knowledge - things which are
known to the actor and objective - things which should be
known to the actor. We can view the Influence Diagram as
a distillation of the actor’s causal knowledge of the world.
A side effect caused with Culpable Knowledge concerns the
case when a bad outcome occurs with almost certainty
according to the SCIMand the algorithm’s policy.
Recklessness and Negligence have been termed culpable
carelessness by
        <xref ref-type="bibr" rid="ref38">(Stark 2017)</xref>
        ; they correspond to cases of model
misspecification and require a judgement about what a
reasonable actor should have had as a model of the world. In the
case of Recklessness, the actor recognised some chance of a
bad outcome happening but continued anyway. If the actual
chance of that thing happening was unreasonably high and
that was knowable to an external reasonable actor, then the
side effect was caused with Recklessness. Negligence
covers the case where the algorithm didn’t even countenance the
risk of something bad happening, and the risk was in
foreseeably unreasonable according to some external reasonable
actor. Side effects caused with negligence are likely to
in3Else the ’I didn’t mean to shoot him your honour, I was
intending to shoot the pigeon behind him’ defence would work really
well.
volve variables not included in the algorithm or its
designers’ model of the world. These should’ve known unknowns
are particularly dangerous since no planning algorithm can
avoid them and yet they will not be viewed as accidents by
society and admit liability to the algorithm owner.
      </p>
      <p>Whilst this table is focussed on the culpability of side
effects as previously defined, Recklessness and Negligence
can also apply to Unintended Outcomes. That is to say
outcomes that may be caused when failing to achieve an
intended outcome. In the case of Knowledge there is debate
about whether someone can commit something with
Culpable Knowledge if their action was intended to obtain some
other result.</p>
      <p>Figure 2 distils Table 1 into a decision process with which
to identify the possible culpability of any caused outcomes.
The grey decision vertices concern questions of subjective
knowledge - information known to the actor at the point of
commission. The white vertices concern questions of
objective knowledge - information that should have been known
to the actor at the point of commission. We will use it in the
following example.</p>
      <p>
        Example 2. Continuing Example 1 we consider the
culpability of the possible side effects caused. Since the outcome
of getting a user to watch a video of type 1 was
overdetermined, there was some uncertainty about the intentional
state of changing preferences - C by choosing action S = 1.
Since causing a change in preferences is not in itself an
unambiguous harm, we will concentrate analysis on the
Radicalisation outcome R which is in the set of side effects
regardless of the intentional status of preference change.
Given that content of type one is chosen, the probability of
radicalisation is µ p.µ R. R is not an intended variable and
has no intended realisation. Consider the case where
Radicalisation does occur. We will assume that the actor has
the same model available to them as in Figure 1. The first
question would be to consider whether harm is foreseeable,
and whether that harm was ’substantial and unjustifiable’,
that is whether the µ p.µ R &gt;&gt; 0. If this wasn’t the case,
then the Radicalisation could be said to be an accident. At
this point, the estimated quantity E[µ P ∗ µ R] must be
assessed. If the actor estimated this as negligible then
Radicalisation would have been caused with negligence. If the
estimate E[µ P ∗ µ R] ≈ 1 then Radicalisation would have
been caused with Culpable Knowledge. The remaining case,
E[µ P ∗ µ R] &gt;&gt; 0 means that the actor caused Radicalisation
with Recklessness. It should be stated that the definition of
’substantial and unjustifiable’ is not straightforward and may
be dependent on the degree of harm caused
        <xref ref-type="bibr" rid="ref38">(Stark 2017)</xref>
        .
      </p>
      <p>
        In the case where harm has not been caused, culpability
can still arise if the actor were to believe their actions were
risky and they were substantially so. For a given crime, it
does not always follow that there is an analogous crime of
reckless endangerment unlike the general existence of
attempt crimes for every crime.
        <xref ref-type="bibr" rid="ref39">Stark (2020)</xref>
        differentiates
between two situations; firstly where there was a risk of
some harm occurring ”Concrete Endangerment” and
secondly where there was in actual fact no risk of
endangerment, but there could have been.
      </p>
      <p>Outcome
Intention</p>
      <p>Exists?</p>
      <p>Yes
Intended
Harm?
Yes</p>
      <p>No</p>
      <p>No
Harm
Caused</p>
      <p>Yes
Harm
Achieved
Yes No</p>
      <p>Attempt
No
Culpable
Purpose</p>
      <p>Harm
Foreseeable</p>
      <p>Harm</p>
      <p>Foreseeable
Else</p>
      <p>Substantial and</p>
      <p>unjustifiable</p>
      <p>Accident /
Strict Negligence</p>
      <p>Harm
Foreseen</p>
      <p>Else</p>
      <p>None Some
Negligence</p>
      <p>
        As well as applying negligence to instances when an
actor should have known about the chance of some harm
being caused, US and other Common Law jurisdictions allow
higher levels of culpability to be imputed in the case when an
agent actively avoids acquiring knowledge in an effort not to
inculpate their behaviour. This is known as the Willful
Ignorance Doctrine
        <xref ref-type="bibr" rid="ref36">(Sarch 2019)</xref>
        and allows culpable knowledge
to be ascribed to an actor in the event of harm.
      </p>
      <p>8</p>
    </sec>
    <sec id="sec-8">
      <title>Related work</title>
      <p>The following review of related work is divided between
subject area.</p>
      <p>
        Side Effects in AI Whilst I argued in that the definition
of side effect in
        <xref ref-type="bibr" rid="ref3">Amodei et al. (2016)</xref>
        captures many more
things than side-effects, the putative solutions presented by
the authors and in descendent research are nevertheless
useful. One general approach is that of minimising impact; an
AI should complete its task as best its can whilst exerting as
small an impact on the world as possible, this is suggested
in Amodei et al and formalised in
        <xref ref-type="bibr" rid="ref23 ref5">(Armstrong and
Levinstein 2017)</xref>
        . A related approach is requiring the reversibility
of all effects caused by a policy
        <xref ref-type="bibr" rid="ref15">(Eysenbach et al. 2017)</xref>
        ,
the assumption being that effects that are reversible are less
harmful. Here an agent learns both policies to achieve things
and policies to undo those things. A value function derived
from the latter can be used to guide the former. This
approach is not suitable for all tasks, since agents will be
required to perform actions with permanent impact.
        <xref ref-type="bibr" rid="ref25">Krakovna
et al. (2020)</xref>
        approach the problem differently by asking the
AI agent to consider completion of future tasks (expressed
as a uniform distribution over all possible goal states) as well
as the present one in an environment that is not reset after a
task is completed. The authors show that this is a
generalisation of the reversible approach.
        <xref ref-type="bibr" rid="ref41">Turner, Ratzlaff, and
Tadepalli (2020</xref>
        ) develop a related method called Attainable
Utility Preservation (AUP), which penalises policies that
prevent the maximisation of a true, complex, yet unseen reward
function, which encodes our preferences regarding bad side
effects not occurring.
      </p>
      <p>
        In a recent review of the subject in AI, Saisubramanian,
Zilberstein, and Kamar (2021) state that Negative Side
Effects (NSE) are ”Undesired effects of an agents actions that
occur in addition to the agent’s intended effects”. This is
close in spirit to the APA definition in Section 3, though
there is no subsequent clarification as to how intent is
assessed. The authors present a taxonomy in which to classify
recent side-effect mitigation techniques consisting of the
following
• Severity - Serious harms are more obvious and likely to
be designed out at an early stage. Saisubramanian et al
observe that less serious harms are the ones that manifest
which they show in related work reduce confidence in AI
systems
        <xref ref-type="bibr" rid="ref29 ref31 ref34">(Saisubramanian, Roberts, and Zilberstein 2021)</xref>
        • Reversibility As discussed in the previous paragraph,
some effects are permanent.
• Avoidability Related to our discussion of means-end
effects, some effects are required for an objective to be
fuliflled. These are therefore intended and cannot be
sideeffects.
• Frequency The occurrence of side-effects might be
generally uncommon but common in a certain situation. For
examples in the medical domain see
        <xref ref-type="bibr" rid="ref26">Leslie et al. (2021)</xref>
        .
• Stochasticity In our causal setting, some side effects
might have a stochastic parent meaning that their
occurrence is not purely a function of the agents actions.
Nearly all of the methods they survey assume
deterministic side-effects.
• Observability Side effects might not be fully observable
according to the agent. Even if they are, they might not
be reflected in the agent’s reward function as penalties.
      </p>
      <p>
        An alternative to the reward (or penalty) based approaches
so far mentioned is constrained optimisation
        <xref ref-type="bibr" rid="ref2">(Achiam et al.
2017)</xref>
        , that is to say policy search within a ’safe’ subset
of policies.
        <xref ref-type="bibr" rid="ref42">Zhang, Durfee, and Singh (2020</xref>
        ) consider a
scenario where a robot agent is given a task but is unsure
about the various side-effects that may occur as a result
of various strategies that satisfy the task. The agent
partitions its state variables into ’free-features’ it knows it can
change, ’locked-features’ it knows it should not change and
’unknown-features’ it is unsure about (yet to be classified).
The agent will proceed to complete a task affecting only
free-features using linear programming, but failing this will
sparingly query an oracle as to the status of an
unknownfeature. An interesting advantage of such an approach is that
side-effects not previously considered by the oracle can be
safely negotiated. More generally side-effects might occur
because of sequences or combinations of states and actions.
      </p>
      <p>
        Many approaches to the side-effect problem assume that
they are a result of underspeciefid reward functions or
nonobservability. It could be that even with an adequate reward
function and state space, undesirable and unnecessary
sideeffects are still incurred due to mis-inference on the part of
the algorithm. It is tempting to believe that causal reasoning
is not needed when using Reinforcement Learning. Often
this is because expert knowledge about the data generation
process has been embedded into a simulation environment
        <xref ref-type="bibr" rid="ref20 ref36">(Herna´n, Hsu, and Healy 2019)</xref>
        , which can be extracted by
the learner through exploration.
      </p>
      <p>
        Side Effects in Philosophy and Psychology Research
considering the moral judgement of side effects in
experimental psychology has been popular since
        <xref ref-type="bibr" rid="ref24">(Knobe 2003)</xref>
        which first identified the Side-effect (or Knobe) effect,
whereby people consistently rate negative side effects as
more intentional than those with positive side effects. This
has since been shown to be the case with related judgements
of causality and blame amongst others. See
        <xref ref-type="bibr" rid="ref16">Feltz (2007)</xref>
        and
        <xref ref-type="bibr" rid="ref23">Kneer and Bourgeois-Gironde (2017)</xref>
        for an overview.
Given the overwhelming about of research written about the
effect, it is surprising that the concept of side effect hasn’t
been more formally identified. The finding that certain side
effects are deemed intentional is consistent with the
definitions of culpable mental states found in common law and
considered in this article.
      </p>
      <p>
        Formal accounts of intent are not hugely common. The
aforementioned,
        <xref ref-type="bibr" rid="ref22">(Kleiman-Weiner et al. 2015)</xref>
        define
intent in Influence diagrams and
        <xref ref-type="bibr" rid="ref18">(Halpern and
KleimanWeiner 2018)</xref>
        define intent using a modified Structural
Causal Model which includes agent utility. In both cases,
an outcome is intended if the agent’s policy is
counterfactually dependent on it.
        <xref ref-type="bibr" rid="ref6 ref7">(Ashton 2021b)</xref>
        extends both
models to consider Oblique Intent, which is similar to Culpable
Knowledge. The Belief Desire Intent (BDI) model of
multiagent programming originated from the Theoretical work of
Philosopher Michael Bratman in the 1980s
        <xref ref-type="bibr" rid="ref9">(Bratman 1999)</xref>
        .
        <xref ref-type="bibr" rid="ref12">Cohen and Levesque (1990)</xref>
        present a formal temporal logic
incorporating intent and the spirit of Bratman’s work.
Recently in the field of causal cognition
        <xref ref-type="bibr" rid="ref29">Quillien and German
(2021)</xref>
        have defined and tested intent as the degree to which
someone’s desire caused something to happen.
      </p>
      <p>
        Side effects in law Law prioritises the establishment of
the various levels of culpability or mens-rea which make
the concept of side effect redundant. At the levels of
culpable carelessness, it isn’t so concerned whether an adverse
outcome was a side effect or an unintended outcome.
Bratman’s intuition that means-end intent should be equivalent to
purpose, expressed in
        <xref ref-type="bibr" rid="ref10">Bratman (2009)</xref>
        as means-end
coherence, is supported by
        <xref ref-type="bibr" rid="ref37">Simester et al. (2019)</xref>
        who quotes the
case of Smith [1960] 2 QB 423 (CA), where the Defendant
was accused of bribing public official. D says that they only
intended to expose public corruption, but the court found
that he necessarily meant to bribe the mayor as a necessary
part of his plan. An argument that the Law reflects the
folkattribution of blame to foreseen negative side effects can be
made by the presence of Culpable Purpose whose definition
does not contain any reference to desire, aim or purpose.
The law is also interested in cases when intended outcomes
did not realise when the intended outcome is prohibited -
attempts to commit crimes are prohibited4. A special case of
failed attempts concerns actions intended to do something
good but which end up causing some harm. These follow
the dotted arc in Figure 2. In such cases culpability might
be waived if the intended outcome of the actor was to
opposite to the actual cause - a surgeon performing life-saving
treatment might know that the chance of death is almost
certain but continue anyway. A related issue is the doctrine of
double effect
        <xref ref-type="bibr" rid="ref27">(McIntyre 2019)</xref>
        which can prevent
culpability of intermediate or successive harmful outcomes as long
as the ’primary’ intended outcome is morally sound. These
are complex issues and do not fall neatly into the culpability
decision rules presented here.
      </p>
      <p>9</p>
    </sec>
    <sec id="sec-9">
      <title>Conclusion</title>
      <p>This article presents a formal definition of what
constitutes side effects sourcing the definition of side effect from
medicine and the necessary definition of intent from law. It
does this with the use of a Structural Causal Incentive Model
or SCIM, itself an extension of a Structural Causal Model
(SCM) and an exogenous definition of intent. Side Effects
of an action are those vertices which are descendants of the
action but are not themselves ancestors of any vertices with
intended realisations.</p>
      <p>Any definition of side effects taken up by the Safe-AI
community should be based on principles agreed by society
rather than computer scientists. Such an approach defuses
the accusation that definitions of concepts important to
society are created to be convenient or progress the objectives of
the engineer or their employer.</p>
      <p>Despite their name, side effects can still attract severe
criminal liability. I have used the standard MPC definitions
of culpability to create a decision process which can be used
to systematically determine culpability for harms caused or
potentially caused. In the event that an algorithm causes
some tangible harm to the world through a side effect,
computer scientists are not going to be surprised about the
conditions under which a side effect might make them liable in
a civil or criminal sense.</p>
      <p>Algorithms or their designers when equipped with model
of the world can use such a definition and decision process
to discriminate between directly intended outcomes and side
effects and then identify what degree of culpability can be
attached to side effects caused or endangered. In particular,
the article should impress upon the reader the importance
of foresight knowledge and its relation to culpability. Harms
can be caused accidentally with no liability, but after their
4Attempts can be divided between those that are interrupted
before commission after passing some threshold of culpable
preparation and those that attempts which are completed but fail in their
aim. We are talking about the latter here
ifrst instance they become foreseeable, at which point if they
reoccur they can no longer be termed accidents and causing
them becomes a culpable action.</p>
      <p>A</p>
    </sec>
    <sec id="sec-10">
      <title>Appendix A: MPC Culpability</title>
      <p>
        The following definitions of culpability are taken from the
MPC
        <xref ref-type="bibr" rid="ref40">(The American Law Insitute 2017)</xref>
        . The fourth level
Purpose (or direct intent) is quoted within the text.
      </p>
      <sec id="sec-10-1">
        <title>Knowledge</title>
        <p>A person acts knowingly with respect to a material
element of an offense when: (i) if the element involves
the nature of his conduct or the attendant
circumstances, he is aware that his conduct is of that
nature or that such circumstances exist; and (ii) if the
element involves a result of his conduct, he is aware
that it is practically certain that his conduct will cause
such a result.</p>
      </sec>
      <sec id="sec-10-2">
        <title>Recklessness</title>
        <p>A person acts recklessly with respect to a material
element of an offense when he consciously disregards a
substantial and unjustifiable risk that the material
element exists or will result from his conduct. The risk
must be of such a nature and degree that, considering
the nature and purpose of the actor’s conduct and the
circumstances known to him, its disregard involves a
gross deviation from the standard of conduct that a
law-abiding person would observe in the actor’s
situation.</p>
      </sec>
      <sec id="sec-10-3">
        <title>Negligence</title>
        <p>A person acts negligently with respect to a material
element of an offense when he should be aware of a
substantial and unjustifiable risk that the material
element exists or will result from his conduct. The risk
must be of such a nature and degree that the actor’s
failure to perceive it, considering the nature and
purpose of his conduct and the circumstances known to
him, involves a gross deviation from the standard of
care that a reasonable person would observe in the
actor’s situation.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Abbott</surname>
          </string-name>
          , R.; and
          <string-name>
            <surname>Sarch</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <year>2020</year>
          .
          <article-title>Punishing Artificial Intelligence: Legal Fiction or Science Fiction</article-title>
          . Is Law Computable?,
          <fpage>323</fpage>
          -
          <lpage>384</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Achiam</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Held</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Tamar</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ; and Abbeel,
          <string-name>
            <surname>P.</surname>
          </string-name>
          <year>2017</year>
          .
          <article-title>Constrained Policy Optimization</article-title>
          . arXiv:
          <volume>1705</volume>
          .10528 [cs].
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Amodei</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Olah</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Steinhardt</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ; Christiano,
          <string-name>
            <given-names>P.</given-names>
            ;
            <surname>Schulman</surname>
          </string-name>
          , J.; and Mane´,
          <string-name>
            <surname>D.</surname>
          </string-name>
          <year>2016</year>
          .
          <article-title>Concrete Problems in AI Safety</article-title>
          . arXiv:
          <volume>1606</volume>
          .06565 [cs].
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <given-names>APA. 2021. Side</given-names>
            <surname>Effect</surname>
          </string-name>
          . Date accessed:
          <volume>29</volume>
          /12/2021 https://dictionary.apa.org/side-effect.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Armstrong</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Levinstein</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <year>2017</year>
          .
          <article-title>Low Impact Artificial Intelligences</article-title>
          . arXiv:
          <volume>1705</volume>
          .10720 [cs].
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Ashton</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <year>2021a</year>
          .
          <article-title>Definitions of intent suitable for algorithms</article-title>
          .
          <source>arXiv:2106</source>
          .04235 [cs.
          <source>AI].</source>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Ashton</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <year>2021b</year>
          .
          <article-title>Extending counterfactual accounts of intent to include oblique intent</article-title>
          .
          <source>arXiv:2106</source>
          .03684 [cs.
          <source>AI].</source>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Ashton</surname>
            , H.; and Franklin,
            <given-names>M.</given-names>
          </string-name>
          <year>2022</year>
          .
          <article-title>The problem of behaviour and preference manipulation in AI systems</article-title>
          .
          <source>In The AAAI-22 Workshop on Artificial Intelligence Safety (SafeAI</source>
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Bratman</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <year>1999</year>
          . Intention, plans, and practical reason.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Bratman</surname>
            ,
            <given-names>M. E.</given-names>
          </string-name>
          <year>2009</year>
          . Intention, Practical Rationality, and
          <string-name>
            <surname>Self-Governance</surname>
          </string-name>
          . Ethics,
          <volume>119</volume>
          (April):
          <fpage>411</fpage>
          -
          <lpage>443</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>CNECT.</surname>
          </string-name>
          <year>2021</year>
          .
          <article-title>Proposal for a regulation of the European parliament and of the Council laying down harmonised rules on Artificial Intelligence (Artificial Intelligence Act) and amending certain union legislative acts</article-title>
          .
          <source>Technical Report COM/</source>
          <year>2021</year>
          /206,
          <string-name>
            <surname>European</surname>
            <given-names>Commission</given-names>
          </string-name>
          ,
          <article-title>DirectorateGeneral for Communications Networks, Content and Technology</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Cohen</surname>
            ,
            <given-names>P. R.</given-names>
          </string-name>
          ; and Levesque,
          <string-name>
            <surname>H. J.</surname>
          </string-name>
          <year>1990</year>
          .
          <article-title>Intention is choice with commitment</article-title>
          .
          <source>Artificial Intelligence</source>
          ,
          <volume>42</volume>
          (
          <issue>2-3</issue>
          ):
          <fpage>213</fpage>
          -
          <lpage>261</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Everitt</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Carey</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Langlois</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Ortega</surname>
            ,
            <given-names>P. A.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Legg</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <year>2021</year>
          .
          <article-title>Agent Incentives: A Causal Perspective</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <source>arXiv:2102</source>
          .
          <fpage>01685</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Eysenbach</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Gu</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Ibarz</surname>
          </string-name>
          , J.; and
          <string-name>
            <surname>Levine</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <year>2017</year>
          .
          <article-title>Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning</article-title>
          .
          <source>arXiv:1711</source>
          .06782 [cs].
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <surname>Feltz</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <year>2007</year>
          .
          <article-title>The Knobe Effect: A Brief Overview</article-title>
          .
          <source>The Journal of Mind and Behavior</source>
          ,
          <volume>28</volume>
          (
          <issue>3</issue>
          /4).
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <surname>Halpern</surname>
            ,
            <given-names>J. Y.</given-names>
          </string-name>
          <year>2016</year>
          .
          <article-title>Actual causality</article-title>
          . Cambridge, Massachusetts: The MIT Press.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <surname>Halpern</surname>
          </string-name>
          , J. Y.; and
          <string-name>
            <surname>Kleiman-Weiner</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <year>2018</year>
          .
          <article-title>Towards formal definitions of blameworthiness, intention, and moral responsibility</article-title>
          .
          <source>32nd AAAI Conference on Artificial Intelligence</source>
          ,
          <source>AAAI</source>
          <year>2018</year>
          ,
          <year>1853</year>
          -
          <fpage>1860</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <surname>Heckerman</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ; and Shachter,
          <string-name>
            <surname>R.</surname>
          </string-name>
          <year>1994</year>
          .
          <article-title>A Decision-Based View of Causality</article-title>
          .
          <source>Uncertainty Proceedings</source>
          <year>1994</year>
          ,
          <fpage>302</fpage>
          -
          <lpage>310</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <surname>Herna´n</surname>
          </string-name>
          , M. A.;
          <string-name>
            <surname>Hsu</surname>
          </string-name>
          , J.; and
          <string-name>
            <surname>Healy</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <year>2019</year>
          .
          <article-title>A Second Chance to Get Causal Inference Right: A Classification of Data Science Tasks</article-title>
          . CHANCE,
          <volume>32</volume>
          (
          <issue>1</issue>
          ):
          <fpage>42</fpage>
          -
          <lpage>49</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <surname>Hildebrandt</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <year>2019</year>
          .
          <article-title>Closure: on ethics, code and law</article-title>
          . In Law for Computer Scientists, chapter
          <volume>11</volume>
          . Oxford University Press.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <string-name>
            <surname>Kleiman-Weiner</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Gerstenberg</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Levine</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Tenenbaum</surname>
            ,
            <given-names>J. B.</given-names>
          </string-name>
          <year>2015</year>
          .
          <article-title>Inference of intention and permissibility in moral decision making</article-title>
          .
          <source>Proceedings of the 37th Annual Conference of the Cognitive Science Society</source>
          ,
          <volume>1</volume>
          (
          <year>1987</year>
          ):
          <fpage>1123</fpage>
          -
          <lpage>1128</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <surname>Kneer</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Bourgeois-Gironde</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <year>2017</year>
          .
          <article-title>Mens rea ascription, expertise and outcome effects: Professional judges surveyed</article-title>
          .
          <source>Cognition</source>
          ,
          <volume>169</volume>
          (
          <year>August</year>
          ):
          <fpage>139</fpage>
          -
          <lpage>146</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <string-name>
            <surname>Knobe</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <year>2003</year>
          .
          <article-title>Intentional action and side effects in ordinary language</article-title>
          .
          <source>Analysis</source>
          ,
          <volume>63</volume>
          :
          <fpage>190</fpage>
          -
          <lpage>194</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <string-name>
            <surname>Krakovna</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Orseau</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Ngo</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ; Martic,
          <string-name>
            <given-names>M.</given-names>
            ; and
            <surname>Legg</surname>
          </string-name>
          ,
          <string-name>
            <surname>S.</surname>
          </string-name>
          <year>2020</year>
          .
          <article-title>Avoiding Side Effects By Considering Future Tasks</article-title>
          .
          <source>Advances in Neural Information Processing Systems (NeuIPS)</source>
          ,
          <volume>33</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          <string-name>
            <surname>Leslie</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Mazumder</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Peppin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Wolters</surname>
            ,
            <given-names>M. K.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Hagerty</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <year>2021</year>
          .
          <article-title>Does “AI” stand for augmenting inequality in the era of covid-19 healthcare? BMJ.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          <string-name>
            <surname>McIntyre</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <year>2019</year>
          .
          <article-title>Doctrine of Double Effect</article-title>
          . In
          <string-name>
            <surname>Zalta</surname>
          </string-name>
          , E. N., ed.,
          <source>Stanford Encyclopedia of Philosophy</source>
          . Metaphysics Research Lab, Stanford University, spring
          <volume>201</volume>
          edition.
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          <string-name>
            <surname>Pearl</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <year>2000</year>
          .
          <article-title>Causality: Models, reasoning and inference</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          <string-name>
            <surname>Quillien</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ; and German,
          <string-name>
            <surname>T. C.</surname>
          </string-name>
          <year>2021</year>
          .
          <article-title>A simple definition of 'intentionally'</article-title>
          .
          <source>Cognition</source>
          ,
          <volume>214</volume>
          (June).
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          <string-name>
            <surname>Russell</surname>
            ,
            <given-names>S. J.</given-names>
          </string-name>
          <year>2019</year>
          .
          <article-title>Human compatible: artificial intelligence and the problem of control. London: Allen Lane, an imprint of Penguin Books</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          <string-name>
            <surname>Saisubramanian</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Roberts</surname>
            ,
            <given-names>S. C.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Zilberstein</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          <string-name>
            <given-names>Understanding</given-names>
            <surname>User Attitudes Towards Negative Side</surname>
          </string-name>
          <article-title>Effects of AI Systems</article-title>
          .
          <source>In Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems</source>
          ,
          <volume>1</volume>
          -
          <fpage>6</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          <source>Yokohama Japan: ACM. ISBN 978-1-4503-8095-9.</source>
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          <string-name>
            <surname>Saisubramanian</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Zilberstein</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Kamar</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          <string-name>
            <given-names>Avoiding</given-names>
            <surname>Negative Side</surname>
          </string-name>
          <article-title>Effects due to Incomplete Knowledge of AI Systems</article-title>
          . arXiv:
          <year>2008</year>
          .12146 [cs].
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          <string-name>
            <surname>Sarch</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <year>2019</year>
          .
          <article-title>Criminal Law Basics and the Willful Ignorance Doctrine</article-title>
          .
          <source>In Criminally Ignorant</source>
          ,
          <fpage>7</fpage>
          -
          <lpage>26</lpage>
          . Oxford University Press.
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          <string-name>
            <surname>Simester</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Spencer</surname>
            ,
            <given-names>J. R.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Stark</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Sullivan</surname>
            ,
            <given-names>G. R.</given-names>
          </string-name>
          ; and Virgo,
          <string-name>
            <surname>G. J.</surname>
          </string-name>
          <year>2019</year>
          .
          <article-title>Mens Rea. In Simester and Sullivan's Criminal Law</article-title>
          , chapter
          <volume>5</volume>
          ,
          <fpage>137</fpage>
          -
          <lpage>190</lpage>
          . Hart,
          <volume>7</volume>
          <fpage>edition</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          <string-name>
            <surname>Stark</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <year>2017</year>
          .
          <article-title>Introduction</article-title>
          . In Culpable Carelessness:
          <article-title>Recklessness and Negligence in the Criminal Law</article-title>
          , chapter
          <volume>1</volume>
          ,
          <fpage>1</fpage>
          -
          <lpage>25</lpage>
          . Cambridge University Press.
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          <string-name>
            <surname>Stark</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <year>2020</year>
          .
          <article-title>The Reasonableness in Recklessness</article-title>
          .
          <source>Criminal Law and Philosophy</source>
          ,
          <volume>14</volume>
          (
          <issue>1</issue>
          ):
          <fpage>9</fpage>
          -
          <lpage>29</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          <string-name>
            <given-names>The</given-names>
            <surname>American Law Insitute</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>General Requirements of Culpability.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          <string-name>
            <surname>Turner</surname>
            ,
            <given-names>A. M.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Ratzlaff</surname>
            , N.; and Tadepalli,
            <given-names>P.</given-names>
          </string-name>
          <year>2020</year>
          .
          <article-title>Avoiding Side Effects in Complex Environments</article-title>
          . arXiv:
          <year>2006</year>
          .06547 [cs].
        </mixed-citation>
      </ref>
      <ref id="ref42">
        <mixed-citation>
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Durfee</surname>
          </string-name>
          , E.; and
          <string-name>
            <surname>Singh</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <year>2020</year>
          .
          <article-title>Querying to Find a Safe Policy under Uncertain Safety Constraints in Markov Decision Processes</article-title>
          .
          <source>Proceedings of the AAAI Conference on Artificial Intelligence</source>
          ,
          <volume>34</volume>
          (
          <issue>03</issue>
          ):
          <fpage>2552</fpage>
          -
          <lpage>2559</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>