<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Categorizing Identi ed Deviations for Auditing</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Marzie Hosseinpour</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mieke Jans</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Hasselt University</institution>
          ,
          <addr-line>Agoralaan Gebouw D, 3590 Diepenbeek</addr-line>
          ,
          <country country="BE">Belgium</country>
        </aff>
      </contrib-group>
      <fpage>125</fpage>
      <lpage>129</lpage>
      <abstract>
        <p>Currently, nancial statements auditors perform the tests of controls based on sampling. However, when using a sampling approach, information is lost. To counter this drawback, data analytics has been applied as a method for auditors to provide assurance while using all data. Speci cally for testing controls, the potential of process mining has been explained in literature. Indeed, conformance checking can be used to compare real process executions with a normative model. However, the outcome of current conformance checking techniques is too vast for an auditor to inspect further. The identi ed deviations are at an atomic level (skipped and inserted tasks) and there is no feasible approach to gain a quick overview of the deviations. In this paper, we propose an approach to categorize deviations, which enables auditors to quickly gain an overview of di erent types of existing deviations along with their frequencies. Categorizing deviating process instances can also give an insight for assessing the risk at case level.</p>
      </abstract>
      <kwd-group>
        <kwd>Deviation Identi cation</kwd>
        <kwd>Financial Statemenets Auditing</kwd>
        <kwd>Process Mining</kwd>
        <kwd>Conformance Checking</kwd>
        <kwd>Risk Assessment</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The objective of nancial audit practice, for auditors is to express their opinion
on correctness and fairness of organizations' nancial statements. To achieve this
goal, auditors test business process's controls e ectiveness and perform
substantive tests of balances and accounts which impact the nancial reporting. Test
of process controls should be done before investigation of balance sheets and
accounts. The rationale is that if the control settings are rigid, there is
reasonable assurance, for the auditor, to rely on the organization's internal controls.
If the results of tests of controls are not satisfactory, more substantive evidence
should be gathered. That means more time and e ort should be dedicated while
testing the balance sheets and statements. To test the controls, as required by
Audit Standard No.5 [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and ISA 315 [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] a general approach is to rst collect
information on the process by interviewing process experts and by studying the
normative model (or creating one, in case it does not exist). After a general
understanding of the normative model, auditors test the e ectiveness of the
associated control setting. Currently, this is tested by taking a sample of process
executions. The sample is compared with the business model manually to check
the conformity of the selected cases. If there are no deviating cases among this
selection, the control settings are assumed to be reliable. Otherwise, when
deviating cases are discovered they should be reported. However, before reporting,
the deviations should be studied by auditors and further discussed with process
experts. The reason is that some of the deviating cases can be 'cleared' due
to some implicit or explicit exceptional rules and be labeled as normal cases.
The deviations which have a higher risk level should be prioritized for
followup. Investigating deviations in such a way is a roughly feasible task owing to the
currently used sampling approach. However, some information is lost and results
may be inaccurate using sampling.
      </p>
      <p>
        The advent of process mining techniques [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], in the last decade, can be
promising for auditing, not in the least because of its full-population testing ( [
        <xref ref-type="bibr" rid="ref8 ref9">8,9</xref>
        ]). By
applying a conformance checking technique (such as [
        <xref ref-type="bibr" rid="ref10 ref11 ref3 ref4 ref6">3, 4, 6, 10, 11</xref>
        ]), the entire
set of real process executions (aka event log) can be compared with the process
model to distinguish deviating cases from normal cases. Although these
techniques are able to locate the root cause of each deviation, for auditing purpose
there are some shortcomings. First of all, the output of detected deviations is too
immense for a full follow-up. There are too many variants of deviations. This is
because the normative model does not cover all possible exceptional or exible
behavior of processes. The second problem is that almost all of the conformance
checking techniques discover deviations in a very atomic level, which is not
pragmatic to work with. For example, consider a procurement process with the model
&lt;Create PO, Sign, Release, IR, GR, Pay&gt; where IR (Invoice Receipt) and GR
(Goods Receipt) are concurrent activities and can change order. Take &lt;Create
PO, IR, Release, Pay&gt; as an executed trace. If we check the conformity of this
trace manually, we would intuitively notice that the activities Sign and GR are
missing in this execution and Release and IR have changed order (since IR is
expected to occur after Release, based on the model). By giving the model and the
trace to the mostly employed conformance checking tool [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], the output will be
as follow: Skipped(Sign), Skipped(Release), Inserted(Release), Skipped(GR). In a
real event log, with thousands di erent cases, this leaves auditors with hundreds
combination of low-level deviations, isolated from the variants and the context
where they took place.
      </p>
      <p>The question is how the idea of conformance checking (comparing a log with
a model) can be made actionable for auditors. The approach we consider in
this paper is to create a di erent layer of deviations that would allow us to
categorize those deviations in sets that are meaningful and manageable for an
auditor. Concretely, we would like to develop (or alter) a conformance checking
algorithm that identi es deviations in such a way that i) they will be meaningful
for auditors, ii) gives them an insight of di erent types of existing deviations and
their frequencies in the executed behavior, and iii) helps auditors to prioritize
deviations based on their risk level.</p>
      <p>In the remainder of this paper, we propose an e cient interpretation of
deviations in section 2, which can answer the above question. In section 3, we explain
how we will implement the idea and we conclude in section 4.</p>
    </sec>
    <sec id="sec-2">
      <title>Proposed Approach</title>
      <p>To address the problem of too ne-grained deviation types (skipped and
inserted) mentioned in the previous section, we de ne a set of six deviation types
that we presume would be meaningful to an auditor, even without the context of
the complete trace. These deviation types interpret deviations at a higher level,
while keeping the location and root cause of the mismatches in each case. The six
types that we propose are: missing a sequence, existence of an extra sequence,
loop on a sequence, repetition of a sequence, swapping two sequences, replacing
one sequence by another sequence. Note that a sequence can contain one or more
activities. These proposed types are explained in table 1 due to lack of space. For
each deviation type an example is provided, based on the procurement model
presented in section 1.</p>
      <p>Type Description Example
1 Missing a se- A sequence of events &lt;Create PO, Release, IR, GR,
quence that should have taken Pay&gt;</p>
      <p>place, is not executed Sign is missing
2 Existence of ex- A sequence of events &lt;Create PO, Sign, Change Line,
tra sequence is executed that it was Release, IR, GR, Pay&gt;)
not designed Extra event Change Line exists
3 Loop on a se- A sequence of events &lt;Create PO, Sign, Release, IR, GR,
quence is repeated while only Pay, Pay)&gt;
a single occurrence was Loop on Pay
designed
4 Repetition of a An executed sequence &lt;Create PO, Sign, Release, IR, GR,
sequence is repeated later in an- Sign, Pay)&gt;</p>
      <p>other part of trace Repetition of Sign after GR
5 Swapping two Two sequences have &lt;Create PO, IR, Sign, Release,
sequences changed their order GR, Pay)&gt;
&lt;Sign, Release&gt; and IR are
swapped
6 Replacing one A sequence takes place &lt;Create PO, Sign, Pay, IR, GR,
sequence by an- instead of another Pay&gt;
other sequence missing sequence. Release is replaced by Pay</p>
      <p>The idea is to develop an algorithm that provides auditors with an overview of
deviations according to their categories. Next, it should be feasible to drill down
to the sequences that were subject to the deviations. For instance, the deviating
trace &lt;Create PO, IR, Release, Pay&gt; will be described as follow: Missing an
event (with two sub-categories: Sign is missed and GR is missed ) and Swapping
events (Release and IR are swapped ). On a log level, for example, this could
give: "This logs shows 500 times a "Repetition of a sequence", this is stemming
from 150 repetition of &lt;Sign, Release&gt;, 150 repetition of IR, and 200 repetition
of GR." which describes existing deviations in the log in categories along with
their frequencies.</p>
      <p>Therefore, the contribution of this paper is to interpret deviations using these
types, which enables auditors to perceive di erent types of devisions and
possibly related risk, as one sees them intuitively (rather than on an atomic level).
3</p>
    </sec>
    <sec id="sec-3">
      <title>Methodology and Implementation</title>
      <p>Before the development of the desired algorithm, we will perform a eld research.
The methodology is to interview auditors to test our proposed deviation types
in their approach. The eld research will be executed to gain insights in how
complex deviation types might be interpreted by human experts (as apposed to
our assumptions). Consider again, the deviating example in section 1, &lt;Create
PO, IR, Release, Pay&gt;. The deviation type 'swapped Release and IR' can be
interpreted in a di erent way like 'Release is postponed after IR' or even 'IR is
advanced before Release'.</p>
      <p>
        When various deviation types exist in one trace, the combination of them also
can be interpreted di erently. Avoiding di erent interpretation of deviations is
important because it may lead to assigning wrong risk level to them. Hence, we
believe performing the eld work research is necessary before implementation
of the idea. After the eld research, we plan to build the algorithm with the
desired requirements. Current known algorithms have already been investigated
to what extend they could help in achieving our goal. The shortcoming of
costbased conformance checking tool, proposed by Adriansyah et al. [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] is discussed
in section 1. The conformance technique proposed by Garcia et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], to the
best of our knowledge, is the only conformance checking tool which provides
the deviations in natural language statements. The tool nds deviations in both
model and event log. The output is suitable to improve the model or see what
types of deviations exist in the event log in general. Nevertheless, their approach
is not fully compatible with auditing purpose of testing the controls. The reason
is that it does not have the means for nding which cases or traces cause the
discovered deviations. Hence, one is not able to nd the deviating cases or even
the frequency of each deviation type. After the eld research, between these two
techniques, we will choose the one which is closer to our objective of capturing
and categorizing deviations and will adapt it to accomplish our goal.
4
      </p>
    </sec>
    <sec id="sec-4">
      <title>Conclusion and Future Work</title>
      <p>This paper introduces the idea of developing a technique for organizing the
identi ed deviations in event logs into certain categories. A set of six di erent
deviation types are proposed to enable auditors to gain an overview of existing
deviations.
During our eld research, we will also study what type of information auditors
use to assess the risk of deviation types. This insight will be used in a
followup phase to go from deviation to risk classi cation. Moreover, the correlation
between risk level and the complexity of each deviating trace (i.e., the number
and the variety of the mismatches in each deviating trace), in a real life setting
will be investigated.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1. Iaasb: International international federation of accountants,
          <source>international standard on auditing 315</source>
          , http://www.ifac.org/system/files/downloads/ a017-2010
          <string-name>
            <surname>-</surname>
          </string-name>
          iaasb-handbook-isa-
          <volume>315</volume>
          .pdf
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2. Pcaob:
          <article-title>The public company accounting oversight board, auditing standard no. 5</article-title>
          , https://pcaobus.org/Standards/Auditing/pages/auditing_standard_5.aspx
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>van der Aalst</surname>
          </string-name>
          , W.M.P., de Beer, H.T.,
          <string-name>
            <surname>van Dongen</surname>
            ,
            <given-names>B.F.</given-names>
          </string-name>
          :
          <article-title>Process mining and veri cation of properties: An approach based on temporal logic</article-title>
          .
          <source>In: Proceedings of the 2005 Confederated International Conference on On the Move to Meaningful Internet Systems</source>
          . pp.
          <volume>130</volume>
          {
          <fpage>147</fpage>
          . OTM'
          <volume>05</volume>
          , Springer-Verlag (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>van der Aalst</surname>
          </string-name>
          , W.M.P.,
          <string-name>
            <surname>de Medeiros</surname>
            ,
            <given-names>A.K.A.</given-names>
          </string-name>
          :
          <article-title>Process mining and security: Detecting anomalous process executions and checking process conformance</article-title>
          .
          <source>Electron. Notes Theor. Comput. Sci. 121</source>
          ,
          <issue>3</issue>
          {
          <fpage>21</fpage>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>van der Aalst</surname>
            ,
            <given-names>W.M.P.</given-names>
          </string-name>
          : Process Mining: Discovery, Conformance and Enhancement of Business Processes. Springer Publishing Company, Incorporated, 1st edn. (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Adriansyah</surname>
          </string-name>
          , A.,
          <string-name>
            <surname>van Dongen</surname>
            ,
            <given-names>B.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>van der Aalst</surname>
            ,
            <given-names>W.M.P.</given-names>
          </string-name>
          :
          <article-title>Conformance checking using cost-based tness analysis</article-title>
          .
          <source>In: Proceedings of the 2011 IEEE 15th International Enterprise Distributed Object Computing Conference</source>
          . pp.
          <volume>55</volume>
          {
          <fpage>64</fpage>
          . EDOC '11, IEEE Computer Society, Washington, DC, USA (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Garcia-Banuelos</surname>
          </string-name>
          , L.and
          <string-name>
            <surname>van Beest</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dumas</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>La</given-names>
            <surname>Rosa</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          :
          <article-title>Complete and interpretable conformance checking of business processes</article-title>
          .
          <source>BPM center</source>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Jans</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alles</surname>
            ,
            <given-names>M.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vasarhelyi</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          :
          <article-title>A eld study on the use of process mining of event logs as an analytical procedure in auditing</article-title>
          .
          <source>The Accounting Review</source>
          <volume>89</volume>
          (
          <issue>5</issue>
          ),
          <volume>1751</volume>
          {1773 (
          <year>September 2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Jans</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alles</surname>
            ,
            <given-names>M.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vasarhelyi</surname>
            ,
            <given-names>M.A.:</given-names>
          </string-name>
          <article-title>The case for process mining in auditing: Sources of value added and areas of application</article-title>
          .
          <source>International Journal of Accounting Information Systems 14 14</source>
          ,
          <issue>1</issue>
          {20 (March
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Ramezani</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fahland</surname>
          </string-name>
          , D., van der Aalst, W.:
          <article-title>Where did i misbehave? diagnostic information in compliance checking</article-title>
          . In: Barros,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Gal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Kindler</surname>
          </string-name>
          , E. (eds.)
          <source>International Conference on Business Process Management (BPM 2012). Lecture Notes in Computer Science</source>
          , vol.
          <volume>7481</volume>
          , pp.
          <volume>262</volume>
          {
          <fpage>278</fpage>
          . Springer-Verlag, Berlin (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Rozinat</surname>
          </string-name>
          , A.,
          <string-name>
            <surname>van der Aalst</surname>
            ,
            <given-names>W.M.P.</given-names>
          </string-name>
          :
          <article-title>Conformance checking of processes based on monitoring real behavior</article-title>
          .
          <source>Information Systems</source>
          <volume>33</volume>
          (
          <issue>1</issue>
          ),
          <volume>64</volume>
          {
          <fpage>95</fpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>