<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Workflow Support for Mobile Data Collection</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Peter K. Wakholi</string-name>
          <email>peterokhisa@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Bergen</institution>
          ,
          <country country="NO">Norway</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Mobile devices have become popular for electronic data collection, management and dissemination in low income countries. Many organizations are considering the use of electronic devices rather than paper-based routines for data collection. In data collection, time dependent and cumulative data, can be regarded as forming a workflow process. Paper-based routines support this workflow process and the transition to a paperless environment poses some workflow challenges that if not addressed could lead to failure to effectively use mobile devices and to meet process-related requirements. The research seeks to explore how challenges related to the use of workflows in mobile, disconnected and distributed environments can be addressed. The research uses an action research approach to propose solutions based on case studies related to Mobile Health (MHealth) and Clinical Trials. This paper presents the current status of research by discussing ideas, methods and frameworks for workflows that provide the flexibility and functionality modeled on paper-based routines.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        In organizations, the daily operation is governed by a set of cooperative business
processes, in which interactions with humans and information systems are involved.
Workflow systems aim to automate a business processes in whole or part, during which
documents, information and tasks are passed from one participant to another for action
according to preset procedural rules [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Workflow management provides the ability
to improve the efficiency of an organization by streamlining and automating
coordinated work activities over distributed environments. Organisations using mobile data
collection (MDC) aim to achieve this goal by replacing paper-based routines with
mobile devices. This research is based on MDC in clinical trials, which provides a good
use-case for replacing paper-based routines with mobile devices.
      </p>
      <p>
        A Clinical Trial can be understood as a workflow process. The workflows are
defined based on study protocols, which specify the duration and structure of the study
and the standard guidelines which must be followed by all participants [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. The data
collection process is most difficult as it involves field studies which could be in
single or multiple sites, often in remote rural environments. One of the core documents
in clinical trial for data collection is the Case Report Form (CRF). The CRF is a form
where the investigator enters all patients’ clinical and non-clinical data related to the
trial. There are three types of data: non-time dependent, time dependent and cumulative
data [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Non-time dependent data is the data collected at a snapshot in time. Such data
include subject demographics and medical history. Time dependent data is data
collected repeatedly over time through multiple visits. Cumulative data is data collected
over time but not linked to a specific visit.
      </p>
      <p>
        If workflows using mobile-based routines are to replace paper-based routines, some
challenges need to be overcome. First, MDC in mobile/wireless computing environment
posts new challenges of the such as disconnections, slow connection links, and
limited computing power which must be addressed when designing a mobile application.
A workflow solution for MDC should take into consideration these characteristics and
limitations by employing new computing paradigms. Second, paper-based routines
provide the flexibility to work in remote and disconnected environments. Work conducted
in the field is often done on paper and filed at the end of a specified time. The workflow
system therefore needs to allow for this flexibility by enabling execution of multiple
tasks before connection to the server is required. This is a shift from the traditional
workflow systems provide for client-server architecture that requires synchronisation
before the next task is executed and in some cases web-based-distributed architecture
which require constant connection links [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>
        This research focuses on the development of mobile systems for EDC by addressing
workflow-related problems in their use for clinical trials and mhealth projects. The
action research approach is used to develop appropriate methods, models and frameworks
for deployment and utilization of Workflow Management System (WFMS) for mobile
data collection. The research will be based on the OMEVAC (Open Mobile Electronic
Vaccine Trials) project [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] - a new mobile platform for conducting clinical trials, funded
by the Norwegian Research Council led by the Centre for International Health,
University of Bergen. Case studies will be undertaken by testing and evaluating the prototypes
and tools in clinical trials being conducted in rural parts of Africa.
1.1
      </p>
    </sec>
    <sec id="sec-2">
      <title>Research Questions</title>
      <p>The main research question is to investigate ”how advances in mobile computing,
process-aware information systems can combine to enable the use of mobile devices for
data collection, management and dissemination in place of paper-based routines used
for field activities” In order to achieve new knowledge in this area, the following sub
questions will be investigated.</p>
      <p>1 How can workflow systems be implemented in the highly constrained and
resourcelimited mobile environments?
2 How can mobile workflows allow for the flexibility of paper-based routines in field
activities?
3 How can distributed workflows be enacted and synchronized in disconnected
environments?
1.2</p>
    </sec>
    <sec id="sec-3">
      <title>Research Objectives</title>
      <p>The research objectives
are:1 To develop a framework for deployment of generic workflow systems to support
mobile data collection in resource constrained environments.
2 To develop methods for execution of multiple tasks before connection to a server is
required so as to allow for flexibility mirrored on paper based routines.
3 To test the concepts above by implementing an actual workflow based mobile data
collection, management and dissemination system for the OMEVAC project.
4 To demonstrate the value and potential of the mobile data collection in low-income
countries as a replacement for paper-based routines.
2
2.1</p>
      <sec id="sec-3-1">
        <title>Current Research and Preliminary Ideas</title>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Workflow Support for Mobile Data Collection</title>
      <p>This research (submitted and accepted by CAISE BPMDS 2011) aims to answer the
first research question identified. It examines the challenges of deploying (WFMS) in
generic data collection tools and proposes a framework for integration. The WFMS is
based on YAWL [6] with a generic MDC tool called openXdata [7], [8]. It describes
the design and implementation of a workflow adapter that acts as a bridge between
the mobile device, data processing applications and workflow engine. Through this
implementation, we have been able to provide a distributed architecture that enables the
ordering of tasks linked to mobile devices and web-based applications. In addition, we
provide an example where this framework has been used in an MHealth project of a
vaccination registry that uses mobile devices to collect data on child immunisations.
2.2</p>
    </sec>
    <sec id="sec-5">
      <title>Allowing for Offline Behaviour</title>
      <p>In order for mobile devices to be used for data collection in disconnected environments,
there is need to allow for offline behaviour. This can be achieved by reducing the number
of connections required in-between tasks by assigning as many tasks as possible for
each connection. Therefore tasks need to be grouped and assigned to a user - assuming
that there are no outside data and resource dependencies for execution of the next task.</p>
      <p>In WFMS, multiple execution of tasks have been implemented using worklets [9].
Worklets are subnets that are executed like subroutines of a main program and are
created at design time. In this research we explore the possibility of dynamically
identifying and generating worklets which can then be a assigned as a bundle of tasks to be
executed by the mobile user. In order to achieve this, there is need to (1) determine
which set of tasks to group; (2) ensure that the tasks assigned are performed as required
and (3) synchronise with the server once tasks are completed. In order to determine
which set of tasks to group we propose a framework for multiple task assignment that
can be used to check the process model to create worklets. To ensure that the worklets
are executed on a mobile phone as intended, we propose designing a mobile client to
have a light workflow engine that receives a worklet. Tasks in a worklet may be fully or
partially completed. We therefore propose a protocol that tracks the status of execution
on the mobile client and updates server. This synchronisation protocol would be run
every time a connection is established.</p>
      <p>B
A</p>
      <p>T1</p>
      <p>T2</p>
      <p>H
T7</p>
      <p>T8</p>
      <p>I</p>
      <p>T5
D
T6
T3</p>
      <sec id="sec-5-1">
        <title>Theoretical Basis for Proposed Solutions</title>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Dynamic Generation of Worklets</title>
      <p>We propose to use model checking to automatically test a process model and determine
whether this model meets a given property. Model checking aims to simulate every
execution of the model of the system in order to obtain a labelled reachability graph
describing all its behaviors. The graph is the checked against a set of properties to
determine if the system performs according to a specified property. We use the model
unfolding technique as proposed by [10], [11] to determine the causality, conflicts and
concurrency properties. [10] define these properties as; two nodes x and y of a net,
are causally retated denoted by x y , if there is a path of arrows from x to y. The
nodes are in conflict, denoted by x 6= y, if there is a place z, different from x and y,
from which one can reach x and y, exiting z by different arrows. Figure 2 provides an
illustration of this approach.</p>
      <p>Figure 1 gives a workflow net showing tasks T1 to T6 that are executed using the
basic control flow patterns initially proposed by the WFMC [12] as Sequence, Parallel
Split, Synchronization, Exclusive Choice and Simple Merge. The net presented can be
unfolded as shown in 2. The groups 1,2 and 3 present the possible sets of tasks whose
grouping would not affect the execution of the patterns above. Based on these groups,
the following behavioral observations of the nets can be made:
1 The nodes in group 1 (C,D,E) are in causal relationship and connected by more
than one event.There are no arc emerging or entering this group.
2 The nodes in group 2 and 3 (B,C,D,E,G,H) (B,H,I) are a set on a path that has
exactly one input condition and one output condition. There are no arcs leaving or
entering the group except at node B (the initial node).</p>
      <p>From these observations one can conclude that in as long as there exists one input arc
and one output arc into a set of nodes from the initial node to the final node, then
a worklet is possible. Input or output arcs are permisible on the initial or final node.
Therefore given an unfolded model, a worklet can be created from a set of nodes that
are not in contradiction with any node outside the set. In addition, for the set of tasks,
all the conditions must be causal (i.e. system should transition to another state in the
group) and all events must be strongly connected (i.e. There exists a path from first
event to the last event).</p>
      <p>S = fsi; si+1; ::::si+ng
Proposition 1. Given an unfolding of a workflow net A = hS; T ; ; ; isi, such that
S = fs0; s1; :::::; smg is a set of states and T = ft0; t1; :::::; tng is a set of
transitions, a worklet W = hS; T ; ; ; isi where T = fti; ti+1; ::::ti+mg T and</p>
      <p>S is possible if:
1. 8S9 fsi si+1g where 0 i m (Causality)
2. 8S9si such that si: 6= fS 2= Sg (no conflict)
3.2</p>
    </sec>
    <sec id="sec-7">
      <title>Event-log based Synchronization</title>
      <p>Synchronisation takes place once a connection between a server and mobile client is
established. Any synchronisation solution should aim to ensure appropriate utilisation
mobile phone processing and memory. We therefore propose keeping a log of all the
events on a mobile device and synchronising with the server. For synchronisation,
reachability analysis can be used to compare the workflow net and event log. For a
transition system like a workflow net, we provide a definition of a state called reached
marking as a basis for tracking current work status.
and pn 6= sm+1
Definition 1 Reached Marking: Let p be a state in P and</p>
      <p>= (a1; a2:::::::am) 2 A be a set of executed tasks for the process definition of P . State
p0 is the initial state of the process and pn is the final state. are reached markings if
and only if there exists states s0; s1; :::::; sm such that p0 = s0 and si ai+!1 si+1 for
all for all 0 i m. The next execution task satisfies the relation sm am+!1 sm+1</p>
      <p>Figure 3 illustrates an example of a simple workflow for patient visiting a health
clinic in a clinical trial. The reachability graph shows all the possible states of the
system and transitions that transform these states. Note that the states associated with the
We have presented in this paper ongoing research about workflow support for mobile
data collection. The use of worklets to enable execution of multiple tasks on a mobile
device reduces the need for many connections thereby allowing for more flexibility in
a disconnected environment. Moreover, it would be neccessary to have these worklets
dynamically generated in order impose less restrictions on the system. We proposed a
model checking technique based on model unfolding and genetic algorithms to
generate these worklets. In order to enable synchronisation between the workflow server and
worklet client, we propose event-log based synchronisation. The use of this approach
greatly reduces the memory usage and application footprint - a key requirement for the
limited resource mobile environment. We thus provide clear progress towards achieving
the objectives of the research as highlighted by the practical ideas and theory presented.
The mobile data collection tool will be enhanced to allow for offline behaviour using
these ideas. It is anticipated that all methods and frameworks proposed will be
empirically tested in MHealth projects to validate the research.
6. W.M.P. Van Der Aalst and A.H.M. Ter Hofstede. YAWL: yet another workflow language.</p>
      <p>Information Systems, 30(4):245–275, 2005.
7. Y. Anokwa, C. Hartung, A. Lerer, B. DeRenzi, and G. Borriello. A new generation of open
source data collection tools. In Information and Communication Technologies and
Development (ICTD), 2009 International Conference on, page 493. IEEE, 2010.
8. J. Klungsoyr, P. Wakholi, B. Macleod, A. Escudero-Pascual, and N. Lesh. OpenROSA,
JavaROSA, GloballyMobile–Collaborations around Open Standards for Mobile
Applications. M4D 2008, General Tracks.
9. M. Adams, A. ter Hofstede, D. Edmond, and W. van der Aalst. Worklets: A service-oriented
implementation of dynamic flexibility in workflows. On the Move to Meaningful Internet
Systems 2006: CoopIS, DOA, GADA, and ODBASE, pages 291–308, 2006.
10. J. Esparza and K. Heljanko. Unfoldings: a partial-order approach to model checking.</p>
      <p>Springer-Verlag New York Inc, 2008.
11. J. Esparza and K. Heljanko. Implementing LTL model checking with net unfoldings. Model</p>
      <p>Checking Software, pages 37–56, 2001.
12. D. Hollingsworth. Workflow Management Coalition-Terminology &amp; Glossary,[online],
1999, verf
”ugbar im World Wide Web: http://www. wfmc. org/standards/docs.
TC1011 term glossary v3. pdf.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>M.</given-names>
            <surname>Weske</surname>
          </string-name>
          .
          <article-title>Business process management: concepts, languages, architectures</article-title>
          . SpringerVerlag New York Inc,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>US</given-names>
            <surname>Food. Drug Administration</surname>
          </string-name>
          .
          <article-title>Guidance for industry, E6 good clinical practice: consolidated guidance</article-title>
          .
          <source>Federal Register</source>
          ,
          <volume>10</volume>
          :
          <fpage>691</fpage>
          -
          <lpage>709</lpage>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>K.K. Moon</surname>
          </string-name>
          .
          <source>Techniques for Designing Case Report Forms in Clinical Trials. ScianNews Volume, DOI</source>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>R.S.H.</given-names>
            <surname>Istepanian</surname>
          </string-name>
          and
          <string-name>
            <given-names>C.S.</given-names>
            <surname>Pattichis</surname>
          </string-name>
          .
          <article-title>M-health: Emerging mobile health systems</article-title>
          . SpringerVerlag New York Inc,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>J.</given-names>
            <surname>Klungsoyr</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Tylleskar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Macleod</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Bagyenda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Chen</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Wakholi. OMEVACOpen Mobile Electronic Vaccine Trials</surname>
          </string-name>
          ,
          <article-title>an interdisciplinary project to improve quality of vaccine trials in low-resource settings</article-title>
          .
          <source>M4D</source>
          <year>2008</year>
          ,
          <string-name>
            <given-names>General</given-names>
            <surname>Tracks</surname>
          </string-name>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>