<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Towards event-based traceability in provenance-aware supply chains</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Monika Solanki</string-name>
          <email>monika.solanki@cs.ox.ac.uk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Oxford</institution>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The sharing of product information plays a central role in coordinating the actions and decisions undertaken by supply chain trading partners. The Electronic Product Code Information Service (EPCIS) is an EPCglobal standard, that aims to enable the sharing of serial-level product related information via the Electronic Product Code (EPC). Central to the EPCIS data model are events that describe speci c occurrences in the supply chain. This paper presents a uni ed, provenance-aware framework, driven by Semantic Web standards and Linked data principles for representing and sharing EPCIS events on the Web of data. Event-based linked pedigrees are utilised as the basic artifact for exchanging knowledge in supply chains. Traceability is implemented through the automated generation and validation of linked pedigrees. We exemplify our approach using the pharmaceuticals supply chain and show how counterfeit detection is an implicit part of our traceability framework.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        One of the most important challenges in logistics and supply chains is
information integration. Sharing of data and knowledge in a standardised manner among
the various stakeholders, is crucial to enable e cient management. Further,
recent advances in sensor technology has resulted in wide scale deployment of RFID
enabled devices in supply chains. Timely processing of RFID data facilitates
efcient analysis of product movement, shipment delays, inventory shrinkage and
out-of-stock situation in end-to-end supply chain processes [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The scanning of
RFID tags in production and storage facilities generates unprecedented volumes
of events as data streams, when trading partners exchange and handle products
from inception through to the end-of-life phase.
      </p>
      <p>This paper presents a uni ed, provenance-aware framework, driven by
Semantic Web standards and Linked data principles for representing and sharing
of supply chain data, speci cally for the purposes of tracing and tracking.
Traceability data in supply chains is generated when barcode and RFID readers record
traces of products tagged with an EPC (Electronic Product Code), monitoring
their movement across the supply chain as speci c occurrences of \events". In the
proposed framework, description of events is facilitated using EPCIS1(Electronic</p>
      <sec id="sec-1-1">
        <title>1 http://www.gs1.org/gsmp/kc/epcglobal/epcis</title>
        <p>
          Product Code Information Services), a standardised event oriented speci cations
prescribed by GS12 for enabling traceability [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] in supply chains. We exploit two
information models: The EPCIS Event Model (EEM)3 based on the EPCIS
speci cation, that enables the sharing and semantic interpretation of event data and
CBVVocab4 a companion ontology to EEM for annotating the business context
associated with events.
        </p>
        <p>The abstraction we use for encoding and sharing traceability data is a \linked
pedigree". Event-based linked pedigrees are utilised as the basic artifact for
exchanging knowledge in supply chains. We propose, OntoPedigree5 a content
ontology design pattern for generating the linked pedigrees. We represent supply
chain events as streams of RDF encoded linked data, while complex event
patterns are declaratively speci ed through extended SPARQL queries.
Traceability is implemented through the automated generation and validation of linked
pedigrees. We exemplify our approach using the pharmaceuticals supply chain.
Counterfeiting has increasingly become one of the major problems prevalent in
these chains. The WHO estimates that between ve and eight percent of the
worldwide trade in pharmaceuticals is counterfeit [9]. Counterfeit detection is
an implicit part of our traceability framework.</p>
        <p>The paper is structured as follows: Section 2 presents our motivating
scenario from the pharmaceuticals supply chain. Section 3 highlights the contextual
background for the proposed framework. Section 4 provides a brief overview of
various elements of the traceability framework with references to more detailed
literature. Section 5 presents conclusions.
2</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>Motivating scenario</title>
      <p>We outline the scenario of a pharmaceutical supply chain, where trading partners
exchange product track and trace data using linked pedigrees. Figure 1 illustrates
the ow of data for four of the key partners in the chain.</p>
      <p>The Manufacturer commissions6, i.e., assigns an EPC (Electronic Product
Code) to the items, cases and pallets. The items are packed in cases, cases are
loaded onto pallets and pallets are shipped. At the Warehouse for the Wholesaler,
the pallets are received and the cases are unloaded. The cases are then shipped
to the various Distribution centers. From the Distribution centers the cases are
sent to retail Dispenser outlets, where they are received and unpacked. Finally,
the items are stacked on shelves for dispensing, thereby reaching their end-of-life
in the product lifecycle.</p>
      <p>EPCIS events are internally recorded for various business steps at each of
the trading partner's premises and used for the generation of linked pedigrees.
When the pallets with the cases are shipped from the manufacturer's premises to</p>
      <sec id="sec-2-1">
        <title>2 http://www.gs1.org/</title>
        <p>3 http://purl.org/eem#
4 http://purl.org/cbv#
5 http://purl.org/pedigree#
6 associates the serial number with the physical product
the warehouse, pedigrees encapsulating the set of EPCIS events encoding
traceability data are published at an IRI based on a prede ned IRI scheme. At the
warehouse, when the shipment is received, internal EPCIS events corresponding
to the receipt of the shipment are recorded. The IRI of the pedigree sent by
the manufacturer is dereferenced to retrieve the pedigree. IRIs of the events
corresponding to the transaction (shipping) and consignment (goods) information
encapsulated in the pedigree are also dereferenced to retrieve the event speci c
information for the corresponding business steps. When the warehouse ships the
cases to the distribution center, it incorporates the IRI of the manufacturer's
pedigree in its own pedigree de nition. As the product moves, pedigrees are
generated with receiving pedigrees being dereferenced and incorporated, till the
product reaches its end-of-life stage. Note that pedigrees sent by a distributor
may include references to the pedigrees sent by more than one warehouse.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Preliminaries</title>
      <sec id="sec-3-1">
        <title>EPCIS</title>
        <p>An Electronic Product Code (EPC)7 is a universal identi er that gives a unique,
serialised identity to a physical object. EPCIS is a rati ed EPCglobal8 standard
that provides a set of speci cations for the syntactic capture and informal
semantic interpretation of EPC based product information. As the EPC tagged
object moves through the supply chain, RFID readers record and transmit the
tagged data as \events". Given the scenario in Section 2, we are concerned with
three types of EPCIS9 events: ObjectEvent represents an event that occurred
as a result of some action on one or more entities denoted by EPCs, i.e.,
commissioning of an object AggregationEvent represents an event that happened to
one or more EPC-denoted entities that are physically aggregated (constrained
to be in the same place at the same time, as when cases are aggregated to a
7 http://www.gs1.org/gsmp/kc/epcglobal/tds/tds_1_6-RatifiedStd-20110922.</p>
        <p>pdf
8 http://www.gs1.org/epcglobal
9 Please refer the speci cation for details.
pallet) TransactionEvent represents an event in which one or more entities
denoted by EPCs become associated or disassociated with one or more identi ed
business transactions, i.e., the shipping of a pallet of goods in accordance to the
ful llment of an order.
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>The EEM ontology</title>
        <p>
          EEM is an OWL 2 DL ontology for modelling EPCIS events. EEM
conceptualises various primitives of an EPCIS event that need to be asserted for the
purposes of traceability in supply chains. A companion standard to EPCIS is
the Core Business Vocabulary(CBV) standard. The CBV standard supplements
the EPCIS framework by de ning vocabularies and identi ers that may
populate the EPCIS data model. CBVVocab is an OWL ontology that de nes entities
corresponding to the identi ers in CBV. Development of both the ontologies
was informed by a thorough review of the EPCIS and the CBV speci cations
and extensive discussions with trading partners implementing the speci cation.
The modelling decisions [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] behind the conceptual entities in EEM highlight the
EPCIS abstractions included in the ontology. It is worth noting that in previous
work [12] we have already de ned a mapping between EEM and PROV-O10,
the vocabulary for representing provenance of Web resources. This implies that
when a constraint violation is detected, the events in the history can be
interrogated using PROV-O for recovering provenance information associated with the
events.
        </p>
        <p>The EEM ontology structure and its alignment with various external
ontologies is illustrated in Figure 2. The ontology is composed of modules that de ne
10 http://www.w3.org/ns/prov-o
various perspectives on EPCIS. The Temporal module captures timing
properties associated with an EPCIS event. It is aligned with temporal properties
in DOLCE+DnS Ultralite (DUL)11. Entities de ning the EPC, aggregation of
EPCs and quantity lists for transformation events are part of the Product
module. The GoodRelations12 ontology is exploited here for capturing concepts such
as an Individual Product or a lot (collection) of items, SomeItems of a single
type. Information about the business context associated with an EPCIS event
is encoded using the entities and relationships de ned in the Business module.
RFID readers and sensors are de ned in the Sensor module. The de nitions here
are aligned with the SSN13 ontology. The EPCISException module incorporates
the hierarchy of the most commonly observed exceptions [13] occurring in EPCIS
governing supply chains.</p>
        <p>
          For further details on EEM and its applications in real world scenarios, the
interested reader is referred to [
          <xref ref-type="bibr" rid="ref6 ref7">6, 7, 11, 12</xref>
          ].
3.3
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>Linked pedigrees</title>
        <p>
          A Pedigree is an (electronic) audit trail that records the chain of custody and
ownership of a drug as it moves through the supply chain. Each stakeholder
involved in the manufacture or distribution of the drug adds visibility based
data about the product at their end, to the pedigree. Recently the concept of
\Event-based Pedigree"14 has been proposed that utilises the EPCIS speci
cation for capturing events in the supply chain and generating pedigrees based on
a relevant subset of the captured events. In previous work [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] we introduced the
concept of linked pedigrees in the form of a content ontology design pattern,
\OntoPedigree". We proposed a decentralised architecture and presented a
communication protocol for the exchange of linked pedigrees among supply chain
partners. In [11], we extended OntoPedigree to include provenance metadata as
illustrated in Figure 3 and proposed an algorithm for the automated
generation of linked pedigrees. For the purpose of completeness, we brie y recall the
axiomatisation of a linked pedigree in Figure 4.
        </p>
        <p>The de nition highlights the mandatory and optional restrictions on the
relationships and attributes for every pedigree that is exchanged between
stakeholders. Based on these, we de ne the requirements on the constraints to be
validated for the pedigrees.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>The traceability framework</title>
      <p>In this section we provide a brief overview of the traceability framework.
Interested readers are referred to speci c past work that cover the various aspects in
further detail.
11 http://ontologydesignpatterns.org/ont/dul/DUL.owl
12 http://purl.org/goodrelations/v1
13 http://purl.oclc.org/NET/ssnx/ssn
14 http://www.gs1.org/docs/healthcare/Healthcare_Traceability_Pedigree_
Background.pdf</p>
      <sec id="sec-4-1">
        <title>Automated generation of linked pedigrees</title>
        <p>We have proposed a pedigree generation algorithm based on complex processing
of continuous and real time streams of RFID data in supply chains. Our streams
comprise of events annotated using RDF/OWL vocabularies. Event streams are
generated, as products tagged with RFID identi ers are scanned and handled by
diverse trading partners, in various phases of an end-to-end supply chain.
Annotating streams using standardised vocabularies ensures interoperability between
supply chain systems and expands the scope to exploit ontology based reasoning
over continuously evolving knowledge.</p>
        <p>
          In the proposed approach, we represent supply chain events as streams of
RDF encoded linked data, while complex event patterns are declaratively
specied through extended SPARQL queries. In contrast to existing approaches [
          <xref ref-type="bibr" rid="ref3 ref5">3,5,8</xref>
          ]
where an element in a stream is a triple, our streams comprise of events where
each event is represented as a named graph [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]. A linked pedigree is considered
as a composition of named graphs, represented as an RDF dataset15.
        </p>
        <p>A detailed explanation of the methodology can be found in [11].
4.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>Validation of traceability artefacts</title>
        <p>Supply chain data is inherently very sensitive to adhoc integration with third
party datasets. For a speci c stakeholder, e ectiveness of the business work ows
and decision support systems utilised within its supply chain operations, that
ultimately govern the timely ful llment of its contractual obligations, is directly
15 http://www.w3.org/TR/rdf11-datasets/
Prefix ped: &lt;http://purl.org/pedigree#&gt;
Prefix prov: &lt;http://www.w3.org/ns/prov-o&gt;
Class: ped:Pedigree</p>
        <p>SubClassOf:
(hasPedigreeStatus exactly 1 ped:PedigreeStatus)
and (hasSerialNumber exactly 1 rdfs:Literal)
and (pedigreeCreationTime exactly 1 xsd:DateTime)
and (prov:wasAttributedTo exactly 1 ped:PedigreeCreator)
and (ped:hasConsignmentInfo someValuesFrom eem:SetOfEPCISEvents)
and (ped:hasTransactionInfo exactly 1 eem:SetOfEPCISEvents)
and (ped:hasProductInfo min 1),
(prov:wasGeneratedBy only ped:PedigreeCreationService),
(ped:hasReceivedPedigree only eem:Pedigree),
prov:Entity
dependent on the quality and authenticity of the data received from other
partners. Before traceability datasets received from external sources and partners
can be incorporated and integrated with the supply chain datasets generated
internally within an organisation, to be further shared downstream, they need to
be validated against information recorded for the physical goods received as well
as against bespoke rules, de ned to ensure the quality, uniformity, consistency
and completeness of datasets exchanged within the supply chain.</p>
        <p>We have proposed a methodology for validating the traceability data sent
from one stakeholder to another in the supply chain. Our approach is motivated
by four main requirements: (1) The validation should be supply chain domain
agnostic, i.e, the constraints must be reusable independently of the goods being
tracked. (2) The representation and sharing of traceability data must conform to
standards most commonly deployed in supply chains (3) The architecture must
be scalable to handle large volumes of streaming traceability data and (4) The
constraints must be formalised using widely used Semantic Web standards that
are t-for-purpose. While constraints can be represented using expressive
formalisms such as temporal logics, adopting a uni ed mechanism for representing
domain knowledge and constraints eliminates impedance mismatch between the
representations, avoids the need for an intermediate mapping language, makes
the addition of new constraints easier and simpli es implementation
requirements.</p>
        <p>In the proposed approach, we show how linked pedigrees received from
external partners can be validated against constraints de ned using SPARQL queries
and SPIN16 rules. To the best of our knowledge, validating constraints on (real
16 http://www.w3.org/Submission/spin-overview/
time) supply chain knowledge has so far not been explored both within the
Semantic Web and supply chain communities.</p>
        <p>A detailed illustration of the proposed approach can be found in [10, 13]
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusions</title>
      <p>Data visibility in supply chains has received considerable attention in recent
years. In the healthcare sector, visibility of datasets that encapsulate track and
trace information is especially important in addressing the problems of drug
counterfeiting. In this paper we have shown how Semantic Web standards,
ontologies and linked data can be utilised to represent and process real time
streams of supply chain knowledge, thereby signi cantly contributing to the
vision. Provenance, which is a critical aspect of supply chain knowledge is an
integral part of our framework. We have shown how we exploit this knowledge
for the validation of constraints that are de ned to ensure the quality,
uniformity, consistency and completeness of datasets exchanged between supply chain
partners. We have performed an exhaustive evaluation of the framework which
have been reported in various works outlined in the paper. Our results provide
very useful insights in improving the overall e ciency of the supply chain. It is
worth noting that while we have chosen the healthcare sector as a case study, our
approach is domain independent and can be widely applied to most scenarios of
traceability.
8. Rinne, Mikko et al. Processing Heterogeneous RDF Events with Standing SPARQL
Update Rules. In OTM Conferences (2), Lecture Notes in Computer Science.</p>
      <p>Springer, 2012.
9. E. W. Schuster and R. Koh. Track and Trace in the Pharmaceutical Supply Chain.</p>
      <p>Auto-ID Labs, Massachusetts Institute of Technology Cambridge, MA.
10. M. Solanki and C. Brewster. A Knowledge Driven Approach towards the Validation
of Externally Acquired Traceability Datasets in Supply Chain Business Processes.
In Proceedings of the 19th International Conference on Knowledge Engineering
and Knowledge Management(EKAW), volume LNAI 8876, pages 503{518. Springer
International Publishing Switzerland, 2014.
11. M. Solanki and C. Brewster. EPCIS event based traceability in pharmaceutical
supply chains via automated generation of linked pedigrees. In Peter Mika et al.,
editor, Proceedings of the 13th International Semantic Web Conference (ISWC).</p>
      <p>Springer-Verlag, 2014.
12. M. Solanki and C. Brewster. Modelling and Linking transformations in EPCIS
governing supply chain business processes. In Hepp, Martin; Ho ner, Yigal (Eds.),
editor, Proceedings of the 15th International Conference on Electronic Commerce
and Web Technologies (EC-Web 2014). Springer LNBIP, 2014.
13. M. Solanki and C. Brewster. Monitoring EPCIS Exceptions in linked traceability
streams across supply chain business processes. In Proceedings of the 10th
International Conference on Semantic Systems(SEMANTiCS). ACM-ICPS, 2014.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>1. F. M. Alexander Ilic</surname>
          </string-name>
          , Thomas Andersen.
          <article-title>EPCIS-based Supply Chain Visualization Tool</article-title>
          .
          <string-name>
            <surname>Auto-ID Labs White Paper</surname>
          </string-name>
          WP-BIZAPP-
          <volume>045</volume>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>J. J.</given-names>
            <surname>Carroll</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Bizer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Hayes</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Stickler</surname>
          </string-name>
          .
          <article-title>Named graphs, provenance and trust</article-title>
          .
          <source>In Proceedings of the 14th International Conference on World Wide Web, WWW '05. ACM</source>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>Davide</given-names>
            <surname>Francesco</surname>
          </string-name>
          et al.
          <article-title>C-SPARQL: a Continuous Query Language for RDF Data Streams</article-title>
          .
          <source>Int. J. Semantic Computing</source>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4. K. Framling,
          <string-name>
            <given-names>S.</given-names>
            <surname>Parmar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Hinkka</surname>
          </string-name>
          , J. Tatila, and
          <string-name>
            <given-names>D.</given-names>
            <surname>Rodgers</surname>
          </string-name>
          .
          <article-title>Assessment of EPCIS Standard for Interoperable Tracking in the Supply Chain</article-title>
          . In T. Borangiu, A. Thomas, and D. Trentesaux, editors,
          <source>Service Orientation in Holonic and Multi Agent Manufacturing and Robotics</source>
          , volume
          <volume>472</volume>
          <source>of Studies in Computational Intelligence</source>
          , pages
          <fpage>119</fpage>
          {
          <fpage>134</fpage>
          . Springer Berlin Heidelberg,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Le-Phuoc</surname>
          </string-name>
          , Danh et al.
          <article-title>A native and adaptive approach for uni ed processing of linked streams and linked data</article-title>
          .
          <source>In Proceedings of the 10th International Conference on The Semantic Web, ISWC'11</source>
          . Springer-Verlag,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>Monika</given-names>
            <surname>Solanki</surname>
          </string-name>
          and
          <string-name>
            <given-names>Christopher</given-names>
            <surname>Brewster</surname>
          </string-name>
          .
          <article-title>Consuming Linked data in Supply Chains: Enabling data visibility via Linked Pedigrees</article-title>
          . In Fourth International Workshop on Consuming Linked
          <string-name>
            <surname>Data (COLD2013) at</surname>
            <given-names>ISWC</given-names>
          </string-name>
          , volume Vol-
          <volume>1034</volume>
          . CEUR-WS.
          <source>org proceedings</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>Monika</given-names>
            <surname>Solanki</surname>
          </string-name>
          and
          <string-name>
            <given-names>Christopher</given-names>
            <surname>Brewster</surname>
          </string-name>
          .
          <article-title>Representing Supply Chain Events on the Web of Data</article-title>
          . In Workshop on Detection, Representation, and
          <article-title>Exploitation of Events in the Semantic Web (DeRiVE) at ISWC</article-title>
          .
          <article-title>CEUR-WS</article-title>
          .
          <source>org proceedings</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>