<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>An OWL Ontology for the Common Statistical Production Architecture</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Antoine Dreyer</string-name>
          <email>antoine.dreyer@insee.fr</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Franck Cotton</string-name>
          <email>franck.cotton@insee.fr</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Guillaume Du es</string-name>
          <email>es@insee.fr</email>
        </contrib>
      </contrib-group>
      <abstract>
        <p>The Common Statistical Production Architecture (CSPA) is a reference architecture for the statistical industry. In particular, CSPA aims at documenting statistical services in a standard way, in order to ease their exchange and reuse between statistical institutes. This paper describes a suggested formalization of some parts of the CSPA speci cation using OWL. This work led to propose some adjustments that could improve the consistency and clarity of the speci cation.</p>
      </abstract>
      <kwd-group>
        <kwd>CSPA</kwd>
        <kwd>OWL</kwd>
        <kwd>ontology</kwd>
        <kwd>statistical service</kwd>
        <kwd>o cial statistics</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>1.1</p>
    </sec>
    <sec id="sec-2">
      <title>Introduction</title>
      <sec id="sec-2-1">
        <title>The Common Statistical Production Architecture (CSPA)</title>
        <p>
          Since 2010, the United Nations Economic Commission for Europe (UNECE)
leads an global e ort to develop common standards and models and to foster
cooperation in the o cial statistics community. The agship of this e ort is
the Common Statistical Production Architecture (CSPA) [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ], a reference
architecture for the statistical industry aiming at promoting collaboration between
national and international organizations.
        </p>
        <p>CSPA comprises in particular: conceptual data and process models1, a
logical information model (LIM), standard templates for documenting statistical
services, standard roles for conceiving, developing and implementing those
services, architectural principles at the application and technology levels, etc.</p>
        <p>
          This di erent components do not exhibit the same level of formalization. The
information models are expressed as UML, and work has already been conducted
to represent the generic process model using OWL [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]; on the other hand, the
templates for documenting the statistical services are just text documents, and
it is obvious from the existing examples of CSPA services that they were and
still are interpreted in various ways. Existing service documentation show also
that there are di erent views on the expected granularity of CSPA services.
        </p>
        <p>To address these problems, a more formal representation of some parts of the
CSPA speci cation is needed. This is what we are aiming at in this paper.
?? Institut National de la Statistique et des Etudes Economiques, http://insee.fr</p>
        <sec id="sec-2-1-1">
          <title>1 Resp. GSIM (http://www1.unece.org/stat/platform/display/gsim/Generic+</title>
          <p>Statistical+Information+Model) and GSBPM (http://www1.unece.org/stat/
platform/display/metis/The+Generic+Statistical+Business+Process+Model).</p>
        </sec>
      </sec>
      <sec id="sec-2-2">
        <title>What do we want to do?</title>
        <p>
          We want to formalize the CSPA speci cation using OWL [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]. OWL is based on
RDF and \is designed to represent rich and complex knowledge about things,
groups of things, and relations between things". Several reasons led us to select
OWL rather than other modeling formalisms:
{ As a formal approach of knowledge, OWL brings structure and consistency to
prior knowledge. This helps avoiding misunderstandings, brings uniformity
and facilitates the exchange of information, which is at the core of CSPA.
{ RDF is machine-actionable, so it can be used directly to capture, exchange
or disseminate common information and to build information systems.
{ The main models de ned in CSPA (GSBPM and GSIM) are now expressed
in OWL: using the same formalism is clearly a factor of interoperability.
OWL has well-known shortcomings when it comes to data validation2, but
validation is not an important objective at this stage. We are starting from a material
which is not structured at all, so the rst goals are to ease sharing of semantics,
and to progress towards a uniform representation of CSPA.
        </p>
        <p>We should note that applying formalism on existing material can raise
difculties. Sometimes, adaptations to the CSPA concepts or vocabularies would
ease the process. Even if our main goal is not change to CSPA speci cation
but to foster its use within the statistical community, we had to suggest some
changes and leave open some questions. This will be reported back to the CSPA
management team.
1.3</p>
      </sec>
      <sec id="sec-2-3">
        <title>What is CSPA for?</title>
        <p>
          The goal of CSPA is not unique. On the one hand it is a representation of
processes, on the other hand it is a way to share IT components between statistical
institutes. This duality obliges us to be cautious: if the ontology is too
conceptual, few people will use it; if it is not conceptual enough, it will not allow e cient
exchange of information. In order to evaluate the right level of representation,
we rst studied the CSPA descriptions already provided by several institutions
[
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. We observed that the original CSPA speci cation was not always tailored
to every need, and consequently chose to take a user-driven perspective in order
to provide a way to specify as close as possible to the existing examples. This
led us to add some aspects to the original CSPA speci cation, in particular the
distinction between functions and packages, which is explained below.
2
2.1
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Structure of CSPA</title>
      <sec id="sec-3-1">
        <title>Introduction</title>
        <p>We decribe in this section the di erent notions de ned by the CSPA speci cation,
in order to prepare the development of the ontology, which will be described in
the next section.</p>
        <sec id="sec-3-1-1">
          <title>2 See for example https://www.w3.org/2012/12/rdf-val/</title>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>Three levels of description</title>
        <p>
          CSPA distinguishes between three levels of service description, and associates
prede ned organisation roles to them:
Service de nition (SD) is the conceptual level which is human-oriented and
has less details. It is the user's view, independent of physical implementation.
It is made by a Service Designer, using conceptual models like the GSIM [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]
as information model, and like GSBPM [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] for processes.
        </p>
        <p>Service speci cation (SS) is the logical level. It ensures that the service
description meets the requirements of CSPA. It is made by the Service
Designer, using logical models like the CSPA LIM or the information models
associated with DDI3 or SDMX4.</p>
      </sec>
      <sec id="sec-3-3">
        <title>Service implementation description (SID) is the physical level which is</title>
        <p>computer oriented and has most details. It is made by a Service Builder
or Service Assembler, using (among others) DDI or SDMX instances.
2.3</p>
      </sec>
      <sec id="sec-3-4">
        <title>Distinction between functions and packages</title>
        <p>We can see from existing examples that current implementers of CSPA want
to specify two slightly di erent things, namely packages and functions. One
service may actually be a bundle of functions, potentially accessible via di erent
protocols. Thus we need to describe not only the bundle of functions as one entity
but also each and every function in the bundle. Depending on the context, the
bundle can also be seen as an independant software or only a package. To help
our analysis, we studied di erent package description systems, in particular:
JavaDoc5, Debian Control File6 (also used by R) and Python7. As a result, we
introduced a distinction between functions and packages, even if the current
CSPA speci cation does not make this distinction clearly. We split the 3 levels
of CSPA into 2 parts: one for functions and one for packages, as is explained
further below.
2.4</p>
      </sec>
      <sec id="sec-3-5">
        <title>Mixing functions and packages with the levels of description</title>
        <p>We rst look at the original CSPA structure with 3 levels (cf Fig. 1), where one
service de nition points to one service speci cation which points to one or more
service implementation descriptions.</p>
        <p>Then we introduce the distinction between function and package. In Fig. 2,
the 3 levels of description are encapsulated inside package or function containers;
one package container points to several function containers.</p>
        <sec id="sec-3-5-1">
          <title>3 Cf. http://ddialliance.org</title>
        </sec>
        <sec id="sec-3-5-2">
          <title>4 Cf. http://sdmx.org</title>
        </sec>
        <sec id="sec-3-5-3">
          <title>5 Cf. http://www.oracle.com/technetwork/articles/java/index-137868.html</title>
        </sec>
        <sec id="sec-3-5-4">
          <title>6 Cf. https://www.debian.org/doc/debian-policy/ch-controlfields.html 7 Cf. https://pypi.python.org/pypi?%3Aaction=list_classifiers</title>
          <p>Conceptual level</p>
          <p>Service De nition
Logical level</p>
          <p>Service Speci cation</p>
          <p>SD
SS
Implementation level</p>
          <p>Service Implementation
Description</p>
          <p>SID</p>
          <p>SID</p>
          <p>SID</p>
          <p>This is a very theoretic representation, because in a typical situation, not
every level exist for each function or package. Those missing levels (service
speci cation for the CSPA package, for instance) lead to a di culty: the initial
schema imposes sequential links (service de nition linked to service speci
cation linked to service implementation description). That is why we must adopt
a slightly di erent structure (which is exempli ed in Fig. 3), where each level
inside a container is linked to the container itself and not to another level.</p>
          <p>Package</p>
          <p>SD</p>
          <p>SS
SID</p>
          <p>SID
Function</p>
          <p>Function</p>
          <p>Function
SD
SS</p>
          <p>SD</p>
          <p>SS
SID</p>
          <p>SID</p>
          <p>SID</p>
          <p>SID</p>
          <p>SID</p>
          <p>SS</p>
        </sec>
      </sec>
      <sec id="sec-3-6">
        <title>Properties and topics</title>
        <p>The CSPA speci cation de nes di erent properties for the services, many unique
to one level of description. To ease understanding, we group those properties into
8 di erent topics:
Business function (only at conceptual level, mostly for packages) to describe
what the service is for.</p>
        <p>Documentation to document the service (how to use it for instance).
Provenance to describe where the service and its description come from.
Interface (mainly for packages) to describe how to invoke the service, for
instance using Bash or a REST API
Dependencies (mainly for packages) to detail what is needed to invoke the
service, for instance a speci c operating system or database; this can be
linked to an Interface at implementation level.</p>
        <p>Input (only for functions) to describe the inputs of a function.
Output (only for functions) to describe the outputs of a function.
Identi cation (Name and Version), to identify a service unambiguously.</p>
        <p>
          We list below the initial properties contained in the CSPA speci cation,
organized according to the topics that we introduced previously. At the beginning
of the line is the name of the properties topic described earlier; the names of the
initial properties follow. The full initial templates of CSPA with de nitions are
available on the UNECE website [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ].
        </p>
      </sec>
      <sec id="sec-3-7">
        <title>Service de nition initial properties</title>
      </sec>
      <sec id="sec-3-8">
        <title>Identi cation Name</title>
        <p>Business function Business Function, GSBPM, Outcomes, Restrictions
Dependency Service dependencies
Input GSIM Input
Output GSIM Output</p>
      </sec>
      <sec id="sec-3-9">
        <title>Service speci cation initial properties</title>
      </sec>
      <sec id="sec-3-10">
        <title>Identi cation Name</title>
        <p>Interface Protocol for Invoking the Service
Input Input Message
Output Output Message
Documentation Applicable Methodologies</p>
      </sec>
      <sec id="sec-3-11">
        <title>Service implementation description</title>
      </sec>
      <sec id="sec-3-12">
        <title>Identi cation Name, Version</title>
        <p>Interface Invocation protocols, Service Interface
Input Data-by-Reference protocols
Documentation Additional information, Installation documentation
Provenance Builder Organization
Dependency Technical dependencies
Most of those topics are often present in each of the 3 levels (de nition, speci
cation, implementation).</p>
      </sec>
      <sec id="sec-3-13">
        <title>Roles</title>
        <p>
          As already mentioned, CSPA de nes [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] organizational roles that are associated
with the life-cycle of statistical services. Those eight roles are:
{ Assembler,
{ Builder,
{ Con gurer,
{ Designer,
{ Environment Provider,
{ Investor,
{ Service Provider,
{ User.
3
3.1
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Ontology description</title>
      <sec id="sec-4-1">
        <title>Introduction</title>
        <p>At this stage, we have distinguished in CSPA three main semantic axes related
to the statistical services: level of description, service granularity and property
topics. Consequently, we will organize the global structure of the ontology with
3 levels of description (service de nition, service speci cation and service
implementation description), 2 granularity categories (functions and packages), and 8
topics (Business function, Documentation, Provenance, Interface, Dependency,
Input, Ouput, Identi cation).</p>
        <p>We have also identi ed additional concepts present in the speci cation, like
organizational roles.</p>
        <p>In this section, we decribe how these di erent notions are formalized in the
CSPA ontology that we propose. Due to space constraints, we limit ourselves to
a narrative description: the actual code of the ontology, as well as an illustrative
example on one of the existing CSPA services, can be found on GitHub8.
3.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>Classes and properties for levels and categories</title>
        <p>In order to describe granularity categories, we create two classes: Package and
Function, and they are de ned as subclasses of a common PackageOrFunction
class. Similarly, ServiceDe nition, ServiceSpeci cation and
ServiceImplementationDescription classes are used to represent the levels of service documentation,
and they are made subclasses of a common DescriptionLevel class. Since some
properties are only found for one level and one category, we need to create 6
(2x3) classes mixing categories and levels. We also need 6 corresponding
properties to link each category to its 3 description levels. Finally, we need a property
to link a function (or a package) to a package: a package can contain functions
or other packages.</p>
        <sec id="sec-4-2-1">
          <title>8 Cf. https://github.com/FranckCo/Stamina/blob/master/doc/cspa</title>
        </sec>
      </sec>
      <sec id="sec-4-3">
        <title>Properties and topics implementation with OWL</title>
        <p>In an OWL ontology, properties de ne how information is stored (DataType
properties) or how objects (instances of classes) are linked together (Object
properties). Consequently, all CSPA properties are implemented using OWL
DataType properties or Object properties.</p>
        <p>How we can represent topics is not as straightforward. All topics are groups
of properties bringing knowledge on the same theme, but some are more than
that: they also represent objects. For instance, the Input topic represent what a
function receives to operate, but a function also has input parameters which are
objects on their own. It is thus logical to merge the two notions: a function will
be linked to one or more objects, instances of an Input class.</p>
        <p>On the whole, four topics (Interface, Dependencies, Input, Output) can be
seen as objects: an Interface is one of potentially several ways to invoke the
service, a Dependency is one thing out of many needed to use the service, an
Input is an object consumed by a function, an Output is an object produced by
a function). OWL classes are needed for representing each of those 4 topics.</p>
        <p>For the other topics (Business function, Documentation, Provenance,
Identication), there is no real need of OWL classes; properties could be linked directly
to the function or the package class. But since there are many properties, we
think that it is clearer to create classes also for these topics and link
properties of a topic to their topic class, that acts as an intermediary for organizing
the information. This structure intends to inform the user where to look at for
a speci c information, and thus to facilitate the understanding and use of the
ontology. We think the added clarity is worth the small extra complexity in the
structure.</p>
        <p>We make an exception for Identi cation: since identi cation should be fast
and straightforward, properties in the identi cation topic should be accessible
directly, and it is better to attach them directly than through a topic class.
3.4</p>
      </sec>
      <sec id="sec-4-4">
        <title>Classes for topics and levels</title>
        <p>According to the conclusions of the previous section, we created in the ontology
a PropertyTopic class to contain property topics, and we created 7 subclasses
of the PropertyTopic class, for each of the topics except Identi cation. The
identi cation properties are described below.</p>
        <p>We must take into account the interaction between property topics and levels.
The BusinessFunction topic can be applied only to one level: ServiceDe nition.
Provenance and Documentation can be applied to all levels, and no distinction
is made between levels in regard of the content of those topics.</p>
        <p>Dependency, Interface, Input and Output topics, on the other hand, can
be applied to all levels, but properties relative to each level are di erent: the
content of these topics is not the same in the di erent levels, so we have to
create 12 classes combining topics and levels (4 topics and 3 levels). Each class
is a subclass of the topic it refers to, and its name is the concatenation of the level
name (abbreviated as De nition, Speci cation and Implementation respectively)
and of the topic name. For example, the De nitionInput class is a subclass of
the Input class.</p>
        <p>As of today, the subclasses for each level for Dependency and Interface are
exactly the same, but the CSPA speci cation shows that it may not stay true
in the future. Table 1 illustrates the connections between topics and levels of
description in CSPA.</p>
        <p>With these di erent de nitions, we can now link one level, or all the levels, or
the levels concerning only functions, etc., to properties topic. Also, to facilitate
the future use of the ontology, we add a description property to every property
topic, which allows to freely add descriptive text to the topic.
DescriptionLevel</p>
        <p>ServiceDe nition
FunctionDe nition</p>
        <p>ServiceSpeci cation</p>
        <p>FunctionSpeci cation
ServiceImplementationDescription</p>
        <p>FunctionImplementation</p>
        <p>In table 1, the lines are the OWL classes corresponding to level of description
and glanularity categories. The blue cells only indicate the highest-level class to
which a given topic is attached. For example, Provenance and Documentation
are attached to DescriptionLevel, which is a common ancestor of all the others,
and so these topics are de ned for all level and all categories. FunctionDe nition
inherits the topics from DescriptionLevel and ServiceDe nition, and adds Input
and Output. PackageDe nition is not represented because it has only inherited
topics.
3.5</p>
      </sec>
      <sec id="sec-4-5">
        <title>Topic identi cation and generalization</title>
        <p>For the 4 topics referring naturally to objects that we listed before (Interface,
Dependency, Input and Output), one instance can be described in more than
one level. An obvious need, and thus ontology use case, is to be able to capture
the relation between the corresponding objects at di erent levels. For instance,
it should be easy to link with each other the di erent levels of description of
an Input (e.g. a GSIM object at the de nition level and a LIM object at the
speci cation level).</p>
        <p>Di erent possibilities were considered to implement this feature, and we
settled on the use of a common identi er. In order to give exibility at
implementation time, we modeled the identi er as an OWL class, with subclasses for each
topic (InputIdenti er, etc.). Only a label property is de ned for the identi ers,
but other properties can be added in the future for more speci c identi cation.</p>
        <p>Regarding the ranges of the topic classes, table 1 shows that we could have
speci ed them very precisely. Nevertheless, we felt that it was clearer to provide
topics as close as possible for each level, so that we decided to extend the range of
some topics. Only business function topic is not extended to every level, since we
considered that the business function was only conceptual and did not have an
implementation. The goal of extending topics is not to alter CSPA speci cation,
but to make it clearer (every topic present at each level), and easier to use.
3.6</p>
      </sec>
      <sec id="sec-4-6">
        <title>Identi cation properties</title>
        <p>As we mentioned above, the Identi cation topic has no associated topic class: in
order to make theme as accessible as possible, the identi cation properties, label
and version, are directly linked to the objects they quali y. They are de ned as
datatype properties with range xsd:string and can be applied on
DescriptionLevel and PackageOrFunction (and thus all their subclasses). A label can also
be applied on Identi er, Dependencies, Interface, Input, Output classes. As a
consequence, for Dependencies, Interface, Input, Output classes, a label can be
given directly or via an Identi er.
3.7</p>
      </sec>
      <sec id="sec-4-7">
        <title>Additional components and features of the ontology</title>
        <p>We decribe in this section more classes and properties dedicated to modeling
the CSPA roles, as well as the speci c properties corresponding to the topics
introduced in 2.5.</p>
        <p>Roles and provenance We introduced the CSPA roles in sub-section 2.6. To
represent these roles and their action in the life-cycle of a statistical service, we
use the PROV9, ORG10 and FOAF11 ontologies. The whole ontologies could be
used to document provenance information, but only a few properties and classes
correspond to the CSPA initial speci cation.</p>
        <p>In the rest of the paper, the usual pre xes prov:, org: and foaf: will be used
for the corresponding ontologies. The cspa: pre x will be used for the CSPA
ontology.</p>
        <p>In order to incorporate PROV, we de ne a cspa:Provenance class (the name
will be precised in a future version) as a subclass of the prov:Entity class.
The property prov:wasGeneratedBy allows to document an organization as a</p>
        <sec id="sec-4-7-1">
          <title>9 Cf. https://www.w3.org/TR/prov-o/ 10 Cf. https://www.w3.org/TR/vocab-org/ 11 Cf. http://xmlns.com/foaf/spec/</title>
          <p>CSPA role. We also create a cspa:Organization sub-class of prov:Organization
and org:Organization (this class being itself a sub-class of foaf:Agent).</p>
          <p>We also create a cspa:organizationName property, which is an alias of foaf:name.</p>
          <p>Finally, we introduce 8 properties linking cspa:Provenance to each role, all
sub-properties of prov:wasGeneratedBy.</p>
          <p>
            For instance, one may describe a CSPA service documentation made by a
National statistical institute (NSI) in the following way (expressed in the Turtle
language[
            <xref ref-type="bibr" rid="ref9">9</xref>
            ]):
:something a cspa:DescriptionLevel;
cspa:comesFrom [ a cspa:Provenance;
cspa:builderOrganization [ a cspa:Organization;
cspa:organizationName "NSI" ] ] .
          </p>
          <p>
            Documentation The ontology de nes 5 properties for this topic:
{ cspa:installationGuide, which links to a foaf:Document
{ foaf:homepage, which links to a foaf:Document
{ cspa:methods, with string values
{ cspa:description, with string values
{ cspa:additionalInformation, with string values
BusinessFunction We de ne 3 speci c properties for this topic. cspa:outcomes
and cspa:restrictions are datatype properties with string values. The cspa:gsbpmSubProcess
property links to the SubProcess class in GSBPM ontology [
            <xref ref-type="bibr" rid="ref2">2</xref>
            ].
          </p>
          <p>
            It should be noted that the business function topic in CSPA is not exactly
the same as in GSIM [
            <xref ref-type="bibr" rid="ref10">10</xref>
            ], but we underline the proximity of the two concepts by
making the cspa:BusinessFunction class a subclass of a GSIM business function
class that will be de ned in a future work on a GSIM ontology.
          </p>
          <p>Dependency and Interface Since the CSPA speci cation is not very precise
on the information that should be stored in these topics, and because we do
not want to have too many properties, we chose in a rst approach to have a
very simple structure, where a dependency or an interface is described with only
two properties: a label and a description, both string datatype properties. In
consequence, there is no property speci c to Dependency and Interface, except
for identi ers (see above).</p>
          <p>We also introduce a direct link between Dependency and Interface. At de
nition and speci cation levels, dependencies should be broad enough not to depend
on a speci c interface. At implementation level however, a direct link between
dependency and interface is needed. For instance, if the service has several
interfaces including a web interface, then a dependency concerning compatible
browser only makes sense for the web interface.</p>
          <p>As a consequence, we extend the cspa:implementationDependsOn property
that was previously created to represent the a service implementation and a
dependency, so that its domain also covers the cspa:ImplementationInterface
class.
Input and Output It would be possible to use for the Input and Output topics
the same label/description construct that we used for Interface and Dependency,
but being able to connect services together is a key point of CSPA, so that is
not a recommended method.</p>
          <p>We saw that service de nition level, Input and Output instances should be
GSIM objects. Thus, we introduce a generic gsim:gsimObject class and create
two object properties, cspa:gsimInput and cspa:gsimOutput with this range,
de ned on the cspa:De nitionInput and cspa:De nitionOutput topic classes.</p>
          <p>A similar construct is used for the speci cation level with CSPA LIM objects.</p>
          <p>On a service implementation level, the way speci cation should be done is
open: for example, is possible to use GSIM Process Input or a GSIM Process
Output objects. At this level, it is possible to link an input or an output
directly to an Interface; some parameters may indeed only have sense for a speci c
interface.</p>
          <p>Cardinality restrictions More experience with CSPA is needed in order to
dene cardinality restrictions in the ontology, and we chose not to use owl:Restriction
to allow for any future implementation of them. However using a property more
than once would never produce non-sensical speci cation.
4</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusion ad future work</title>
      <p>We presented in this paper the construction of an OWL ontology aiming to
model the core of the Common Statistical Production Architecture speci cation,
namely the di erent levels of service documentation and the organizational roles
involved in the conception, building and implementation of statistical services.
This work consisted in a detailed semantic analysis of CSPA, allowing to identify
the main concepts in order to translate them as OWL classes and properties. We
had to make some compromises and some additions to CSPA in order to build
a complete and consistent ontology, but we are con dent that this will enhance
the coherence between CSPA and its main pillars: GSBPM and GSIM.</p>
      <p>The work presented here is just a rst step; it will be prolonged in di erent
directions.</p>
      <p>First, since more CSPA services are presently being developed by di erent
organizations, we will shortly be able to test our model on more examples.
Currently, there are only a handful of service descriptions, so a serious evaluation
of the model is not possible, although we found it performed well on existing
cases (see example referenced above). With more service descriptions, we will be
able to evaluate our work more completely, which is likely to help us re ne the
ontology.</p>
      <p>Second, the results of our work will be presented to the CSPA sponsors,
speci cally the CSPA Implementation Group which is the high-level committee
in charge of maintaining CSPA. We believe that this input will be taken into
consideration for a future version of CSPA.</p>
      <p>
        Third, the current work is part of a broader interest in semantic business
process management (SBPM). The article by Hepp and others [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] points to the
leverage provided by a semantic approach of business process management. The
article by Bevilacqua and others [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] show an example of service composition
using service description made with OWL. Some additional work is needed in
order to incorporate in the CSPA ontology the outcomes of these works.
      </p>
      <p>Finally, we intend to operationalize the ontology by constructing an IT
system based on it. A hackathon session sponsored by the UNECE is already
scheduled with colleagues from di erent countries, and one objective is to develop a
graphical CSPA service editor that will produce representations conformant to
the ontology presented here.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>Common</given-names>
            <surname>Statistical Production Architecture</surname>
          </string-name>
          , http://www1.unece.org/stat/ platform/display/CSPA/Common+Statistical+Production+Architecture
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Cotton</surname>
          </string-name>
          , Franck and Gillman, Daniel,
          <article-title>Modeling the Statistical Process with Linked Metadata</article-title>
          .
          <source>In: Proceedings of the 3rd International Workshop on Semantic Statistics</source>
          , http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>1551</volume>
          /article-06.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>W3C</given-names>
            <surname>Web Ontology Language</surname>
          </string-name>
          , https://www.w3.org/OWL/
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Implementing</surname>
            <given-names>CSPA</given-names>
          </string-name>
          Projects, http://www1.unece.org/stat/platform/display/ CSPA/Implementing+CSPA+Projects
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>5. Generic Statistical Information Model, http://www1.unece.org/stat/platform/ display/metis/Generic+Statistical+Information+Model</mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>The</given-names>
            <surname>Generic Statistical Business Process Model</surname>
          </string-name>
          , http://www1.unece.org/stat/ platform/display/metis/The+Generic+Statistical+Business+Process+Model
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>7. CSPA, Annex 1: Templates, http://www1.unece.org/stat/platform/display/ CSPA/Annex+1_+Templates</mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <article-title>Roles involved in the production of IT enabled</article-title>
          CSPA Services, http: //www1.unece.org/stat/platform/display/CSPA/Roles+involved+in+the+ production+of+IT+enabled+CSPA+Services
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Turtle (Terse RDF Triple Language</surname>
          </string-name>
          ), https://www.w3.org/TR/turtle/
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Clickable</surname>
            <given-names>GSIM</given-names>
          </string-name>
          : Business Function, http://www1.unece.org/stat/platform/ display/GSIMclick/Business+Function
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Martin Hepp Frank Leymann</surname>
          </string-name>
          , John Domingue, Alexander Wahler, and Dieter Fensell,
          <article-title>Semantic Business Process Management: A Vision Towards Using Semantic Web Services for Business Process Management, www</article-title>
          .heppnetz.de/ files/mhepp-et-al-SemanticBusinessProcessManagement.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <given-names>L.</given-names>
            <surname>Bevilacqua</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Furno</surname>
          </string-name>
          and
          <string-name>
            <given-names>V. S. di</given-names>
            <surname>Carlo</surname>
          </string-name>
          and
          <string-name>
            <given-names>E.</given-names>
            <surname>Zimeo</surname>
          </string-name>
          ,
          <article-title>A tool for automatic generation of WS-BPEL compositions from OWL-S described services</article-title>
          .
          <source>In: 2011 5th International Conference on Software, Knowledge Information, Industrial Management and Applications</source>
          (SKIMA), http://angelofurno.net/documents/ WS-BPEL_Composition_
          <fpage>SKIMA11</fpage>
          -
          <lpage>v1</lpage>
          .4.pdf.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>