<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Semantic UBL-like documents for innovation</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>National Research Council</institution>
          ,
          <addr-line>IASI “Antonio Ruberti” Viale Manzoni 30, 00185 Roma</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>SRDC Software Research &amp; Development and Consultancy Ltd.</institution>
          <addr-line>Silikon Blok No:14, Teknokent ODTU, 06800 Ankara</addr-line>
          ,
          <country country="TR">Turkey</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Considering innovation as the result of only spontaneous activities is a simplistic vision, because working out inspiration and reaching up to innovation need awareness and knowledge about the application domain and its problems. In this paper, we address the issue of knowledge representation, access and sharing in an enterprise context, by proposing an ontology-based framework (DocOnto) for the semantic description of documents involved in innovation activities. The framework, which is built within the BIVEE European project, is characterized by a customizable approach inspired by the UBL/CCTS, which allows each enterprise to refine the DocOnto at best for its needs. Then UBL-like structures are semantically lifted and used for describing concrete documents. Such a semantic representation enables reasoning services like querying and retrieving of documents, understanding similarities among documents, assessing their status and quality, monitoring innovation activities. The framework is supported by the technological integration of the iSurf eDoCreator, for modelling UBL-like documents structures, and the Production and Innovation Knowledge Repository (PIKR), the semantic knowledge hub of the BIVEE platform.</p>
      </abstract>
      <kwd-group>
        <kwd>business innovation</kwd>
        <kwd>ontologies</kwd>
        <kwd>semantic description</kwd>
        <kwd>UBL</kwd>
        <kwd>CCTS</kwd>
        <kwd>document management</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        “Genius is one percent inspiration and ninety-nine percent perspiration”. This
quotation from Thomas. A. Edison reveals what is behind innovation successes.
Innovation is often identified as the result of spontaneous attitudes like creativity or
artistic flair, but bringing new ideas into the market in the form of innovative products
or services, also needs the adoption of methods and supporting means. In particular,
in the era of the information society, knowledge management has a primary role also
in relation to innovation activities. With respect to that, [
        <xref ref-type="bibr" rid="ref1 ref4">1,4</xref>
        ] consider innovation as a
practice and process that captures, acquires, manages and diffuses knowledge with the
aim to create new knowledge. Furthermore, knowledge enables creativity by
permitting knowledge associations and linkages that otherwise are difficult to be
discovered.
      </p>
      <p>In this paper, we outline a framework for designing semantics-based structures
(Document Ontology, DocOnto) to enable the semantic enrichment and management
of innovation-related documents, i.e. documental resources produced and consumed
during innovation initiatives (e.g., proposed ideas, feasibility studies, etc.). This
activity is conducted within the BIVEE1 European project, which is about the
development of an ICT infrastructure for supporting innovation activities in virtual
enterprise (VE) environments.</p>
      <p>
        Among related initiatives, we mention Dublin Core2, a vocabulary of fifteen
properties for description of documental resources, and SALT [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], which is for
describing the organization of a document in terms of sections and paragraphs. While
we intend to re-use part of the terms from Dublin Core, we look at documents
differently from SALT, since we are focused on the semantics instead of the
organization of the structure of a document.
      </p>
      <p>
        The proposed framework is based on the one hand, on the methodological
integration of the UBL/CCTS [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] approach, which is for modeling and customizing
documents structures, together with semantic representation methods. On the other
hand, the framework is supported by the technological integration between the iSurf
eDoCreator[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] UBL editor and the Production and Innovation Knowledge Repository
(PIKR), which is the semantic knowledge hub of the BIVEE platform. The objective
is the semantic lifting of innovation-related documents structures and content for
enabling interoperability and openness, as well as reasoning services such as querying
and retrieval of documents, reasoning over documents description, understanding
similarities among documents, assessing status and quality of documents and,
monitoring innovation activities.
      </p>
      <p>The paper is organized as follows. Section 2 presents the overall structure of the
DocOnto and the UBL-inspired approach for building and customizing the DocOnto
itself. Section 3 focuses on the technical aspects concerning the integration between
the PIKR and eDoCreator, as well as on the services provided by the PIKR for
reasoning over the semantic description of annotated documents. Conclusions and
future work end the paper.
2</p>
    </sec>
    <sec id="sec-2">
      <title>The DocOnto framework</title>
      <p>One of the main objectives of the BIVEE project is to support and facilitate
innovation activities in a VE environment. To this end, the Virtual Enterprise
Modeling Framework (VEMF) has been developed. According to the VEMF,
innovation-related activities happen within four waves: Creativity, Feasibility,
Prototyping and Engineering. Flowing through these four waves, many documents are
produced, used, consumed and evaluated. For instance, in the Creativity wave, given a
problem or issue, many ideas can be proposed to address it. Some of them will pass
1 Business Innovation and Virtual Enterprise Environment (No. FoF-ICT-2011.7.3-285746).
2 http://dublincore.org/documents/dces/.
the initial stage and will be further elaborated. Recording such information means
keeping track of reasons that guide decisions, and re-using knowledge to save time
and money in the future. We consider that ontology-based semantic technique can be
effective in addressing representation, sharing, access, and reasoning over documental
resources, especially in VE context where boundaries are larger and a reference
(ontology) is requested.</p>
      <p>
        For the definition of the proposed ontology-based innovation document framework
(DocOnto) we started from the results of an activity performed within the BIVEE
project: two end-user organizations were asked to see their innovation-related
activities through the four innovation waves and indicate the information they actually
produce and use. This brought the identification of sets of documents, one for each
end-user [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. These results have been taken as specifications and, starting from them,
a conceptualization of these documents has been performed for identifying valuable
InfoItems (building blocks, which correspond to small and meaningful elements),
InfoSets (recursive aggregation and association of InfoItems) and associations
between them as described below and reported in Table 1.
 Header groups meta-data InfoItems like the title of the document, the authors etc.
 Content groups InfoItems describing the essence of the document, i.e., its
semantics. The adoption of domain-focused dictionaries, thesauri or ontologies
increments the level of interoperability and enables reasoning mechanisms.
 Related Knowledge Resources section allows to establish relations between
      </p>
      <p>InfoSets such as 'prerequisite', 'feedback', 'partOf', 'relatedTo' etc.</p>
      <p>Title
Description
...</p>
      <p>Research Line
Technology
...</p>
      <p>Part of</p>
      <p>Header
Advanced HMI
System for the robot programming based on the 3d reconstruction of
the inspected components
...</p>
      <p>Content
3D vision, cloud point, artificial intelligence algorithm
HMI
...</p>
      <p>
        Related Knowledge Resources
doc:IP_AdvancedHMI
(ASBIE) and Aggregate Business Information Entity (ABIE). UBL [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] implements
CCTS and publishes XML based Business Document Definitions, Common BIEs and
Data Types such as an Invoice document or an Address BIE
      </p>
      <p>Data requirements change for different virtual enterprises in order to address the
needs of innovation activities. Hence, it is required to customize the DocOnto for each
virtual enterprise once the requirements have been set. UBL provides a
methodological way for the customization of already available documents and BIEs.
Since this methodology has already been implemented by eDoCreator tool, our
solution inherently supports customization of existing innovation related documents
and BIEs. According to the UBL standard, new information entities can be added to
meet the requirements of a specific business context, optional information entities can
be omitted, the meaning of information entities can be refined, new constraints can be
specified, new aggregations or documents can be combined or assembled or new
business rules can be added during a customization. If a new type of innovation
document is required, users can model its structure through customization facilities
offered by eDoCreator, which are conformance with customization guidelines of
UBL. Since we model documents through InfoItems and InfoSets, and follow the UBL
approach, our modelling directly maps to UBL terms when we leave out the
technologies of our framework. This mapping can be depicted as follows: BBIE
InfoItem, ABIE - InfoSet and ASBIE - Associations.
3</p>
    </sec>
    <sec id="sec-3">
      <title>TECHNICAL REALIZATION</title>
      <p>
        In this section we give an overview of the technical aspects related to the integration
of the PIKR [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], and the eDoCreator for supporting the implementation of the
DocOnto. We also outline the semantic services in charge of exploiting the semantic
description of the documents in terms of the DocOnto.
      </p>
      <p>About the integration between the eDoCreator and the PIKR, the former exports
XML Schema3 of modelled documents to a Mediator module. The Mediator performs
the semantic lifting by encoding documents structures into OWL/RDF4, the de-facto
standard for ontology and meta-data sharing. The result of the lifting is then
transmitted to the PIKR, which maintains it in a triple store.</p>
      <p>
        The knowledge representation framework discussed in the previous sections
enables the enactment of a number of reasoning facilities to support the management
of documents in innovation projects, in terms of the following services.
Search. This service provides keyword-based search functionalities. The user request
is expressed as an ontology-based feature vector describing the criteria for the
selection of the resources of interest. By applying semantic similarity techniques (the
SemSim metrics [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]) the degree of matching among the terms used to formulate the
request and the ones used to describe the available resources is computed, and a list of
ranked results, with respect to the Semsim similarity metrics, is returned. For instance,
3 http://www.w3.org/TR/xmlschema-0/
4 http://www.w3.org/TR/owl2-overview/
suppose that the user is interested in finding all the documents that have been
authored in the last two years and concerning the initial stages of the design of a piece
of furniture equipped with an electronic device. The corresponding request should be
formulated as follows:
{content:[Furniture, Electronic_Device]; type= Proposal,
creationWave=Creativity, issueYear&gt;2010}
The engine will retrieve semantically related resources, such as Proposed Idea or
Project Proposal documents about a Contour Chair with an embedded Media Player
(which are assumed to be defined in the domain ontology as kinds of piece of
furniture and electronic device, respectively).
      </p>
      <p>Query. This service enables us to retrieve pieces of knowledge which exhibit some
given properties. Queries are posed in terms of the vocabulary and semantic relations
provided by the PIKR ontologies, and the underlying reasoning engine returns a list of
answers that satisfy all the specified properties. These answers may consist of factual
knowledge (DocOnto instances), conceptual knowledge (ontological terms), or
references to concrete resources. We are currently developing a query language, based
on SELECT-WHERE paradigm along the line of the SPARQL5 standard. For
instance, to identify reusable best practices or technical solutions in a given domain,
we may want to retrieve all the protocols related to documents addressing the research
line 3D_Vision. This can be expressed as follows.</p>
      <p>Q(?p) : protocol(?p)
research_line(?doc,3D_Vision)</p>
      <p>AND
related(?p,?doc)</p>
      <p>AND
Compliance Checking. This service allows us for checking the compliance of the
factual knowledge, captured at a given time in the semantic description of the
documents, with respect to business policies and internal regulations. Compliance
requirements can be represented in the DocOnto as business rules, i.e., statements that
define or constrain the structure of the documents or the dependencies among them on
the basis of the sequencing of business operations. The compliance check verifies the
consistency among the assertions contained in the DocOnto instances and the axioms
defined in the Knowledge Resource Ontologies formalizing the business rules.
Examples of constraints are “Each Innovation Report needs to be composed by a
Project Proposal and a Market Analysis", or "A Monitoring Sheet cannot be produced
unless a Gantt Chart has been finalized before". The former rule can be formalized by
the following axiom:
if innovation_report(x) then  y,z. project_proposal(y)
and market_analysis(z) and partOf(x,y) and partOf(x,z)
5 http://www.w3.org/TR/rdf-sparql-query/
In this paper we outlined an ontology-based framework for semantic description of
innovation-related documents. We have elaborated on CCTS and UBL approaches
and identified a bunch of InfoSets corresponding to categories of information that are
produced, consumed and evaluated during innovation projects. Furthermore, we
identified relationships that can occur among InfoSets, and we started to identify
InfoItems, elementary components of the InfoSets. We intend to re-use available
vocabularies as much as possible to enable Linked Data approach in this document
management methodology.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Cohen</surname>
            ,
            <given-names>W.M.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Levinthal</surname>
            ,
            <given-names>D.A.</given-names>
          </string-name>
          (
          <year>1990</year>
          ), “
          <article-title>Absorptive capacity: A new perspective on learning and innovation”</article-title>
          ,
          <source>Administrative Science Quarterly</source>
          , Vol.
          <volume>35</volume>
          ,
          <fpage>128</fpage>
          -
          <lpage>152</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Diamantini</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Potena</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Proietti</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Storti</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Taglino</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>A semantic framework for knowledge management in virtual innovation factories</article-title>
          .
          <source>International Journal of Information System Modeling and Design</source>
          . To appear.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Formica</surname>
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Missikoff</surname>
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pourabbas</surname>
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Taglino</surname>
            <given-names>F.</given-names>
          </string-name>
          (
          <year>2013</year>
          )
          <article-title>Semantic search for matching user requests with profiled enterprises</article-title>
          .
          <source>Computers in Industry</source>
          ,
          <volume>64</volume>
          :
          <fpage>191</fpage>
          -
          <lpage>202</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Gloet</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Terziovski</surname>
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2004</year>
          ),
          <article-title>“Exploring the Relationship between Knowledge Management Practices and Innovation Performances”</article-title>
          ,
          <source>Journal of Manufacturing Technology Management</source>
          , Vol.
          <volume>15</volume>
          No.
          <issue>5</issue>
          , pp.
          <fpage>402</fpage>
          -
          <lpage>409</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Sinaci</surname>
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Piersantelli</surname>
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cristalli</surname>
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gigante</surname>
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Laleci</surname>
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Basar</surname>
            <given-names>V.</given-names>
          </string-name>
          (
          <year>2012</year>
          ),
          <article-title>"A Document Centric Approach for User Requirements in BIVEE"</article-title>
          ,
          <source>CEUR Workshop Proceedings</source>
          Vol.
          <volume>864</volume>
          Article 5
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Tuncer</surname>
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dogac</surname>
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Postaci</surname>
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gonul</surname>
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alpay</surname>
            <given-names>E.</given-names>
          </string-name>
          (
          <year>2009</year>
          ),
          <article-title>"iSURFeDoCreator: eBusiness Document Design</article-title>
          and
          <string-name>
            <surname>Customization Environment</surname>
          </string-name>
          "
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>OASIS UBL TC</surname>
          </string-name>
          (
          <year>2006</year>
          ),
          <article-title>"</article-title>
          <source>Universal Business Language v2.0". Retrieved March 3</source>
          , 2013 from http://docs.oasis-open.org/ubl/os-UBL-
          <volume>2</volume>
          .0/UBL-2.0.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Groza</surname>
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Handschuh</surname>
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2009</year>
          ),
          <article-title>"Salt Document Ontology (SDO)</article-title>
          .
          <source>Retrieved March 3</source>
          , 2013 from http://salt.semanticauthoring.org/ontologies/sdo#.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>