=Paper= {{Paper |id=Vol-1312/ldop2014_paper5 |storemode=property |title=Community-Driven Approach to Large-Scaled Ontology Development based on OAM Framework: a Case Study on Life Cycle Assessment |pdfUrl=https://ceur-ws.org/Vol-1312/ldop2014_paper5.pdf |volume=Vol-1312 |dblpUrl=https://dblp.org/rec/conf/jist/TakhomBISS14 }} ==Community-Driven Approach to Large-Scaled Ontology Development based on OAM Framework: a Case Study on Life Cycle Assessment== https://ceur-ws.org/Vol-1312/ldop2014_paper5.pdf
    Community-Driven Approach to Large-Scale Ontology
    Development based on OAM Framework: a Case Study
                 on Life Cycle Assessment

              Akkharawoot Takhom1, Prachya Boonkwan2, Mitsuru Ikeda3,
                 Boontawee Suntisrivaraporn1, and Thepchai Supnithi2

                1 School of Information, Computer and Communication Technology

    Sirindhorn International Institute of Technology (SIIT), Thammasat University, Thailand
                     akkharawoot.t@gmail.com, sun@siit.tu.ac.th
                          2 Language and Semantic Technology Laboratory

     National Electronics and Computer Technology Center (NECTEC), Pathumthani, Thailand
                {prachya.boonkwan, thepchai.supnithi}@nectec.or.th
                                 3 School of Knowledge Science

            Japan Advanced Institute of Science and Technology (JAIST), Nomi, Japan
                                      ikeda@jaist.ac.jp




        Abstract. This paper presents a community-driven development framework for
        large-scale ontology that offers intra- and inter-community communication, vot-
        ing and endorsement system, and version control. Our system design addresses
        three lacks in the traditional ontology development: (1) constructive communi-
        cation among the relevant stakeholders, (2) consensual endorsement and voting
        system for concept and structural augmentation, and (3) bookkeeping and ver-
        sion control. To cope with these shortcomings, we introduce three more tiers:
        user management with technical profiles, knowledge augmentation via a web-
        board, and community collaboration via the vote and endorsement system, on
        top of the existing Ontology-based Application Management Framework
        (OAM) [1]. When applying the design to the task of Life Cycle Assessment, we
        discover additional needs to partition a large ontology to modularize the users’
        responsibilities and to provide thread-based conversation for keeping track of
        the topics.
        Keywords: ontology development framework, community-driven approach,
        large-scale ontology, knowledge management, version control, Life Cycle As-
        sessment




1       Introduction

Life Cycle Assessment (LCA) is a kind of measures of environmental impact which is
used in environmental impact assessment (EIA). Offering opportunities to environ-
mental decontamination, it quantifies energy and materials that are consumed in the
process and released to the environment, and assesses the environmental impacts of
human activities and industries. The comprehensive knowledge of LCA is large-scale,
voluminous, and industry-specific. It has to be acquired from the published standard
documentations (ISO 14040 [2] , 14044 [3], and 14048 [4]), and numerous relevant
case studies, resulting in a very steep learning curve for the novice users. Develop-
ment and management of such knowledge are laborious and require a close collabora-
tion among experts. Therefore, knowledge transfer in the relevant stakeholder com-
munity becomes non-trivial in this task.
   For ease of knowledge transfer, parts of LCA knowledge have been elicited in
terms of ontologies such as CASCADE [5], LCAO [6], Semantic Oil [7], and O-LCA
[8, 9], and transcribed into several representations, such as Semantic Web [10] encod-
ed as OWL [11], Predicate Logic, and Description Logic [12]. The notion of ‘ontolo-
gy’ promotes common understanding among the experts, makes the knowledge scala-
ble and reusable, and improves the interoperability between the experts and the users
and across relevant applications. Despite precise elicitation, the development of a
large-scale ontology is still an immense challenge because it requires effective
knowledge management when the entire community of experts are involved.
   This issue can be alleviated by the notion of community-driven development ap-
proach. In this approach, it is assumed that every stakeholder is willing to participate
in the development and maintenance and contribute to the community with his exper-
tise. We focus on multidisciplinary knowledge integration and cross-checking among
domain experts and relevant stakeholders. A large-scale ontology is constructed and
maintained on a common platform on which the community of experts, knowledge
engineers, and users controls the development and maintenance processes, where each
decision making is done based on the community’s consensus. The platform provides
a means to communicate, transfer tacit knowledge, and discuss the changes of the
worked ontology with respect to immediate needs. Community-driven development
has been used in several ways, such as ontology matching [13, 14] and knowledge
curation [15]. There have been attempts to large-scale ontology development [16–20].
   The community-driven approach is suitable for the development of LCA ontology
because of the following reasons. First, the domain experts specialize in their particu-
lar subfields of LCA. Since these fields sometimes share common knowledge, cross-
checking becomes necessary in a large-scale development project. Second, the devel-
opment of the LCA ontology is operated by a group of experts in parallel. In practice,
they usually branch (or fork) the current version of the ontology to work on their own.
This causes problems in updating when the finished ontologies are to be merged back
to the main ontology; thus, the need of version control. Third and last, some parts of
the ontology have to be cross-checked by specialists from other relevant fields; for
example, some parts of the ontology regarding earth and water can also be validated
by geophysicists, chemists, and environmentalists.
   We propose the use of the Ontology-based Application Management Framework
(OAM) [1] as a community-driven development platform of the LCA Ontology. The
current framework offers the following facilities:
      1. an ontology management tool that allows the community to get involved in
          the evolution of the ontology,
     2. a sandbox toolkit with application templates for a semantic search engine and
          a recommender system, where no programming skills are required, and
     3. APIs and web service deployment for practical application development.
   However, it lacks collaborative features that enables constructive communication
among the stakeholders. The contribution of this paper is: we introduce to OAM the
notion of modified webboard, where any concept and structural augmentations are
proposed by experts and they have to be endorsed by the community. This allows the
system and the knowledge to grow along with the users’ expertise.
   In this paper, we will show that the OAM Framework does not fully support com-
munity-driven development of a large-scale ontology where constructive communica-
tion is needed (section 2). Then we will show that our modified webboard enables
such communication oriented by concept and structural augmentations and consensual
endorsement (section 3). We will also explain how to apply this technique to the field
of LCA (section 4).


2      Problem Statement

OAM Framework [1] is a software platform that aims to simplify the development
and maintenance of a semantic web and an ontology as well as to automate the im-
plementation of a semantic search and a web service. The architecture of OAM is
illustrated in Fig. 1.
    Ontology development via OAM entails three fundamental steps and three user
roles: domain experts, a knowledge engineer, and application developers. First, a
domain expert designs his own ontology according to the task of interest and export it
in the OWL format. On OAM, the knowledge engineer maps the ontology designed
by the experts to the database schema and the vocabulary and imports it into the sys-
tem. At this step, the knowledge engineer also maintains the current ontology accord-
ing to the experts’ requests via personal communication.




                   Fig. 1 System architecture of OAM Framework [1]


   Second, the domain experts design recommendation rules for the current ontology
in terms of Prolog-like first-order logic. The knowledge engineer then transcribes
these rules into JENA Language via the Recommendation Rule Management Module.
Finally, the knowledge users implement their own knowledge-enhanced applications
with the Application Configuration Module. At this stage they can deploy a semantic
search engine and a web service using the ontology developed by the domain experts.
   The key struggle in this paradigm is the ontology development entirely relies on
personal communication. This method is prone to the loss of communication; i.e. it is
very hard to keep track of conversations and consensus as time goes by. The history
of development evolution, or version control, plays a crucial role in community-based
development, especially for a large-scale ontology, in which a group of domain ex-
perts, knowledge engineers, and knowledge users are involved. These lacks necessi-
tate the use of a tractable communication means where conversations and consensus
are structurally organized for ease of bookkeeping, knowledge transcription, version-
ing, and deployment.
   We are proposing an extension of the OAM Framework that incorporates the no-
tion of thread-based webboard, version control, and status notification to solve the
aforementioned problems. These features allow the community with the three user
roles to co-create a large-scale ontology and maintain it by means of community en-
dorsement. By doing so, the system and the knowledge grow along with the users’
expertise.


3      System Design

In this paper, we design a community-driven ontology-based application management
framework (CD-OAM). We extend the canonical OAM Framework (shown as Data
Tier and Application Tier, all in dashed borders) with three more tiers: Collaboration
Tier, Knowledge Tier, and User Tier (all in solid borders). A system overview of CD-
OAM is illustrated in Fig. 2.




                                Fig. 2 System Overview
   First, the User Tier facilitates the admin to manage domain expert members, in-
cluding adding, deleting, and assigning to some of the communities. The users are
required to create a profile annotated with their expertises so as to classify them with
respect to their interests, technical backgrounds, and objectives. Our user policy is
merit-based [14]: the more a user participates in votings or change proposals, the
more merits he gets.
   Next, The Collaboration Tier, the heart of this framework, encourages the commu-
nity-driven development of a large-scale ontology. It contains three modules: com-
munity management, vote and endorsement system, and notification system. The
Community Management module allows the community’s administrator to manage
their members and their user roles. The Vote and Endorsement System prepares a
platform for voting and endorsement by the community when changes in the ontology
take place. Finally, the Notification System disseminates the voting results and
changes to the community.
   Finally, the Knowledge Tier is composed of four modules that facilitates both in-
tra- and inter-community communication in large-scale ontology development and
keep track of the evolution. We assume that a large-scale ontology can be separated
into parts and distributed to all involved communities. The Intra-community Ontology
Management Module [15] allows the community to manipulate the part of the ontolo-
gy the community is responsible for. The Inter-community Knowledge Augmentation
Module facilitates the integration of each part of the ontology. Should modification is
necessary, this can be voted and endorsed by all relevant communities. All proposed
changes must be endorsed by the community. The Webboard Module organizes a vote
for intra- and inter-community changes in the ontology and approves the consensus.
For example, an expert may propose to augment some concepts or the hierarchy in the
Webboard. If changes are endorsed and committed, the history is kept in the Version
Control Module.


4      Case Study: LCA

Ontology development for LCA is a very challenging task. Standardized by the ISO
into ISO 14040, 14044, and 14048, LCA is a family of best-practice procedures for
information sharing and guidelines for environmental impact evaluation. It minimizes
operations in the organization which affect the environment, to comply with laws,
regulations, and other environmentally oriented requirements and continual improve-
ment. Figure 3 shows the ISO guideline for LCA that consists of four main phases:
(1) setting the goal and scopes (2) listing up life cycle inventory from a given supply
chain (3) assessing the life cycle impacts and (4) data interpretation. The ontology for
LCA is usually very large because there are numerous concepts and a complicated
hierarchy and relations between them in the product’s life cycle.
                         Fig. 3 ISO standards guideline of LCA
   Due to its sheer scale, there are a number of stakeholders working side by side on
the ontology. We classify them into five categories with respect to their objectives:
        1. Domain experts: experts and specialists on LCA
        2. Domain beginners: beginners who just started in the field and are learning
              the knowledge
        3. Knowledge users: direct users of the knowledge via a recommender sys-
              tem
        4. System developers: software developers who make use of the knowledge
        5. Knowledge engineers
   Complying with the community-based approach, these stakeholders have different
roles in developing the LCA ontology and they all need a means to communicate to
each other. Any requests of change in the ontology have to be endorsed by the entire
community. Moreover, version control becomes necessary as the ontology grows by
the community. Our system design fit all these requirements for the community-based
LCA Ontology Development.
   Our system design has to be specialized as per the roles and interests of the users.
We envisage the following modifications in the following tiers when applied to the
field of LCA.
      User Tier: Each user should specify his objectives, role, interest, and tech-
          nical backgrounds regarding LCA. These users are then assigned to a differ-
          ent part of the ontology according to their request or technical backgrounds.
      Collaboration Tier: To modularize the user responsibilities, the community
          are able to partition (and repartition) the ontology via voting and endorse-
          ment.
      Knowledge Tier: There are at least two sources of knowledge in LCA: ISO
          standards and field data collection. Knowledge integration becomes a non-
          trivial issue because domain experts may have different perspectives on the
          received data. The webboard module has to offer a place for open discussion
          before leading to voting and endorsement. To keep track of the decision, a
          thread-based conversation is best suitable for this scenario.
5      Conclusion and Future Work

We have presented a design for a community-driven development framework for
large-scale ontology, where all relevant stakeholders can participate in the evolution
of the ontology. Three additional layers: user management, knowledge augmentation,
and community collaboration, respectively, are put on top of the OAM Framework to
facilitate communications among the stakeholders, voting and endorsement, and ver-
sion control. We also find that, in practice, ontology partitioning and thread-based
conversation are also needed for teamwork as found in Life Cycle Assessment.
   Our future work remains as follows. First, we will implement the system closely
following this design and conforming to User Experience (UX). Second, we will in-
corporate the notions of conflict detection and concept similarity into the knowledge
augmentation module. This will help knowledge engineers and the community prede-
termine holistic conflicts in the ontology before committing any changes. Third and
finally, we will develop a data modeling module [21] which automatically maps input
databases with a different data scheme to the ontology.


Acknowledgment

   This research is financially supported by Japan Advanced Institute of Science and
Technology (JAIST), Sirindhorn International Institute of Techcnology (SIIT), and
Thammasat University (TU). Materials and information provided by National Metal
and Materials Technology Center (MTEC, Life Cycle Assessment Laboratory, Thai-
land). The authors are grateful to Marut Buranarach for fruitful technical discussions.


References

1.      Buranarach, M., Thein, Y., Supnithi, T.: A Community-Driven Approach to
        Development of an Ontology-Based Application Management Framework. In:
        Takeda, H., Qu, Y., Mizoguchi, R., and Kitamura, Y. (eds.) Semantic
        Technology. pp. 306–312. Springer Berlin Heidelberg (2013).

2.      ISO 14040:2006 - Environmental management -- Life cycle assessment --
        Principles and framework. ISO, Geneva, Switzerland (2006).

3.      ISO 14044:2006 - Environmental management -- Life cycle assessment --
        Requirements and Guidelines, (2006).

4.      ISO/TS 14048:2002 - Environmental management -- Life cycle assessment --
        Data documentation format, (2002).

5.      Cappellaro, F., Masoni, P., Moreno, A., Scalbi, S.: CASCADE. In: Werner
        Pillmann, K.T. (ed.) The 16th Internationale Conference: informatics for
        environment protection. pp. 490–493. IGU/ISEP (2002).
6.    Brascher, M., Monteiro, F., Silva, A.: Life cycle assessment ontology. the 8th
      Conference of the International Society for Knowledge Organization. pp.
      169–177. Congreso ISKO-España, España, Spain (2007).

7.    Bertin, B., Scuturici, V., Risler, E., Pinon, J.: A semantic approach to life
      cycle assessment applied on energy environmental impact data management.
      Proceedings of the 2012 Joint EDBT/ICDT Workshops. pp. 87–94 (2012).

8.    Takhom, A., Suntisrivaraporn, B., Supnithi, T., Theeramunkong, T., Manabu,
      O.: Ontology-enhanced life cycle assessment: toward formalizing the standard
      guidelines. The International Conference on Information and Communication
      Technology for Embedded Systems (ICICTES 2013). , Samutsongkhram,
      Thailand, (2013).

9.    Takhom, A., Suntisrivaraporn, B., Supnithi, T.: Ontology-enhanced life cycle
      assessment: a case study of application in oil refinery. The Second Asian
      Conference on Information Systems, (ACIS 2013) (2013).

10.   Horrocks, I.: Ontologies and the semantic web. Commun. ACM. 51, 58–67
      (2008).

11.   Schreiber, G., Dean, M.: OWL Web Ontology Language Reference. (2004).

12.   Baader, F., Horrocks, I., Sattler, U.: Description Logics. Stud. Health
      Technol. Inform. 101, 137–41 (2004).

13.   Zhdanova, A. V: Towards Community-Driven Ontology Matching. Computer
      (Long. Beach. Calif). 1–2 (2005).

14.   Zhdanova, A. V, Shvaiko, P.: Community-driven Ontology Matching.
      Proceedings of the 3rd European Conference on The Semantic Web: Research
      and Applications. pp. 34–49. Springer-Verlag, Berlin, Heidelberg (2006).

15.   Groza, T., Tudorache, T., Dumontier, M.: Commentary: State of the Art and
      Open Challenges in Community-driven Knowledge Curation. J. Biomed.
      Informatics. 46, 1–4 (2013).

16.   Gendarmi, D., Lanubile, F.: Community-Driven Ontology Evolution Based
      on Folksonomies. Move to Meaningful Internet Syst. 2006 OTM 2006 Work.
      181–188 (2006).

17.   Hepp, M., Bachlechner, D., Siorpaes, K.: OntoWiki: Community-driven
      Ontology Engineering and Ontology Usage Based on Wikis. Proceedings of
      the 2006 International Symposium on Wikis. pp. 143–144. ACM, New York,
      NY, USA (2006).
18.   Siorpaes, K.: Lightweight community-driven ontology evolution. Lecture
      Notes in Computer Science (including subseries Lecture Notes in Artificial
      Intelligence and Lecture Notes in Bioinformatics). pp. 951–955 (2007).

19.   Aseeri, A., Wongthongtham, P.: Community-driven ontology evolution based
      on lightweight social networking in oil and gas domain. IEEE International
      Conference on Digital Ecosystems and Technologies. pp. 197–202 (2011).

20.   Maleewong, K., Anutariya, C., Wuwongse, V.: A Semantic Argumentation
      Approach to Collaborative Ontology Engineering. Proceedings of the 11th
      International Conference on Information Integration and Web-based
      Applications and Services. pp. 56–63. ACM, New York, NY, USA (2009).

21.   Bertin, B., Scuturici, V.-M., Pinon, J.-M., Emmanuel Risler: A semantic
      approach to life cycle assessment applied on energy environmental impact
      data management. Proceedings of the 2012 Joint EDBT/ICDT Workshops.
      pp. 87–94. ACM, New York, NY, USA (2012).