<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Extension of the M-Gov Ontology Mapping Framework for Increased Traceability</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Anuj Singh</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Christophe Debruyne</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rob Brennan</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alan Meehan</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Declan O'Sullivan</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>ADAPT Centre, School of Computer Science and Statistics, Trinity College Dublin</institution>
          ,
          <country country="IE">Ireland</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper describes an extension to the M-Gov framework that captures queryable metadata about matcher tools that have been utilized, the users involved, and the discussions of the users, during the generation of alignments. This increases the traceability in an alignment creation process and enables an evaluator to more deeply interpret and evaluate an alignment, e.g. for reuse or maintenance. This requires precise information about the alignments being encoded and the decisions undertaken during their creation. This information is not captured by state of the art approaches in a queryable format. The paper also describes an experiment that was undertaken to examine the effectiveness of our approach in enabling the traceability in the alignment creation process. In the experiment, stakeholders created an alignment between two different datasets. The results indicate that the users were 93% accurate while creating the alignment. The major traceability achievements demonstrated for the test groups were 1) level of participation of various users of a group during alignment creation; 2) most discussed correspondences by users of a group; and 3) accuracy of a group in creating alignment.</p>
      </abstract>
      <kwd-group>
        <kwd>Ontology Matching</kwd>
        <kwd>Ontology Alignment</kwd>
        <kwd>Mapping governance</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>
        Ontology mapping is required to overcome the problem of semantic heterogeneity and
facilitate interoperability between ontology-based systems that share the same
concepts but have the different representation of those concepts [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Creation and
maintenance of ontology mapping is a difficult task in several aspects [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], one of the
aspects, which we focus on this paper is traceability in the alignment creation process.
      </p>
      <p>
        Alignments are built for a purpose like data integration or a link data mashup for a
specific group of stakeholders. Creation of an alignment is a non-trivial task, as it
requires these stakeholders to collaborate. In [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], we suggested an approach, which
allows stakeholders to collaborate for creating an alignment by using a Mapping
Governance framework. An initial implementation of the approach is also outlined in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ],
which we now term the M-Gov framework. The framework captures the metadata
during alignment creation, which enables the traceability in an alignment creation
process.
      </p>
      <p>
        Traceability in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] refers to “the ability to follow the life of a requirement in a
forward or backward direction”. Similarly, the traceability in an alignment creation
process will allow one to trace the following for a correspondence: decisions about a
correspondence; rationale for the decisions; and the stakeholders who were involved
in the decision making process. The approach we introduced in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] suggested
capturing metadata information about the matcher used, the contributors and their
discussions during an alignment creation process. Our intuition was that capturing such
information would increase traceability in the alignment creation process, as this will
not only allow one to formulate queries to look for existing alignments but also to
formulate questions such as “which stakeholder participated the most in alignment
creation” or “which correspondence was mostly discussed by stakeholders”.
      </p>
      <p>In this paper, we first describe how we have extended the M-Gov framework by
supporting stakeholders during the Match phase (Section 3). First, the Alignment API
4.8 is used to discover candidate correspondences between two different datasets.
Then stakeholders are allowed to discuss each identified correspondence displayed on
a web page using a grid table. The paper also describes (Section 4) the initial
evaluation that we have undertaken. Specifically, the research question under investigation
during our evaluation was to what extent captured metadata allows tracing of: the
most discussed correspondences by stakeholders, the level of participation of
stakeholders, and the decisions taken by a group of stakeholders for a correspondence?</p>
      <p>In summary, the contribution of this paper is as follows: Firstly by extending the
M-Gov framework to enable tracebility in an alingment creation process. Secondly,
we have provided a detailed description of the alignment creation process. Thirdly we
have provided evidence that metadata captured in the M-Gov framework enables
traceability in an alignment creation process.</p>
      <p>The paper is organized as follows: Section 2 provides an overview of the
background information; Section 3 outlines the match phase of the M-Gov framework;
Section 4 presents an evaluation of the experiment that was undertaken; Section 5
sheds some light on the related work; and conclusions are drawn in section 6.</p>
    </sec>
    <sec id="sec-2">
      <title>2 Background</title>
      <p>This section presents necessary background on collaborative ontology engineering,
community-driven ontology matching and an overview of the M-Gov framework.</p>
      <sec id="sec-2-1">
        <title>2.1 Collaborative ontology engineering</title>
        <p>
          Ontology engineering refers to the study of the activities related to the ontology
development, the ontology life cycle, and tools and technologies for building the
ontologies [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]. In the situation of a collaborative ontology engineering, platforms and tools
are designed to help stakeholders to reach a consensus in an asynchronous manner. To
facilitate and practice consensus-building in a collaborative environment, the
community needs to control each activity, and be able to trace the process and results
achieved so far.
        </p>
        <p>
          In collaborative ontology-engineering, publishing the new version of an ontology
is different to a centralized situation, as there is a need to synchronize the editing. To
facilitate the editing, web-based or desktop based applications are used, and versions
of ontologies are traced with the help of distributed versioning software [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ].
        </p>
        <p>In contrast, our approach does not use distributed versioning software for
traceability during alignment creation. M-Gov itself keeps track of each activity that occurs in
an alignment creation process.</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2 Community-driven ontology matching</title>
        <p>
          Community-driven ontology matching (CDOM) extends conventional ontology
matching by involving the community (end users, knowledge engineers, and
developers) in the creation, description, and reuse of mappings [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. The CDOM is described
as a manual task which is based on the following types of information: a) Users: the
information about the contributors in the matching process; b) Communities: the
information about the relationship among the agents; c) Tools: these tools match the
two different ontologies automatically.
        </p>
        <p>
          A prototype has been implemented and analyzed in [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ], which supports the
community driven approach. It annotates the community-related information in the basic
ontology alignment format. The service has been available online since November
2004. The results show that the acquisition of shared ontology mappings among the
web communities is feasible. However, the approach does not annotate the other
useful information about the mappings such as “why this mapping seems to be
legitimate”, etc. This information can serve as the rationale behind a particular mapping.
        </p>
        <p>In contrast, M-Gov captures each activity that occurs during alignment creation.
The captured information could serve as the rationale for the creation of a mapping. It
also allows one to facilitate the discovery and reuse of existing alignments with the
help of queries and thus making the alignment creation process more traceable.</p>
      </sec>
      <sec id="sec-2-3">
        <title>2.3 M-Gov Framework</title>
        <p>
          Governance refers to [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] “what decisions must be made to ensure effective
management and use of IT and who makes the decisions.” Data governance is required to
improve the data quality, which in result improves the maintenance of data [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]. For
addressing the data quality issues, [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] suggested to use a holistic approach, which
focuses on the people, process, and technology.
        </p>
        <p>
          [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] uses an extension of PROV-O (metadata) to describe the ontology mapping
process, which captures the information of people (stakeholders), process (activities/
discussions), and technology (matcher) as suggested in [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]
        </p>
        <p>
          A project-centric perspective has been adopted by [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] to deal with the ontology
mapping process. The M-Gov framework is based on the project-centric perspective.
In the framework, a single ontology mapping project (process) is divided into six
phases as follows: 1) Stage: This phase constitutes the identification of the
stakeholders, setting up the scope of the project and enumerate the requirements. 2)
Characterize: It identifies and analyzes the ontologies for generating mappings between them.
As in [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ], it is referred as “to analyze the addressed ontologies to identify difficulties
that may be involved for generating mappings.” 3) Reuse: It discovers whether any
existing alignment can be used for the new mappings. 4) Match: This phase uses the
information captured in the characterization phase. The selected ontologies and the
configured matchers are used to identify the potential correspondences, which need to
be evaluated for their fitness to form an alignment. 5) Align and Map: Manual
refinement of the candidate correspondences is needed to create an alignment. The rules
written based on the alignment is called as mapping. 6) Application: The
stakeholders identify the application, which will use the formed mappings. If either source or
target ontologies change over time, this will trigger the new interaction in the
community and lead to a new version of mapping.
        </p>
        <p>Adopting a project-centric perspective in ontology mapping process allows one to
capture the metadata of various aspects of the mapping process. Using the extension
of PROV-O as metadata model makes the ontology mapping process more traceable,
as it will not only allow one to formulate queries to reuse existing mappings but also
formulate questions about the activities happened during the mapping process.</p>
        <p>
          This paper is built on [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] by a) using an extension of PROV-O to capture each
activity in alignment creation process; b) using IBIS [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ] for structuring the discussions;
c) extending the work done by [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] on M-Gov framework. The “stage” and
“characterize” phase of M-Gov was already implemented by [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ].
        </p>
        <p>This paper extends the initial M-Gov implementation; it implements the “match
phase” of M-Gov and evaluates the correspondences identified in match phase. The
next section presents the methodology adopted for ontology matching and evaluation
of correspondences.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3 Match Phase of M-Gov framework</title>
      <p>This section describes the requirements, design, and implementation of the match
phase newly developed for the M-Gov framework.</p>
      <sec id="sec-3-1">
        <title>3.1 Functional requirements</title>
        <p>
          The main objective of the Match Phase is to identify the potential correspondences
between two datasets automatically and capture the metadata produced during the
alignment creation [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ], with the following functional requirements being derived. The
match phase should allow a user to configure the matcher by selecting a source
ontology, a target ontology, and a matching tool. A matching tool needs to be used to
identify the correspondences between the selected ontologies automatically. Identified
correspondences need to be displayed on a web page. Users should be allowed to
discuss every displayed correspondence with other users by presenting their opinion
about its fitness. Based on the discussion, users should be allowed to accept or reject a
correspondence. The configuration of matcher, identified correspondence, and
discussions of the users about the fitness of the correspondences, need to be stored as the
metadata. The metadata should be captured in a queryable format, as that will enable
the traceability in the alignment creation process.
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2 Design</title>
        <p>To fulfill the functional requirements, there needed to be a number of aspects
designed. In this section, we present a quick overview of the design. The design was
focused on an initial baseline without sophisticated UI as our focus was on interaction
process and capturing of discussions. Future work will develop the UI. In addition, we
focused on an alignment problem where pre-processing is not necessary, as the
experimental focus was on traceability of the captured discussions. However, it would be
easy to add further steps and linked discussions in the M-Gov framework.</p>
        <p>
          A web based form was built to allow the users to configure the matcher by
selecting a source and target ontology, and a matcher tool. The matcher configuration was
stored in a database. Selected ontologies were matched using Alignment API 4.8. A
REST call was designed for communicating with the Alignment API. The Alignment
API returns the potential correspondences in alignment format (an XML format as
shown in Fig. 2.), which was used to capture the M-Gov metadata about the identified
correspondences. The captured metadata is again stored in the database. Furthermore,
an interface was designed to present the M-Gov metadata about the potential
correspondences for stakeholders to discuss. To provide context for discussions about the
correspondences, the values of object1 and object2 on the interface were linked to
their online Linked Data resources. The interface was also designed to show the
comments of all the stakeholders on a correspondence. Thus, allows the stakeholders
to see other perspectives about the fitness of a correspondence. The discussions of
stakeholders are structured by using the IBIS framework and the metadata model used
in the M-Gov framework is an extension of PROV-O, as suggested by [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. Fig. 1
shows the interaction between the elements of the design during the match phase of
the M-Gov framework.
        </p>
        <p>The capture of discussions was the major challenge faced while designing the
MGov match phase supports, as this will enable the traceability in an alignment creation
process. For this, we capture every statement given by each stakeholder during the
alignment creation. In M-Gov every statement is linked with its creator, and
correspondence’s ID on which the statement has been made. M-Gov also captures the
conclusion and the stakeholder’s ID who concluded that discussion. Table 2 describes
the M-Gov metadata used to track the discussion.</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3 Implementation</title>
        <p>Description
Unique identifier attached to each discussion
Type of discussion: a conclusion or just an opinion
Stakeholder who made the statement
Content of statement
Type of statement, e.g.: supporting or objecting
Final statement while concluding the correspondence
Timestamp of the conclusion
Stakeholder who concluded the correspondence</p>
        <p>If the correspondence is accepted to rejected
A form has been built to allow a user to select a source and target ontology, and a
matcher tool. A user can select these parameters from a drop-down menu to configure
the matcher. The M-Gov uses these parameters to create the URL to invoke a REST
call to Alignment API. Fig. 2 describes the response from Alignment API, it shows an
example of a potential equivalence correspondence (line 5) between “HumanActor”
(line 3) and “HumanActorAge” (line 4) with a confidence of 0.93 (line 6).</p>
        <p>The M-Gov displays every potential correspondence on a webpage using grid
tables, which also contains a "state" column, whose default value is “inDiscussion”.
The M-Gov also attaches a “change decision” button to every displayed
correspondence, which is used to start a new discussion thread for that correspondence. If the
discussion thread is already created then this button will lead to the in progress
discussion for that correspondence. Once the users reach a consensus after discussion,
the M-Gov provides a “Conclude discussion” link, which allows a user to change the
state of the correspondence to either “Accepted” or “Rejected”. The M-Gov also
stores the discussions along with the user’s information under the “post” table in the
database.</p>
        <p>Fig. 3 represents the page by which stakeholders can add their arguments to
participate in a discussion about a correspondence. In our example of Fig. 2, this would
involve discussion of whether HumanActor and HumanActorAge are really
equivalent? Fig. 3 shows the overview of the correspondence and arguments about its
fitness. “reply” textbox can be used to add arguments, while a suitable reply type needs
to be selected from the dropdown “Reply Type”, whose values are “Supporting
example, objecting example, supporting justification, objecting justification, supporting
motivation and objecting motivation”.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4 Evaluation</title>
      <p>Motivation. The purpose of this experiment was to trace the discussions among the
stakeholders during the alignment creation process and identify the following: 1) level
of participation of various users of a group during alignment creation. 2) most
discussed correspondences by users of a group. 3) accuracy of a group in creating
alignment.</p>
      <p>In the experiment, we have used three types of correspondences: a) Correct
correspondences - those in which both objects point towards the same resource. b)
Incorrect correspondences - those in which both objects point towards completely different
resources. c) Ambiguous correspondences - those in which both objects point towards
different resources. But to understand the difference, a user needs to go through a
substantial amount of information, as the difference might not be clear from the label
of the entities.</p>
      <p>Hypothesis. In most cases, the discussion thread attached to an ambiguous
correspondence will be longer than correct and incorrect correspondences.</p>
      <p>Experiment method. We formed 4 groups, 3 groups contained 3 users while 1
group contained only 2 users. A separate instance of the framework was provided for
each group. Every user was located at a different workstation and was allocated
discrete credentials to log into the framework. We have only used instance level
correspondences in the experiment, since creating concept level correspondences requires
participants with a deeper understanding (who are harder to recruit). It was thus
decided to first investigate stakeholder collaboration tracing using instance level
correspondences, which could be performed by a wider range of participants. We created a
discrete set of 7 instance level equivalence correspondences for each group, the
complete list is available online1. Semantic mapping researchers validated the created
correspondences. These correspondences have been created manually and injected in
the framework for discussion, which covers three types of correspondences as
follows: a) Correct correspondence: These are created by taking an entity from OSi2
dataset as object1, while the object2 has been selected from DBpedia3, which points
to the exact same resource as referred by object1. For example, “County Roscommon
represented by OSi” and “County Roscommon represented by DBpedia”, b) Incorrect
correspondence: These are created by taking an entity from OSi dataset as object1,
while the object2 has been selected from DBpedia, which points to a completely
different resource than that referred by object1. For example, “County Roscommon
represented by OSi” and “County Clare represented by DBpedia”, c) Ambiguous
correspondence: These are created by taking an entity from OSi dataset as object1,
while the object2 has been selected from DBpedia, which points to the resource that
has a label similar to the resource referred by object1. To figure out the difference
between both objects, a user needs to examine the available information about both
the resources. For example, “County Tipperary represented by OSi” and “Tipperary
town represented by DBpedia”. Participants can discuss the correspondences within
the group only through the framework. For deciding upon a correspondence, if it was
acceptable or not, users needed to come to a consensus.</p>
      <p>
        Metrics. To trace the most discussed correspondences in a group, the word count
in the statements of the users will be used to calculate the length of the discussion.
The word count for a discussion in a group also depends on the active users in a
group. At the end of the experiment, users will be asked to evaluate the use of the
framework by providing answers to usability based questions of PSSUQ [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ].
      </p>
      <p>Datasets. A subset of entities in the OSi county dataset and in the DBpedia dataset
for counties of the Republic of Ireland has been used to create correspondences.</p>
      <p>User recruitment. The selected users were M.Sc. students of computer science at
Trinity College Dublin. For preparing the users for the experiment, we have given a
presentation, a video tutorial and a detailed version of user instructions to users about
how to use the M-Gov to curate the correspondences. All the documents related to the
experiment preparation are available online4.</p>
      <p>Data analysis. For each group, Fig. 4 describes the type of correspondence and
length of discussion involved in coming to the conclusion. Fig. 5 describes the
individual contribution of the users in each group. Group 1 had 3 users: user 9, 10, and
11. However, user 10 did not participate in the discussion properly. For group 1, the
1 https://github.com/anujsinghdm/Experiment/blob/master/AllCorrepondence.xlsx
2 http://data.geohive.ie/
3 http://wiki.dbpedia.org/
4 https://github.com/anujsinghdm/Experiment/tree/master/UserInstruction
longest discussion thread has been attached to C4, which is a correct correspondence
and it is also clear from Fig. 4 and 5, user 9 and 11 were mostly discussing the
nonambiguous correspondences, hence group 1 does not support the hypothesis.</p>
      <p>Group 2 had 2 users: user 7 and 8. However, the majority of the word count
represents the user 8. For group 2, the longest discussion threads have been attached to C1
and C4, where C1 is the correct correspondence, while the C4 is the ambiguous
correspondence. Users did not discuss much the 2nd ambiguous correspondence, only user
8 gave the statement, why it wants to reject the correspondence. Having the
discussions analyzed, we can say that user 7 did not participate in the discussions properly
and group 2 is also not in support of the hypothesis.</p>
      <p>Group 3 had 3 users: user 4, 5, and 6. For group 3, the longest discussion thread
has been attached to an ambiguous correspondence C4. Fig. 4 describes that group 3
discussed incorrect correspondences more. This might be the reason why the 2nd
ambiguous correspondence does not have a longer discussion, as the users perceived
it a correct correspondence. Group 3 concluded one more correspondence incorrectly,
but we believe that is just an operation error, as the attached discussion indicates that
they analyzed the correspondence correctly. Group 3 supports the hypothesis as the
longest discussion thread is attached to an ambiguous correspondence.</p>
      <p>Group 4 had 3 users: user 1, 2, and 3. However, most of the discussions have been
carried out by user 1. As it is clear from Fig. 4, ambiguous correspondences (C4 and
C7) have the longest discussion thread attached to them. Hence, group 4 supports the
hypothesis.</p>
      <p>
        Fig. 5. Individual contribution of each user in discussing the correspondences
Finally, participants were asked to complete a PSSUQ [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] questionnaire. The
information in Fig. 6 has been produced by taking the means and standard deviation of
the responses of each participant per questions. Then we checked below:
resultant = abs (value (response of a specific user for selected question) - means
(responses of all users for the selected question))
if the resultant is greater than the standard deviation (responses of all users for a
specific question), we marked false for that specific response. Finally, we counted all
the "True" values of a specific user for all the questions.
      </p>
      <p>Conclusions. The results indicate that except group 3, every other group was
accurate (compared to the gold standard) in creating alignment. Group 3 incorrectly
concluded 2 correspondences out of 7. The results also show that group 1 and 2 do not
support the hypothesis, however not every member of this group actively took part in
the discussion. Group 3 supports the hypothesis as for the 1st ambiguous
correspondence, the discussion thread is the longest. We believe that the users of this group
miscomprehended the information of 2nd ambiguous correspondence, hence the
correspondence did not get discussed in detail and concluded incorrectly. Group 4 clearly
supports the hypothesis as they discussed ambiguous correspondences the most.
Gathered data is unable to lead us to any conclusion about the hypothesis, as two
groups are supporting the hypothesis while the other two groups are not in support of
it. However, the results provide an evidence that the captured metadata by M-Gov
enabled the traceability in the alignment creation process. Additionaly, for the
technical contribution we tracked the following: a) level of participation of users, b) most
discussed correspondences and, c) the accuracy of groups in alignment creation. We
can also conclude from Fig. 6, user 9 and 4 are the outliers as most of their responses
do not comply with other users. The data from the PSSUQ suggests that 72% users
were satisfied by using M-Gov but enhancements in terms of UI/UX are required so
that tasks could be performed more efficiently.</p>
    </sec>
    <sec id="sec-5">
      <title>5 Related work</title>
      <p>A variety of approaches has been used to evaluate the methodologies/ frameworks
that support collaborative ontology engineering. We see two evaluation approaches
related to our work. This section focuses on these approaches.</p>
      <p>
        Domain experts are supported by [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] to engineer an ontology in a distributed
environment. In the start of the process, an initial version of an ontology needs to be
created by users then they can use it and locally adapt it for their own purpose. There
is no support to change the ontology shared by all the users, only control board
handles the changes to a shared ontology. The board deploys the feasible changes in the
next version. [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] also describes a two stage experiment for a creating an ontology. In
the first stage, users argued for a change without any guidelines, while in second stage
they were given a subset of the arguments that had been found effective in stage one
of the experiment. The paper concluded that the creation of ontology proceeded faster
during the second stage. We could benefit from [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] in our future work by giving
some more restricted guidelines to the users for a discussion.
      </p>
      <p>
        The Ontology development framework proposed in [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] supports various users to
reach consensus through iterative evaluations. [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] describes a consensus based
experiment using [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. 7 users were involved in the experiment, which are of different
competency. The coordinator has created an initial version of an ontology. Iterative
evaluation is done by each user by an “initial ontology evaluation sheet” that helps to
evolve the ontology. They used Nominal Group Technique (NGT) for the evaluation.
In contrast, we support online discussion among users located at different locations.
Our approach also captures the discussions to enable the traceability in the alignment
creation process. [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] uses the degree of participation (dop), which is leveraged by the
facilitator to determine the quality of an ontology. In contrast, we have measured the
dop by word count in the statements of each user during the discussion. We have
noticed in our experiment that the groups in which the users were more active are
supporting the hypothesis formed for the experiment.
      </p>
    </sec>
    <sec id="sec-6">
      <title>6 Conclusion</title>
      <p>The paper presents an extension of M-Gov framework to match the two different
datasets automatically and capture the metadata produced during the alignment
creation. The paper also describes an experiment in which 11 stakeholders discussed the
potential correspondences to create an alignment. The aim was to trace the metadata
produced during the alignment creation.</p>
      <p>The research question presented in this paper is to what extent the captured
metadata allows us to trace the most discussed correspondences by users, the level of
participation of users, and the decisions undertaken by a group of users for a
correspondence to determine if it is acceptable or not, in an alignment creation process?
We also present the evaluation of M-Gov by users in creating alignment.</p>
      <p>An experiment was conducted to create an alignment between the locations in
DBpedia and OSi dataset. Based on the results, we are unable to conclude the
hypothesis, as two groups are supporting it while the other two groups are not in support of
the hypothesis. However, it provides an evidence that the captured metadata during
the alignment creation enables traceability. In addition to this, the technical
contribution of our work involves tracing the following: a) Group 1 and 2 discussed mostly
the non-ambiguous correspondences, as the discussion thread attached to
nonambiguous correspondences are the longest. Group 3 and 4 have the longest thread
attached to the ambiguous correspondences, so group 3 and 4 discussed mostly the
ambiguous correspondences. b) Not every participant in group 1 and 2 was actively
engaged in the discussion. c) 26 correspondences out of 28 were concluded correctly.
We would be able to do more detailed analysis if the participants would have been
more active in each group as we would have got richer experiment data.
Acknowledgements. This research has received funding from the ADAPT Centre for
Digital Content Technology, funded under the SFI Research Centres Programme
(Grant 13/RC/2106) and co-funded by the European Regional Development Fund.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Silva</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Rocha</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <year>2003</year>
          .
          <article-title>Service-oriented ontology mapping system</article-title>
          .
          <source>In Semantic Integration Workshop</source>
          (SI-
          <year>2003</year>
          ) (p.
          <fpage>144</fpage>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Choi</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Song</surname>
          </string-name>
          , I.Y. and Han, H.,
          <year>2006</year>
          .
          <article-title>A survey on ontology mapping</article-title>
          .
          <source>ACM Sigmod Record</source>
          ,
          <volume>35</volume>
          (
          <issue>3</issue>
          ), pp.
          <fpage>34</fpage>
          -
          <lpage>41</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Gotel</surname>
            ,
            <given-names>O.C.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Finkelstein</surname>
            ,
            <given-names>C.W.</given-names>
          </string-name>
          ,
          <year>1994</year>
          .
          <article-title>An analysis of the requirements traceability problem</article-title>
          .
          <source>Proceedings of IEEE International Conference on Requirements Engineering</source>
          (pp.
          <fpage>94</fpage>
          -
          <lpage>101</lpage>
          ). IEEE.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Debruyne</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Walshe</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <article-title>and</article-title>
          <string-name>
            <surname>O'Sullivan</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <year>2015</year>
          .
          <article-title>Towards a project centric metadata model and lifecycle for ontology mapping governance</article-title>
          .
          <source>In Proceedings of the 17th International Conference on Information Integration and Web-based Applications &amp; Services</source>
          (p.
          <fpage>50</fpage>
          ). ACM.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Zhdanova</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Shvaiko</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <year>2006</year>
          .
          <article-title>Community-driven ontology matching</article-title>
          .
          <source>The Semantic Web: Research and Applications</source>
          , pp.
          <fpage>34</fpage>
          -
          <lpage>49</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Simperl</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Luczak-Rösch</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <year>2014</year>
          .
          <article-title>Collaborative ontology engineering: a survey</article-title>
          .
          <source>The Knowledge Engineering Review</source>
          ,
          <volume>29</volume>
          (
          <issue>1</issue>
          ), pp.
          <fpage>101</fpage>
          -
          <lpage>131</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Cheong</surname>
            ,
            <given-names>L.K.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Chang</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <year>2007</year>
          .
          <article-title>The need for data governance: a case study</article-title>
          .
          <source>ACIS 2007 Proceedings</source>
          , p.
          <fpage>100</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Friedman</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <year>2006</year>
          .
          <article-title>Key issues for data management and integration, 2006</article-title>
          . Gartner Research.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Khatri</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Carol</surname>
            <given-names>V.B.</given-names>
          </string-name>
          ,
          <year>2010</year>
          .
          <article-title>Designing data governance</article-title>
          .
          <source>Communications of the ACM</source>
          ,
          <volume>53</volume>
          (
          <issue>1</issue>
          ), pp.
          <fpage>148</fpage>
          -
          <lpage>152</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Thomas</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brennan</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <article-title>and</article-title>
          <string-name>
            <surname>O'Sullivan</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <year>2011</year>
          . MooM-
          <article-title>-a prototype framework for management of ontology mappings</article-title>
          .
          <source>IEEE International Conference on Advanced Information Networking and Applications</source>
          (pp.
          <fpage>548</fpage>
          -
          <lpage>555</lpage>
          ). IEEE.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Fruhling</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <year>2005</year>
          .
          <article-title>Assessing the reliability, validity and adaptability of PSSUQ</article-title>
          .
          <source>AMCIS 2005 Proceedings</source>
          , p.
          <fpage>378</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Kunz</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Rittel</surname>
            ,
            <given-names>H.W.</given-names>
          </string-name>
          ,
          <year>1972</year>
          .
          <article-title>Information science: on the structure of its problems</article-title>
          .
          <source>Information Storage and Retrieval</source>
          ,
          <volume>8</volume>
          (
          <issue>2</issue>
          ), pp.
          <fpage>95</fpage>
          -
          <lpage>98</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Pinto</surname>
            ,
            <given-names>H.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Staab</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Tempich</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <year>2004</year>
          ,
          <article-title>August</article-title>
          . DILIGENT:
          <article-title>Towards a fine-grained methodology for DIstributed, Loosely-controlled and evolvInG Engineering of oNTologies</article-title>
          .
          <source>In Proceedings of the 16th European Conference on Artificial Intelligence</source>
          (pp.
          <fpage>393</fpage>
          -
          <lpage>397</lpage>
          ). IOS Press.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Holsapple</surname>
            ,
            <given-names>C.W.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Joshi</surname>
          </string-name>
          ,
          <string-name>
            <surname>K.D.</surname>
          </string-name>
          ,
          <year>2002</year>
          .
          <article-title>A collaborative approach to ontology design</article-title>
          .
          <source>Communications of the ACM</source>
          ,
          <volume>45</volume>
          (
          <issue>2</issue>
          ), pp.
          <fpage>42</fpage>
          -
          <lpage>47</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Karapiperis</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Apostolou</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <year>2006</year>
          .
          <article-title>Consensus building in collaborative ontology engineering processes</article-title>
          .
          <source>Journal of Universal Knowledge Management</source>
          ,
          <volume>1</volume>
          (
          <issue>3</issue>
          ), pp.
          <fpage>199</fpage>
          -
          <lpage>216</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Ehrig</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Staab</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <year>2004</year>
          .
          <article-title>QOM-quick ontology mapping</article-title>
          .
          <source>In Third International Semantic Web Conference</source>
          (pp.
          <fpage>683</fpage>
          -
          <lpage>697</lpage>
          ). Springer.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>