<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Joint Conference (March</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Volunteered multidimensional design to the test: The Farmland Biodiversity VGI4Bio project's experiment</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Sandro Bimonte</string-name>
          <email>sandro.bimonte@irstea.fr</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lucile Sautot</string-name>
          <email>lucile.sautot@agroparistech.fr</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stefano Rizzi</string-name>
          <email>stefano.rizzi@unibo.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Benoit Fontaine</string-name>
          <email>benoit.fontaine@mnhn.fr</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>DISI, Univ. of Bologna</institution>
          ,
          <addr-line>Italy, Bologna</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Irstea, TSCF</institution>
          ,
          <addr-line>Clermont Ferrand</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>MNHN, CESCO</institution>
          ,
          <addr-line>Paris</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>UMR TETIS, AgroParisTech</institution>
          ,
          <addr-line>CNRS, CIRAD, IRSTEA, Montpellier</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <volume>26</volume>
      <issue>2019</issue>
      <abstract>
        <p>Moving volunteers of VGI (Volunteer Geographic Information) from passive data producers to active data analysts in the context of Data Warehouses (DWs) and OLAP systems is an open issue. Indeed, volunteers have particular features that make existing DW design methodologies inadequate. In this paper, using a real case study concerning the farmland biodiversity, we test the methodology proposed in [5], which enables volunteers to design DW schemes. The experiments aim at answering two research questions: (i) How can volunteered design be streamlined with respect to the methodology described in [5]?; (ii) To what extent does the involvement of a large number of volunteers actually improve the cubes implemented? Our experiments confirm the adequacy of the methodology proposed in [5], but they also reveal some important limitations. Among them, we identify possible conflicts among volunteers in the first steps of the design process. To address this issue we propose a solution based on social software engineering tools, and in particular Wiki systems.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>
        Volunteer Geographic Information (VGI) has been defined as “the
mobilization of tools to create, assemble, and disseminate
geographic data provided by volunteers” [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ]. VGI has been
successfully applied in several contexts such as urban, architectural,
hazards, environmental, and trafic jam domains. In these
contexts, crowd-sourced data is produced by amateurs and/or
professional volunteers in order to collaboratively create useful datasets,
which are then handled by community experts that provide
analytic services to volunteers. However, it has already been proved
that, when either volunteers are not fully involved in the analytic
process or the services ofered to them do not fit their needs, it
becomes dificult to mobilize the volunteers to collect data [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ].
Therefore, to increase the possible application domains of
crowdsourced data, there is a need for extending the role of volunteers
from data producers to active data consumers, i.e., give them the
possibility to impact the design of the analytical process.
      </p>
      <p>
        Data warehouses (DWs) and OLAP are first-class citizens
within Business Intelligence technologies, and enable an
efective and friendly exploration and analysis of huge datasets [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ].
Warehoused data are stored in databases according to the
multidimensional paradigm, based on the concept of cube. A cube
is focused on a subject of analysis, called fact, quantitatively
described by a set of measures. Measures are analyzed according to
dimensions, which are composed of hierarchical levels. Values of
dimensions and levels are called members. Aggregation operators
are applied to compute measure values when facts are analyzed
at coarser levels. Finally, derived measures are calculated based on
other measures and/or dimension members, while an indicator
associates a measure with a specific aggregation operator.
      </p>
      <p>
        DW and OLAP are promising tools for the analysis of VGI data,
as shown in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. However, as mentioned above, to have volunteers
closely involved in the decisional process, it is not suficient to
give them an OLAP front-end to access cubes designed by others:
we should let them design their own cubes. This goal is not trivial,
since volunteers are usually non-skilled from the ICT and OLAP
points of view, and is the subject of our work.
      </p>
      <p>
        The design of multidimensional cubes has been fully
investigated in several papers [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ]; the methodologies are
classiifed into data-driven (i.e., the cube schema is derived from the
data sources), requirement-driven (i.e., the cube schema is
derived from the users’ requirements), and mixed (i.e., data- and
requirement-driven approaches are combined). Query-driven
approaches are a subclass of requirement-driven approaches in
which the cube schema is derived from the analytical queries
and reports the users ask for. In all those classical approaches,
decision-makers are a few OLAP-skilled users, and they are
highly committed to the project. To best of our knowledge, only
in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] multidimensional design is investigated in the VGI
context. In that article, the authors present a new methodological
paradigm, which we will call volunteered design, that allows each
group of volunteers to define its cube schemes, then a
centralized approach is adopted where a DW expert solves the conflicts
associated to diferent definitions of the same cube by several
groups. The authors of [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] propose to involve volunteers with
diferent social and professional skills and from diferent
organizations in cube design, so that the analysis requirements obtained
better represent the whole volunteers community. However, the
methodology described in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] presents some limitations: (i)
conlficts among the volunteers can emerge also within a single group;
(ii) the prototyping phase may require multiple iterations, which
makes it very dificult for volunteers and DW experts to handle
them over time; and (iii) DW experts can deal with several
volunteers while prototyping diferent cubes at the same time, which
makes communicating with them quite dificult and expensive.
      </p>
      <p>
        In this paper we test volunteered design on a real case study
concerning the analysis of agricultural biodiversity in the context
of the French ANR project VGI4Bio1. In particular, we aim at
answering two main research questions:
1www.VGI4bio.fr
(1) How can volunteered design be streamlined with respect to
the methodology described in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]?
(2) To what extent does the involvement of a large number of
volunteers in requirements analysis actually improve the
cubes implemented from the point of view of the analyses
they support?
      </p>
      <p>The paper is organized in the following way: the
methodological framework of volunteered design is described in Section 2;
the proof of concept for the volunteered design methodology is
presented in Section 3 using the farmland biodiversity case study,
together with the lessons learned from the case study; Section
4 presents the collaborative extension of the the methodology;
Section 5 describes the related work; and Section 6 concludes the
paper and discusses the future work.
2</p>
    </sec>
    <sec id="sec-2">
      <title>AN OVERVIEW OF VOLUNTEERED</title>
    </sec>
    <sec id="sec-3">
      <title>DESIGN</title>
      <p>
        In this section we recall the methodology proposed by [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] for
involving volunteers in cube design.
      </p>
      <p>
        The VGI context presents the following peculiarities:
• Non-skilled users. The volunteers are researchers in
ecology, farmers, naturalists, and managers. They are
nonskilled in the OLAP paradigm (i.e., they never used
database and DW/OLAP technologies), therefore, as stated in
[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], they do not know the proper technical terminology
to express their analysis needs in terms of cube elements.
• Limited time. Most participants work on the project on
a volunteer basis, therefore they cannot spend too much
time in communicating with DW experts to express their
analysis needs.
• Large number of volunteers. While DW experts handle the
design of a cube, they also have to consider the cubes
designed by several other volunteers.
• Volunteer groups. Volunteers can be organized in groups
to define their requirements. Groups can be defined based
on their members’ organization and/or enterprise, social
afinities, etc.
      </p>
      <p>
        In this context, existing DW design methodologies are not
appropriate since they implicitly assume that decision-makers are
familiar with OLAP concepts, they are not numerous and fully
employed in the DW project, and that there is no conflict about
their analysis requirements. To fill this gap, [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] presents a new
design methodology, sketched in Figure 1 using a UML activity
diagram. In the first phase, using the ProtOLAP approach [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ],
volunteers and/or groups of volunteers separately communicate
to DW experts their analysis requirements. In the second phase,
committers solve conflicts between cube prototypes. The overall
process can be more specifically described as follows.
(1) Volunteers express their requirements in natural language,
mainly in terms of the indicators and dimensions they
need (Requirement Identification ). Remarkably, the
iterative and rapid process adopted allows also DW experts
not skilled in the specific application domain to
understand the requirements during the interviews with users.
In principle, other tools (e.g., brainstorming, workshops,
scenarios, case studies) might be associated to natural
language to make requirement elicitation more efective, but
unfortunately the adoption of most of these tools to
replace natural language is dificult in presence of unskilled
OLAP users.
(2) DW experts draw a draft conceptual schema with the
      </p>
      <p>
        ICSOLAP UML profile [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] ( Cube Design).
(3) The prototyped DW schema is validated by DW experts
against data sources using existing methodologies, e.g., [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]
(Validation against data sources). If some pieces of data
cannot be found on the data sources, then step (1) is executed
again to tune the requirements.
(4) The draft schema is automatically implemented in a
relational DBMS, deployed on an OLAP server, and filled with
synthetic dimension members and randomly-generated
measure values so as to create a cube prototype. The reason
why synthetic data are used at this stage is that volunteers
must be quickly enabled to “play” with the cubes to check
that they are satisfactory, while implementing a real ETL
process may take a lot of time.
(5) Volunteers explore the cube prototype using an OLAP
client (Cube Exploration). If they do not agree with the way
requirements were understood and implemented, another
iteration is done; otherwise the cube is delivered for the
next phase.
(6) A fusion algorithm is applied to all delivered cube
prototypes to merge them into one or more cubes (Fusion).
(7) For each cube, the committers vote for each cube element
to solve conflicts ( Voting).
(8) The cube whose elements present a common agreement
among committers is delivered and fed with real data
through an ETL process designed and implemented by
DW experts (Cube Release).
      </p>
      <p>This methodology has been specifically conceived to support
the VGI features previously described: (i) unskilled users, (ii)
limited time, (iii) large number of users, and (iv) users groups. In
the rest of the paper, we experiment it in a real-world project to
verify to what extent it actually supports these VGI features.
3</p>
    </sec>
    <sec id="sec-4">
      <title>THE FARMLAND BIODIVERSITY CASE</title>
    </sec>
    <sec id="sec-5">
      <title>STUDY</title>
      <p>In the context of the VGI4Bio project, we mobilize two VGI
databases, namely Faune-Aquitaine (Biolovision database, LPO
– Ligue pour la Protection des Oiseaux) and OAB (Observatoire
Agricole de la Biodiversité), to build OLAP applications to
analyze farmland biodiversity indicators. Faune-Aquitaine and OAB
have 7682 and 1500 volunteers, respectively, who produce data.
Among the possible users interested in analyzing these data, we
have identified a large number of volunteers belonging to diverse
categories: for instance, farmers who are interested in analyzing
biodiversity data in relation with their farming daily practices,
environmental NGOs needing to visualize biodiversity trends,
and French public and private organizations (Regional Direction
of Environment and Housing – DREAL, Chambre d’Agriculture,
etc.).</p>
      <p>
        An example of cube schema obtained from our data sources
is called abundance, and can be used to analyze the abundance
of individual species according to space, time, meteorological
conditions, and altitude, in order to understand the impact of
global changes on biodiversity (abundance increase or decrease,
changes in range or phenology). The abundance cube schema is
shown in Figure 2 using the ICSOLAP profile [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. The cube has
seven dimensions; for four of them (namely, Time,
RespectOfMeteoParameters, Altitude, and Location), hierarchies are hidden
in the figure inside UML packages for better readability. The
cube also has one indicator that represents the sum of abundance
(Sum(abundance)), and one derived measure that represents a
spatial average of the abundance (abundance by location).
Following the ICSOLAP profile, the Sum(abondance) indicator is not
graphically connected to the fact, but it presents a tagged value
(i.e., aggregatedAtribute ) whose value is a measure of the cube.
This cube has been implemented by two DW experts based on the
analysis requirements provided by a group of three volunteers.
      </p>
      <p>For the experiments conducted over these two datasets we
have mobilized eight volunteers overall, five on the Faune-Aquitaine
dataset and three on the OAB one. The three volunteers of the
OAB dataset are organized into two groups, featuring one and
two volunteers, respectively. The volunteers are all non-OLAP
skilled users and they come from diferent organizations with
diferent professional profiles (ecologists, farmers, and
administration). Two DW experts are involved in the project, one
10years-experienced engineer and one young engineer, who have
been working half-day by week on the project for 7 months.</p>
      <p>The following subsections focuse on the lessons learned during
the most critical steps of the methodology (see Figure 1).
3.1</p>
    </sec>
    <sec id="sec-6">
      <title>Cube prototyping</title>
      <p>In order to assess the feasibility and the improvements brought
by automatic prototyping, we have measured the time taken
to implement the Faune-Aquitaine cubes with and without the
ProtOLAP tool. We have not taken into account the time for
designing the UML model since it is required in both cases. The
average time for manual implementation (without ProtOLAP) is
2 hours, while with ProtOLAP it is only 5 to 10 minutes. This
diference is due to the fact that in a manual implementation
these kinds of errors frequently appear:
• SQL errors: wrong insert and create statements;
• Mondrian errors: wrong definition of Mondrian XML tags;
• SQL to Mondrian mapping errors: wrong usage of SQL
tables and attributes in the Mondrian XML tags.</p>
      <p>Another relevant diference between manual and automatic
implementation is that the manual process is very tedious and
long. Indeed, feeding the dimensions and fact tables with data,
to ofer volunteers to “play” with the cube and really understand
the prototype, takes a lot of time.</p>
      <p>Overall, our experiments confirm the feasibility and the
beneifts of the usage of ProtOLAP in our volunteered design context.</p>
      <p>
        Learned lessons. From the experiments it appears that
ProtOLAP indeed makes prototyping more rapid, which allows
nonOLAP skilled volunteers with limited time to participate to
multidimensional design. However, two limitations have been identified
thanks to our case study:
• Business Dictionary. In the current implementation, the
DW experts are in charge of matching the diferent
terminologies used by diferent volunteers. This becomes
too complex when the number of cubes increases, and
the catalog of classes ofered by the CASE tool used in
ProtOLAP is not suficient. For instance, diferently from
all other volunteers, a volunteer gave the name "cortege"
to species groups. This means that this dimension level
was considered diferent all along the process, introducing
a misunderstanding, which in the end afected the quality
of the designed cube and the duration of the project. So, a
unique and non ambiguous naming strategy is needed when
dealing with a large number of volunteers. Therefore, the
implementation of Business Dictionary must be provided,
for example using the Semantic of Business Vocabulary
and Rules standard. A more flexible solution, that does not
oblige DW experts and committers to create a Business
Vocabulary, would be the adoption of a domain-specific
ontology [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ]; this will bring clear advantages only when
such ontology already exists, since designing and
implementing it from scratch would presumably take a lot of
time and resources.
• Derived measures. ICSOLAP, as well as all the other DW
conceptual models [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], supports an explicit definition of
dimensions, facts, and aggregations in a non ambiguous
way. However, in real projects, derived measures —such as
the abundance ratio per location— are fundamental.
Unfortunately, [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] does not support a conceptual representation
of derived measures. This represents an important
limitation since it is very dificult to express derived measures
for volunteers, and DW experts will spend too much time
to achieve a correct implementation in MDX and/or SQL.
This issue causes several misunderstandings that slow the
prototyping phase down, so scaling up to a large number of
volunteers might not be feasible. Some works try to
provide a high level representation of derived measures. For
instance, [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] integrates a formal description of complex
aggregations and derived measures into OLAP operators
to explain the formula computation to decision-makers,
but without automating their implementation. Only [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]
uses a formal languages such as OCL to express complex
aggregations and derived measures, and also proposes an
MDX implementation. Therefore, this approach could be
integrated in [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] with some possible extension of OCL
statements to represent complex aggregations such as
median and standard deviation as proposed in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
3.2
      </p>
    </sec>
    <sec id="sec-7">
      <title>Cube exploration</title>
      <p>In our experiments, each volunteer needs 3 to 5 meetings to
obtain the final cube prototype, depending on the complexity of
the cube. For example, 3 meetings were needed for the
FauneAquitaine cubes, while more meetings were necessary for the
OAB cubes since they have more dimensions and indicators than
the Faune-Aquitaine ones. Each meeting took 2 hours on average.</p>
      <p>Learned lessons. The exploration step enables unskilled OLAP
users to validate cube elements in a natural way since they are
familiar with pivot tables. However, this step still presents some
problems:
• Manual feeding. Manually adding dimensional data in the
feeding step has two main drawbacks. First, the process is
tedious and long, which makes prototyping not so rapid.
Secondly, when data is fed by DW experts the exploration
step can be complicated for volunteers. Indeed, DW
experts may use dimensional data that do not fit with real
data or with the data used by volunteers. This may
generate misunderstanding errors, i.e., make volunteers think
that the implementation of dimensional elements is not
correct. This issue can negatively influence the ability of
volunteers to validate the cubes, and slows this step down.
Therefore, it could make this process unfeasible for a large
number of users.
• Desktop OLAP client. The OLAP client used in the
ProtOLAP methodology is JRUbik. Though JRubik is a
userfriendly desktop client, it must be configured by DW
experts:
– on the desktops of volunteers. This can be a complex
task since a new configuration is needed for each new
cube version, or it is simply impossible since DW
experts may have no access to the volunteers’ desktops
for security reasons.
– on the desktops of DW experts. This requires that
volunteers access JRubik via a screen-sharing tool, and that
DW experts are present during their exploration session.</p>
      <p>This option has been used in our case study.</p>
      <p>Overall, the OLAP client desktop solution appears to be a
technological barrier when several volunteers are involved
in the project.
• Conflicts within the same group of volunteers . Volunteers
of the same group can have diferent visions of the same
analysis need. For example, for the meteorological
dimension, one volunteer may want to see all the details about
rain, wind, temperature, etc. at the time when
biodiversity data were collected, while another one may believe
that just a Boolean attribute (stating whether the
meteorological conditions defined by the data collection protocol
were respected) will be enough. This issue remains open
in the methodology, where conflicts within the same
volunteers group are solved by them with exchanges that
may cause misunderstandings. Therefore, conflicts within
the same volunteers group negatively impact the quality
of the designed cube and slows the overall methodology
down.
3.3</p>
    </sec>
    <sec id="sec-8">
      <title>Cube delivery</title>
      <p>In this section, we study the impact of the involvement of several
volunteers on the definition of requirements that are
representative of the whole volunteers community. In particular, we take
into account all the 11 cubes defined by 5 Faune-Aquitaine dataset
volunteers.</p>
      <p>To begin with, we have analyzed the number of dimensional
elements, aggregations, and derived measures shared or not by
them as shown in Figure 3, where each bar corresponds to one
cube. Cubes have been designed in a temporal order; the first
cube designed is c1 and the last one is c11.</p>
      <p>The first cube ( c1) defined by the first volunteer (User 1)
contains only new elements, since User 1 expressed his needs first.
The second cube (c2) defined by the second volunteer (User 2)
contains four new elements and eight existing elements: User 2
has defined some new elements, but he has also used elements
deifned by User 1. In the same way, the following cubes share some
existing elements, and they add new ones. After the definition
of the first three cubes, the users just add a few new elements to
define new cubes.</p>
      <p>Figure 3 shows that, after the definition of the cube
prototypes, the volunteers need just a few more elements to define
new cubes, since thy share some elements. At the same time,
each new volunteer brings some new elements and new analysis
requirements.</p>
      <p>In Figure 4, we represent the number of defined elements
as a function of the number of defined cubes, and we propose
an extrapolation of these data based on a logarithmic function.
The extrapolation indicates that the number of needed elements
increases at a slower pace than the number of new defined cubes.
Therefore, according to these results, after the first prototypes,
the definition of new cubes does not require the definition of
many new elements.</p>
      <p>Learned lessons. In our case study, a few volunteers were
suficient to define the core elements of the cubes; involving
other volunteers improves the defined cubes but only in specific
details. In other words, the methodology could efectively be used
to discover analysis requirements representing the whole set of
users of the Faune-Aquitaine dataset. The conception of cubes
has thereby two interesting features:
• the elements commonly needed by the community of
volunteers are rapidly identified by the group;
• each volunteer adds original elements and points of view.
It is also important to note that the prototyping phase is more and
more rapid since volunteers share multidimensional elements,
which confirms the feasibility of this step.</p>
      <p>When the cube schemas produced respects the trend shown in
Figure 4, the methodology can be applied with a large number of
volunteers.
3.4</p>
    </sec>
    <sec id="sec-9">
      <title>Voting</title>
      <p>To test this phase, we used one cube and three committers with
diferent profiles. They belong to LPO, DREAL, and
AgroParisTech, and they are an ecologist, a manager, and an agronomy
engineer, respectively. The cube they use concerns the
FauneAquitaine dataset and contains 6 dimensions and 7 indicators; it
is the result of the fusion step of 11 cube prototypes.</p>
      <p>
        Conflict resolution took 2 hours, and led to the elimination of
3 indicators and one level. It relied on a video-conference system
and was supported by the GRUS system as described in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. At
the end of the meeting we asked the 3 committers their feedbacks
on the process. They appreciated the methodology and found
that it is suitable to deliver the final cubes. However, they pointed
out a limitation: the methodology should allow them to modify
the proposed cube by adding new elements.
      </p>
      <p>Learned lessons. Conflict solving is well-suited for
committers, the duration is convenient for a large number of cubes,
but some efort should be done to allow committers to actively
participate in the design phase.
4</p>
    </sec>
    <sec id="sec-10">
      <title>COLLABORATIVE CUBE DESIGN</title>
      <p>To overcome the previously described limitations, we have
extended the methodology as follows.
4.1</p>
    </sec>
    <sec id="sec-11">
      <title>Overview</title>
      <p>
        To provide an efective answer to the research query How can
volunteer cubes be more efectively and eficiently designed? , we
extended [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] by adding a collaborative step, Wiki Exchange, to
the ProtOLAP approach, aimed at streamlining volunteer-based
design (Figure 5). In particular, this new step follows cube
exploration and supports a collaborative validation of cubes; this is
achieved by integrating the usage of a Wiki-based system into
ProtOLAP.
      </p>
      <p>
        Indeed, as previously mentioned, Wiki systems can be
considered as powerful tools improving software engineering processes
[
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] by enabling the sharing of contents, a collaborative
development, easy team communication, debugging, etc. Wikis are
essentially content management systems, in the form of Web
pages, which contain information that may be easily updated by
their users using a simplified markup language.
      </p>
      <p>In our context, associating a Wiki with an OLAP system
enables (i) an easier communication between volunteers and DW
experts, (ii) a faster resolution of conflicts among the volunteers
of a group, and (iii) a simplified versioning of cubes during
prototyping as described in the following.</p>
      <p>The collaborative requirement validation step is implemented
using a Wiki system that is integrated to the automatic and
iterative implementation of the cube prototypes. Figure 6 shows,
using the notation of a UML class diagram, the data model we
have used to achieve this integration.</p>
      <p>Since the prototyping approach is iterative, a cube can be
associated to its previous versions. For example, the cube in
• the problem cannot be solved due to technical DW/OLAP
limits, or to data sources configuration;
• the problem can be solved, then a new version of the cube
is implemented and proposed to the volunteers.</p>
      <p>Solving a requirement problem may imply several explorations
(i.e., OLAP queries) over several cube versions. We name
requirement tests these OLAP queries. The collaborative step is then
implemented by a Wiki page with a discussion associated to each
OLAP query. A discussion can contain several comments. In this
way, the DW experts and the volunteers of a group can easily
communicate about the reasons of the problem and its possible
solutions. Wiki discussions are organized in namespaces (i.e.,
directories) in the Wiki system according to the requirement test
they refer to.</p>
      <p>An example is shown in Figure 8, where a volunteer defines an
OLAP query, then he states on the Wiki that the list of treatments
is not suficient and that he needs a coarser level describing the
type of treatment.</p>
      <p>At this time a requirement test about this problem is created
(Figure 9). The DW experts take into account the comment and
prototype a new cube with the new hierarchy. The volunteers
explore the new cube (Figure 10) and create a new discussion
to communicate to the DW experts that the hierarchy is now
correct.</p>
      <p>Note that, in Figure 9, the Wiki home page presents the list
of solved and ongoing requirement tests. For example, for the
requirement test “test5”, the Wiki page shows the two discussions
associated with the two previously described OLAP queries.</p>
      <p>Using a Wiki also provides another important benefit in the
VGI context: it enables an easier cooperation among the persons
involved in design. Indeed, volunteers and DW experts can
provide their comments on the Wiki at any time and location, which
is mandatory in such kind of projects where volunteers are
geographically distributed and their agendas are not dependent from
the DW project. This is possible since both the OLAP client and
the Wiki system are web-based technologies.</p>
      <p>Finally, both systems are simple and user-friendly, which
allows non-ICT and non-OLAP skilled volunteers to easily use
them.
4.2</p>
    </sec>
    <sec id="sec-12">
      <title>Implementation</title>
      <p>The target relational DBMSs used for data storage are Postgres
and Oracle, while the OLAP server is Mondrian. The OLAP client
used is JPivot, a web-based OLAP client ofering tabular and
graphical displays to visualize the results of OLAP queries. The
Wiki system has been implemented with DocuWiki2, deployed
on an Apache server. A template of the Wiki discussion page
is instantiated by means of a Java servlet each time an OLAP
query is chosen by a volunteer. The name of the discussion page
is generated using a unique ID build from the multidimensional
elements of the OLAP query. Those elements are retrieved by
a Java servlet that parses the XMLA3 response of the OLAP
2www.docuWiki.net
3XMLA is a de facto standard for OLAP, which allows querying and visualizing cubes
(docs.microsoft.com/en-us/bi-reference/xmla/xml-for-analysis-xmla-reference)
Server Mondrian. Indeed, JPivot uses the XMLA web service to
communicate with Mondrian.</p>
      <p>An automatic implementation must also provide a complex
configuration of the OLAP client and server inside the web server.
This is not a simple task since several configuration files must be
set up. This hand-made configuration is usually time-consuming
since trivial errors can easily appear, slowing prototyping down.
To overcome this limitation, we have developed a configuration
wizard that allows a free-error and instantaneous XMLA
configuration of Mondrian and JPivot, and Postgres JDBC connection.
The wizard only needs the paths to some directories and files.</p>
      <p>Finally, in order to overcome the problems related to manual
feeding, we have extended the feeding step with a tool that
allows volunteers to upload dimensional data using their own CSV
ifles. Volunteers upload the files, then for each hierarchy level
they choose the CSV column that contains the corresponding
members. In this way each level of the dimension is fed with data
that is familiar to the volunteers.
4.3</p>
    </sec>
    <sec id="sec-13">
      <title>Experiments and validation</title>
      <p>
        Validation concerns the requirements expressed about all the
different elements of the cubes, and is carried out during a meeting
in which the DW experts ask specific questions to the
decisionmakers. Note that, while the subsequent interactions between
decision-makers and DW experts will be mediated by the Wiki,
meetings are preferable for the first round of validation because
decision makers are non-skilled in multidimensional modeling,
so they typically need explanations and an expert guide for
validation. The types of cube elements are listed below with the
corresponding questions: :
• Dimensional elements: dimensions, hierarchies, and
levels.
– Is this dimension useful? For example, the temperature
dimension can be excluded from the model since it is
not relevant for the analysis of abundance.
– Does this hierarchy include all the necessary levels? For
example, is it not suficient to have the class of a
agricultural treatment without a level describing the active
material.
– Is this level needed? For example, the week level is not
useful for a description of biodiversity temporal trends.
• Data calculation
– Is this aggregation operator useful? In OLAP clients, the
measures stored in the DW are always associated with
aggregation operators in order to be visualized and
analyzed by decision makers. Therefore, since the cube
prototype is fed with data, the volunteers can validate
the aggregation operator used. For example, for
abundance, the MEDIANE operator is considered useful, but
MIN and MAX are not.
– Does this derived measure correctly implement the
deifned requirement? For example, the derived measure
ratio of abundance by location was not correctly
implemented in the two cube versions. It was calculated
as the ratio between total abundance and the number
of parcels, while volunteers wanted the ratio between
total abundance and the total surface of the parcels.
• Data: What constraints must cube data respect? Source
data can present some integrity constraints that must be
taken into account when they are loaded in the cubes [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
For example, volunteers do not want their cubes to include
butterfly abundance data recorded outside a predefined
itinerary.
• Nomenclature: Are the names used for aggregations,
derived measures, and dimensional elements well-defined? For
example, volunteers want to visualize in the OLAP client
the sum of abundance as SUM(abundance) and not as sum
abundance, since they find it more understandable.
      </p>
      <p>To assess the benefits ofered by the Wiki system, we have
used it with one group composed of two volunteers for the OAB
dataset. This group needed only three meetings to achieve the
ifnal prototype, diferently from the other volunteer group of the
OAB which needed six meetings. Therefore, it appears that the
usage of Wiki reduces the number of needed meetings. Note that,
in the general case, groups are composed by diferent persons
with diferent analysis needs, so it is quite dificult to properly
evaluate to what extent the results obtained by the diferent
groups are the same. However, we mention that in our case study
the cube schemes obtained by the two groups only difer in one
dimension and one indicator. Figure 11 shows the number of
discussions related to the elements of the cube that the group of
volunteers created. We observe that they concern all the
previously described requirements.</p>
      <p>Finally, we have asked the DW experts to classify each
discussion according to the cause of the error:
• “data” when the problem is related to the data fed in the
cube;
• “misunderstanding”, when the problem is that the DW
experts have not well understood the requirement expressed
by the volunteers; and
• “vague”, when the error is due to the fact that the
requirement has not been precisely defined by volunteers
themselves.</p>
      <p>From Figure 12 it appears that misunderstandings are the main
class of problems, which confirms the need for a prototype-based
methodology. A total of 13 errors were identified. We have also
found that 63% of the discussions are used to solve a problem on
a requirement without having to create a new cube version to be
explored by volunteers.</p>
      <p>Learned lessons. Experiments confirm that the easy
communication support provided by the Wiki really enhances the
exchange among DW experts and volunteers and avoids useless
meetings. However, in our case study volunteers had to be trained
in order to be able to use the Wiki system. Some volunteers did
not use the Wiki since they considered it too complicated and
they preferred the classical email exchange. This means that the
Wiki interface must be made even more friendly to be adopted
by the whole community of volunteers.</p>
    </sec>
    <sec id="sec-14">
      <title>RELATED WORK</title>
      <p>In this section we discuss related work concerning DW design
methodologies (Section 5.1), testing DW (Section 5.2), and finally
social tools for software engineering (Section 5.3).
5.1</p>
    </sec>
    <sec id="sec-15">
      <title>Data warehouse design</title>
      <p>
        Several methodologies have been developed in literature for the
design of DWs [
        <xref ref-type="bibr" rid="ref12 ref24">12, 24</xref>
        ]; they can be grouped into three classes:
• data-driven, which analyze the data sources schemata to
deduce numerical attributes that can be used as measures,
and associated tables that can represent dimensions (such
as [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]).
• requirement-driven, which derive a multidimensional schema
from the analysis requirements that are formalized using
ad-hoc or standard formalisms (e.g., [
        <xref ref-type="bibr" rid="ref22 ref8">8, 22</xref>
        ]). When
requirements are expressed as analytical queries (SQL or MDX)
and reports the users ask for, the term query-driven is used
[
        <xref ref-type="bibr" rid="ref25">25</xref>
        ].
• mixed, which combine data- and requirement-driven
approaches in that they validate the multidimensional schema
derived from data sources over the analysis requirements
(for example [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]).
      </p>
      <p>
        Requirement-driven methodologies are based on the
assumption that analytical requirements are well-defined by
decisionmakers; clearly, this gives a strong importance to the adoption of
an efective elicitation requirement method. Requirements
elicitation is the practice of collecting the requirements of a system
from users, customers, and other stakeholders. Requirements
elicitation is non-trivial. Requirements elicitation practices include
interviews, questionnaires, user observation, workshops,
brainstorming, use cases, role playing, and prototyping. Elicitation of
requirements for DWs uses a set of tools (scenario, prototype, etc.)
for helping DW experts to communicate with decision-makers
[
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]. To the best of our knowledge they also suppose that all the
requirements expressed by decision-makers do not present any
conflicts. In this context, only [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] studies conflicts among the
requirements of diferent groups of decision-makers that have
been already elicited and validated. However, [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] does not
address conflicts that can emerge during the elicitation phase for a
group of volunteers.
      </p>
      <p>
        When decision-makers are non-skilled decision-makers, also
rapid prototyping proved to be an efective tool to support
requirement elicitation and design. Some works propose agile
methodologies for the development of DWs using prototypes
[
        <xref ref-type="bibr" rid="ref14">14</xref>
        ][
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]; in particular, the usage of automatic prototype
implementation from conceptual multidimensional schemes has been
investigated.
5.2
      </p>
    </sec>
    <sec id="sec-16">
      <title>Data warehouse testing</title>
      <p>
        Classical DW development includes design phase, as described
in the previous section, and then a functional and non-functional
testing phase. Functional testing is devoted to finding out
problems of requirements implementations. Some approaches have
been developed to lead tests on DWs (e.g., [
        <xref ref-type="bibr" rid="ref11 ref20">11, 20</xref>
        ]); a survey is
proposed in [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. These methodologies are based on quantitative
metrics to test the multidimensional schemata, the nomenclature,
and the warehoused data [
        <xref ref-type="bibr" rid="ref10 ref26">10, 26</xref>
        ]. Besides these metrics, classical
testing techniques have been also used, such as
testing-in-thesmall and stress tests [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. However, to the best of our knowledge,
no work propose a test phase done by decision-makers by
using collaborative tools, diferently from other computer science
domains where social tools have been quite successfully
investigated, as described in the next subsection.
      </p>
    </sec>
    <sec id="sec-17">
      <title>Social tools for software engineering</title>
      <p>
        Social tools for software engineering (SSEs) are designed for
sharing ideas, knowledge, and artifacts among groups and their
members during software engineering processes. Two surveys
can be found in [
        <xref ref-type="bibr" rid="ref1 ref18">1, 18</xref>
        ]. According to [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], SSEs can be considered
as collaboration tools for web-based collaboration software. Web
2.0 fosters software engineering mainly using Wiki systems, for
example [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ] uses Wikis to enable collaborative programming.
Moreover, it has been proved that Wikis are particularly useful
in agile software development since they enable, among others,
a fast and easy content creation [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ].
      </p>
      <p>
        To best of our knowledge only [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] proposes to associate an
OLAP client to a Wiki system to allow decision-makers to share
their analytical queries. The main diference with our work is
that in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] the Wiki system is not used in the design phase but for
an already implemented DW, which does not require to handle
cube versioning.
6
      </p>
    </sec>
    <sec id="sec-18">
      <title>CONCLUSIONS AND FUTURE WORK</title>
      <p>
        Changing VGI volunteers from passive data producers into active
data analysts in the context of DWs and OLAP systems is an
open issue. Indeed, volunteers have some peculiarities that make
existing DW design methodologies inadequate: they are
nonskilled in ICT and OLAP, they can dedicate to the project only
a limited time, and they are a numerous. In this paper we used
a real case study concerning a farmland biodiversity project to
test the methodology proposed in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], which allows volunteers
to actively participate in the design of DW schemas.
      </p>
      <p>
        Our experiments confirmed the relevance of the methodology
proposed in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], but also revealed some limitations. In particular,
though this methodology allows volunteers to rapidly design
draft multidimensional schemas, there are conflicts among
volunteers in the first steps of design. To cope with this issue we
have proposed a solution based on social software engineering
tools, and in particular Wiki systems. Integrating a Wiki with
an OLAP system allows for managing requirements validation
over all the diferent versions of the elements of the cubes. All
the remaining limitations pointed out in this paper represent our
current work.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Navid</given-names>
            <surname>Ahmadi</surname>
          </string-name>
          , Mehdi Jazayeri, Francesco Lelli, and
          <string-name>
            <given-names>Sasa</given-names>
            <surname>Nesic</surname>
          </string-name>
          .
          <year>2008</year>
          .
          <article-title>A survey of social software engineering</article-title>
          .
          <source>In Proc. ASE</source>
          . L'Aquila, Italy,
          <fpage>1</fpage>
          -
          <lpage>12</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Sandro</given-names>
            <surname>Bimonte</surname>
          </string-name>
          , Elodie Edoh Alove, Hassan Nazih, Myoung Ah Kang, and
          <string-name>
            <given-names>Stefano</given-names>
            <surname>Rizzi</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>ProtOLAP: rapid OLAP prototyping with on-demand data supply</article-title>
          .
          <source>In Proce. DOLAP</source>
          . San Francisco, USA,
          <fpage>61</fpage>
          -
          <lpage>66</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Sandro</given-names>
            <surname>Bimonte</surname>
          </string-name>
          , Omar Boucelma, Olivier Machabert, and
          <string-name>
            <given-names>Sana</given-names>
            <surname>Sellami</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>A new Spatial OLAP approach for the analysis of Volunteered Geographic Information</article-title>
          . Computers,
          <source>Environment and Urban Systems</source>
          <volume>48</volume>
          (
          <year>2014</year>
          ),
          <fpage>111</fpage>
          -
          <lpage>123</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Sandro</given-names>
            <surname>Bimonte</surname>
          </string-name>
          and Myhoung Ah Kang.
          <year>2013</year>
          .
          <article-title>WikOLAP Integration of Wiki and OLAP Systems</article-title>
          .
          <source>In Encyclopedia of Business Analytics and Optimization. 1-5.</source>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Sandro</given-names>
            <surname>Bimonte</surname>
          </string-name>
          , Amir Sakka, Lucile Sautot, Pascale Zarate, Guy Camilleri, and
          <string-name>
            <given-names>Aurelien</given-names>
            <surname>Besnard</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>A Volunteer Design Methodology of Data Warehouses</article-title>
          .
          <source>In Proc. ER</source>
          , to appear.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>A.</given-names>
            <surname>Bonifati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Cattaneo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ceri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Fuggetta</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Paraboschi</surname>
          </string-name>
          .
          <year>2001</year>
          .
          <article-title>Designing data marts for data warehouses</article-title>
          .
          <source>ACM Transactions on Software Engineering Methodologies</source>
          <volume>10</volume>
          ,
          <issue>4</issue>
          (
          <year>2001</year>
          ),
          <fpage>452</fpage>
          -
          <lpage>483</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Kamal</given-names>
            <surname>Boulil</surname>
          </string-name>
          , Sandro Bimonte, and
          <string-name>
            <given-names>François</given-names>
            <surname>Pinet</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Conceptual model for spatial data cubes: A UML profile and its automatic implementation</article-title>
          .
          <source>Computer Standards &amp; Interfaces</source>
          <volume>38</volume>
          (
          <year>2015</year>
          ),
          <fpage>113</fpage>
          -
          <lpage>132</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>R.</given-names>
            <surname>Bruckner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>List</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Schiefer</surname>
          </string-name>
          .
          <year>2001</year>
          .
          <article-title>Developing requirements for data warehouse systems with use cases</article-title>
          .
          <source>In Proc. Americas Conf. on Information Systems</source>
          .
          <volume>329</volume>
          -
          <fpage>335</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Jordi</given-names>
            <surname>Cabot</surname>
          </string-name>
          , Jose-Norberto Mazón, Jesús Pardillo, and
          <string-name>
            <given-names>Juan</given-names>
            <surname>Trujillo</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Specifying Aggregation Functions in Multidimensional Models with OCL</article-title>
          .
          <source>In Proc. ER</source>
          . Vancouver, Canada,
          <fpage>419</fpage>
          -
          <lpage>432</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>C.</given-names>
            <surname>Calero</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Piattini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Pascual</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Serrano</surname>
          </string-name>
          .
          <year>2001</year>
          .
          <article-title>Towards Data Warehouse Quality Metrics</article-title>
          .
          <source>In Proc. DMDW</source>
          . Interlaken, Switzerland,
          <volume>2</volume>
          .
          <fpage>1</fpage>
          -
          <lpage>2</lpage>
          .
          <fpage>10</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>R.</given-names>
            <surname>Cooper</surname>
          </string-name>
          and
          <string-name>
            <given-names>S.</given-names>
            <surname>Arbuckle</surname>
          </string-name>
          .
          <year>2002</year>
          .
          <article-title>How to Thoroughly Test a Data Warehouse</article-title>
          .
          <source>In Proc. STAREAST</source>
          . Orlando.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>A.</given-names>
            <surname>Cravero</surname>
          </string-name>
          and
          <string-name>
            <given-names>S.</given-names>
            <surname>Sepúlveda</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Multidimensional Design Paradigms for Data Warehouses: A Systematic Mapping Study</article-title>
          .
          <source>Journal of Software Engineering and Applications</source>
          <volume>7</volume>
          (
          <year>2014</year>
          ),
          <fpage>53</fpage>
          -
          <lpage>61</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Claudia</surname>
            <given-names>Diamantini</given-names>
          </string-name>
          , Domenico Potena, and
          <string-name>
            <given-names>Emanuele</given-names>
            <surname>Storti</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Extended drill-down operator: Digging into the structure of performance indicators</article-title>
          .
          <source>Concurrency and Computation: Practice and Experience</source>
          <volume>28</volume>
          ,
          <issue>15</issue>
          (
          <year>2016</year>
          ),
          <fpage>3948</fpage>
          -
          <lpage>3968</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>Matteo</given-names>
            <surname>Golfarelli</surname>
          </string-name>
          and
          <string-name>
            <given-names>Stefano</given-names>
            <surname>Rizzi</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Data warehouse testing: A prototypebased methodology</article-title>
          .
          <source>Information &amp; Software Technology</source>
          <volume>53</volume>
          ,
          <issue>11</issue>
          (
          <year>2011</year>
          ),
          <fpage>1183</fpage>
          -
          <lpage>1198</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>B.</given-names>
            <surname>Hüsemann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lechtenbörger</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Vossen</surname>
          </string-name>
          .
          <year>2000</year>
          .
          <article-title>Conceptual Data Warehouse Design</article-title>
          .
          <source>In Proc. DMDW</source>
          . Stockholm, Sweden,
          <fpage>3</fpage>
          -
          <lpage>9</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Ralph</surname>
            <given-names>Kimball</given-names>
          </string-name>
          , Margy Ross, Joy Mundy, and
          <string-name>
            <given-names>Warren</given-names>
            <surname>Thornthwaite</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>The Kimball group reader: Relentlessly practical tools for data warehousing and business intelligence remastered collection</article-title>
          . John Wiley &amp; Sons.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>Panagiotis</given-names>
            <surname>Louridas</surname>
          </string-name>
          .
          <year>2006</year>
          .
          <article-title>Using Wikis in Software Development</article-title>
          .
          <source>IEEE Software 23</source>
          ,
          <issue>2</issue>
          (
          <year>2006</year>
          ),
          <fpage>88</fpage>
          -
          <lpage>91</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <surname>Ioanna</surname>
            <given-names>Lykourentzou</given-names>
          </string-name>
          , Foteini Dagka, Katerina Papadaki, Giorgos Lepouras, and
          <string-name>
            <given-names>Costas</given-names>
            <surname>Vassilakis</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Wikis in enterprise settings: a survey</article-title>
          .
          <source>Enterprise IS 6</source>
          ,
          <issue>1</issue>
          (
          <year>2012</year>
          ),
          <fpage>1</fpage>
          -
          <lpage>53</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <surname>Alejandro</surname>
            <given-names>Maté</given-names>
          </string-name>
          , Juan Trujillo,
          <string-name>
            <given-names>and John</given-names>
            <surname>Mylopoulos</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Specification and derivation of key performance indicators for business analytics: A semantic approach</article-title>
          .
          <source>DKE</source>
          <volume>108</volume>
          (
          <year>2017</year>
          ),
          <fpage>30</fpage>
          -
          <lpage>49</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>A.</given-names>
            <surname>Mookerjea</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Malisetty</surname>
          </string-name>
          .
          <year>2008</year>
          . Data Warehouse/ETL Testing:
          <article-title>Best Practices</article-title>
          .
          <source>In Proc. Test (Test Excellence through Speed and Technology)</source>
          . New Delhi, India.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <surname>Maria</surname>
            <given-names>Paasivaara</given-names>
          </string-name>
          , Sandra Durasiewicz, and
          <string-name>
            <given-names>Casper</given-names>
            <surname>Lassenius</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Using Scrum in Distributed Agile Development: A Multiple Case Study</article-title>
          .
          <source>In Proc. ICGSE</source>
          . Limerick, Ireland,
          <fpage>195</fpage>
          -
          <lpage>204</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>N.</given-names>
            <surname>Prakash</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Gosain</surname>
          </string-name>
          .
          <year>2003</year>
          .
          <article-title>Requirements Driven Data Warehouse Development</article-title>
          .
          <source>In Proc. CAiSE.</source>
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>Naveen</given-names>
            <surname>Prakash</surname>
          </string-name>
          and
          <string-name>
            <given-names>Deepika</given-names>
            <surname>Prakash</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Data Warehouse Requirements Engineering: A Decision Based Approach</article-title>
          . Springer.
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>Oscar</given-names>
            <surname>Romero</surname>
          </string-name>
          and
          <string-name>
            <given-names>Alberto</given-names>
            <surname>Abelló</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>A survey of multidimensional modeling methodologies</article-title>
          .
          <source>International Journal of Data Warehousing and Mining</source>
          <volume>5</volume>
          ,
          <issue>2</issue>
          (
          <year>2009</year>
          ),
          <fpage>1</fpage>
          -
          <lpage>23</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>Oscar</given-names>
            <surname>Romero</surname>
          </string-name>
          and
          <string-name>
            <given-names>Alberto</given-names>
            <surname>Abelló</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Automatic validation of requirements to support multidimensional design</article-title>
          .
          <source>DKE 69</source>
          ,
          <issue>9</issue>
          (
          <year>2010</year>
          ),
          <fpage>917</fpage>
          -
          <lpage>942</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>M.</given-names>
            <surname>Serrano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Trujillo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Calero</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Piattini</surname>
          </string-name>
          .
          <year>2007</year>
          .
          <article-title>Metrics for data warehouse conceptual models understandability</article-title>
          .
          <source>Information &amp; Software Technology 49</source>
          ,
          <issue>8</issue>
          (
          <year>2007</year>
          ),
          <fpage>851</fpage>
          -
          <lpage>870</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>Vijayan</given-names>
            <surname>Sugumaran</surname>
          </string-name>
          and
          <string-name>
            <given-names>Veda C.</given-names>
            <surname>Storey</surname>
          </string-name>
          .
          <year>2002</year>
          .
          <article-title>Ontologies for conceptual modeling: their creation, use, and management</article-title>
          .
          <source>DKE 42</source>
          ,
          <issue>3</issue>
          (
          <year>2002</year>
          ),
          <fpage>251</fpage>
          -
          <lpage>271</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>Daniel</given-names>
            <surname>Sui</surname>
          </string-name>
          , Sarah Elwood, and
          <string-name>
            <given-names>Michael</given-names>
            <surname>Goodchild</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Crowdsourcing geo knowledge: volunteered geo information (VGI) in theory and practice</article-title>
          . Springer Science &amp; Business Media.
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>D.</given-names>
            <surname>Wright</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Underhill</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Keene</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Knight</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Understanding the motivations and satisfactions of volunteers to improve the efectiveness of citizen science programs</article-title>
          .
          <source>Society &amp; Natural Resources</source>
          <volume>28</volume>
          (
          <year>2015</year>
          ),
          <fpage>1013</fpage>
          -
          <lpage>1029</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <surname>Wenping</surname>
            <given-names>Xiao</given-names>
          </string-name>
          , Chang Yan Chi, and
          <string-name>
            <given-names>Min</given-names>
            <surname>Yang</surname>
          </string-name>
          .
          <year>2007</year>
          .
          <article-title>On-line collaborative software development via wiki</article-title>
          .
          <source>In Proc. Int. Symp. on Wikis</source>
          . Montreal, Canada,
          <fpage>177</fpage>
          -
          <lpage>183</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>