<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Data Currency Quality Factors in Data Warehouse Design</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Dimitri Theodoratos</string-name>
          <email>dth@dblab.ece.ntua.gr</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mokrane Bouzeghoub</string-name>
          <email>Mokrane.Bouzeghoub@prism.uvsq.fr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>INRIA</institution>
          ,
          <addr-line>Domaine de Voluceau - B.P. 105, Le Chesnay Cedex, FR-78153</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>NTUA, Dept. of Electrical and Computer Eng.</institution>
          ,
          <addr-line>Athens, GR-15773</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>A Data Warehouse (DW) is a large collection of data integrated from multiple distributed autonomous databases and other information sources. A DW can be seen as a set of materialized views defined over the remote source data. Until now research work on DW design is restricted to quantitatively selecting view sets for materialization. However, quality issues in the DW design are neglected. In this paper we suggest a novel statement of the DW design problem that takes into account quality factors. We design a DW system architecture that supports performance and data consistency quality goals. In this framework we present a high level approach that allows to check whether a view selection guaranteeing a data completeness quality goal also satisfies a data currency quality goal. This approach is based on an AND/OR dag representation for multiple queries and views. It also allows determining the minimal change propagation frequencies that satisfy the data currency quality goal along with the the optimal query evaluation and change propagation plans. Our results can be directly used for a quality driven design of a DW.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Research supported by the European Commission under the ESPRIT
Program LTR project “DWQ: Foundations of Data Warehouse Quality”
The copyright of this paper belongs to the paper’s authors. Permission to
copy without fee all or part of this material is granted provided that the
copies are not made or distributed for direct commercial advantage.</p>
    </sec>
    <sec id="sec-2">
      <title>Proceedings of the International Workshop on Design and</title>
    </sec>
    <sec id="sec-3">
      <title>Management of Data Warehouses (DMDW’99), Heidelberg,</title>
      <p>Germany, 14. - 15.6. 1999
(S. Gatziu, M. Jeusfeld, M. Staudt, Y. Vassiliou, eds.)
http://sunsite.informatik.rwth-aachen.de/Publications/CEUR-WS/Vol-19/
1</p>
      <sec id="sec-3-1">
        <title>Introduction</title>
        <p>
          A Data Warehouse (DW) is a large collection of data used
by companies for On-Line Analytical Processing (OLAP)
and Decisison Support System (DSS) applications [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ].
Because of the enormous quantity of information
available to companies nowadays, DWs often grow to be very
large. Data warehousing is also an approach for
integrating data from multiple, possibly very large, distributed,
autonomous, heterogeneous databases and other
information sources [
          <xref ref-type="bibr" rid="ref44">44</xref>
          ]: selected information from each source
is extracted in advance, cleaned, translated and filtered
as needed, merged with relevant information from other
sources and stored in a repository. OLAP in the DW
is decoupled as much as possible from On-Line
Transaction Processing (OLTP) supported by the remote source
databases. A DW can be abstractly seen as a set of
materialized views defined over source relations. We provide
below a brief overview of issues related to the design of a
DW.
        </p>
        <p>
          Query evaluation. OLAP and DSS applications make
heavy use of complex queries (usualy with
grouping/aggregation). These data intensive queries often
require sequential scans. Ensuring high query performance
is one of the most significant challenges when
implementing a DW. To this end, the queries addressed to the DW are
evaluated locally (using exclusively the materialized views)
without accessing the original information sources.
Therefore a complete rewriting [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ] of the queries over the views
materialized at the DW must be possible [
          <xref ref-type="bibr" rid="ref40 ref41">41, 40</xref>
          ].
View maintenance. When the source relations change, the
materialized views need to be brought up-to-date.
Typically, the DW is maintained separately from the
operational source databases. Recent applications of DWs
require data that are more current. In order to bring the
materialized views up-to-date, different update scenarios can
be envisaged. They can be classified according to the class
of queries and views considered, the type of changes, the
maintenance strategy, the maintenance timing policy, the
type of environment, and for a distributed environment the
type of information sources [
          <xref ref-type="bibr" rid="ref13 ref44 ref48">13, 44, 48</xref>
          ].
        </p>
        <p>
          Maintenance strategies. A maintenance strategy can be
an incremental one, or a recomputation of the views from
scratch. In an incremental strategy, the changes to the
views are computed using the changes to the source
relations [
          <xref ref-type="bibr" rid="ref12 ref2 ref27 ref28 ref38 ref6 ref9">2, 27, 12, 9, 28, 38, 6</xref>
          ]. Incremental maintenance can
be significantly cheaper and more space efficient than
recomputing the view from scratch, especially if the size of
the view is much larger than the size of the changes [
          <xref ref-type="bibr" rid="ref49 ref9">9, 49</xref>
          ].
Maintenance timing policy. The maintenance timing
policy can be either immediate or deferred. In an
immediate timing policy [
          <xref ref-type="bibr" rid="ref2 ref3">2, 3</xref>
          ], a materialized view is maintained
within or immediately after the transaction that changes the
source relations. In a deferred timing policy [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ], the
maintenance of the view is delayed. It can be done periodically
[
          <xref ref-type="bibr" rid="ref34">34</xref>
          ], at query time [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ], or on-demand by the user. It can
also be triggered by events at the sources (e.g. when the
net change to a source database exceeds a certain
threshold). Sometimes supporting multiple maintenance timing
policies may be appropriate [
          <xref ref-type="bibr" rid="ref33 ref6">6, 33</xref>
          ].
        </p>
        <p>
          Type of environment. Most of the research work on
incremental view maintenance assumes a centralized database
environment where a single system has control of the
materialized views, the source relations, and the changes
[
          <xref ref-type="bibr" rid="ref12 ref27 ref28 ref9">27, 12, 9, 28</xref>
          ]. Therefore, changes to the source relations
and to the materialized views can be combined within the
same transaction. DWs are typically distributed database
environments. Some approaches to incremental view
maintenance in distributed environments are based on
timestamping the changes to the source relations [
          <xref ref-type="bibr" rid="ref33 ref34">34, 33</xref>
          ].
However, the source databases can be autonomous, a global
clock cannot be assumed and therefore inconsistencies may
appear [
          <xref ref-type="bibr" rid="ref49">49</xref>
          ]. Approaches that keep materialized view data
loosely consistent with the remote sources are more
appropriate for DW environments [
          <xref ref-type="bibr" rid="ref18 ref49 ref50 ref51">49, 50, 18, 51</xref>
          ].
        </p>
        <p>
          Types of sources. Concerning the detection and handling
of the changes of the source data, the sources can be of
the following types [
          <xref ref-type="bibr" rid="ref44">44</xref>
          ]: cooperative sources (they
provide active database features with triggering [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ], and
allow the detection, filtering, storage, processing, and
propagation of changes to the DW to be programmed and
occur automatically), logged sources (they maintain a log and
allow changes of interest to be extracted by inspecting the
log), queryable sources (they can be queried periodically in
order to detect changes of interest), and snapshot sources
(they only allow snapshots of data to be taken periodically
and changes are extracted by comparing successive
snapshots [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ]).
        </p>
        <p>
          Maintenance queries. When changes are reported by a
data source, it may be necessary to issue queries to the same
or other data sources in order to maintain the affected
materialized views. In the case of an incremental maintenance
strategy the queries involve also differentials (source
relation changes). We call theses queries maintenance queries.
Maintaining materialized views incurs a cost for computing
maintenance queries, a cost for transmitting data from the
sources to the DW and inversely (in a distributed DW
environment), and a cost for applying the computed changes to
the materialized views [
          <xref ref-type="bibr" rid="ref45">45</xref>
          ].
        </p>
        <p>
          Multiquery optimization on maintenance queries. The
changes taken into account for maintaining the
materialized views at the DW may affect more than one view. Then
multiple maintenance queries are issued against the source
relations for evaluation. These maintenance queries may
contain subexpressions that are identical, equivalent, or
more generally subexpressions such that one can be
computed from the other. We describe these subexpressions
by the generic term common subexpressions [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ]. The
techniques of multiple query optimization [
          <xref ref-type="bibr" rid="ref35 ref36">35, 36</xref>
          ] allow
these queries to be computed together by detecting
common subexpressions between maintenance queries:
nonoptimal local query evaluation plans are combined into an
optimal global plan which is more efficient to execute than
executing separately the optimal local evaluation plan of
each maintenance query.
        </p>
        <p>
          Using auxiliary views to reduce the view maintenance
cost. A global evaluation plan for maintenance queries
can be executed more efficiently if some intermediate
subqueries are kept materialized in the DW, or can be
computed from views that are kept materialized in the DW
[
          <xref ref-type="bibr" rid="ref30 ref42">30, 42</xref>
          ]. These materialized subqueries (views) are called
auxiliary views. It is worth noting that an optimal global
evaluation plan without auxiliary views can be completely
different than the optimal global evaluation plan when
materialized views are used. The existence of auxiliary
views can greatly reduce the cost of evaluating
maintenance queries. Indeed, the computation of the
corresponding subqueries is avoided or simplified. Further, since DWs
are typically distributed systems, access of the data sources
and expensive data transmissions are reduced. Obviously,
there is a cost associated with the process of maintaining
the auxiliary materialized views. But, if this cost is less
than the reduction to the maintenance cost of the initially
materialized views, it is worth keeping the auxiliary views
in the DW.
        </p>
        <p>
          Self-maintenability. By appropriately selecting auxiliary
views to materialize in the DW, it is possible to maintain
the initial materialized views and the auxiliary views
altogether, for any source relation change, without issuing
queries against the source relations. Such a set of views is
called self-maintainable [
          <xref ref-type="bibr" rid="ref11 ref18 ref29">11, 18, 29</xref>
          ].
        </p>
        <p>DW design. When designing a DW, a number of choices
are made first by the DW designer (e.g. the maintenance
timing policy, the maintenance strategy, or the DW system
architecture) dictated by physical parameters (e.g. the type
15-2
and the availability of data sources, the type of changes,
the data transmission rates over the network, the hardware
computational power etc.) and the requirements of the
knowledge workers that will use the DW.</p>
        <p>
          The next step in the DW design process concerns the
selection of views to materialize in the DW according to
use the DW is intended to (i.e. the queries the DW has
to answer). This is the DW view selection problem. The
view selection problem takes as input a set of queries or
views and aims at selecting a set of views to materialize in
the DW that minimizes the overall query evaluation cost,
or the overall view maintenance cost, or a combination of
both. A number of constraints may be additionally
provided as input for satisfaction (e.g. the materialized views
should fit in the space allocated for materialization or the
view maintenance cost should not exceed a certain limit)
[
          <xref ref-type="bibr" rid="ref15 ref17 ref32 ref41">32, 17, 41, 15</xref>
          ]. The DW design problem is complex. One
of the reasons of its complexity relies on the fact that
common subexpressions between the input queries need to be
detected and exploited.
        </p>
        <p>
          Quality factors in DW design. Until now research work
on DW design is restricted to quantitatively selecting view
sets for materialization. Quality issues in the DW design
are neglected. However, the design of a DW at the
logical and physical level is subject to a number of quality
factors. These quality factors determine quality goals that
have to be achieved through the design process [
          <xref ref-type="bibr" rid="ref20 ref22">20, 22</xref>
          ].
The present research work is done in the framework of the
European Foundations of Data Warehouse Quality (DWQ)
project. The goal of the DWQ project is to develop
semantic foundations that will allow the designers of DWs
to link their choice of deeper models, rich data structures
and rigorous implementation techniques to quality factors
in a systematic manner, thus improving the design, the
operation and the evolution of DWs [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ]. In this paper, we
deal mainly with the quality factors of data currency, data
consistency, data completeness, and query and view
maintenance performance in the design of a DW.
1.1
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>The problem</title>
      <p>
        When designing a DW, alternative view selections for
materialization are examined in order to find one that satisfies
the DW design goals. View selection algorithms for Data
Warehousing proceed in a similar manner [
        <xref ref-type="bibr" rid="ref15 ref17 ref41 ref47">17, 41, 15, 47</xref>
        ].
The framework. We consider that the approach adopted
for designing a DW aims at selecting a set of views for
materialization that satisfies the following quality goals: (a)
data completeness, (b) data currency, (c) query evaluation
and view maintenance performance, and (d) data
consistency. We suppose also that the sources are subject to a
source availability constraint. We explain these notions
below.
      </p>
      <p>The source availability constraint states that the change
propagation frequency from each source is restricted not to
exceed a maximal frequency. These maximal frequencies
are set by the administrators of the source databases and
express the availability of the data sources and the degree
of decoupling of OLTP at the operational data sources from
DW activities.</p>
      <p>The data completeness quality goal guarantees that the
data necessary for answering the input queries are present
at the DW. Therefore, it requires a complete rewriting of
the input queries over the materialized views</p>
      <p>The data currency quality goal upper bounds the time
elapsed between the time point the answer to a query is
returned to the user and the time point the most recent
changes to a source relation that are taken into account in
the computation of this answer are read (this time reflects
the currency of answer data). The data currency quality
goal is expressed by currency constraints associated with
every source relation in the definition of every input query.
The upper bound in a currency constraint (minimal
currency required) is set by the knowledge workers according
to their needs.</p>
      <p>The query evaluation and view maintenance
performance quality goal requires the minimization of a
combination of these costs.</p>
      <p>The data consistency quality goal ensures that at every
moment the state of the DW reflects a certain state of the
source relations. It is expressed by a number of properties
that the DW data must satisfy. These properties are
formally presented in Section 3.</p>
      <p>
        This formalization of the problem of designing a DW
using quality goals is novel. Further, it allows:
(a) stating currency constraints at the query level and not
at the materialized view level as is the case in other
approaches [
        <xref ref-type="bibr" rid="ref18 ref33">33, 18</xref>
        ]. Therefore, currency constraints can
be exploited by DW view selection algorithms where
the queries are the input, while the materialized views
are the output (and therefore are not available).
(b) stating different currency constraints for different
relations in the same query. This flexibility is necessary.
For instance, a query that combines share prices and
companies introduced in the stock market requires
different currency constraints for the data derived from
the source relation providing information on share
prices and for the data derived from the source
relation providing information on companies.
      </p>
      <p>Problem addressed. In the framework set up above, we
address the problem of checking whether for a view
selection that satisfies the data completeness quality goal, there
are change propagation frequencies that satisfy the given
data currency and source availability constraints.
Additionally, it is required:
(a) In case of a positive answer,
(a1) the change propagation frequencies that
guarantee the satisfaction of the currency and source
availability constraints, while minimizing the
15-3
view maintenance cost, and
(a2) the optimal way the queries are evaluated from
the materialized views (query evaluation plans),
and the optimal way the changes to the source
relations are propagated and applied to the affected
materialized views (change propagation plans).
(b) In case of a negative answer, the set of source relations
that cause the violation of a currency constraint.
We call this problem currency constraint satisfiability
problem.</p>
      <p>We provide a novel statement for the DW view
selection problem based on quality goals and source
availability constraints. In particular, the data currency
quality goal is expressed by a set of currency
constraints flexibly specified, on a per relation basis, at
the query level.</p>
      <p>We describe a DW system architecture over remote
autonomous sources that (a) supports the query
evaluation and view maintenance performance quality goal
(by evaluating queries exclusively from the
materialized views, and by exploiting self-maintainability
and auxiliary views), and (b) achieves the data
completeness quality goal (by appropriately materializing
views over the source relations at the DW), and the
data consistency quality goal (by the appropriate
selection of an update propagation process).</p>
      <p>We formally define the currency constraint
satisfiability problem using an AND/OR dag representation for
multiple queries and views. The multiquery AND/OR
dag representation allows to take into account
common subexpressions between queries and views and to
formally express query evaluation and change
propagation plans.</p>
      <p>In this framework, we present a high level approach
for solving the currency constraint satisfiability
problem. The approach proceeds by “pushing down”
currency constraints from the queries to the materialized
views along optimal query evaluation plans and by
“pushing up” source availability constraints from the
source relations to the materialized views along
optimal change propagation plans.</p>
      <p>Our approach allows also to determine the minimal
change propagation frequencies that satisfy the
constraints along with the optimal query evaluation and
change propagation plans.</p>
      <p>When the satisfaction of the data currency and source
availability constraints is not possible, we provide
the set of source relations that cause the violation of
source availability and currency constraints. This
information can guide the view selection algorithms in
providing alternative view selections that satisfy the
constraints, or can help the DW designer in finding
a solution to the view selection problem by
negotiating the relaxing of some currency constraints and/or
source availability constraints.</p>
      <p>The rest of the paper is organized as follows. Next
section reviews related work. Section 3 presents the
architecture of the data warehousing system, outlines change
propagation and defines data consistency. In Section 4 we
introduce multiquery AND/OR graphs, and provide initial
definitions. We also state formally the currency constraint
satisfability problem. The different steps of our approach
are presented in Section 4. In Section 5 we discuss
improving a selected view set in order to satisfy the constraints.
Finally, Section 6 contains concluding remarks and future
research directions.
2</p>
      <sec id="sec-4-1">
        <title>Related work</title>
        <p>
          Answering queries using views has been studied in many
papers, e.g. [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ]. In particular, this issue, in connection to
grouping/aggregation queries and views, has been studied
in [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] for set semantics, and in [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] for multiset semantics.
        </p>
        <p>
          Materialized view maintenance has been addressed in
recent years by a plethora of researchers. A number of
papers dealing with different aspects of materialized view
maintenance are cited in the introduction and in subsequent
sections. Different levels of data consistency of
materialized view maintenance processes in distributed
environments are discussed in [
          <xref ref-type="bibr" rid="ref18 ref49 ref50 ref51">49, 50, 18, 51</xref>
          ]. Data consistency
in this paper is defined similarly to that in [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ].
        </p>
        <p>
          View selection problems for Data Warehousing usually
follow the following pattern: select a set of views to
materialize in order to optimize the query evaluation cost, or
the view maintenance cost or a combination of both,
possibly in the presence of some constraints. In [
          <xref ref-type="bibr" rid="ref32">32</xref>
          ] views
are seen as sets of pointer arrays and the goal is to
optimize the combined cost under a space constraint. [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ] aims
at minimizing the query evaluation cost in the context of
aggregations and multidimensional analysis under a space
constraint. Given a materialized SQL view, [
          <xref ref-type="bibr" rid="ref30">30</xref>
          ] presents
an exhaustive approach as well as heuristics for selecting
auxiliary views that minimize the total view maintenance
cost. Given an SPJ view, [
          <xref ref-type="bibr" rid="ref29">29</xref>
          ] derives, using key and
referential integrity constraints, a set of auxiliary views, other
than the base relations, that eliminate the need to access
the base relations when maintaining both the initial and the
auxiliary views (i.e. that makes the views altogether
selfmaintainable). In [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ] greedy algorithms are provided for
selecting views to materialize that minimize the query
evaluation cost under a space constraint. A solution for
selecting views that minimize the combined cost is given in [
          <xref ref-type="bibr" rid="ref47">47</xref>
          ].
15-4
V V ous state of is available for evaluating queries using .
V changes are applied to a materialized view , the
previQuery evaluation. The queries of the analysts are
evaluated locally without accessing the source databases. Thus,
this DW architecture satisfies the data completeness
quality goal which requires a complete rewriting of the input
queries over the materialized views. We consider that when
Therefore, the evaluation process is not delayed by the
application of the changes to the materialized views.
        </p>
        <p>We describe in this section the architecture of the DW
system on which our analysis is based. Among the goals of
this architecture is high query performance and low view
maintenance cost. Figure 1 illustrates the DW system
architecture. On the bottom of the diagram are shown the
remote source relations. The materialized views are kept at
the DW component. Some of these views may be defined
using other views. Knowledge workers and analysts,
depicted on the top of the diagram, address their queries to
the DW.</p>
        <p>Further, no currency constraints are considered, nor
examining alternative change propagation plans in order to
guarantee the “freshness” of the materialized views.
3</p>
      </sec>
      <sec id="sec-4-2">
        <title>DW system architecture and operation</title>
        <p>
          V lecting a materialized view from which is to be updated,
V adopted in [
          <xref ref-type="bibr" rid="ref33">33</xref>
          ] where a materialized view is updated
V of the view . Further, a centralized environment based on
V V issued against and the currency of is unsatisfactory
V minimizing the average updating cost of per query by
set Q either at the end of a time period or when a query is
        </p>
        <p>None of the previous approaches requires the queries to
be answerable exclusively from the materialized views in a
non-trivial manner (that is without considering that the base
relations and the materialized views reside in the same site,
and without replicating all the base relation at the DW).</p>
        <p>
          This requirement is taken into account in [
          <xref ref-type="bibr" rid="ref41">41</xref>
          ] where the
problem of configuring a DW without space restrictions is
addressed for a class of select-join queries. This work is
extended in [
          <xref ref-type="bibr" rid="ref42">42</xref>
          ] in order to take into account space
restrictions, multiquery optimization over the maintenance
queries, and the use of auxiliary views when maintaining
other views. Another extension of [
          <xref ref-type="bibr" rid="ref41">41</xref>
          ] deals with the same
problem for a class of PSJ queries under space restrictions
[
          <xref ref-type="bibr" rid="ref40">40</xref>
          ]. The approach adopted in the last three papers is
tailored for the static DW design problem. An incremental
version of the DW design problem (dynamic DW design)
is addressed in [
          <xref ref-type="bibr" rid="ref43">43</xref>
          ].
        </p>
        <p>
          A variation of the DW design problem endeavoring to
select a set of views that minimizes the query evaluation
cost under a total maintenance cost constraint is adopted in
[
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. However, restricting the total view maintenance cost
does not provide any guaranty whatsoever for the currency
of the query answer data. Quite the contrary, reducing the
view maintenance cost may result in lower update
propagation frequencies, and therefore in stale query answer data.
        </p>
        <p>
          An analytical study of optimal refresh policies based
on parametrization of the average query response time and
the average cost for materialized view maintenance is
described in [
          <xref ref-type="bibr" rid="ref37">37</xref>
          ]. Yet, this analysis concern a single view
defined over relations of the same source in a centralized
environment without communications costs. Further, no
currency constraints are introduced.
        </p>
        <p>A periodical or at query time (hybrid) timing policy is
with respect to Q. An algorithm is provided that aims at
and the time period t. Currency constraints are associated
with the queries but they are different than those introduced
here since they refer to the currency of views from which
the query is answered, they do not characterize each
relation in the query, and they are used to trigger the updating
timestamping is assumed, while all the views are defined
over a single source relation.</p>
        <p>
          [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ] characterizes the “freshness” of materialized views
in a mediator using time upper bounds required for the
different steps of the view maintenance process (e.g.
communication delay, maintenance query processing delay etc.).
        </p>
        <p>That mediator system architecture is different than ours
since queries can be answered also from the source
relations, an immediate maintenance timing policy is adopted,
and source relation changes are buffered at the mediator.</p>
        <p>Simple and auxiliary materialized views. The
materialized views that are used in the optimal query evaluation
plans of the queries are called simple views. The rest of the
materialized views are used for reducing the view
maintenance cost of the materialized views, and are called
auxiliary views. Simple views too can be used in the same
manner, yet they have to appear in the optimal query evaluation
plan of a query.</p>
        <p>
          Self-maintainability. For each source relation Ri there
is at the DW a materialized view Vi obtained by
applying selections and projections on Ri as much as possible
such that all the views in the DW can be completely
rewritten over the Vi’s. These views are called source relation
images. Select-project views are self-maintainable [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ].
        </p>
        <p>Other views defined over these views may also be
materialized at the DW. The role of the source relation images
is to guarantee the self-maintainability of all the
materialized views. A source relation image can be either simple or
auxiliary view.</p>
        <p>Type of sources and changes. We assume that the source
database systems are autonomous and can cooperate with
the DW in the following sense: each source database
system is able to keep in a buffer the tuples inserted to a source
relation and the tuples deleted from it (modifications are
modeled by deletions followed by insertions). A source is
aware of the view definition of the corresponding source
relation image. It can also filter the changes, compute the
net changes to be applied to the source relation image, and
15-5
Queries/Answers
...
...</p>
        <p>...
...</p>
        <p>DW
...
...</p>
        <p>...
...</p>
        <p>...</p>
        <p>...</p>
        <p>Auxiary</p>
        <p>Views
Source relation image
net changes
periodically transmit the changes to the DW.</p>
        <p>...
...</p>
        <p>...
...
...</p>
        <p>
          ...
Change filtering. Using the view definition of the
corresponding source relation image, a source filters out changes
that are irrelevant to the source relation images and projects
only relevant attributes. A source relation change is
irrelevant if it has no effect on the state of the source relation
image, independently on the source relation state [
          <xref ref-type="bibr" rid="ref1 ref25">1, 25</xref>
          ].
        </p>
        <p>Since the source relation images are select-project views,
selection conditions involve only attributes of the source
relation. Therefore, detecting irrelevant changes can be
performed efficiently by the sources. Alternatively, source
relation changes can be transmitted to and gathered in buffers
at the DW. This alternative does not significantly change
our approach, yet changes cannot be filtered by the data
sources. Therefore, the communication cost is increased
by the transmission of irrelevant changes and of useless
attribute values of tuples from the remote sources to the DW.</p>
        <p>V ing have been computed. Parts of the update transactions
V When the net changes of a source relation image
V applied to a materialized view until all the changes for
arrive at the DW, they are propagated to the
materialized views that are affected by these changes (that is the
views that include the source relation in their view
definition), according to a change propagation plan. Net changes
from different sources are transmitted to the DW
asynchronously. We assume that different changes from the
same source relation are received by the DW in the order
they are transmitted. Further, different changes are
propagated to the affected views in the order they are received
by the DW. The affected materialized views are maintained
incrementally. During the propagation of the changes from
a source relation to the materialized views, changes are not
all the other materialized views that are directly defined
usthat propagate changes from the source relation images to
the materialized views can be executed concurrently. Note
that employing recomputation of the materialized views
from scratch instead of an incremental view maintenance
image, using the source relation image view definition and
the changes to the source relation, and (c) it transmits the
source relation image changes to the DW.</p>
        <p>There are two advantages when the computation of the
source relation images is performed by the sources: (a)
processing at the DW is saved, and (b) the transmission
cost is reduced. Determining the change propagation
frequency for each source relation is an objective of this
paper. Each source relation frequency is upper bounded by
a given maximal frequency for that source as indicated by
the corresponding source availability constraint. The
maximal frequencies are determined by the time the source data
base can devote to supporting the maintenance of the DW.</p>
        <p>Net changes. In order to avoid wasteful insertions and
deletions (and data transmissions) when incrementally
maintaining a materialized view from other materialized
views in the DW (and a source relation image from the
source relations), we consider the changes actually inserted
to or deleted from each view (source relation). These
changes are called net changes. For example, if a tuple
is inserted and then deleted, it is not represented at all in
the net changes. In the following ‘changes’ refer to ‘net
changes’.</p>
        <p>Change propagation. Changes are propagated to the DW
views periodically. At the end of the change propagation
period for a source relation, the source database system
performs three tasks: (a) it reads the changes in the
corresponding buffer and flashes the content of the buffer, (b)
it computes the changes, to be applied to the source relation
15-6</p>
        <p>strategy does not affect our approach.
t reflects some anterior state of the source relations
(b) The state of each materialized view at the DW at time
(not necessarily the state of the source relations at a
single time since the changes from multiple
consecutive update transactions to the same source relation are
transmitted together to the DW, and change
transmissions from different sources is asynchronous).</p>
        <p>
          Data consistency. The DW system we presented above is
consistent. We define a DW system to be consistent if its
data satisfy the following properties [
          <xref ref-type="bibr" rid="ref18 ref51">18, 51</xref>
          ].
(a) If a number of changes is applied to the source
relations, and this activity has ceased, the state of the
materialized views at the DW eventually reflects the final
state of the source relations.
        </p>
        <p>R the state of at times t01 and t02, where t01 t02.</p>
        <p>(c) The states of a materialized view, defined using some</p>
        <p>source relation R, at times t1 and t2, t1 t2, reflects
We introduce in this section multiquery AND/OR dags. We
then use this notion to formally describe change
propagation to multiple views, to introduce time cost functions, and
finally, to state the currency constraint satisfability
problem.</p>
        <p>
          We assume relational queries and views possibly with
grouping/aggregation operations that have multiset (bag)
semantics. A multiset algebra allows incrementally
maintaining views that have the SQL multiset semantics [
          <xref ref-type="bibr" rid="ref28 ref9">9, 28</xref>
          ].
        </p>
        <p>
          Duplicate retention (or at least a replication count) is
essential if select-project views are to be self-maintainable
[
          <xref ref-type="bibr" rid="ref11 ref2">2, 11</xref>
          ]. We think that it is necessary to include
grouping/aggregation queries since they are extensively used in
        </p>
        <p>Data Warehousing applications.</p>
        <p>G E V, a multiexpression AND/OR dag for is an AND/OR
G E nodes of are view nodes labeled by expressions in (but
E not all the expressions in label necessarily root nodes).
e and represents alternative ways of evaluating (alternative
G the expressions in E. is not necessarily a rooted dag (that
G The sink nodes in are labeled exactly by the views in V.
e equivalent rewritings of over V), while the sink nodes are
e nodes of Ge are view nodes. The root node is labeled by
E Given a set of expressions defined over a set of views
in AND nodes and OR nodes. AND nodes are called
operation nodes and are labeled by operations while OR nodes
are called view nodes and are labeled by views. In the
following we may identify nodes with their labels. An
operation node has one or two outgoing edges to view nodes and
one incoming edge from a view node. Its meaning as an
AND node is that its parent view can be obtained by
applying the labeling operation to all its child views. A view
node has one or more outgoing edges (if any) to operation
nodes and one or more incoming edges (if any) from an
operation node. Its meaning as an OR node is that the labeling
view can be computed by applying any of the child
operations to their respective child views. The root node and sink
labeled by the views in V.
dag resulting by merging the expression AND/OR dags for
is it does not necessarily have a single root). All the root
In addition, view nodes in a (multi)expression AND/OR
dag can be marked. Marked nodes represent views
materialized at the DW.</p>
        <p>We can now define query and multiquery AND/OR dags.
4</p>
      </sec>
      <sec id="sec-4-3">
        <title>Multiquery AND/OR</title>
        <p>problem statement</p>
        <p>
          dags and formal
Alternative ways for evaluating a relational expression can
be compactly represented by an AND/OR dag [
          <xref ref-type="bibr" rid="ref14 ref32">32, 14</xref>
          ]. A
particular representation of AND/OR dags distinguishes
between AND nodes and OR nodes [
          <xref ref-type="bibr" rid="ref30">30</xref>
          ] and has been
developed initially for performing cost-based query
optimization [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]. We use here this representation for multiple
queries, extended with marked nodes to account for views
materialized at the DW [
          <xref ref-type="bibr" rid="ref43">43</xref>
          ].
        </p>
        <p>We start by defining expression and multiexpression
AND/OR dags.</p>
        <p>Q1
and Q2 be the SQL query:
R image. Note that a source relation node may coincide
R case is replicated at the DW.</p>
        <p>R or projections over in a query represented in G). In this
marked nodes are exactly the materialized views of the DW.</p>
        <p>
          In these multiquery AND/OR dags, all the paths from a
query node to a source relation contain a source relation
with its image node (for instance if there are no selections
Q AND/OR dag for a set of queries can be constructed by
Q rules is a query AND/OR dag for [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]. A multiquery
Q query AND/OR dag for a query can be constructed by
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Construction of multiquery AND/OR dags. A multi</title>
      <p>applying transformation rules to an initial query dag. The
initial query dag represents the expression defining Q. The
transformation rules add new operation and view nodes and
link these nodes with existing nodes by adding new edges.</p>
      <p>
        The dag resulting by the application of the transformation
merging equivalent view nodes of the query AND/OR dags
for the queries in Q. [
        <xref ref-type="bibr" rid="ref31">31</xref>
        ] presents transformation rules for
multiquery AND/OR dags using a different representation
scheme.
      </p>
      <p>Expression AND/OR dags is a general formalism that
captures many of the notions mentioned previously. In
particular they can represent views and materialized views
(source relation images, simple and auxiliary views),
complete rewritings of the queries over the materialized views
and/or the source relations, and common subexpressions
between (maintenance) queries, between views, and
between (maintenance) queries and views. Different types of
expression AND/OR dags will be used below to represent
query evaluation plans and change propagation plans.</p>
      <p>
        V5
are represented. Note that as shown in [
        <xref ref-type="bibr" rid="ref46">46</xref>
        ] in this case the
grouping/aggregation operator can be pushed past the join.
      </p>
      <p>Marked nodes (materialized views) are depicted by filled
black circles. Source relation R1 and R2 are depicted by
rectangles. Their images are the views V1 and V2
respectively. Remark that all the paths from query node Q1 or Q2
to source relation node R1 or R2 contain a source relation
image.
V6</p>
      <p>V2
G Consider a multiquery AND/OR dag for a set of queries</p>
      <p>Q. A query evaluation plan over materialized views is
represented by a query evaluation dag defined as follows.</p>
      <p>R V to are also affected by the changes to . We describe
the views that are affected by these changes. Recall that the
type of multiquery AND/OR dags we consider here implies
that the materialized views that are affected by the changes
this change propagation process below.</p>
    </sec>
    <sec id="sec-6">
      <title>4.4 Incremental view maintenance using change propagation dags</title>
      <p>
        The changes to the materialized views that are affected by
the changes to a source relation image can be computed
using the maintenance expressions provided in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] for a
multiset algebra and in [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ] for grouping/aggregation operators
under multiset semantics. We call these expressions
maintenance queries. Maintenance queries involve in general
the pre-update state of the source relation images (that is
the state prior to the application of the changes), the
preupdate state of the materialized view, and the changes to
the source relation image.
      </p>
      <p>The changes to the source relation images (which are
select-project views) are computed using maintenance
queries, exclusively from the changes to the source
relations (select-project views are self-maintainable when
multiset semantics are adopted).</p>
      <p>If the maintenance queries for different materialized
views that are affected by the changes to a source relation
Example 4.2 Consider the multiquery AND/OR of
example 4.1. Figure 4 shows two query evaluation dags for the
queries Q1 and Q2 respectively.</p>
      <p>A change propagation plan is represented by a change
propagation dag defined below.</p>
      <p>Clearly a change propagation dag is a connected graph.</p>
      <p>
        Example 4.3 Consider the multiquery AND/OR dag of
example 4.1. Figure 3 shows two different change
propagation dags for the source relation image V1. Figure 5 shows
two change propagation dags for the source relation image
V2. As this example makes clear, there can be more than
one change propagation dags for a source relation image in
a given multiquery AND/OR dag.
U needed in .
by joining its child nodes and thus its pre-update state is not
needed for the computation of its own changes [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
Therefore the computation of the pre-update state of V6 is not
V update state (if needed) of the parent view nodes of are
V not needed [
        <xref ref-type="bibr" rid="ref28 ref9">9, 28</xref>
        ]. For instance, if is a self-maintainable
V update state are needed: ’s changes can be computed
exR image of a source relation by considering a change
propV state of Vis is not needed. As another example, if is
obV computed. Thus the pre-update state of remains
availV The changes to a (marked or non-marked) view node
V on Vis, neither the pre-update state of any Vis nor ’s
preU agation dag for the image of R, and by proceeding in
V may need the pre-update state of for the computation of
V needed neither for the computation of the changes to nor
V The (pre-update state) of a non-marked view node
V and the pre-update state of are also needed for this
comV node(s) Vi. This computation is not necessary if is not
V clusively from the changes to Vis. If is marked
(materiV by any of its parent view nodes. A parent view node of
V pre-update state of and/or the pre-update state of Vis are
image have a common subexpression, we can use the
techniques of multiquery optimization to compute this
subexpresion only once. Further if a subexpression (that does not
include changes) of a maintenance query is a view already
materialized in the DW (other than a source relation
image), we can use this view as an auxiliary view in the
computation of the maintenance query. Therefore, the
computation of this subexpression from the source relation images
is avoided.
      </p>
      <p>
        We maintain the views affected by the changes to the
a bottom-up fashion. This way, we can take advantage of
the views materialized in the DW that are not affected by
the changes, while common subexpressions between
maintenance queries are computed only once.
that is affected by the changes to the source relation image
are computed from the changes to its child view node(s)
Vi. (A view node is affected by the changes to the source
relation image if it is an ancestor of the source relation
image sink node.) In general, the pre-update state of each Vi,
putation (V ’s pre-update state can of course be computed
from the pre-update state of Vis). In particular cases, the
view (with respect to the changes to Vis) the pre-update
tained by a selection or a projection or an additive union
alized), the computed changes are applied to it. However,
these changes are not applied until the changes and the
preable where needed.
is computed from the (pre-update state of its) child view
its own changes or (in case it is a non-marked view)
because its own pre-update state is needed by one of its
parents. By a top-down scan of the change propagation dag,
prior to the propagation of the changes, the non-marked
view nodes that need not be computed can be detected [
        <xref ref-type="bibr" rid="ref39">39</xref>
        ].
15-10
Q tion dag for Q. The minimal time tQ needed to compute
Q different time costs. The query evaluation dag for
yieldDifferent query evaluation dags for the same query yield
ing the minimal time cost is called optimal query
evaluais:
fQi tQi
R update propagation dag for the image of
source relation
operation node
view node
      </p>
      <p>R query defined using</p>
      <p>time needed to compute the parent view node of O</p>
      <p>FR
formula:
fRi tRi</p>
      <p>We can now define source availability constraints
Before defining currency constraints, we provide a
definition on answer data currency.
15-11
5.1</p>
    </sec>
    <sec id="sec-7">
      <title>Pushing currency constraints down to the simple views</title>
      <sec id="sec-7-1">
        <title>A solution to the problem</title>
        <p>We now present a solution to the currency constraint
satisfiability problem. Our approach proceeds in three steps.</p>
        <p>In the first step the optimal query evaluation dags and the
minimal query evaluation cost is determined, and the
currency constraints are “pushed down” from the queries to
the simple views. In the second step, the optimal change
propagation dags are determined and the source availability
constraints are “pushed up” from the source relations to the
simple views. The third step checks constraint satisfability,
and computes the minimal change propagation frequencies
and the minimal view maintenance cost.</p>
        <p>R from in the answer of Q.</p>
        <p>Q is returned to the user. Suppose that the most recent
R changes to a source relation that are propagated and
ap= define tRQ t1 t2. tRQ expresses the currency of the data
Q in the evaluation of are read at the source at time t2. We</p>
        <p>Definition 4.7 Let t1 be the time point (commit point of
the corresponding transaction) when the answer of a query
plied to the affected views and are taken in consideration
R Q in the answer of are “older” (less current). Currency
R</p>
        <p>Note that a higher value for tQ means that the data from
constraints are defined as follows.</p>
        <p>R imal currency required for the data from in the answer</p>
        <p>Definition 4.8 Let TRQ be a time value expressing the
minof Q. A currency constraint is an inequality of the form
tRQ TRQ.
We can now state formally the currency constraint
satisfiability problem.</p>
        <p>Q 2 Q R For every and every in the definition of Q,
fR FR.
a currency constraint tRQ TRQ.</p>
        <p>Time cost functions tO, tUO, tRO, tVR, tRR, and ttRr.</p>
        <p>G dags in for every query in Q. Then, the optimal one for
G (a) determining the simple views in (that is the
materialE ing the minimal query evaluation cost using formula (1).
In this step we first detect the alternative query evaluation
each query is chosen among them. This procedure allows:
ized views that occur in the optimal dags), and (b)
computRecall that, by definition, the only marked nodes occurring
in a query evaluation dag are sink nodes.</p>
        <p>G Example 5.1 Consider the multiquery AND/OR dag of
example 4.1. One can easily see that there is only one query
evaluation dag for Q1, and two query evaluation dags for
Q2 in G. Assuming, for the needs of this example, that
there are not indexes on the materialized views, performing
a natural join of V1 and V2 and then a selection is more time
consuming than performing a single natural join of V4 and
V2. Therefore, the optimal query evaluation dags for Q1
and Q2 are those depicted in Figure 4. The marked view
nodes V7, V3 and V4 are the simple views in G.</p>
        <p>M cost , and the optimal change propagation dags
P - The minimal query evaluation cost , and the
op</p>
        <p>Output:</p>
        <p>A decision on whether the constraints can be satisfied.</p>
        <p>If the constraints can be satisfied:
- The minimal change propagation frequencies</p>
        <p>that guarantee the satisfaction of the constraints
- The corresponding minimal view maintenance
for the images of the source relations in R.</p>
        <p>timal query evaluation dags for the queries in Q.</p>
        <p>If the constraints can not be satisfied:
- The source relations that cause the violation of</p>
        <p>the constraints.</p>
        <p>V7
V1</p>
        <p>V1</p>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>Pushing source availability constraints up to the simple views</title>
      <p>In this step we first detect the alternative change
propagation dags for each source relation image in G. Then, the
optimal one for each source relation image is chosen among
them.
we are sure that the new state of each of these views is
available for querying only when the propagation of the changes
using this propagation dag has finished.</p>
      <p>V2
V Consider a simple view defined using source relation R.</p>
      <p>The source availability constraints are pushed from the
source relations up to the simple views along optimal
change propagation dags as follows:</p>
      <p>We define
R V from source relation in the simple view as set by the
LVR represents the maximal currency allowed for the data
source availability constraint associated with R. Note that
we do not fix a specific order for applying the changes to
the simple views in a change propagation dag. Therefore,
Example 5.2 Consider the multiquery AND/OR dag GV
shown in Figure 2. GV represents two change propagation
dags for each of the source relation images V1 (shown in
Figure 3) and V2 (shown in Figure 5). Assuming that there
are no indexes, it is reasonable to consider that
computing the changes to view node V7 from the changes to V1
and the pre-update state of V3 is less time consuming than
computing the changes to V6 from the changes to V1 and
the pre-update state of V2 and then the changes to V7 from
the changes to V6 and the pre-update state of V7. Note that
V3 has no more tuples than V2, and the computation of V6 is
not needed as shown in example 4.5. Therefore the change
propagation dag of Figure 3(a) is the optimal change
propagation dag for V1. By a similar argument, the change
propagation dag of Figure 5(b) is the optimal change propagation
dag for V2.
M min =
V3
V3
V such a view from the DW reduces the cost of
propagatV 0 adding the materialized view as an auxiliary view the
V contain . This reduction may be sufficient for satisfying a
V 0 definition of (besides source relation images). Then, by
V involving source relations in the definition of 0.
V = 1 V V 0 C1 (R1) 0, where is a complex view, is
materialized at the DW. Further, suppose that there are no
materialized views involving the source relations used in the
time cost of the optimal change propagation plan for R1 is
reduced. This addition may be sufficient for satisfying the
constraints that were previously violated. Note, however,
that this change may cause the violation of other constraints</p>
      <p>
        The selected view set can also be improved in order
to satisfy the constraints by removing redundant auxiliary
views. Consider for instance the case where in each optimal
change propagation dag, a specific auxiliary view is either
a root node, or does not appear at all in it. Such a view is
redundant: it is not used for answering the queries (since it is
not a simple view), and is not useful for reducing the view
maintenance cost of other materialized views (since it is a
root node in the optimal plans where it appears). Removing
ing changes along the optimal change propagation dags that
constraints that was previously violated. In [
        <xref ref-type="bibr" rid="ref39">39</xref>
        ] we provide
an approach for detecting redundant views in a DW. Even
when the constraints are satisfied, by removing redundant
views, the change propagation frequencies (and therefore
the view maintenance cost) can be further reduced.
      </p>
      <p>If the time cost of the change propagation dags cannot
be further reduced, the DW designer can opt for hardware
improvements (e.g. faster links between the sources that
violate the constraints and the DW). As a final solution, he
can negotiate with the source administrators and the
analysts the relaxing of the source availability and/or the
currency constraints.</p>
      <p>
        Our approach can be combined with a view selection
algorithm. In [
        <xref ref-type="bibr" rid="ref40 ref41 ref42">41, 42, 40</xref>
        ] such algorithms are provided
that generate alternative view sets satisfying the data
completeness quality goal and choose the one that minimizes
a combination of the query evaluation and view
maintenance cost. Using our results, the generated view sets can
be checked, instead, for satisfaction of the quality goals.
      </p>
      <p>This suggests for a quality driven DW design. Note finally
that the detection of violated constraints and of useless
auxiliary views can guide the devise of heuristics. The later are
necessary for pruning the search space of alternative view
selections which can be very large.
7</p>
      <sec id="sec-8-1">
        <title>Conclusion and possible extensions</title>
        <p>Up to now research work on DW design issues is restricted
to quantitatively selecting view sets for materialization in
a DW. However the design of a DW is subject to a
number of quality factors. In this paper we have presented a
novel statement for the DW view selection problem based
on the satisfaction of performance, data consistency, data
completeness, and data currency quality goals. The data
currency quality goal is specified by a number of detailed
currency constraints at the query level while availability
constraints at the data source level are also taken into
account. We have described a DW system architecture that
supports the performance quality goal and satisfies the data
consistency and completeness quality goals. In this
framework we have addressed the problem of checking whether
a view selection satisfies the given constraints. We have
presented an approach for solving this problem that uses
an AND/OR dag representation for multiple queries and
views. Our approach allows also determining the change
propagation frequencies that minimizes the view
maintenance time cost and computes the optimal change
propagation and query evaluation plans. More importantly, it can
help the DW designer in the improvement of the selected
view set, and can cooperate with DW design algorithms to
generate a view set that satisfies the quality goals.</p>
        <p>Future work includes the integration of our approach
with DW view selection algorithms and the experimental
validation of an automatic quality-oriented DW design
process.
15-14
15-15</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Blakeley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Coburn</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P. A.</given-names>
            <surname>Larson</surname>
          </string-name>
          .
          <article-title>Updating derived relations: detecting irrelevant and autonomously computable updates</article-title>
          .
          <source>ACM Transactions on Database Systems</source>
          ,
          <volume>14</volume>
          (
          <issue>3</issue>
          ):
          <fpage>369</fpage>
          -
          <lpage>400</lpage>
          ,
          <year>1989</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Blakeley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Larson</surname>
          </string-name>
          , and
          <string-name>
            <given-names>F. W.</given-names>
            <surname>Tompa</surname>
          </string-name>
          .
          <article-title>Efficiently updating materialized views</article-title>
          .
          <source>In Proc. of the ACM SIGMOD Intl. Conf. on Management of Data</source>
          , pages
          <fpage>61</fpage>
          -
          <lpage>71</lpage>
          ,
          <year>1986</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>S.</given-names>
            <surname>Ceri</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Widom</surname>
          </string-name>
          .
          <article-title>Deriving production rules for incremental view maintenance</article-title>
          .
          <source>In Proc. of the 20th Intl. Conf. on Very Large Data Bases</source>
          , pages
          <fpage>577</fpage>
          -
          <lpage>589</lpage>
          ,
          <year>1991</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>S.</given-names>
            <surname>Chaudhuri</surname>
          </string-name>
          and
          <string-name>
            <given-names>U.</given-names>
            <surname>Dayal</surname>
          </string-name>
          .
          <article-title>An Overview of Data Warehousing and OLAP Technology</article-title>
          .
          <source>SIGMOD Record</source>
          ,
          <volume>26</volume>
          (
          <issue>1</issue>
          ):
          <fpage>65</fpage>
          -
          <lpage>74</lpage>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>L.</given-names>
            <surname>Colby</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Griffin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Libkin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I. S.</given-names>
            <surname>Mumick</surname>
          </string-name>
          , and
          <string-name>
            <given-names>H.</given-names>
            <surname>Trickey</surname>
          </string-name>
          .
          <article-title>Algorithms for Deferred View Maintenance</article-title>
          .
          <source>In Proc. of the ACM SIGMOD Intl. Conf. on Management of Data</source>
          , pages
          <fpage>469</fpage>
          -
          <lpage>480</lpage>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>L.</given-names>
            <surname>Colby</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kawagushi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lieuwen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I. S.</given-names>
            <surname>Mumick</surname>
          </string-name>
          , and
          <string-name>
            <given-names>K.</given-names>
            <surname>Ross. Supporting Multiple View Maintenance</surname>
          </string-name>
          <article-title>Policies</article-title>
          .
          <source>In Proc. of the ACM SIGMOD Intl. Conf. on Management of Data</source>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>S.</given-names>
            <surname>Dar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. V.</given-names>
            <surname>Jagadish</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Y.</given-names>
            <surname>Levy</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Srivastava</surname>
          </string-name>
          .
          <article-title>Answering SQL Queries with Aggregation using Views</article-title>
          .
          <source>In Proc. of the 22nd Intl. Conf. on Very Large Data Bases</source>
          , pages
          <fpage>318</fpage>
          -
          <lpage>329</lpage>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>G.</given-names>
            <surname>Graefe</surname>
          </string-name>
          and
          <string-name>
            <given-names>W. J.</given-names>
            <surname>McKenna</surname>
          </string-name>
          .
          <article-title>The Volcano Optimizer Generator: Extensibility and Efficient Search</article-title>
          .
          <source>In Proc. of the 9th Intl. Conf. on Data Engineering</source>
          , pages
          <fpage>209</fpage>
          -
          <lpage>217</lpage>
          ,
          <year>1993</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>T.</given-names>
            <surname>Griffin</surname>
          </string-name>
          and
          <string-name>
            <given-names>L.</given-names>
            <surname>Libkin</surname>
          </string-name>
          .
          <article-title>Incremental maintenance of views with duplicates</article-title>
          .
          <source>In Proc. of the ACM SIGMOD Intl. Conf. on Management of Data</source>
          , pages
          <fpage>328</fpage>
          -
          <lpage>339</lpage>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>A.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Harinarayan</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Quass</surname>
          </string-name>
          .
          <article-title>Aggregate-Query Processing in Data Warehousing Environments</article-title>
          .
          <source>In Proc. of the 21st Intl. Conf. on Very Large Data Bases</source>
          , pages
          <fpage>358</fpage>
          -
          <lpage>369</lpage>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>A.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Jagadish</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and I. S.</given-names>
            <surname>Mumick</surname>
          </string-name>
          .
          <article-title>Data Integration using Self-Maintainable Views</article-title>
          .
          <source>In Proc of the 5th EDBT Conf.</source>
          , pages
          <fpage>140</fpage>
          -
          <lpage>144</lpage>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>A.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Mumick</surname>
          </string-name>
          , and
          <string-name>
            <given-names>V.</given-names>
            <surname>Subrahmanian</surname>
          </string-name>
          .
          <article-title>Maintaining views incrementally</article-title>
          .
          <source>In Proc. of the ACM SIGMOD Intl. Conf. on Management of Data</source>
          , pages
          <fpage>157</fpage>
          -
          <lpage>166</lpage>
          ,
          <year>1993</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>A.</given-names>
            <surname>Gupta</surname>
          </string-name>
          and
          <string-name>
            <given-names>I. S.</given-names>
            <surname>Mumick</surname>
          </string-name>
          .
          <article-title>Maintenance of materialized views: Problems, techniques and applications</article-title>
          .
          <source>Data Engineering</source>
          ,
          <volume>18</volume>
          (
          <issue>2</issue>
          ):
          <fpage>3</fpage>
          -
          <lpage>18</lpage>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>H.</given-names>
            <surname>Gupta</surname>
          </string-name>
          .
          <article-title>Selection of Views to Materialize in a Data Warehouse</article-title>
          .
          <source>In Proc. of the 6th Intl. Conf. on Database Theory</source>
          , pages
          <fpage>98</fpage>
          -
          <lpage>112</lpage>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>H.</given-names>
            <surname>Gupta</surname>
          </string-name>
          and
          <string-name>
            <given-names>I. S.</given-names>
            <surname>Mumick</surname>
          </string-name>
          .
          <article-title>Selection of Views to Materialize Under a Maintenance Cost Constraint</article-title>
          .
          <source>In Proc. of the 7th Intl. Conf. on Database Theory</source>
          , pages
          <fpage>453</fpage>
          -
          <lpage>470</lpage>
          ,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>E. Hanson.</surname>
          </string-name>
          <article-title>A Performance Analysis of View Materialization Strategy</article-title>
          .
          <source>In Proc. of the ACM SIGMOD Intl. Conf. on Management of Data</source>
          , pages
          <fpage>440</fpage>
          -
          <lpage>453</lpage>
          ,
          <year>1987</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>V.</given-names>
            <surname>Harinarayan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Rajaraman</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J. D.</given-names>
            <surname>Ullman</surname>
          </string-name>
          .
          <article-title>Implementing Data Cubes Efficiently</article-title>
          .
          <source>In Proc. of the ACM SIGMOD Intl. Conf. on Management of Data</source>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>R.</given-names>
            <surname>Hull</surname>
          </string-name>
          and
          <string-name>
            <given-names>G.</given-names>
            <surname>Zhou</surname>
          </string-name>
          .
          <article-title>A Framework for Supporting Data Integration Using the Materialized and Virtual Approaches</article-title>
          .
          <source>In Proc. of the ACM SIGMOD Intl. Conf. on Management of Data</source>
          , pages
          <fpage>481</fpage>
          -
          <lpage>492</lpage>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>M.</given-names>
            <surname>Jarke</surname>
          </string-name>
          .
          <article-title>Common Subexpression Isolation in Multiple Query Optimization</article-title>
          . In Kim, Reiner, and Batory, editors,
          <source>Query Processing in DB Systems</source>
          , pages
          <fpage>191</fpage>
          -
          <lpage>205</lpage>
          . SpringerVerlag,
          <year>1984</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>M.</given-names>
            <surname>Jarke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Jeusfeld</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Quix</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Vassiliades</surname>
          </string-name>
          .
          <article-title>Architecture and Quality in Data Warehouses</article-title>
          .
          <source>In Proc. of the 10th Intl. Conf. on Advanced Information Systems Engineering</source>
          , pages
          <fpage>93</fpage>
          -
          <lpage>113</lpage>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>M.</given-names>
            <surname>Jarke</surname>
          </string-name>
          and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Vassiliou. Data Warehouse</surname>
          </string-name>
          <article-title>Quality: A Review of the DWQ Project</article-title>
          .
          <source>In Proc. of the 2nd Intl. Conf. on Information Quality Cambridge</source>
          , Mass., pages
          <fpage>98</fpage>
          -
          <lpage>112</lpage>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>M.</given-names>
            <surname>Jeusfeld</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Quix</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Jarke</surname>
          </string-name>
          .
          <article-title>Design and Analysis of Quality Information for Data Warehouses</article-title>
          .
          <source>In Proc. of the 17th Intl. Conf. on Conceptual Modeling</source>
          , Springer LNCS 1507,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>W. J.</given-names>
            <surname>Labio</surname>
          </string-name>
          and
          <string-name>
            <given-names>H.</given-names>
            <surname>Garcia-Molina. Efficient Snapshot</surname>
          </string-name>
          <article-title>Differential Algorithms for Data Warehousing</article-title>
          .
          <source>In Proc. of the 22nd Intl. Conf. on Very Large Data Bases</source>
          , pages
          <fpage>63</fpage>
          -
          <lpage>74</lpage>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>A.</given-names>
            <surname>Levy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. O.</given-names>
            <surname>Mendelson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Sagiv</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Srivastava</surname>
          </string-name>
          .
          <article-title>Answering Queries using Views</article-title>
          .
          <source>In Proc. of the ACM Symp. on Principles of Database Systems</source>
          , pages
          <fpage>95</fpage>
          -
          <lpage>104</lpage>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>A. Y.</given-names>
            <surname>Levy</surname>
          </string-name>
          and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Sagiv</surname>
          </string-name>
          .
          <article-title>Queries Independent of Updates</article-title>
          .
          <source>In Proc. of the 19th Intl. Conf. on Very Large Data Bases</source>
          , pages
          <fpage>171</fpage>
          -
          <lpage>181</lpage>
          ,
          <year>1993</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>F.</given-names>
            <surname>Llirbat</surname>
          </string-name>
          , E. Simon, and
          <string-name>
            <given-names>D.</given-names>
            <surname>Tombroff</surname>
          </string-name>
          .
          <article-title>Using Versions in Update Transactions</article-title>
          .
          <source>In Proc. of the 23rd Intl. Conf. on Very Large Data Bases</source>
          , pages
          <fpage>96</fpage>
          -
          <lpage>105</lpage>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>X.</given-names>
            <surname>Qian</surname>
          </string-name>
          and
          <string-name>
            <given-names>G.</given-names>
            <surname>Wiederhold</surname>
          </string-name>
          .
          <article-title>Incremental Recomputation of Active Relational Expressions</article-title>
          .
          <source>IEEE Transactions on Knowledge and Data Engineering</source>
          ,
          <volume>3</volume>
          (
          <issue>3</issue>
          ):
          <fpage>439</fpage>
          -
          <lpage>450</lpage>
          ,
          <year>1991</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>D.</given-names>
            <surname>Quass</surname>
          </string-name>
          .
          <article-title>Maintenance Expressions for Views with Aggregation</article-title>
          .
          <source>In Workshop on Materialized Views: Techniques and Applications</source>
          , pages
          <fpage>110</fpage>
          -
          <lpage>118</lpage>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>D.</given-names>
            <surname>Quass</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I. S.</given-names>
            <surname>Mumick</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Widom</surname>
          </string-name>
          .
          <article-title>Making Views Self Maintainable for Data Warehousing</article-title>
          .
          <source>In Proc. of the 4th Intl. Conf. on Parallel and Distributed Information Systems</source>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>K. A.</given-names>
            <surname>Ross</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Srivastava</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Sudarshan</surname>
          </string-name>
          .
          <article-title>Materialized View Maintenance and Integrity Constraint Checking: Trading Space for Time</article-title>
          .
          <source>In Proc. of the ACM SIGMOD Intl. Conf. on Management of Data</source>
          , pages
          <fpage>447</fpage>
          -
          <lpage>458</lpage>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>N.</given-names>
            <surname>Roussopoulos</surname>
          </string-name>
          .
          <article-title>The Logical Access Path Schema of a Database</article-title>
          .
          <source>IEEE Transactions on Software Engineering</source>
          ,
          <volume>8</volume>
          (
          <issue>2</issue>
          ):
          <fpage>563</fpage>
          -
          <lpage>573</lpage>
          ,
          <year>1982</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>N.</given-names>
            <surname>Roussopoulos</surname>
          </string-name>
          .
          <article-title>View Indexing in Relational Databases</article-title>
          .
          <source>ACM Transactions on Database Systems</source>
          ,
          <volume>7</volume>
          (
          <issue>2</issue>
          ):
          <fpage>258</fpage>
          -
          <lpage>290</lpage>
          ,
          <year>1982</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <string-name>
            <given-names>A.</given-names>
            <surname>Segev</surname>
          </string-name>
          and
          <string-name>
            <given-names>W.</given-names>
            <surname>Fang</surname>
          </string-name>
          .
          <article-title>Currency-based updates to distributed materialized views</article-title>
          .
          <source>In Proc. of the 6th Intl. Conf. on Data Engineering</source>
          , pages
          <fpage>512</fpage>
          -
          <lpage>520</lpage>
          ,
          <year>1990</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [34]
          <string-name>
            <given-names>A.</given-names>
            <surname>Segev</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Park</surname>
          </string-name>
          .
          <article-title>Updating distributed materialized views</article-title>
          .
          <source>IEEE Transactions on Knowledge and Data Engineering</source>
          ,
          <volume>1</volume>
          (
          <issue>2</issue>
          ):
          <fpage>173</fpage>
          -
          <lpage>184</lpage>
          ,
          <year>1989</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [35]
          <string-name>
            <given-names>T. K.</given-names>
            <surname>Sellis. Multiple Query</surname>
          </string-name>
          <article-title>Optimization</article-title>
          .
          <source>ACM Transactions on Database Systems</source>
          ,
          <volume>13</volume>
          (
          <issue>1</issue>
          ):
          <fpage>23</fpage>
          -
          <lpage>52</lpage>
          ,
          <year>1988</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          [36]
          <string-name>
            <given-names>K.</given-names>
            <surname>Shim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. K.</given-names>
            <surname>Sellis</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Nau</surname>
          </string-name>
          .
          <article-title>Improvements on a Heuristic Algorithm for Multiple-Query Optimization</article-title>
          .
          <source>Data &amp; Knowledge Engineering</source>
          ,
          <volume>12</volume>
          :
          <fpage>197</fpage>
          -
          <lpage>222</lpage>
          ,
          <year>1994</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          [37]
          <string-name>
            <given-names>J.</given-names>
            <surname>Srivastava</surname>
          </string-name>
          and
          <string-name>
            <given-names>D.</given-names>
            <surname>Rotem</surname>
          </string-name>
          .
          <article-title>Analytical Modeling of Materialized View Maintenance</article-title>
          .
          <source>In Proc. of the 5th ACM Symp. on Principles of Database Systems</source>
          , pages
          <fpage>126</fpage>
          -
          <lpage>134</lpage>
          ,
          <year>1988</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          [38]
          <string-name>
            <given-names>M.</given-names>
            <surname>Staudt</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Jarke</surname>
          </string-name>
          .
          <article-title>Incremental Maintenance of Externally Materialized Views</article-title>
          .
          <source>In Proc. of the 22nd Intl. Conf. on Very Large Data Bases</source>
          , pages
          <fpage>75</fpage>
          -
          <lpage>86</lpage>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          [39]
          <string-name>
            <given-names>D.</given-names>
            <surname>Theodoratos</surname>
          </string-name>
          .
          <article-title>Detecting Redundnacy in Data Warehouse Design</article-title>
          .
          <source>Technical Report, Knowledge and Data Base Systems Laboratory</source>
          , Electrical and Computer Engineering Dept., National Technical University of Athens, pages
          <fpage>1</fpage>
          -
          <lpage>20</lpage>
          ,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          [40]
          <string-name>
            <given-names>D.</given-names>
            <surname>Theodoratos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ligoudistianos</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Sellis</surname>
          </string-name>
          .
          <article-title>Designing the Global Data Warehouse with SPJ Views</article-title>
          . To appear
          <source>in Proc. of the 11th Intl. Conf. on Advanced Information Systems Engineering</source>
          ,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          [41]
          <string-name>
            <given-names>D.</given-names>
            <surname>Theodoratos</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Sellis</surname>
          </string-name>
          .
          <article-title>Data Warehouse Configuration</article-title>
          .
          <source>In Proc. of the 23rd Intl. Conf. on Very Large Data Bases</source>
          , pages
          <fpage>126</fpage>
          -
          <lpage>135</lpage>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref42">
        <mixed-citation>
          [42]
          <string-name>
            <given-names>D.</given-names>
            <surname>Theodoratos</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Sellis</surname>
          </string-name>
          .
          <article-title>Data Warehouse Schema and Instance Design</article-title>
          .
          <source>In Proc. of the 17th Intl. Conf. on Conceptual Modeling</source>
          , Springer LNCS 1507, pages
          <fpage>363</fpage>
          -
          <lpage>376</lpage>
          ,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref43">
        <mixed-citation>
          [43]
          <string-name>
            <given-names>D.</given-names>
            <surname>Theodoratos</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Sellis</surname>
          </string-name>
          .
          <article-title>Dynamic Data Warehouse Design</article-title>
          . To appear
          <source>in Proc. of the 1st Intl. Conf. on Data Warehousing and Knowledge Discovery</source>
          , Springer-Verlag, LNCS,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref44">
        <mixed-citation>
          [44]
          <string-name>
            <given-names>J.</given-names>
            <surname>Widom</surname>
          </string-name>
          . Research Problems in Data Warehousing.
          <source>In Proc. of the 4th Intl. Conf. on Information and Knowledge Management</source>
          , pages
          <fpage>25</fpage>
          -
          <lpage>30</lpage>
          , Nov.
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref45">
        <mixed-citation>
          [45]
          <string-name>
            <given-names>J.</given-names>
            <surname>Wiener</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Labio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhuge</surname>
          </string-name>
          , H. GarciaMollina, and
          <string-name>
            <given-names>J.</given-names>
            <surname>Widom</surname>
          </string-name>
          .
          <article-title>A System Prototype for Warehouse View Maintenance</article-title>
          .
          <source>In Workshop on Materialized Views: Techniques and Applications</source>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref46">
        <mixed-citation>
          [46]
          <string-name>
            <given-names>W.</given-names>
            <surname>Yan</surname>
          </string-name>
          and P.- A˚. Larson. Performing Group-
          <article-title>By before Join</article-title>
          .
          <source>In Proc. of the 10th Intl. Conf. on Data Engineering</source>
          , pages
          <fpage>89</fpage>
          -
          <lpage>100</lpage>
          ,
          <year>1994</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref47">
        <mixed-citation>
          [47]
          <string-name>
            <given-names>J.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Karlapalem</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Q.</given-names>
            <surname>Li</surname>
          </string-name>
          .
          <article-title>Algorithms for Materialized View Design in Data Warehousing Environment</article-title>
          .
          <source>In Proc. of the 23rd Intl. Conf. on Very Large Data Bases</source>
          , pages
          <fpage>136</fpage>
          -
          <lpage>145</lpage>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref48">
        <mixed-citation>
          [48]
          <string-name>
            <given-names>G.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Hull</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>King</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and J.-C.</given-names>
            <surname>Franchitti</surname>
          </string-name>
          .
          <article-title>Supporting Data Integration and Warehousing Using H20</article-title>
          .
          <source>Data Engineering</source>
          ,
          <volume>18</volume>
          (
          <issue>2</issue>
          ):
          <fpage>29</fpage>
          -
          <lpage>40</lpage>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref49">
        <mixed-citation>
          [49]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhuge</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Garcia-Molina</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hammer</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Widom</surname>
          </string-name>
          .
          <article-title>View Maintenance in a Warehousing Environment</article-title>
          .
          <source>In Proc. of the ACM SIGMOD Intl. Conf. on Management of Data</source>
          , pages
          <fpage>316</fpage>
          -
          <lpage>327</lpage>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref50">
        <mixed-citation>
          [50]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhuge</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Garcia-Molina</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>J.</given-names>
            <surname>Wiener</surname>
          </string-name>
          .
          <article-title>The Strobe Algorithms for Multi-Source Warehouse Consistency</article-title>
          .
          <source>In Proc. of the 4th Intl. Conf. on Parallel and Distributed Information Systems</source>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref51">
        <mixed-citation>
          [51]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhuge</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wiener</surname>
          </string-name>
          , and
          <string-name>
            <given-names>H.</given-names>
            <surname>Garcia-Molina</surname>
          </string-name>
          .
          <article-title>Multiple View Consistency for Data Warehousing</article-title>
          .
          <source>In Proc. of the 13th Intl. Conf. on Data Engineering</source>
          , pages
          <fpage>289</fpage>
          -
          <lpage>300</lpage>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>