<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>PhD Workshop, August</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Symmetric and Asymmetric Aggregate Function in Massively Parallel Computing</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>ZHANG Chao Supervised by  Farouk Toumani</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Emmanuel GANGLER</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>LIMOS</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Universit e´ Clermont Auvergne</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Aubie` re</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>France</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>zhangch</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>ftoumanig@isima.fr</string-name>
        </contrib>
      </contrib-group>
      <pub-date>
        <year>2017</year>
      </pub-date>
      <volume>28</volume>
      <issue>2017</issue>
      <fpage>2</fpage>
      <lpage>5</lpage>
      <abstract>
        <p>Applications of aggregation for information summary have great meanings in various elds. In big data era, processing aggregate function in parallel is drawing researchers' attention. The aim of our work is to propose a generic framework enabling to map an arbitrary aggregation into a generic algorithm and identify when it can be e ciently executed on modern large-scale data-processing systems. We describe our preliminary results regarding classes of symmetric and asymmetric aggregation that can be mapped, in a systematic way, into e cient MapReduce-style algorithms.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>
        The ability to summarize information is drawing
increasing attention for information analysis [
        <xref ref-type="bibr" rid="ref11 ref6">11, 6</xref>
        ].
Simultaneously, under the progress of data explosive growth
processing aggregate function has to experience a transition to
massively distributed and parallel platforms, e.g. Hadoop
MapReduce, Spark, Flink etc. Therefore aggregation
function requires a decomposition approach in order to execute
in parallel due to its inherent property of taking several
values as input and generating a single value based on certain
criteria. Decomposable aggregation function can be
processed in a way that computing partial aggregation and then
merging them at last to obtain nal results.
      </p>
      <p>
        Decomposition of aggregation function is a long-standing
research problem due to its bene ts in various elds. In
distributed computing platforms, decomposability of
aggregate function can push aggregation before shu e phase [
        <xref ref-type="bibr" rid="ref17 ref3">17,
3</xref>
        ]. This is usually called initial reduce, with which the
size of data transmission on a network can be substantially
reduced. For wireless sensor network, the need to reduce
data transmission is more necessary because of limitation of
power supply [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. In online analytical processing (OLAP),
decomposability of aggregate function enables aggregation
across multi-dimensions, such that aggregate queries can be
executed on pre-computation results instead of base data to
accelerate query answering [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. An important point of query
optimization in relational databases is to reduce table size
for join [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], and decomposable aggregation brings interests
[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>
        When an arbitrary aggregation function is decomposable,
how to decompose it and when a decomposition is 'e cient'
is a hard nut to crack. Previous works identify interesting
properties for decomposing aggregation. A very relevant
classi cation of aggregation functions, introduced in [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], is
based on the size of sub-aggregation (i.e., partial
aggregation). This classi cation distinguishes between distributive
and algebraic aggregation having sub-aggregation with xed
sizes, and holistic functions where there is no constant bound
on the storage size of sub-aggregation. Some algebraic
properties, such as associativity and commutativity, are
identied as su cient conditions for decomposing aggregation [
        <xref ref-type="bibr" rid="ref17 ref3">17,
3</xref>
        ]. Compared to these works, our work provides a generic
framework to identify the decomposability of any symmetric
aggregation and generate generic algorithms to process it in
parallel. Moreover, all but few researches in the literature
consider symmetric functions. Asymmetric aggregation is
inherently non-commutative functions and this makes their
processing in parallel and distributed environment far from
being easy. In [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], a symbolic parallel engine (SYMPLE) is
proposed in order to automatically parallelize User De ned
Aggregations (UDAs) that are not necessarily commutative.
Although interesting, the proposed framework lacks
guarantees for e ciency and accuracy in the sense that it is up to
users to encode a function as SYMPLE UDA. Moreover,
symbolic execution may have path explosion problem.
      </p>
      <p>
        My research focuses on designing generic framework that
enables to map symmetric and asymmetric aggregation
functions into e cient massively parallel algorithms. To achieve
this goal, we rstly identify a computation model, and an
associated cost model to design and evaluate parallel
algorithms. We consider MapReduce-style (M R) framework and
use the M RC [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] cost model to de ne 'e cient' M R
algorithms. We rest on the notion of well-formed aggregation [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]
as a canonical form to write symmetric aggregation and
provide a simple and systematic way to map well-formed
aggregation function into an MR algorithm, noted by M R( ).
Moreover, we provide reducible properties to identify when
the generated M R( ) is e cient (when M R( ) is an M RC
algorithm). Then we extend our framework to a class of
asymmetric aggregation function, position-based
aggregation, and propose extractable property to have generic M RC
algorithms. Our main results are Theorem 1 and Theorem
2, of which proofs are provided in an extended report[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
2.
      </p>
    </sec>
    <sec id="sec-2">
      <title>M RC ALGORITHM</title>
      <p>
        Several research works concentrate on the complexity of
parallel algorithms. M U D[
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] algorithm was proposed to
transform symmetric streaming algorithms to parallel
algorithms with nice bounds in terms of communication and
space complexity, but without any bound on time
complexity. This disquali es M U D as a possible candidate cost
model to be used in our context. M RC[
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] is another
popular model that has been used to evaluate whether a
MapeReduce algorithm is e cient. The constraints enforced by
M RC w.r.t. total input data size can be summarized as
following: sublinear number of total computing nodes,
sublinear space for any mapper or reducer, polynomial time for
any mapper or reducer, and logarithm round number. We
illustrate these constraints besides round number in a
simpli ed MapReduce owchart in gure 1 where &gt; 0.
      </p>
      <p>Hence, the M RC model considers necessary parameters
for parallel computing, communication time, computation
space and computing time, and makes more realistic
assumptions. A MapReduce algorithm satisfying these
constraints is considered as an e cient parallel algorithm and
will be called hereafter an M RC algorithm.</p>
    </sec>
    <sec id="sec-3">
      <title>SYMMETRIC AGGREGATION WITH M RC</title>
      <p>
        Let I be a doamin, an n-ary aggregation is a function[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]:
In ! I. is symmetric or commutative[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] if (X) =
( (X)) for any X 2 I and any permutation , where
(X) = (x (1); :::; x (n)). Symmetric aggregation result does
not depend on the order of input data, therefore input is
considered as a multiset. In this section, we de ne a generic
framework to map symmetric aggregation into an M RC
algorithm.
3.1
      </p>
    </sec>
    <sec id="sec-4">
      <title>A Generic Form for Symmetric Aggregation</title>
      <p>
        To de ne our generic aggregation framework, we rest on
the notion of well-formed aggregation [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. A symmetric
aggregation de ned on a multiset X = fd1; : : : ; dng can be
written in well-formed aggregation as following:
(X) = T (F (d1) : : :
      </p>
      <p>F (dn));
where F is translating function(tuple at a time), is a
commutative and associative binary operation, and T is
terminating function. For instance, average can be easily
transformed into well-formed aggregation: F (d) = (d; 1); (d; k)
d
(d0; k0) = (d + d0; k + k0) and T ((d; n)) = . In fact, any
n
symmetric aggregation can be rewritten into well-formed
aggregation with a exible choice of , e.g = [.</p>
      <p>Well-formed aggregation provides a generic plan for
processing aggregate function in distributed architecture based
on the associative and commutative property of :
processing F and at mapper, and T at reducer. Table
1 depicts the corresponding generic MapReduce(MR)
algorithm(the case of one key and trivially extending to any
number of keys), noted by M R( ), where mapper input is
a submultiset Xi of X and mapper output is oi, and P is
the concatenation of .</p>
      <p>However, the obtained M R( ) are not necessarily an e
cient MapReduce algorithm. We identify when M R( ) is a
M RC algorithm using reducibility property.</p>
      <p>De nition 1. A symmetric aggregation function de ned
on domain I is reducible if the well-formed aggregation (F;
; T ) of satis es
8di; dj 2 I : jF (di)</p>
      <p>F (dj)j= O(1):</p>
      <p>With this reducible property, we provide a theorem
identifying when M R( ) of a symmetric aggregation is a M RC
algorithm.</p>
      <p>Theorem 1. Let be a symmetric well-formed
aggregation and M R( ) be the generic algorithm for , then M R( )
is an MRC algorithm if and only if is reducible.
3.2</p>
    </sec>
    <sec id="sec-5">
      <title>Deriving MRC Algorithm from Algebraic</title>
    </sec>
    <sec id="sec-6">
      <title>Properties</title>
      <p>In this section, we investigate several symmetric
aggregation properties satisfying Theorem 1. If an aggregation
is in one of the following classes, then has an M RC( )
algorithm illustrated in table 1.</p>
      <p>
        An aggregate function is associative [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] if for
multiset X = X1 [ X2, (X) = ( (X1); (X2)) : Associative
and symmetric aggregation function can be transformed
in well-formed aggregation (F; ; T ) as following,
F = ; ;
      </p>
      <p>= ; T = id
where id denotes identity function. is reducible because
it is an aggregation. Therefore M R( ) of associative and
symmetric aggregation is an M RC algorithm.</p>
      <p>
        An aggregation is distributive [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] if there exists a
combining function C such that (X; Y ) = C( (X); (Y )).
Distributive and symmetric aggregation can be rewritten in
well-formed aggregation (F; ; T ) as following,
      </p>
      <p>F = ;</p>
      <p>= C; T = id:
Similarly, is reducible and corresponding M R( ) is an
M RC algorithm.</p>
      <p>
        Another kind of aggregate function having the same
behavior as symmetric and distributive aggregation is
commutative semigroup aggregate function [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. An
aggregation is in this class if there exists a commutative
semigroup (H; ), such that (X) = Nxi2X (xi). The
corresponding well-formed aggregation (F; ; T ) is illustrated as
following,
      </p>
      <p>F = ;
= ; T = id:
(1)
(2)
(3)
It is clearly that is reducible and M R( ) is an M RC
algorithm.</p>
      <p>
        A more general property than commutative semi-group
aggregation is symmetric and preassociative aggregate
function. An aggregation is preassociative [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] if it
satises (Y ) = (Y 0) =) (XY Z) = (XY 0Z):
According to [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], some symmetric and preassociative(unarily
quasi-range-idempotent and continuous) aggregation
functions can be constructed as (X) = Pin=1 '(xi) ; n 1;
where and ' are continuous and strictly monotonic
function. For instance, (X) = Pin=1 2 xi, where = id and
'(xi) = 2 xi. The well-formed aggregation (F; ; T ) for this
kind of preassociative aggregation is illustrated as following
F = ';
= +; T = :
(4)
The corresponding M R( ) is also an M RC algorithm.
      </p>
      <p>
        An aggregate function is barycentrically associative [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]
if it satis es (XY Z) = (X (Y )jY jZ), where jY j denotes
the number of elements contained in multiset Y and (Y )jY j
denotes jY j occurrences of (Y ). A well-known class of
symmetric and barycentrically associative aggregation is
quasiarithmetic mean : (X) = f 1 n1 Pin=1 f (xi) ; n 1;
where f is an unary function and f 1 is a quasi-inverse of f .
With di erent choices of f , can be di erent kinds of mean
functions, e.g arithmetic mean, quadratic mean, harmonic
mean etc. It is trivial to rewrite this kind of aggregation into
well-formed aggregation (F; ; T ) and the M R( ) is also an
M RC algorithm,
      </p>
      <p>F = (f; 1);
= (+; +); T = f 1(</p>
      <p>Pn
i=1 f (xi) ):
n
(5)</p>
    </sec>
    <sec id="sec-7">
      <title>ASYMMETRIC AGGREGATION</title>
      <p>
        Many commonly used aggregation function is
symmetric(commutative) such that the order of input data can be
ignored, while asymmetric aggregation considers the order.
Two common asymmetric cases could be weighted
aggregation and cumulative aggregation, where aggregated result
will be changed if data order is changed, e.g. WMA(weighted
moving average) and EMA(exponential moving average)[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ],
which are used to highlight trends.
4.1
      </p>
    </sec>
    <sec id="sec-8">
      <title>A Generic Form for Asymmetric Aggregation</title>
      <p>In contrast to symmetric aggregation, asymmetric
function is impossible to rewrite into well-formed aggregation,
because translating function F is a tuple at a time function
and is commutative and hence both of them are insensitive
to the order. For this reason, we propose an extended form
based on well-formed aggregation which is more suitable for
asymmetric aggregation.</p>
      <p>De nition 2. An asymmetric aggregation de ned on
an ordered sequence X is an asymmetric well-formed
aggregation if can be rewritten as following,
(X) = T (F o(X; x1) :::</p>
      <p>F o(X; xn));
(6)
where F o is order-in uenced translating function, is a
commutative and associative binary operation, and T is
terminating function.</p>
      <p>
        For instance, (X) = P z)i 1xi[
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] with a
constant z can be rewritten asxiF2Xo((X1 ; xi) = (1 z)i 1xi; =
+; T = id; where i is the position of xi in the sequence X.
      </p>
      <p>Asymmetric well-formed aggregation can rewrite any
asymmetric aggregation , and with the associative property of
, also has a generic MR algorithm M R( ): processing
F o and at mapper, and T at reducer. Similar to the
behavior of symmetric well-formed aggregation, reducible
property is needed to ensure M RC constraints. The
reducible property for asymmetric well-formed aggregation is
8xi; xi+1 2 X : jF o(X; xi)</p>
      <p>F o(X; xi+1)j= O(1):
However, in order to have a correct generic M RC
algorithm for asymmetric aggregation, reducible property is not
enough, because asymmetric function considers data order
such that operations for combining mapper outputs are more
than . We illustrate this problem and identify properties
to have correct MRC algorithm for a class of asymmetric
well-formed aggregation in the following.</p>
      <p>We deal with a kind of asymmetric aggregation called
position-based aggregation, for which F o is F o(X; xi) =
h(i) f (xi), where h() and f () are unary functions, and
is a binary operation. The corresponding asymmetric
well-formed framework is (X) = T (P ;xi2X h(i) f (xi)),
where P is the concatenation of .</p>
      <p>Let X be an ordered sequence X = S1 ::: Sm, where Sl is
a subsequence of X, l 2 f1; :::; mg and is the concatenation
of subsequence, and i be the holistic position of xi in X and
j be the relative position of xj in subsequence Sl. Then
P F o(X; xi) of on any subsequence Sl is</p>
      <p>X
;xi2Sl</p>
      <p>
        X
;xj2Sl
F o(X; xi) =
h(j + k)
f (xi);
where j + k (j + k = i) is the holistic position of the jth
element xj in Sl. In order to process in parallel on these
subsequences, the rst requirement is to have l, which means
in distributed and parallel computing data set is split into
ordered chunks and chunk indexes can be stored. It can be
trivially implemented in Hadoop[
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. Secondly, k is needed,
the number of elements before Sl. Sequential distributing
subsequence count values then starting aggregation is costly
due to too many times of data transferring on network. If k
can be extracted out of P ;xj2Sl h(j +k) f (xi), then can
be processed without distributing counts because operations
relating to count can be pushed to reducer. We identify
conditions to extract k which we call extractable property.
      </p>
      <p>Lemma 1. Given an ordered sequence X, a position-based
asymmetric well-formed aggregation de ned in (F o; ; T )
and F o(X; xi) = h(i) f (xi) for any xi 2 X, where h()
and f () are unary functions, is extractable if there exists
a binary operation making h() satisfy h(i + k) = h(i)
h(k + c) with a constant c, and ; and satisfy one of
the following conditions,
and</p>
      <p>are same,
and</p>
      <p>are same and they are distributive over ,
is distributive over</p>
      <p>which is same as .</p>
      <p>The behavior of h() is similar to group homomorphism
however they are not exactly same, and our intention is to
extract k instead of preserving exact operations.</p>
      <p>Theorem 2. Let be a position-based well-formed
aggregation and M R( ) be the generic algorithm for , then
M R( ) is an MRC algorithm if is reducible and extractable.</p>
      <p>
        Extractable property of position-based aggregation
allows previous subsequences count value 'k' to be extracted
out of mapper operation, then can be correctly processed
by P F o or (P f (xi); P h(i)) at mapper phase. To
combine mapper outputs, more than and T are needed
and speci c combining operation depends on the three
different extractable conditions (provided in our extended
report[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]).
      </p>
      <p>For instance, given an input sequence X = (x1; :::; xn),
then EM A(X) = PPin=in1=(11(1 a)ai)i1 1xi ; where a is a constant
between 0 and 1. We give below the asymmetric well-formed
aggregation of EM A, where h(i) = (1 a)i 1,</p>
      <p>F o : F o(X; xi) = h(i) xi; h(i) ;
: h(i) xi; h(i)</p>
      <p>h(i + 1) xi+1; h(i + 1)
= h(i) xi + h(i + 1) xi+1; h(i) + h(i + 1) ;</p>
      <p>n n
T : T (X h(i) xi; X h(i)) =
i=1 i=1</p>
      <p>Pn
i=1 h(i) xi
Pn
i=1 h(i)
:</p>
      <p>It is clearly that EMA is a position-based aggregation, and
EMA is reducible because is a pair of addition. Moreover
h() satis es h(i + k) = h(i) h(k + 1), and the
corresponding three binary operations = , = , = +
satisfy the second extractable condition. Therefore EMA has a
MRC algorithm(the generic MRC algorithm for the second
extractable condition) illustrated as following, where we
assume input sequence X = S1 ::: Sm and mapper input is
Sl; l 2 f1; :::; mg, and count(S0) = 0;
mapper: OMl0 = Pxj2Sl h(j) xj, OMl00 = Pxj2Sl h(j),
OMl000 = count(Sl) ,
reducer:</p>
      <p>Pm</p>
      <p>l=1 OMl0 (1
Pm
l=1 OM 00 (1
l
a)Plj=10 OMj000
a)Plj=10 OMj000
.</p>
    </sec>
    <sec id="sec-9">
      <title>CONCLUSION AND FUTURE WORK</title>
      <p>In this work, we studied how to map aggregation
functions, in a systematic way, into generic M RC algorithms
and we identi ed properties that enable to e ciently execute
symmetric and asymmetric aggregations using
MapReducestyle platforms. For symmetric aggregation, we proposed
the reducible property within well-formed aggregation
framework to satisfy space and time complexity of M RC. Several
algebraic properties of symmetric aggregation leading to a
generic M RC algorithm have been identi ed. Moreover, we
extended the notion of well-formed aggregation to
asymmetric aggregation and showed how it can be exploited to
deal with position-based asymmetric aggregation. Through
identifying the problem for parallelizing it, we proposed
extractable property and merged it with the reducible
property of asymmetric well-formed aggregation to have M RC
algorithms.</p>
      <p>Our future work will be devoted to the implementation
and experimentation. We will study the extension of our
framework to mainstream parallel computing platforms (e.g.
Apache Spark). Moreover, we also plan to extend our
framework to cover additional classes of asymmetric aggregations.
Finally, we plan to investigate how to generalize our
approach to nested aggregation functions (i.e., functions
dened as a complex composition of aggregation functions).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <article-title>[1] Moving average</article-title>
          . https://en.wikipedia.org/wiki/Moving_average.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <article-title>[2] Symmetric and asymmetric aggregate function in massively parallel computing(extened version)</article-title>
          . https://hal-clermont-univ.archives-ouvertes. fr/hal-01533675.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>C.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z. S.</given-names>
            <surname>McDirmid</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Moscibroda</surname>
          </string-name>
          .
          <article-title>Automating distributed partial aggregation</article-title>
          .
          <source>In SOCC'14</source>
          , pages
          <fpage>1</fpage>
          {
          <fpage>12</fpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>S.</given-names>
            <surname>Cohen</surname>
          </string-name>
          .
          <article-title>User-de ned aggregate functions: bridging theory and practice</article-title>
          .
          <source>In SIGMOD'06</source>
          , pages
          <fpage>49</fpage>
          {
          <fpage>60</fpage>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>S.</given-names>
            <surname>COHEN</surname>
          </string-name>
          ,
          <string-name>
            <surname>W.NUTT</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Y.</given-names>
            <surname>SAGIV</surname>
          </string-name>
          .
          <article-title>Rewriting queries with arbitrary aggregation functions using views</article-title>
          .
          <source>ACM TODS</source>
          ,
          <volume>31</volume>
          (
          <issue>2</issue>
          ):
          <volume>672</volume>
          {
          <fpage>715</fpage>
          ,
          <year>June 2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>A.</given-names>
            <surname>Cuzzocrea</surname>
          </string-name>
          .
          <article-title>Aggregation and multidimensional analysis of big data for large-scale scienti c applications: models, issues, analytics, and beyond</article-title>
          .
          <source>In SSDBM'15</source>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>J.</given-names>
            <surname>Feldman</surname>
          </string-name>
          , S.muthukrishnan, A. Sidiropoulos,
          <string-name>
            <given-names>C.</given-names>
            <surname>Stein</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Z.</given-names>
            <surname>Svitkina</surname>
          </string-name>
          .
          <article-title>On distributing symmetric streaming computations</article-title>
          .
          <source>ACM TALG</source>
          ,
          <volume>6</volume>
          (
          <issue>4</issue>
          ),
          <year>August 2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Franklin</surname>
          </string-name>
          .
          <article-title>An overview of data warehousing and olap technology</article-title>
          .
          <source>ACM SIGMOD Record</source>
          ,
          <volume>26</volume>
          (
          <issue>1</issue>
          ):
          <volume>65</volume>
          {
          <fpage>74</fpage>
          ,
          <string-name>
            <surname>March</surname>
          </string-name>
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>M.</given-names>
            <surname>Grabisch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.-L.</given-names>
            <surname>Marichal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Mesiar</surname>
          </string-name>
          , and
          <string-name>
            <given-names>E.</given-names>
            <surname>Pap</surname>
          </string-name>
          .
          <source>Aggregation function: Means. Information Sciences</source>
          ,
          <volume>181</volume>
          (
          <issue>1</issue>
          ):1{
          <fpage>22</fpage>
          ,
          <year>January 2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>H.</given-names>
            <surname>Garcia-Molina</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.D.</given-names>
            <surname>Ullman</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Widom</surname>
          </string-name>
          . Database System Implementation. Prentice-Hall, New Jersey,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>J.</given-names>
            <surname>Gray</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bosworth</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Layman</surname>
          </string-name>
          , and
          <string-name>
            <given-names>H.</given-names>
            <surname>Pirahesh</surname>
          </string-name>
          .
          <article-title>Data cube: A relational aggregation operator generalizing group-by, cross-tab, and sub-totals</article-title>
          .
          <source>Data Mining and Knowledge Discovery</source>
          ,
          <volume>1</volume>
          (
          <issue>1</issue>
          ):
          <volume>29</volume>
          {
          <fpage>53</fpage>
          ,
          <string-name>
            <surname>Janaury</surname>
          </string-name>
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>H.</given-names>
            <surname>Karlo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Suri</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Vassilvitskii</surname>
          </string-name>
          .
          <article-title>A model of computation for mapreduce</article-title>
          .
          <source>In SODA'10</source>
          , pages
          <fpage>938</fpage>
          {
          <fpage>948</fpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>M.</given-names>
            <surname>Jean-Luc</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Bruno</surname>
          </string-name>
          .
          <article-title>Preassociative aggregation functions</article-title>
          .
          <source>Fuzzy Sets and Systems</source>
          ,
          <volume>268</volume>
          :
          <fpage>15</fpage>
          {
          <fpage>26</fpage>
          ,
          <year>June 2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>M.</given-names>
            <surname>Jean-Luc</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Bruno</surname>
          </string-name>
          .
          <article-title>Strongly barycentrically associative and preassociative functions</article-title>
          .
          <source>Fuzzy Sets and Systems</source>
          ,
          <volume>437</volume>
          (
          <issue>1</issue>
          ):
          <volume>181</volume>
          {
          <fpage>193</fpage>
          , May
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>S.</given-names>
            <surname>Madden</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.J.</given-names>
            <surname>Franklin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.M.</given-names>
            <surname>Hellerstein</surname>
          </string-name>
          , and
          <string-name>
            <given-names>W.</given-names>
            <surname>Hong</surname>
          </string-name>
          .
          <article-title>Tag: a tiny aggregation service for ad-hoc sensor networks</article-title>
          .
          <source>In OSDI'02</source>
          , pages
          <fpage>131</fpage>
          {
          <fpage>146</fpage>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>V.</given-names>
            <surname>Raychev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Musuvathi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Mytkowicz</surname>
          </string-name>
          .
          <article-title>Parallelizing user-de ned aggregaions using symbolic execution</article-title>
          .
          <source>In SOSP'15</source>
          , pages
          <fpage>153</fpage>
          {
          <fpage>167</fpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Isard</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Gunda</surname>
          </string-name>
          .
          <article-title>Distributed aggregation for data-parallel computing: Interfaces and implementations</article-title>
          .
          <source>In SOSP'09</source>
          , pages
          <fpage>247</fpage>
          {
          <fpage>260</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>