-

On the CALM Principle for BSP Computation

Matteo Interlandi

minterlandi@cs.ucla.edu 1

Letizia Tanca

letizia.tanca@polimi.it 0 0 Politecnico di Milano 1 University of California , Los Angeles , USA

In recent times, considerable emphasis has been given to two apparently disjoint research topics: data-parallel and eventually consistent, distributed systems. In this paper we propose a study on an eventually consistent, dataparallel computational model, the keystone of which is provided by the recent finding that a class of programs exists that can be computed in an eventually consistent, coordination-free way: monotonic programs. This principle is called CALM and has been proven by Ameloot et al. for distributed, asynchronous settings. We advocate that CALM should be employed as a basic theoretical tool also for data-parallel systems, wherein computation usually proceeds synchronously in rounds and where communication is assumed to be reliable. We deem this problem relevant and interesting, especially for what concerns parallel workflow optimization, and make the case that CALM does not hold in general for dataparallel systems if the techniques developed by Ameloot et al. are directly used. In this paper we sketch how, using novel techniques, the satisfiability of the if direction of the CALM principle can still be obtained, although just for a subclass of monotonic queries.

Recent research has explored ways to exploit different levels of consistency in order to improve the performance of distributed systems w.r.t. specific tasks and network configurations, while maintaining correctness [ 18 ]. A topic strictly related to consistency is coordination, usually informally interpreted as a mechanism to accomplish a distributed agreement on some system property [ 8 ]. Indeed, coordination can be used to enforce consistency when, in the natural execution of a system, this is not guaranteed in general. In this paper we sketch some theoretical problems springing from the use of eventually consistent, coordination-free computation over synchronous systems with reliable communication (rsync). Informally, such systems have the following properties: (i) a global clock is defined and accessible by every node; (ii) the relative difference between the time clock values of any two nodes is bounded; and (iii) the results emitted by a node arrive at destination at most after a certain bounded physical time (the so-called bounded delay guarantee).

Rsync is a common setting in modern data-parallel frameworks - such as MapReduce - in which computation is usually performed in rounds, where each task is blocked and cannot start the new round until a synchronization barrier is reached, i.e., every other task has completed its local computation. In this work we consider synchronization (barrier) and coordination as two different, although related entities: the former is a mechanism enforcing the rsync model, the latter a property of executions. Identifying under what circumstances eventually consistent, coordination-free computation can be employed over rsync systems would enable us to “stretch” the declarativeness of parallel programs, freeing execution plans of the restriction to follow predefined (synchronous) patterns. In fact, all recent high-level data-parallel languages suffer from this limitation, for instance both Hive [ 16 ] and Pig [ 15 ] sacrifice pipelining in order to fit query plans into MapReduce workflows. Our aim is then to understand when a synchronous “blocking” computation is actually required by the program semantics – and therefore must be strictly enforced by the system – and when, instead, a pipelined execution can be performed as optimization. For batch parallel processing, the benefits of understanding where the former can be replaced by the latter are considerable [ 7 ]: thanks to the fact that data is processed as soon as it is produced, online computation is possible, i.e., the final result can be refined during the execution; as a consequence, new data can be incrementally added to the input, making continuous computation possible. Overall, pipelining is highly desirable in the Big Data context, where full materialization is often problematic.

Recently, a class of programs that can be computed in an eventually consistent, coordination-free way has been identified: monotonic programs [ 9 ]; this principle is called CALM (Consistency and Logical Monotonicity) and has been proven in [ 4 ]. While CALM was originally proposed to simplify the specification of distributed (asynchronous) data management systems, in this paper we advocate that CALM should be employed as a basic theoretical tool also for the declarative specification of data-parallel (synchronous) systems. As a matter of fact, CALM permits to link a property of the execution (coordination-freedom) with a class of programs (monotonic queries). But to which extent CALM can be applied over data-parallel systems? Surprisingly enough, the demonstration of the CALM principle in rsync systems is not trivial and, with the communication model and the notion of coordination as defined in [ 4 ], the CALM principle does not hold in general in rsync settings (cf: Example 3). Thus, in order to extend CALM over data-parallel synchronous computation, in this paper we sketch a new generic parallel computation model leveraging previous works on synchronous Datalog [ 10, 12 ] and transducer networks [ 4 ], and grounding rsync computation on the well-known Bulk Synchronous Parallel (BSP) model [ 17 ] equipped with content-based addressing. With BSP, computation proceeds as a series of global rounds, each composed by three phases: (i) a computation phase, in which nodes parallely perform local computations; (ii) a communication phase, where data are exchanged among the nodes; and (iii) the synchronization barrier. Exploiting this new type of transducer network, we will then show that the CALM principle is satisfied for synchronous and reliable systems under a new definition of coordination-freedom, although, surprisingly enough, just for a subclass of monotonic queries, i.e., the chained monotonic queries (cf: Definition 5.7). When defining coordination-freedom we will take advantage of recent results describing how knowledge can be acquired in synchronous systems [ 5, 6 ]. Organization: The paper is organized as follows: Section 2 introduces some preliminary notation. Section 3 defines our model of synchronous and reliable parallel system, and shows that the CALM principle is not satisfied for systems of this type. Section 3.2 proposes a new computational model based on hashing, while Section 4 introduces the new definition of coordination. Finally, Section 5 discusses CALM under the new setting. The paper ends with some concluding remarks. We refer the reader to [ 11 ] for proofs and more detailed discussions. 2

Relational Transducers

In this paper we expect the reader to be familiar with the basic notions of database theory and relational transducer (networks). In this section we use some example to set forth our notation, which is close to that of [ 1 ] and [ 4 ].

We employ a transducer (resp. a transducer network) as an abstraction modeling the behavior of a single computing node (resp. a network of computing nodes): this abstract computational model permits us to make our results as general as possible without having to rely on a particular framework, since transducers and transducer networks can be easily imposed over any modern data-parallel system. We consider each node to be equipped with an immutable database and a memory used to store useful data between any two consecutive computation steps. In addition, a node can produce an output for the user and can also communicate some data to other nodes (the concept of data communication in a transducer network will appear clearer in Section 3). Finally, an internal time, and system data are kept mainly for configuration purposes. Every node executes a program that operates on (input) instances of the database, the memory and the communication channel, and produces new instances that are either saved in memory, or directly output to the user, or addressed to other nodes.

Example 1. A first example of relational transducer is the following UCQ-transducer T , with schema , that computes the ternary relation Q as the join between two binary relations R and T :

Schema: db = fR(2); T (2)g; mem = ;; com = ;; out = fQ(3)g

Program: Qout(u; v; w) R(u; v); T (v; w): Let I be an initial instance over which we want to compute the join. Then, let us define Idb = I as an instance over the database schema db. A transition I ! J for T is such that I = I [ Isys, Ircv and Jsnd are empty (no communication query exists), and J = I [ Iout [ Isys, where Iout is the result of the query Qout, i.e., the join between R and T . Note that the subscript in Qout means that this is an output query, that is, it specifies the final result of the whole computation. 3

Computation in rsync

In order to allow query evaluation in parallel settings, we will sketch a novel transducer network [ 4 ], where computation is synchronous, and communication is reliable. This permits us to define how a set of relational transducers can be assembled to obtain an abstract computational model for distributed data-parallel systems. To be consistent with [ 4 ], we will assume broadcasting as the addressing model.

Example 2. Assume we want to compute a distributed version of the join of Example 1. We can implement it using a broadcasting synchronous transducer network which emits one of the two relations, say T , and then joins R with the received facts over T . Note that the sent facts will be used just starting from the successive round, and the program will then employ two rounds to compute the distributed join. UCQ is again expressive enough. The transducer network can be written as follows – where Ssnd denotes a communication query and this time schema com is non-empty because communication is needed:

Program: Ssnd(u; v)

Schema: db = fR(2); T (2)g; com = fS(2)g; out = fQ(3)g

T (u; v): Qout(u; v; w)

R(u; v); S(u; w):

Synchronous specifications have the required expressive power:

Lemma 1 Let L be a language containing UCQ and contained in DATALOG:. Every query expressible in L can be distributively computed in 2 rounds by a broadcasting L-transducer network.

The above lemma permits us to draw the following conclusion: under the rsync semantics, monotonic and non-monotonic queries behave in the same way: two rounds are needed in both cases. This is due to the fact that, contrary to what happens in the asynchronous case of [ 4 ], starting from the second round we are guaranteed – by the reliability of the communication and the synchronous assumption – that every node will compute the query over every emitted instance. Conversely, in the asynchronous case, as a result of the non-determinism of the communication, we are never guaranteed, without coordination, that every sent fact will be actually received. 3.1

The CALM Conjecture

The CALM conjecture [ 9 ] specifies that a well-defined class of programs can be distributively computed in an eventually consistent, coordination-free way: monotonic programs. CALM has been proven in this (revisited) form for asynchronous systems [ 4 ]: Conjecture 1 A query can be distributively computed by a coordination-free transducer network if and only if it is monotonic.

The concept of coordination suggests that all the nodes in a network must exchange information and wait until an agreement is reached about a common property of interest. Following this intuition, Ameloot et al. established that a specification is coordinationfree if communication is not strictly necessary to obtain a consistent final result. Surprisingly enough, under this definition of coordination-freedom, CALM does not hold in rsync settings under the broadcasting communication model: Example 3. Let Qout be the “emptiness” query of [ 4 ]: given a nullary database relation R(0) and a nullary output relation T (0), Qout outputs true (i.e., a nullary fact over T ) iff IR is empty. The query is non-monotonic: if IR is initially empty, then T is produced, but if just one fact is added to R, T is not derived, i.e., IT must be empty. A FO-transducer network N can be easily generated to distributively compute Qout: first every node emits R if its local partition is not empty, and then each node locally evaluates the emptiness of R. Since the whole initial instance is installed on every node when R is checked for emptiness, T is true only if R is actually empty on the initial instance. The complete specification follows.

Schema: db = fR(0)g; mem = fReady(0)g; com = fS(0)g; out = fT (0)g: Program: Ssnd() R():

Readyins()

:Ready(): Tout()

:S(); Ready(): One can show [ 11 ] that, if communication is switched off, the above transducer is still able to obtain the correct result if, for example, I is installed on every node. That is, a partitioning exists, making communication not strictly necessary to reach the proper result. Note that the same query requires coordination in asynchronous settings: since emitted facts are non-deterministically received, the only way to compute the correct result is that nodes coordinate to understand if the input instance is globally empty.

The result we have is indeed interesting although expected: when we move from the general asynchronous model to the more restrictive rsync setting, we no longer have a complete understanding of which queries can be computed without coordination, and which ones, instead, do require coordination. It turns out that both the communication model and the definition of coordination proposed in [ 4 ] are not strong enough to work in general for synchronous systems. As the reader may have realized, this is due to the fact that, in broadcasting synchronous systems, coordination – as defined by Ameloot et al. – is already “baked” into the model. In the next sections we will see that our definition of coordination-freedom guarantees eventually consistent computation for those queries that do not rely on broadcasting in order to progress. That is, the discriminating condition for eventual consistency is not monotonicity, but the fact that it is not necessary to send a fact to all the nodes composing a network. 3.2

Hashing Transducer Networks

Broadcasting specifications are not really convenient from a practical perspective. Following other parallel programming models such as MapReduce, in this section we are going to introduce hashing transducer networks: i.e., synchronous networks of relational transducers equipped with a content-based communication model founded on hashing. Under this new model, the node to which an emitted fact must be addressed is derived using a hash function applied to a subset of its terms called keys. Example 4. This program is the hashed version of Example 2, where every tuple emitted over S and U is hashed on the first term (this is specified by the schema definition S(1;2) and U (1;2), where the pair (1; 2) means that the related relation has arity 2 and the first term is the key-term). In this way we are assured that, for each pair of joining tuples, at least a node exists containing the pair. This because S and U are joined over their key-terms, and hence the joining tuples are addressed to the same node. We have seen in Section 3.1 that, for rsynch systems, a particular notion of coordinationfreedom is needed. In fact we have shown that, under such model, certain nonmonotonic queries – Example 3 – requiring coordination under the asynchronous model can be computed in a coordination-free way. The key-point is that, as observed in [ 4 ], in asynchronous systems coordination-freedom is directly related to communicationfreedom under ideal partitioning. That is, if the partitioning is correct, no communication is required to correctly compute a coordination-free query because (i) no data must be sent (the partition is correct), and (ii) no “control message” is required to obtain a consistent result (the query is coordination-free). However, due to its synchronous nature, in rsync settings non-monotonic queries can be computed in general without resorting to coordination because coordination is already “baked” into the rsync model: each node is synchronized with every other one, hence “control messages” are somehow implicitly assumed. In this section we introduce a novel knowledge-oriented perspective linking coordination with the way in which explicit and implicit information flows in the network. Under this perspective, we will see that coordination is needed if, to maintain consistency, a node must have some form of information exchange with all the other nodes. Achieving coordination in asynchronous systems is a costly task. A necessary condition for coordination in such systems is the existence of primitives that enforce some control over the ordering of events. In a seminal paper [ 13 ], Lamport proposed a synchronization algorithm based on the relation of potential causality (!) over asynchronous events. According to Lamport, given two events e; e0, we have that e ! e0 if e happens before e0 and e might have caused e0. From a high-level perspective, the potential causality relation models how information flows among processes, and therefore can be employed as a tool to reason on the patterns which cause coordination in asynchronous systems. A question now arises: what is the counterpart of the potential causality relation for synchronous systems? Synchronous potential causality (syncausality in short) has been recently proposed [ 5 ] to generalize Lamport’s potential causality to synchronous systems. Using syncausality we are able to model how information flows among nodes with the passing of time. Consider a parallel execution trace – called a run – and two points in this execution ( i; t), ( j ; t0) for (possibly not distinct) nodes i; j, identifying the local state for i, j at time t and t0 respectively. We say that ( j ; t0) causally depends on ( i; t) if either i = j and t t0 – i.e., a local state depends on the previous one – or a tuple has been emitted by node i at time t, addressed to node j, with t < t01. We refer to these two types of dependencies as direct.

Definition 4.1. Given a run , we say that two points ( i; t), ( j ; t0) are related by a direct potential causality relation !, if one of the following is true: 1. t0 = t + 1 and i = j; 2. t0 t + 1 and node i sent a tuple at time t addressed to j; 3. there is a point ( k; t00) s.t. ( i; t) ! ( k; t00) and ( k; t00) ! ( j ; t0). Note that direct dependencies define precisely Lamport’s happen-before relation – and hence we maintain the same signature !.

Differently from asynchronous systems, we however have that a point on node j can occasionally indirectly depend on another point on node i even if no fact addressed to j is actually sent by i. This is because j can still draw some conclusion simply as a consequence of the bounded delay guarantee of synchronous systems. That is, each node can use the common knowledge that every sent tuple is received at most after a certain bounded delay to reason about the state of the system. The bounded delay guarantee can be modelled as an imaginary N U LL fact, like in [ 14 ]. Under this perspective, indirect dependencies appear the same as the direct ones, although, instead of a flow generated by “informative” facts, with the indirect relationship we model the flow of “non-informative”, N U LL facts.

Definition 4.2. Given a run , we say that two points ( i; t), ( j ; t0) are related by an indirect potential causality relation 99K, if i 6= j, t0 t + 1 and a N U LLiR fact addressed to node j has been (virtually) sent by node i at round t.

An interesting fact about the bounded delay guarantee is that it can be employed to specify when negation can be safely applied to a predicate. In general, negation can be applied to a literal R(u) when the content of R is sealed for what concerns the current round. In local settings, we have that such condition holds for a predicate at round t0 if its content has been completely generated at round t, with t0 > t. In distributed settings, we have that if R is a communication relation, being in a new round t0 is not enough, in general, for establishing that its content is sealed. This is because tuples can still be floating, and therefore, until we are assured that every tuple has been delivered, the above condition does not hold. The result is that negation cannot be applied safely. We can reason in the same way also for every other negative literal depending on R. We will then model the fact that the content of a communication relation R is stable because of the bounded delay guarantee, by having every node i emit a fact N U LLiR at round t, for every communication relation R, which will be delivered at node j exactly by the 1 Note that a point in a synchronous system is what Lamport defines as an event in an asynchronous system. next round. We then have that the content of R is stable once j has received a N U LLiR fact from every node i contained in the set N of nodes composing the network. The sealing of a communication relation at a certain round is then ascertained only when jN j N U LLR facts have been counted. Recall that not necessarily the N U LLiR facts must be physically sent. This in particular is true under our rsync model, where the strike of a new round automatically seals all the communication relations. Example 5 shows one situation in which this applies.

Example 5. Consider the hashing version of the program of Example 3. Let I be an initial instance. At round t + 1 we have that the relation S is stable, and hence negation can be applied. Note that if R is empty in the initial instance, no fact is sent. Despite this, every node can still conclude at round t + 1 that the content of S is stable. In this situation we clearly have an indirect potential causality relation.

We are now able to introduce the definition of syncausality: a generalization of Lamport’s happen-before relation which considers not only the direct information flow, but also the flow generated by indirect dependencies.

Definition 4.3. Let be a run. The syncausality relation ; is the smallest relation s.t.: 1. if ( i; t) ! ( j ; t0), then ( i; t) ; ( j ; t0); 2. if ( i; t) 99K ( j ; t0), then ( i; t) ; ( j ; t0); and 3. if ( i; t) ; ( j ; t0) and ( j ; t0) ; ( k; t00), then ( i; t) ; ( k; t00). 4.2

From Syncausality to Coordination

We next propose the predicate-level syncausality relationship, modeling causal relations at the predicate level. That is, instead of considering how (direct and indirect) information flows between nodes, we introduce a more fine-grained relationship modelling the flows between predicates and nodes.

Definition 4.4. Given a run , we say that two points ( i; t), ( j ; t0) are linked by a relation of predicate-level syncausality ;R, if any of the following holds: 1. i = j, t0 = t + 1 and a tuple over R 2 mem [ out has been derived by a query in Qins [ Qout at time t0; 2. R 2 com and node i sends a tuple over R at time t addressed to node j, with t0 t + 1; 3. R 2 com and node i (virtually) sends a N U LLiR fact at time t addressed to node j, with t0 t + 1; 4. there is a point ( k; t00) s.t. ( i; t) ;R ( k; t00) and ( k; t00) ;R ( j ; t0). We are now able to specify a condition for achieving coordination. Informally, we have that coordination exists when all the nodes of a network reach a common agreement that some event happened. But the only way to reach such an agreement is that a (direct or indirect) information flow exists between the node in which the event actually occurs, and every other node. This is a sufficient and necessary condition because of the reliability and bounded-delay guarantee of rsync systems. Formalizing this intuition by means of the (predicate level) syncausality relationship we have that: ( i; t) ;R ( j ; t0).

Definition 4.5. Let N be a set of nodes. We say that a synchronous relational transducer network manifests the coordination pattern if, for all possible initial instances I 2 inst( db), whichever run we select, a point ( i; t) and a communication relation R exist so that 8j 2 N there is a predicate-level syncausality relation such that We call node i the coordination master. A pattern with a similar role has been named broom in [ 6 ].

Remark: The reader can now appreciate to which extent coordination was already “baked” inside the broadcasting synchronous specifications of Section 3. Note that broadcasting, in rsync, brings coordination. This is not true in asynchronous systems. Intuitively, the coordination master is where the event occurs. If a broadcasting of (informative or non-informative) fact occurs, then such event will become common knowledge [ 8 ] among the nodes. On the contrary, if broadcasting is not occurring, common knowledge cannot be obtained and therefore, if the correct final outcome is still reached, this is obtained without coordination. That is, if at least a non-trivial configuration exists s.t. the coordination pattern doesn’t manifest itself, we have coordination-freedom. 5

CALM in rsync Systems

The original version of the CALM principle is not satisfiable in rsync systems because a monotonic class of queries exists—i.e., unchained queries, introduced next—which is not coordination-free. Informally, a query is chained if every relation is connected through a join-path with every other relation composing the same query. Definition 5.6. Let body(qR) be a conjunction of literals defining the body of a query qR. We say that two different positive litteral occurrences Ri(ui), Rj (uj ) 2 body(qR) are chained in qR if either: – ui \ uj 6= ;; or – a third relation Rk 2 qR different from Ri, Rj exists such that Ri is chained with

Rk, and Rk is chained with Rj .

Definition 5.7. A query Qout is said chained if, for every rule qR 2 Qout, each relation occurrence Ri 2 body(qR) is chained with every other relation occurrence Rj 2 body(qR).

Remark: Nullary relations are not chained by definition.

Example 6. Assume two relations R(2) and T (1), and the following query Qout returning the full R-instance if T is nonempty.

Q(u; v)

R(u; v); T ( ): The query is clearly monotonic. Let T be the following broadcasting UCQ-transducer program computing Qout.

Program: Ssnd(u; v)

Assume now we want to make the above transducer a hashing one. We have that, whichever key we chose, the related specification might be no more consistent. Indeed, consider an initial instance I and a set of keys spanning all the terms of S and U . Assume I such that adom(IR) adom(IT ), and a network composed by a large number of nodes. In this situation, it may happen that a nonempty set of facts over R is hashed to a certain node i, while no fact over T is hashed to i. This because a constant may exist in adom(IR) that is not in adom(IT ) and for which the hashing function returns a node i not returned by hashing any constant in adom(IT ). Hence no tuple emitted to i will ever appear in the output, although they do appear in Qout(I). Thus this transducer is not eventually consistent.

From the above example we can intuitively see that, for rsync, a final consistent result can be obtained without coordination only for queries that are chained and monotonic. That is, the following restricted version of the CALM conjecture holds for rsync systems: Theorem 1 A query can be parallelly computed by a coordination-free transducer network if it is chained and monotonic [ 11 ].

We will leave for future works the investigation on whether every monotone and chained query is also coordination-free.

Remark: For the readers familiar with the works [ 2, 3 ] our result state that under the rsync model, a query is computable in a coordination-free way if monotonic and distributing over components. 6

Conclusions

In this paper the CALM principle is analyzed under synchronous and reliable settings. By exploiting CALM, in fact, we would be able to break the synchronous cage of modern parallel computation models, and provide pipelined coordination-free executions when allowed by the program logic. In order to reach our goal, we have introduced a new abstract model emulating BSP computation, and a novel interpretation of coordination with sound logical foundations in distributed knowledge reasoning. By exploiting such techniques, we have shown that the if direction of the CALM principle indeed holds also in rsync settings, but just for the subclass of monotonic queries defined as chained.

[1]

Abiteboul ,

Hull , and

Vianu . Foundations of Databases. Addison-Wesley , 1995 .

[2]

T. J.

Ameloot ,

Ketsman ,

Neven , and

Zinn . Weaker forms of monotonicity for declarative networking: a more fine-grained answer to the calm-conjecture . In PODS , pages 64 - 75 . ACM, 2014 .

[3]

T. J.

Ameloot ,

Ketsman ,

Neven , and

Zinn . Datalog queries distributing over components . In ICDT. ACM , 2015 .

[4]

T. J.

Ameloot ,

Neven , and J. Van Den Bussche. Relational transducers for declarative networking . J. ACM , 60 ( 2 ): 15 : 1 - 15 : 38 , May 2013 .

[5]

Ben-Zvi and

Moses . Beyond lamport's happened-before: On the role of time bounds in synchronous systems . In N. A. Lynch and A. A . Shvartsman, editors, DISC , volume 6343 of Lecture Notes in Computer Science, pages 421 - 436 . Springer, 2010 .

[6]

Ben-Zvi and

Moses . On interactive knowledge with bounded communication . Journal of Applied Non-Classical Logics , 21 ( 3-4 ): 323 - 354 , 2011 .

[7]

Condie ,

Conway ,

Alvaro ,

J. M.

Hellerstein ,

Elmeleegy , and

Sears . Mapreduce online . In Proceedings of the 7th USENIX conference on Networked systems design and implementation , NSDI'10 , pages 21 - 21 , Berkeley, CA, USA, 2010 . USENIX Association.

[8]

Fagin ,

J. Y.

Halpern ,

Moses , and

M. Y.

Vardi . Reasoning About Knowledge . MIT Press, Cambridge, MA, USA, 2003 .

[9]

J. M.

Hellerstein . The declarative imperative: experiences and conjectures in distributed logic . SIGMOD Rec ., 39 : 5 - 19 , September 2010 .

[10]

Interlandi . Reasoning about knowledge in distributed systems using datalog . In Datalog , pages 99 - 110 , 2012 .

[11]

Interlandi and

Tanca . On the calm principle for bulk synchronous parallel computation . arXiv:1405 . 7264 .

[12]

Interlandi ,

Tanca , and

Bergamaschi . Datalog in time and space, synchronously . In L. Bravo and M. Lenzerini, editors, AMW , volume 1087 of CEUR Workshop Proceedings. CEUR-WS.org , 2013 .

[13]

Lamport . Time, clocks, and the ordering of events in a distributed system . Commun. ACM , 21 ( 7 ): 558 - 565 , July 1978 .

[14]

Lamport . Using time instead of timeout for fault-tolerant distributed systems . ACM Trans. Program. Lang. Syst. , 6 ( 2 ): 254 - 280 , Apr. 1984 .

[15]

Olston ,

Reed , U. Srivastava,

Kumar , and

Tomkins . Pig latin: a not-so-foreign language for data processing . In Proceedings of the 2008 ACM SIGMOD international conference on Management of data, SIGMOD '08 , pages 1099 - 1110 , New York, NY, USA, 2008 . ACM.

[16]

Thusoo ,

J. S.

Sarma ,

Jain ,

Shao ,

Chakka ,

Anthony , H. Liu,

Wyckoff , and

Murthy . Hive: A warehousing solution over a map-reduce framework . Proc. VLDB Endow ., 2 ( 2 ): 1626 - 1629 , Aug. 2009 .

[17]

L. G.

Valiant . A bridging model for parallel computation . Commun. ACM , 33 ( 8 ): 103 - 111 , Aug. 1990 .

[18]

Vogels . Eventually consistent . Commun. ACM , 52 ( 1 ): 40 - 44 , Jan. 2009 .