1. Introduction

September

Ephemeral Per-query Engines for Serverless Analytics

Michael Wawrzoniak

Rodrigo Bruno

Ana Klimovic

Gustavo Alonso

1 0 INESC-ID/Técnico, U. Lisboa 1 Systems Group, Computer Science Department , ETH Zürich , Switzerland

2023

1 2023 0000 0002

We challenge the common assumption that queries are submitted to a pre-configured, already running engine and put forward the idea of dynamically instantiating a chosen data processing engine upon query submission by leveraging Function-as-aService (FaaS) platforms. We demonstrate the idea by running unmodified data processing engines (we use Apache Drill as an initial example) on real-world serverless FaaS platforms and show that such engines can be instantiated on demand when a query arrives. We aim to eventually support a wide range of queries and workloads. Wide access to such functionality would be a game changer in data processing. First, it would enable pay-per-query models supporting sporadic, interactive data analysis on arbitrary engines. Second, it would significantly increase the flexibility for data processing by enabling the possibility of dynamically choosing the actual engine, its configuration, and the resource allocation on a per-query basis. Logically, this amounts to dynamically attaching a query engine to the query rather than sending the query to a pre-configured and already deployed engine. In this paper we elaborate on this vision, outline the design of the MetaQ prototype that we are building to explore the idea, demonstrate that it is realistic through initial experiments, and discuss its many exciting practical implications.

eol>Serverless Data Analytics Functions-as-a-Service

1. Introduction

is instantiated (potentially selected from a variety of engines) in the best possible configuration and deployment Operating a long-running query engine has several lim- for the query, the query is executed by the engine, and itations. First, it generates costs even if it is idle. Sec- upon completion, the engine is shut down (unless there is ond, most distributed query engines lack elasticity, which a reason to keep it running, like a similar query arriving leads to deployments being over-provisioned to cope while the engine is active). This eliminates the need for with potential peak loads [ 1, 2, 3 ]. And third, as work- dynamic elasticity in the engine. Every query gets an enload diversity increases, each query might benefit from a gine deployed on just the resources it needs (e.g., nodes, diferent configuration and/or engine deployment (e.g., memory, bandwidth, CPUs). This also simplifies engine involving accelerators, caches, parallelism level, etc.), re- deployment (since the engine can be instantiated specifisulting in the engine often running in a less than optimal cally for the query at hand, e.g., maximizing data source setting for most queries [ 4, 5 ]. locality) and removes the need for auto-tuning [ 5 ] of

In this paper we explore an ambitious and radically long-running engines (the engine settings need to be opnew design: one in which we take advantage of server- timized only for the given query, which allows for more less computing to provide ephemeral per-query engines specialized and eficient solutions [ 6 ]). The approach also (EPQE), i.e., query engines dynamically instantiated for eliminates the problem of idle resources since if there is each query and discarded upon completion. The ulti- no query, there is no engine running. Finally, another mate goal is to be able to select the optimal engine and crucial aspect of the idea is the possibility of selecting configuration on a per-query basis, to eliminate the inefi- among diferent data processing engines on a per-query ciencies of using all-purpose configurations and resource basis. This opens up the opportunity to use diferent overprovisioning. engines depending on factors like data types (e.g., relaIn the EPQE paradigm, given a query, a query engine tional, semi-structured, graphs), file formats used (e.g., Arrow, Parquet, CVS, JSON, etc.), expected performance (e.g., based on previous profiling), feature set (e.g., availability of required statistical functions), or suitability to the overall task (e.g., when the query is a step in an ML pipeline). The idea resembles unikernel operating systems [ 7 ] where, for each application, a specialized operating system is constructed (e.g. from a library operating system [ 8 ]) and instantiated, already optimized for the application.

The vision of EPQE is enabled by the emergence of SM - session manager MO - meta-system optimizer PP - platform provider

FaaS Storage No compute resources instantiated before query execution

SM MO

Function as a Service (FaaS). In serverless computing, spective. These systems propose, among other things, users deploy and invoke fine-grain functions on-demand [ 9, complex ways to reduce the overhead of communicating 10]. There are three main characteristics of serverless through cloud storage, clever optimizations to minimize that can help in realizing the EPQE idea. First, thanks the amount of data exchanged, and suggest algorithms to lightweight VM system infrastructure [ 11, 12 ], func- to reduce the impact of start-up times as the number of tions can be instantiated quickly. For example, in AWS functions needed grows. In addition, there are eforts to Lambda [ 13 ], function cold start initialization latency is leverage commercial serverless FaaS oferings to provide ∼ 200ms. Such fast resource instantiation times allow caching and storage services to data center applications starting a new engine for a query without contributing running outside of the serverless functions [23, 24]. significantly to the overall execution time. Second, in- Unlike these existing eforts that build custom experidividual functions can be deployed with diferent CPU mental FaaS query engines to circumvent the limitations and memory configurations. Furthermore, thousands of of serverless platforms, our approach is to leverage existfunctions can be instantiated in parallel. Such a level of re- ing serverless infrastructure to run unmodified state-ofsource availability and configurability allows us to right- the-art data processing systems. By including existing size and right-configure engines at per-query granularity. unmodified engines, we will be able to take advantage Finally, FaaS platforms provide fine-grained resource ac- of their wide variety, feature completeness, and years counting (e.g., AWS Lambda users pay at microsecond of efort put into their development and optimization. granularity), aligning the costs of the EPQEs to the work However, all of the real-world FaaS platforms that we done and can play a role in deciding which engine to are aware of do not provide execution environments that instantiate. support running of-the-shelf distributed query engines.

However, despite their advantages, today’s FaaS server- Our approach is to leverage an evolution of the Boxer [25] less platforms are not adequate for general data process- system, which aims to overcome FaaS limitations (e.g., ing [ 14, 15 ] since running queries often requires features by enabling inter-function networking) to provide an that are missing, such as caching, support for direct com- execution environment on top of existing commercial munication among functions, and persistent state. This FaaS platforms (such as AWS Lambda) that matches the is the result of a conscious choice by providers who bun- requirements of unmodified of-the-shelf query engines. dle functions with a very restricted programming model To explore the feasibility of the EPQE concept, in this based on network-isolated, event-triggered modules com- paper we investigate whether (1) it is already possible to posable into larger systems through workflow-based or- run existing query engines on a commercial serverless chestration services [ 16, 17 ]. To overcome this mismatch, system (AWS Lambda); (2) whether the resulting pera significant research efort is underway. One approach formance is acceptable since existing distributed query involves redesigning serverless platforms from scratch engines have not been originally designed to operate on and developing a completely new FaaS platform to be top of serverless functions; and (3) get an initial idea of run on VMs (e.g., Anna key-value store [ 18 ]). Another whether selecting engines on a per-query basis would approach relies on commercial serverless FaaS oferings bring an advantage. We build a prototype system, MetaQ, and tries to overcome some of the platform shortcom- as a way to realize the EPQE model and conduct a feasiings from a data processing [ 19, 20 ] or ML [ 21, 22 ] per- bility study. (a) the initial resource allocation (e.g., where, how, and how many configured AWS Lambda functions should be started), (b) the query engine to use (such as Apache Drill,

Apache Spark, Trino [31], etc.), (c) the configuration of the query engine instantiation, including auxiliary systems such as Zookeeper [32] (e.g., mapping of engine executors onto the resources, configuring engine settings, required storage plugins, etc.)

Our focus is on distributed data processing platforms, (SM) for the given query. If the user-supplied specificasuch as Apache Spark or Apache Drill, instead of tradi- tions of the query engine or resources are not complete tional database engines, such as PostgreSQL or MySQL. or are left underspecified, then (step ○ 2 ) the meta-system We do not analyze the cost tradeofs of using AWS Lambda optimizer (MO) is used to choose all of the missing specifor data analytics, previous works [ 19, 20 ] established ifcations. The specification has three elements: that serverless can reduce costs for bursty query workloads. In particular, steady, similar, high-throughput workloads are better served by long-running systems utilizing more cost-efective infrastructure than AWS Lambda (e.g., AWS EC2 virtual machines).

We report the result of using an unmodified version of Apache Drill [26] in a distributed configuration over serverless FaaS and its performance running the TPCH benchmark. This initial experiment shows that the EPQE approach is feasible and, for all but one query, executing the query with the ephemeral approach is faster than the time it takes to simply instantiate a system with Once the complete specification is determined, it is used matching configuration over AWS Fargate [ 27]/Elastic to instruct the platform provider (PP) (step ○ 3 ) to instanContainer Service (ECS) [28] (without even starting to tiate and configure the specified resources and then start run any queries). We study the start-up time of a query the configured query engine processes (and any auxilprocessing engine in this context to examine its practical iary systems). The platform provider (step ○ 4 ), using the feasibility. Finally, we also discuss preliminary results specification of initial resource allocation (a), requests that indicate that some queries run faster in one engine the resources from the underlying platform, such as net(Apache Drill) than in others (Apache Spark [29]) and worked FaaS functions, configures their networking, and vice-versa, providing initial evidence that the per-query assigns necessary names, roles, and ids to function inengine selection approach can bring important advan- stances. The query engine specification (b) determines tages. which function (or container) images are instantiated from the available catalog. Finally, before the platform 2. MetaQ Prototype provider starts the query engine, the specification of the query engine configuration (c) is used to populate the necessary configuration files and environment variables for the query engine.

Once the engine is started and ready to process queries, the session manager (step ○ 5 ) submits the user query and awaits the results from the execution engine. When the query execution completes, the session manager retrieves the results (step ○ 6 ) and returns them to the user (step ○ 7 ).

When the query execution completes, all of the resources are released, and the system scales back to zero.

We assume that the persistent data is stored in standard formats (such as Parquet, ORC, Avro, CSV) and is available through cloud storage services compatible with the common query engines (such as S3 or EBS). We restrict the set of distributed query engines considered to ones that can be used in such networked shared-disk configurations.

Our current prototype of MetaQ uses AWS Lambda FaaS functions. To run of-the-shelf query engines despite the restricted function execution environment, we utilized Boxer to provide the required but missing functionality. Boxer is a system that runs standard datacenter applications in FaaS environments, providing the expected network-of-hosts execution model. Boxer runs in every function, alongside the application processes, and

We first outline the design of the MetaQ prototype, a

proof of concept design of the EPQE paradigm.

MetaQ has three main components: session manager (SM), platform provider (PP), and meta-system optimizer (MO). The session manager oversees end-to-end query execution and its resources, including handling communication with the client. The platform provider orchestrates the required resources and configures the environment required for the query engine execution. The meta-system optimizer is used to determine the complete specification of the resources, the query engine to be used, its conifguration, and possibly engine-specific query rewriting.

In cases when users specify the complete specification, the meta-system optimizer can be bypassed since the execution is fully specified.

Figure 1 illustrates query execution in MetaQ. To execute a query, a user (step ○ 1 ) starts MetaQ and speciifes the query and (optionally) the specifications of the query engine and resources to use for the query execution. MetaQ launches as a serverless FaaS function that can be instantiated on demand via a request to an API proxy service of a cloud provider (such as AWS API Gateway [30]).

MetaQ begins by instantiating the session manager

Startup Query Executution Fargate/ECS/EC2 min. billable Fargate/ECS Init 0.42 0.48 0 1 3 5 6 7 8

9 12 TPC-H Query 13 14 16 17 18 20 establishes an ephemeral network between the partici- reduce costs for bursty query workloads. For this study, pating functions. Boxer executes unmodified application we chose to use a variant of Boxer as our MetaQ platform processes (query engines and any auxiliary systems) in a provider (PP) component, which allowed us to instantiate FaaS environment while transparently exposing function- networked systems using AWS Lambda. For this initial to-function networking via the standard POSIX interfaces validation, we assumed that along with every considered (stream sockets etc.). To facilitate configuring the unmod- query, the user specifies the complete system specifiified distributed query engines in FaaS, Boxer is used to cation (resource allocations, query engine specification, assign roles to functions, provide name resolution, host and configuration). This bypasses the meta-engine optimembership, and coordinate query engine process execu- mizer(MO), which we plan to explore in the next stages tion. The collection of these Boxer features provides an of our research. execution environment in AWS Lambda FaaS that closely We experiment with per-query instantiations of Apache matches what is expected by distributed query engines. Drill, a general-purpose distributed SQL engine inspired

Although we show how MetaQ can run in FaaS envi- by Google Dremel [33]. We used the TPC-H benchmark ronments, its design is not tied to them. For example, to simulate the user queries to be evaluated using MetaQ. MetaQ’s components (SM,MO,PP) could execute locally Using the benchmark tools, we populated S3 cloud storon the user’s computer, and then could provide (a sub- age with data set at scale factor 10, resulting in 12 GBytes set of) standard client protocols that many distributed of data and with the largest relation with almost 60 milquery engines often expose (such as PostgreSQL stan- lion tuples. Each TPC-H query evaluation request was dard wire protocol or JDBC). Independently, there could accompanied by the complete query system specification be diferent platform providers (PP) giving access to dif- specifying (a) resources for 10 AWS Lambda functions ferent types of resources for query execution, from the with 6 vCPUs, x86_64 architecture, and 10GB of memuser’s local resources (useful for smaller workloads) to ory each, (b) Apache Drill as the query engine (the only serverless container services such as AWS Fargate or engine option in our experiment), and (c) stock configfuture serverless platforms that may provide access to uration options for Apache Drill worker nodes, a head heterogeneous hardware accelerators. node, and a single Apache Zookeeper node (required by Apache Drill).

The experiment emulates a session manager (SM) that 3. Feasibility study uses the Boxer system as the platform provided (PP) to instantiate resources on AWS Lambda and to start Apache 3.1. Methodology Drill nodes (and Zookeeper). The experimental session To validate the real-world feasibility of the EPQE paradigm, manager then waits for the query system to be available, we experiment with some of the basic components of the and then submits the query and waits for the results, MetaQ prototype design. We focus our analysis on the and returns on completion. In this study, to factor out technical feasibility of MetaQ rather than analyzing its the efects of function caching, we ensure that only cold cost tradeofs, [ 19, 20 ] have shown that serverless can functions are used for each query. 3.2. End-to-end query latency Figure 2 shows the median end-to-end query execution times. Without optimizing the Drill configuration, the 0.8 observed median end-to-end query execution times were between 30.42s and 65.13s seconds. (Not all of the TPC- itron0.6 H queries were able to run on Drill with the current Boxer roop0.4 variant due to its limit of less than 1024 file descriptors P available to Drill, while for some queries, Drill required 0.2 more.) For comparison, if we chose an alternative platform provider (PP) based on a serverless container service 0.0 20 25 30 35 40 45 such as AWS Fargate (using AWS Elastic Container Ser- Startup time [sec] vice(ECS), or AWS Elastic Kubernetes Service(EKS) [34]) Figure 3: Empirical CDF of the observed system startup times we expect the execution times to be significantly higher. of all instantiations in the experiment. The time from resource Such container services are not optimized for startup instantiation to the time when the 8-worker-node Apache times, and their implementations rely on EC2 for on- Drill system is ready to start executing the query. demand resource allocation. We observed that the median time to just instantiate a comparable (serverless) container (8GBytes of memory, with 1024MBytes image 3.3. Engine startup time size) using AWS Fargate/Elastic Container Service(ECS) is 54.9s (dashed line in Figure 2). This means that by the We also examined the variance of the query execution time the ECS container only begins to start the query times and startup times. The error bars in Figure 2 show engine, all but one of the queries executed by MetaQ are the maximum and minimum times for each query execualready finished, and the resources are already released. tion time relative to the median of the startup time (the Furthermore, the minimum billable duration for AWS variance due to the startup time is factored out). We obFargate/ECS is 1 minute, while AWS Lambda billing is at serve a noticeable, but acceptable, variance in the query 1ms granularity with no set minimum. execution time, with the majority of the queries having

Takeaway 1: MetaQ improves performance and re- median execution times within 10% of the slowest and duces resource usage by instantiating per-query data fastest executions. The highest observed dispersion was processing engines on FaaS infrastructure compared to for Query 20, with the slowest observed execution being containers or virtual machines. 18% slower than the median.

Figure 2 shows that a significant fraction of the query However, when we inspect the distribution of all of the execution is consumed by the startup time. The median startup times during the experiment, shown in Figure 3, time for the system to become ready to start executing we observe significantly higher variance (note that in a query is 19.67s, and (in terms of median values) that this experiment, the startup time is independent of the consumes between 30% (for query 9) and 67% (for query executed query since the client always specifies the same 6) of the total execution times for the queries we tested. complete specification with each query request, so we There are many techniques that can be used to reduce do not factor the startup times by query executed.) The this time (we have not optimized it in this experiment startup times ranged widely from 17.78s to 47.02s. In at all), from configuring the system to avoid starting particular, the measurements form two groups; of the unnecessary components to snapshotting JVM state [35, total of 70 measurements, the top 5 times (grey area in 36, 37]. Fortunately, because faster startup times are Figure 3) were above 43s, while all the remaining runs desirable for other use-cases of FaaS platforms as well, needed less than 27s to start executing the query. Our recently AWS Lambda started to ofer ability to fully initial investigation into the source of this variance indisnapshot the initial function state to avoid this issue [38]. cates that its main contributor is the time for the Apache We have not yet explored this feature, so the current Drill workers to become available after their processes results with FaaS should be treated as a conservative are started. Since the variance does not persist to the upper bound since there are further optimizations that query execution times, it suggests that these stragglers we can enable, such as restoring from snapshots. are not due to their function execution context being (per

Takeaway 2: MetaQ does not interfere with poten- manently) resource constrained. It is possible that these tial optimizations that cloud providers could introduce functions had to fetch base images from a deeper storage to FaaS. Its performance will only improve with these hierarchy as the worker processes were loading blocks of optimizations, giving it an even bigger advantage over data during startup. This suggests that the meta-system current solutions. optimizer (MO) strategies should consider the possibility of instantiating additional workers to compensate for this straggler phenomenon. Once enough workers are Lambada [ 19 ] and Starling [ 20 ] both ofer a data-analytics available, MetaQ could then terminate the unnecessary platform on top of serverless. Others have explored the stragglers. A similar technique is already performed in- benefits and pitfalls of running ML training and inference ternally by Boxer platform provider. Boxer, depending on FaaS [ 21, 22 ]. In all these cases, a major limitation on configuration, already instantiates additional func- is that serverless functions are stateless and exchange tions and proceeds only with the requested number of data through remote storage services (e.g., S3). Hence, functions that became available first and immediately for each query or task deployed on FaaS, a significant terminates the rest of the slower and unnecessary func- portion of time is spent reading/writing data from/to stortions. Notice that these techniques that discard stragglers age. Complex queries that require shufling data become are feasible using FaaS because of the fine granularity of even more of a problem by requiring multiple rounds of accounting and no minimum billing time. access to storage servers, thereby further increasing the

Takeaway 3: Although limited in scope, the experi- overhead. A lot of prior work focuses on how to mitiment demonstrates the real-world feasibility of the per- gate the data-passing limitations of FaaS infrastructure query engine paradigm. This very simple experiment by constructing custom experimental systems. leaves many possibilities for future improvements, but A first contribution and potentially the first application it already highlights the potential of our vision and mo- of the idea behind MetaQ is that it aims to run existing tivates the further exploration of the design space and platforms without having to wait until a suitable new future work on the MetaQ prototype. data processing or ML engine is developed matching the characteristics of serverless. Our approach enables run3.4. Selecting Query Engines ning complex data processing tasks at a large scale using existing mature systems, using a variety of engines taiA key aspect of the EPQE is the possibility of choos- lored to the query and data at hand, and deploying at the ing a diferent engine for each query. Although fur- scale needed while still maintaining all the advantages ther investigation is necessary, our preliminary com- of serverless. parison of query execution times of Apache Drill and Apache Spark, indicates that there likely will be a per- 4.2. Dynamically Extensible Engines formance gain from choosing diferent engines on perqueries granularity. We measured the query execution Data processing engines, such as traditional relational times of TPC-H scale factor 30 for Apache Drill and databases or many SQL-centric distributed platforms, Apache Spark using 8 AWS Lambda worker nodes. Ig- are limited along two dimensions. One is in terms of noring the startup times and only based on the relative deployment, as only one configuration is available at query execution times, we observe that for 14 of the any time. This leads to overprovisioning to make sure queries (1,4,5,6,7,9,10,11,12,13,15,16,17,22) Drill notice- the system can cope with any possible workload. The ably outperformed Spark, for 3 queries (14,19,20) Spark other is in terms of functionality. Very often, data is outperformed Drill, for 2 queries (8,18) perfromance was processed in these engines and then needs to be moved similar, while 3 queries were completed by only one of to other systems for further processing (e.g., ML training, the two engines (2,21 Spark only, 3 Drill only.) These statistical analysis, visualization). initial results suggest that, indeed, the notion of instanti- MetaQ can be used as an extension of existing engines ating a diferent engine depending on the query can be to address these two problems. In the same way we beneficial. This opens up very interesting research ques- show that one can launch a complete data processing tions in terms of how an optimizer could decide which system on serverless when a query arrives, an existing system to use. engine running on a VM could do the same to trigger additional capacity when necessary. For example, the basic mechanism presented here can be used to have Apache 4. Use Cases Drill launch additional ephemeral engines when the longrunning system is not able to cope with the additional In this section we explore use cases that could be either load. Recently, a similar approach has been explored by implemented on top of the prototype of MetaQ or would modifying an existing system, Pixels-Turbo [39] is an exrequire additional work on several aspects of the system tension of a Pixels [40] query engine that can instantiate and further research. query engines in AWS Lambda function to add elasticity to the system instantiated on long-running VMs. In the 4.1. More Eficient Data Analytics case of missing functionality for some tasks, the tranThere is a growing amount of work exploring how to best sition to another system can be done by triggering the use commercial serverless platforms for data analytics. corresponding system once the data processing engine ifnishes. This eliminates the need to have both systems running all the time and helps to automate the process rather than copying the data and transferring it manually to the other system (and then copying results back).

Complementary to these ideas is the notion of deploying a minimalist system (i.e., requiring much fewer resources) on a permanent infrastructure using VMs and then using the mechanisms of MetaQ to launch a more complete version of the system (or one tailored exactly to the task at hand) when queries arrive that require the more advanced functionality. 4.3. User-owned Data Analytics Stack Cloud providers ofer a set of Query-as-a-Service platforms, such as AWS Athena [41], which provide a simpliifed interface for large-scale analytics and charge users per byte read. However, users may still prefer to run their queries on a data analytics stack that they fully control (e.g., to optimize parameters and hardware conifgurations for their workloads). MetaQ enables users to run their own data analytics stack while still benefiting from simple abstractions and a convenient pay-per-query cost model, as resources can be acquired and released on-demand in response to load. As Palkar and Zaharia point out, users may also prefer to run their own analytics engines and web services rather than relying on out-of-the-box cloud solutions for privacy reasons [42].

This is especially true when queries involve UDFs, as these are more dificult to securely isolate in shared infrastructure deployments. By operating their own data analytics stack, users get to control how the system is configured and monitor how they are billed for the work performed for a particular task. 4.4. Data Lakes Data Lakes refer to collections of heterogeneous data that needs to be processed in a variety of diferent ways. The problem with this notion is that the processing is also highly heterogeneous, and it is the user who is responsible for handling it. Lakehouses is a new iteration of the concept that incorporates the data processing as a firstclass citizen and provides support for diferent engines, languages, etc., while automating as much as possible the task of matching data to engines and tools [43].

MetaQ is well-suited to Lakehouses as it enables dynamically selecting the engine and processing tools on the fly, and this can be done on the basis such as data types, data sizes, type of query, user requirements, or cost, etc. Furthermore, the per-query engine vision enables an intriguing possibility: sharing of auxiliary data structures across engines (indexes, partitions, zone maps, etc.) as well as creating a general infrastructure that is engine agnostic (e.g., a main memory caching layer for data to avoid having to retrieve it from slow storage every time or a results cache). Such infrastructure exists, but it is typically system specific. MetaQ opens up the possibility of seeing these aspects as orthogonal to the actual engine. In the extreme, all common modules of query engines could become serverless components dynamically added to an engine as it is instantiated with the query-specific functionality.

5. Research Opportunities The idea of EPQE behind MetaQ opens a number of interesting research directions which we now highlight.

5.1. The Meta-Engine

EPQE unlock a number of opportunities when it comes

to selecting the most appropriate engine for each query. This can be done in a very simple manner by, for instance, asking the user to specify which engine to use. However, we are interested in automating the selection process by building an end-to-end query system that handles this. In a scenario where users write queries in an engine-agnostic syntax (for example, in a declarative language such as SQL), MetaQ’s meta-system optimizer could inspect the query and determine which engine is the most eficient given the data types, its type (static or streaming), the type of operations required, etc. This leads to cross-engine optimizations, such as picking the engine that is faster to perform a given operation provided by several engines. The main research question is how to derive meta-system optimizer policies. One possible approach is to extend the domain of automatic configuration systems [ 5 ] with the additional tasks of choosing not just configuration parameters for a query engine, but also the choice of the query engine itself and resource allocation based on the query considered, eventually realizing the vision of vertically integrated per-query optimization. 5.2. Autoscaling Per-query Deployments

With a new deployment being launched and shut down

per query, it is now possible to optimize the deployment where the engine will run for every query. Such deployment configuration could determine the amount of resources used, such as the CPU and/or memory budget.

Such configuration could be inferred by analyzing the query and data inputs to estimate the amount of data that would be processed and, therefore, the amount of compute and memory necessary to finish the query within a particular time frame. From another perspective, it is now possible to dynamically find tradeofs between execution time and price for each query. This tradeof could also be exposed to users as a way to prioritize interactive services provided by the cloud. This results in inefiqueries over batch workloads. ciencies that are dificult to address: over-provisioning, coarse resource allocation, generic engine configurations, 5.3. Query Scheduling and Caching low utilization, etc. In this paper, we put forward the idea of ephemeral per-query engines: selected query engines Beyond automatically sizing and optimizing per-query dynamically instantiated when a query arrives and redeployments, it is also possible to schedule query execu- moved when it terminates. In the paper, we have outlined tion on nodes that have some locally cached data or that the idea, discussed its potential to address many of the are close to storage nodes. For example, if a workload re- limitations of current deployments, provided a feasibility quires two queries to be executed, the second query could study, and demonstrated that, while there is still much be scheduled for execution on the same physical node(s) work to do, it is possible to implement it in current FaaS that was used to execute the previous one. To keep data platforms. The initial experiments are highly encouraglocal, caching approaches such as Faa$t cache [44] can ing. They show that existing engines can be suficiently also be used to keep the output of queries. quickly instantiated on demand to run a single query. Building on this basis, we have also discussed and pre5.4. System Infrastructure sented several research directions that can be pursued based on the ideas and results presented here.

To implement inter-function communication, MetaQ pro

totype uses Boxer as its platform provider. Boxer (and therefore MetaQ) do not require any cloud provider intervention and can be deployed today in AWS Lambda. However, Boxer is not yet feature complete in terms of interfaces, networking support, reliability, and integration within larger systems. That is something that we are working on at the moment so as to have a more solid basis for the system. Similarly, Boxer was initially built for AWS Lambda. We are in the process of studying how to port Boxer to other commercial serverless oferings. Doing so would open yet another wave of exciting opportunities, like triggering serverless jobs across heterogeneous clouds using the networking capabilities available in Boxer. 5.5. Generalizing to Other Engines Our experiments are only a first step towards the perquery engine vision. We plan to test this paradigm and our MetaQ prototype on a wider range of data processing engines and platforms on top of the existing prototype to make sure it can indeed be used as a general-purpose distributed computing platform equivalent to what can be done on a VM. Systems that we are in the process of testing include Apache Spark, Trino, Databend [45], Flink [46], Clickhouse [47]. Having them running on the same serverless platform will also ofer a great opportunity to study the engine designs that are most suitable for serverless, providing very valuable information on the road toward serverless native engines.

6. Conclusion Distributed data processing engines often require to have a fixed underlying infrastructure to run in the form of pre-allocated VMs, Virtual Private Networks, and other

[38] Improving startup performance with Lambda Snap- house: A new generation of open platforms that Start, 2023. URL: https://docs.aws.amazon.com/ unify data warehousing and advanced analytics, in: lambda/latest/dg/snapstart.html, (accessed: 2023- 11th Conference on Innovative Data Systems Re03-01). search, CIDR 2021, Virtual Event, January 11-15, [39] H. Bian, T. Sha, A. Ailamaki, Using cloud functions 2021, Online Proceedings, www.cidrdb.org, 2021.

as accelerator for elastic data analytics 1 (2023). [44] F. Romero, G. I. Chaudhry, I. n. Goiri, P. Gopa, P. Ba[40] H. Bian, A. Ailamaki, Pixels: An eficient column tum, N. J. Yadwadkar, R. Fonseca, C. Kozyrakis, store for cloud data lakes, in: 2022 IEEE 38th Inter- R. Bianchini, Faa$t: A transparent auto-scaling national Conference on Data Engineering (ICDE), cache for serverless applications, Association for 2022, pp. 3078–3090. Computing Machinery, New York, NY, USA, 2021. [41] Amazon Athena, 2020. URL: http://docs.aws. [45] Databend, 2023. URL: https://databend.rs/, (acamazon.com/athena/, (accessed: 2020-08-17). cessed: 2023-06-20). [42] S. Palkar, M. Zaharia, Diy hosting for online privacy, [46] Apache Flink, 2023. URL: https://flink.apache.org/, in: Proceedings of the 16th ACM Workshop on Hot (accessed: 2023-03-01).

Topics in Networks, HotNets-XVI, 2017. [47] ClickHouse, 2023. URL: https://clickhouse.com/, (ac[43] M. Zaharia, A. Ghodsi, R. Xin, M. Armbrust, Lake- cessed: 2023-06-20).

[1]

Vuppalapati ,

Miron ,

Agarwal ,

Truong ,

Motivala , T. Cruanes, Building an elastic query engine on disaggregated storage , in: Proceedings of the 17th Usenix Conference on Networked Systems Design and Implementation , NSDI'20,

USENIX

Association , USA, 2020 , p. 449 - 462 .

[2]

Tan ,

Babu , Tempo: Robust and self-tuning resource management in multi-tenant parallel databases , Proc. VLDB Endow . 9 ( 2016 ) 720 - 731 .

[3]

Das ,

Li ,

V. R.

Narasayya ,

A. C.

König , Automated demand-driven resource scaling in relational database-as-a-service , in: Proceedings of the 2016 International Conference on Management of Data, SIGMOD '16 , 2016 , p. 1923 - 1934 .

[4]

Augusta ,

Idreos , Jafar: Near-data processing for databases , in: Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, SIGMOD '15 , 2015 , p. 2069 - 2070 .

[5]

D. V.

Aken ,

Yang ,

Brillard ,

Fiorino ,

Zhang ,

Billian ,

Pavlo , An inquiry into machine learning-based automatic configuration tuning services on real-world database management systems , Proc. VLDB Endow . 14 ( 2021 ) 1241 - 1253 .

[6]

Zhu , J. Liu,

Guo ,

Bao ,

Ma ,

Liu ,

Song ,

Yang , Bestconfig: Tapping the performance potential of systems via automatic configuration tuning, Association for Computing Machinery , New York, NY, USA, 2017 .

[7]

Madhavapeddy ,

Mortier ,

Rotsos ,

Scott ,

Singh ,

Gazagnaire ,

Smith ,

Hand ,

Crowcroft , Unikernels: Library operating systems for the cloud, Association for Computing Machinery , New York, NY, USA, 2013 .

[8]

Kuenzer ,

V.-A.

Bădoiu ,

Lefeuvre ,

Santhanam ,

Jung , G. Gain,

Soldani , C. Lupu, c. Teodorescu, [22]

Wu , T. T. A. Dinh , G. Hu, M.

Zhang , Y. M.

Chee , C.

Răducanu , C.

Banu , L.

Mathy , R.

Deaconescu , B. C.

Ooi , Serverless data science - are we there yet? C.

Raiciu , F.

Huici , Unikraft: Fast, specialized A case study of model serving, in: SIGMOD '22: unikernels the easy way , Association for Comput- International Conference on Management of Data, ing Machinery , New York, NY, USA, 2021 . Philadelphia, PA, USA, June 12 - 17, 2022 , 2022 .

[9]

Schleier-Smith ,

Sreekanti ,

Khandelwal , [23]

Wang ,

Zhang ,

Ma ,

Anwar ,

Rupprecht ,

Carreira ,

N. J.

Yadwadkar ,

R. A.

Popa ,

J. E.

Gon- D. Skourtis ,

Tarasov ,

Yan , Y. Cheng, Infinicache: zalez, I. Stoica,

D. A.

Patterson , What serverless Exploiting ephemeral serverless functions to build computing is and should become: The next phase of a cost-efective memory cache, in: USENIX FAST, cloud computing , Commun. ACM 64 ( 2021 ) 76 - 84 . 2020 .

[10]

Castro ,

Ishakian ,

Muthusamy ,

Slominski , [24]

Zhang ,

Wang ,

Ma ,

Carver ,

N. J.

Newman , The rise of serverless computing, Commun . ACM A. Anwar , L.

Rupprecht , V.

Tarasov , D.

Skourtis , 62 ( 2019 ) 44 - 54 .

Yan , Y. Cheng, Infinistore: Elastic serverless

[11]

Agache ,

Brooker ,

Iordache , A . Liguori, cloud storage 16 ( 2023 ). R. Neugebauer,

Piwonka , D.-M. Popa , Firecracker: [25] M. Wawrzoniak , I. Müller ,

Bruno , G. Alonso, Lightweight virtualization for serverless applica- Boxer: Data analytics on network-enabled servertions , in: NSDI , 2020 . less platforms, in: CIDR , 2021 .

[12]

Ao , G. Porter,

G. M.

Voelker , Faasnap: Faas made [26]

Apache

Drill , 2022 . URL: https://drill.apache.org/, fast using snapshot-based vms, Association for (accessed: 2022- 10 -20). Computing Machinery, New York, NY, USA, 2022 . [27]

AWS

Fargate -

Serverless compute for containers,

[13]

AWS

Lambda , 2020 . URL: https://aws.amazon.com/ 2023-03- 01 . URL: https://aws.amazon.com/fargate/, lambda, (accessed: 2020 -08-17). (accessed: 2023 -03-01).

[14] J. M. Hellerstein , J. M. Faleiro , J. Gonzalez, [ 28 ]

Amazon

Elastic Container Service (Amazon ECS ), J. Schleier-Smith ,

Sreekanti ,

Tumanov ,

Wu , 2023 . URL: https://aws.amazon.com/ecs/, (accessed: Serverless computing: One step forward , two steps 2023 - 03 -01). back, in: CIDR, 2019 . [29]

Zaharia ,

R. S.

Xin ,

Wendell , T. Das , M. Arm-

[15]

Wang ,

Li ,

Zhang , T. Ristenpart, M. Swift, brust,

Dave ,

Meng ,

Rosen ,

Venkataraman , Peeking behind the curtains of serverless platforms, M. J . Franklin , A.

Ghodsi , J.

Gonzalez , S. Shenker, in : Proceedings of the 2018 USENIX Conference I. Stoica , Apache spark: A unified engine for big on Usenix Annual Technical Conference , USENIX data processing 59 ( 2016 ). ATC '18 , 2018 . [30] Amazon

API Gateway

, 2023 . URL: https://aws.

[16]

AWS

Step Functions , 2023 . URL: https://aws. amazon.com/api-gateway/, (accessed: 2023 -03-01). amazon.com/step-functions/, (accessed: 2023 -03- [31] Trino , 2023 . URL: https://trino.io/, (accessed: 2023 - 01 ). 06 - 20 ).

[17]

Azure

Durable Functions , 2023 . URL: [32]

Hunt ,

Konar ,

F. P.

Junqueira ,

Reed , https://learn.microsoft.com/en-us/azure/ Zookeeper: Wait-free coordination for internetazure-functions/durable/, (accessed: 2023-03- scale systems , USENIX ATC'10 , 2010 . 01 ). [33]

Melnik ,

Gubarev ,

J. J.

Long , G. Romer, S. Shiv-

[18]

Sreekanti ,

Wu ,

X. C.

Lin ,

Schleier-Smith , J. E. akumar, M. Tolton, T. Vassilakis, Dremel: InteracGonzalez,

J. M.

Hellerstein ,

Tumanov , Cloud- tive analysis of web-scale datasets, 2010. burst: Stateful functions-as-a-service , Proc. VLDB [34] Amazon Elastic Kubernetes Service (EKS) , 2023 . Endow. 13 ( 2020 ) 2438 - 2452 . URL: https://aws.amazon.com/eks/, (accessed: 2023 -

[19] I. Müller ,

Marroquín , G. Alonso, Lambada: Inter- 03 -01). active data analytics on cold data using serverless [35]

Shin ,

W.-H.

Kim ,

Min , Fireworks: A fast, cloud infrastructure , in: SIGMOD , 2020 . eficient, and safe serverless framework using vm-

[20]

Perron ,

R. Castro

Fernandez , D. DeWitt, S. Mad- level post-jit snapshot, Association for Computing den, Starling: A scalable query engine on cloud Machinery , New York, NY, USA, 2022 . functions, in: SIGMOD, 2020 . [36]

Du ,

Yu ,

Xia ,

Zang , G. Yan,

Qin ,

Wu ,

[21]

Jiang ,

Gan ,

Liu ,

Wang ,

Alonso ,

Chen , Catalyzer: Sub-millisecond startup for A . Klimovic , A.

Singla , W.

Wu , C. Zhang,

Towards serverless computing with initialization-less bootdemystifying serverless machine learning training, ing , in: ASPLOS, 2020 . in: Proceedings of the 2021 International Confer- [37]

Cadden ,

Unger ,

Awad ,

Dong ,

Krieger , ence on Management of Data , 2021 , p. 857 - 871 . J. Appavoo , Seuss: Skip redundant paths to make serverless fast , in: EuroSys, 2020 .