-

Development of a Parallel DBMS on the Basis of PostgreSQL

c Constantin Pan

0 1

M.Sc. advisor: Mikhail Zymbler

0 1 0 Proceedings of the Spring Researcher's Colloquium on Database and Information Systems , Moscow, Russia, 2011 1 South Ural State University

2 6

The paper describes the architecture and the design of PargreSQL parallel database management system (DBMS) for distributed memory multiprocessors. PargreSQL is based upon PostgreSQL open-source DBMS and exploits partitioned parallelism.

Daemon

Currently open-source PostgreSQL DBMS [ 12 ] is a reliable alternative for commercial DBMSes. There are many both practical database applications based upon PostgreSQL and research projects devoted to extension and improvement of PostgreSQL.

One of the directions mentioned above is to adapt PostgreSQL for parallel query processing. In this paper we describe the architecture and design of PargreSQL parallel DBMS for analytical data processing on distributed multiprocessors. PargreSQL represents PostgreSQL with embedded partitioned parallelism.

The paper is organized as follows. Section 2 briefly discusses related work. Section 3 gives a description of the PostgreSQL DBMS architecture. Section 4 introduces design principles and architecture of PargreSQL DBMS. The results of experiments on the current partial implementation are shown in section 5. Section 6 contains concluding remarks and directions for future work.

Related work

The research on extension and improvement of PostgreSQL DBMS includes the following.

In [ 10 ] native XML type support in PostgreSQL is discussed. Adding data types to provide support of HL7 medical information exchange standard in PostgreSQL is described in [ 4 ]. The authors of [ 3 ] propose an imagehandling extension to PostgreSQL. In [ 8 ] an approach to integration of PostgreSQL with the Semantic Web is presented.

There are papers investigating adoption of PostgreSQL for parallel query processing as well. In [ 6 ] authors introduce their work on extending PostgreSQL

This paper is supported by the Russian Foundation for Basic Research (grant No. 09-07-00241-a). to support distributed query processing. Several limitations in PostgreSQL’s query engine and corresponding query execution techniques to improve performance of distributed query processing are presented. ParGRES [ 9 ] is an open-source database cluster middleware for high performance OLAP query processing. ParGRES exploits intra-query parallelism on PC clusters and uses adaptive virtual partitioning of the database. GParGRES [ 5 ] exploits database replication and inter- and intra-query parallelism to efficiently support OLAP queries in a grid. The approach has two levels of query splitting: gridlevel splitting, implemented by GParGRES, and nodelevel splitting, implemented by ParGRES.

In [ 1 ] building a hybrid between MapReduce and parallel database is explored. The authors created a prototype named HadoopDB on the basis of Hadoop and PostgreSQL, that is as efficient as parallel DBMS, but as scalable, fault tolerant and flexible as MapReduce systems. PostgreSQL is used as the database layer and Hadoop as the communication layer.

Our contribution is embedding partitioned parallelism [ 2 ] into PostgreSQL. We use methods for parallel query processing, proposed in [ 11 ] and [ 7 ]. 3

PostgreSQL Architecture

PostgreSQL is based on the client-server model. A session involves three processes into interaction: a frontend, a backend and a daemon (see fig. 1).

connects 1

1 -executor 1

The daemon handles incoming connections from frontends and creates a backend for each one. Each backend executes queries received from the related frontend. The activity diagram of a PostgreSQL session is shown in fig. 2.

There are following steps of query processing in PostgreSQL: parse, rewrite, plan/optimize, and execute.

Respective PostgreSQL subsystems are depicted in fig. 3. Parser checks the syntax of the query string and builds a parse tree. Rewriter processes the tree according accept fork connect send query recv result [more queries] else exec query send result

libpq to the rules specified by the user (e.g. view definitions). Planner creates an optimal execution plan for this query tree. Executor takes the execution plan and processes it recursively from the root. Storage provides functions to store and retrieve tuples and metadata. libpq implements frontend-backend interaction protocol and consists of two parts: the frontend (libpq-fe) and the backend (libpq-be). The former is deployed on the client side and serves as an API for the end-user application. The latter is deployed on the server side and serves as an API for libpq-fe, as shown in fig. 4. 4

PargreSQL Architecture

PargreSQL utilizes the idea of partitioned parallelism [ 7 ] as shown in fig. 5. This form of parallelism supposes partitioning relations among the disks of the multiprocessor system.

The way the partitioning is done is defined by a fragmentation function, which for each tuple of the relation S0 P9 S9 ⋮ g n i g re ⋮ M Partitioning function calculates the number of the processor node which this tuple should be placed at. A query is executed in parallel on all processor nodes as a set of parallel agents. Each agent processes its own fragment and generates a partial query result. The partial results are merged into the resulting relation.

The architecture of PargreSQL, in contrast with PostgreSQL, assumes that a client connects to two or more servers (see fig. 6).

connects n k par_Frontend

The interaction sequence is shown in fig. 7. As opposed to PostgreSQL there are many daemons running in PargreSQL. A frontend connects to each of them, sends the same query to many backends, and receives the result relation.

2.1: create() d1 : Daemon

b1 : par_Backend 3.1: sendquery() 5.1: sendresult() 4.n: exchange() 1.1: connect() 1.n: connect() f : par_Frontend 3.n: sendquery() 2.n: create() dn : Daemon bn : par_Backend 5.n: sendresult() 4.1: exchange()

Parallel query processing in PargreSQL is done in more steps: parse, rewrite, plan/optimize, parallelize, execute, and balance. During the query execution each agent processes its own part of the relation independently so, to obtain the correct result, transfers of tuples are required. Parallelization stages creation of a parallel plan by inserting special exchange operators into the corresponding places of the plan. Balance provides loadbalancing of the server nodes.

PargreSQL subsystems are depicted in fig. 8. PostgreSQL is one of them. PargreSQL development involves changes in Storage, Executor and Planner subsystems of PostgreSQL.

The changes in the old code are needed to integrate it with the new subsystems. par Storage is responsible for storing partitioning metadata of relations. par Exchange encapsulates the exchange operator implementation. Exchange operator is meant to compute the distribution function for each tuple of the relation, send “alien” tuples to the other nodes, and receive “own” tuples in response.

There are however some new subsystems which do not require any changes in the old code: par libpq-fe and par Compat. par libpq-fe is a wrapper around libpq-fe, it is needed to propagate queries from an application to many servers. par Compat makes this propagation transparent to the application. app

MPS subsystem (Message Passing System) is used by Scatter and Gather to transmit tuples. Its interface is like MPI reduced to three methods: ISend, IRecv, and Test. They are actually implemented on top of MPI.

Figs. 14, 15, 16, and 17 show algorithms for next() method of four exchange subnodes. Exchange operator [ 7, 11 ] serves to exchange tuples between parallel agents. It is inserted into execution plans by Parallelizer subsystem. The operator’s architecture is presented in fig. 12.

Split is meant to calculate fragmentation function for each tuple and choose whether to keep it on the processor node or send it to other processor node.

even := not even [even]

[odd] right.next left.next tuple left.next right.next tuple

Experimental Evaluation

At the moment we have implemented par libpq and par Exchange subsystems of PargreSQL. The implementation has been tested on the following query: select * from tab where tab.col % 10000 = 0

The query has been run against table tab consisting of 108 tuples. The speedup relative to PostgreSQL is shown in fig. 18.

6 5 p4 u d e e 3 p S 2 1

Linear

Actual 2 3 4 5

Nodes

[1]

Azza

Abouzeid , Kamil Bajda-Pawlikowski, Daniel J. Abadi, Alexander Rasin, and Avi Silberschatz. HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads . PVLDB , 2 ( 1 ): 922 - 933 , 2009 .

[2] David J. DeWitt and

Jim

Gray . Parallel Database Systems: The Future of High Performance Database Systems. Commun. ACM , 35 ( 6 ): 85 - 98 , 1992 .

[3]

Denise

Guliato , Ernani V. de Melo, Rangaraj M. Rangayyan , and Robson

Soares. POSTGRESQL-IE: An Image-handling Extension for PostgreSQL . J. Digital Imaging , 22 ( 2 ): 149 - 165 , 2009 .

[4]

Yeb

Havinga , Willem Dijkstra, and Ander de Keijzer. Adding HL7 version 3 data types to PostgreSQL . CoRR, abs/1003.3370, 2010 .

[5]

Nelson

Kotowski , Alexandre A. B. Lima , Esther Pacitti, Patrick Valduriez, and Marta Mattoso . Parallel query processing for OLAP in grids . Concurrency and Computation: Practice and Experience , 20 ( 17 ): 2039 - 2048 , 2008 .

[6]

Rubao

Lee and

Minghong

Zhou . Extending PostgreSQL to Support Distributed/Heterogeneous Query Processing . In Kotagiri Ramamohanarao,

Radha Krishna , Mukesh K. Mohania , and Ekawit Nantajeewarawat, editors, DASFAA , volume 4443 of Lecture Notes in Computer Science, pages 1086 - 1097 . Springer, 2007 .

[7] Andrey

Lepikhov and Leonid B.

Sokolinsky . Query processing in a DBMS for cluster systems . Programming and Computer Software , 36 ( 4 ): 205 - 215 , 2010 .

[8] Dmitry

Levshin and A. S.

Markov . Algorithms for integrating PostgreSQL with the semantic web . Programming and Computer Software , 35 ( 3 ): 136 - 144 , 2009 .

[9]

Melissa

Paes , Alexandre A. B. Lima , Patrick Valduriez, and Marta Mattoso . High-Performance Query Processing of a Real-World OLAP Database with ParGRES . In Jose´ M. Laginha M. Palma , Patrick Amestoy, Michel J . Dayde´,

Marta

Mattoso , and Joa˜o Correia Lopes, editors, VECPAR , volume 5336 of Lecture Notes in Computer Science, pages 188 - 200 . Springer, 2008 .

[10]

Nikolay

Samokhvalov . XML Support in PostgreSQL . In Sergei D. Kuznetsov, Andrey Fomichev, Boris Novikov, and Dmitry Shaporenkov, editors, SYRCoDIS , volume 256 of CEUR Workshop Proceedings. CEUR-WS.org , 2007 .

[11] Leonid

Sokolinsky . Organization of Parallel Query Processing in Multiprocessor Database Machines with Hierarchical Architecture . Programming and Computer Software , 27 ( 6 ): 297 - 308 , 2001 .

[12]

Michael

Stonebraker and

Greg

Kemnitz . The POSTGRES next generation database management system . Commun. ACM , 34 : 78 - 92 , October 1991 .