=Paper=
{{Paper
|id=Vol-1295/paper15
|storemode=property
|title=Managing Massive Business Process Models and Instances with Process Space
|pdfUrl=https://ceur-ws.org/Vol-1295/paper15.pdf
|volume=Vol-1295
|dblpUrl=https://dblp.org/rec/conf/bpm/WangLW014
}}
==Managing Massive Business Process Models and Instances with Process Space==
Managing Massive Business Process Models and Instances with Process Space Shuhao Wang, Cheng Lv, Lijie Wen, and Jianmin Wang School of Software, Tsinghua University, Beijing 100084, P.R. China shudiwsh2009@gmail.com,lvcheng1031@qq.com, wenlj@tsinghua.edu.cn,jimwang@tsinghua.edu.cn Abstract. BPM techniques are becoming more widely used, and there are more and more business process models and instances emerging. In this demonstration, we show how to manage large scale of process models and instances with Process Space. Creating, importing, storing, indexing and querying of models and instances will be exhibited. Since online tools for managing massive process instances are very rare, we focus on showing our useful tool of exploring process instances. 1 Introduction With the technology of business process management being more widely used, there are more and more business process models and instances accumulated in enterprises. For example, there are more than 8,000 models in CMCC, a Chinese state-owned telecommunication company. On the other hand, in recent years, process data (especially event log) is increasing sharply and becoming a typical type of big data. Behavioral analysis on big process data is in urgent need. For example, 90,000 construction machineries in Sany (officially Sany Heavy Industry Co., Ltd.), a Chinese multinational heavy machinery manufacturing company, generated more than 60 billion working status records in 2012. The size of Wal- Mart’s RFID data recorded every three days is equivalent to the entire collections of The Library of Congress, and even common RFID applications generate log data of more than 1 Gigabyte every day. Based on these models and instances, enterprises can get a clear view of their processes’ running states, which plays an important role in making deci- sions. Therefore, managing a large number of models and instances efficiently is challenging and can bring huge values for enterprises. Our contributions can be summarized as follows. Firstly, an online process model and instance management tool is fully designed and implemented for big process data, including modeling, importing, storing, analysis and querying. Sec- ondly, it’s an open platform which supports user-defined modules and functions. Copyright c 2014 for this paper by its authors. Copying permitted for private and academic purposes. 2 Shuhao Wang, Cheng Lv, Lijie Wen, and Jianmin Wang The remainder of this paper is structured as follows. In Section 2 we study related work, before we introduce the implementation of our work in Section 3. The management of process instances is presented in Section 4. In Section 5, we show our demonstration and conclude the paper. 2 Related Work BeehiveZ [2] BeehiveZ1 is a business process model management system devel- oped by Tsinghua University. It focuses on the kernel algorithms for model query, index, generation, simulation, similarity measure and the evaluation of process mining algorithms, etc. Four types of business process model indexes and queries mentioned in [7], including: (1) exact query based on structure, (2) similarity query based on structure, (3) exact query based on behavior, (4) similarity query based on behavior, are all supported by BeehiveZ. Nearly all the functions in BeehiveZ have been integrated and extended into our new online tool Process Space. Oryx Oryx2 is a web-based editor for modeling business processes in various lan- guages like BPMN, EPC and Petri net. It is an open platform for developments regarding process modeling. Oryx is a project of the University of Potsdam. Some source codes of Oryx are modified and imported into Process Space. Disco Disco3 is a stand-alone applications designed and developed by Fluxicon for process mining and analysis on event logs. In the design of process execution analysis module of Process Space, we get a lot of inspiration from chart display methods in Disco. Compared to Disco, Process Space is an open web application which can handle large scale event logs and supporting third-party plugins. Business process model query There have already been several research pro- totypes on business process model query. BPMN-Q [3] is a graph-based query language. WISE [5] is a workflow information search engine, where workflow models are represented hierarchically. VisTrails [4] allows users to query work- flows by example. BP-QL [1] is based on an abstraction of the BPEL standard for distributed environment and supports query by example. Yan [8] uses feature- based similarity estimation to improve the efficiency of similarity search. 3 Implementation of Process Space Process Space is a Brower/Server application implemented in Java. As shown in Figure 1, it includes four major modules, which are process model analyzer, process monitor, process instance analyzer and process data repository. For space limitation, we omit the detail description. 1 https://code.google.com/p/beehivez/ 2 https://code.google.com/p/oryx-editor/ 3 http://fluxicon.com/disco/ Managing Massive Business Process Models and Instances with Process Space 3 8KVXKYK TZGZOUT 6XUIKYY3UJKR'TGR_`KX 8KVUXZ 3UJKR)UT\KXYOUT 3UJKR,XGMSKTZGZOUT 9OSORGXOZ_ 3KGY[XK GTJ 8KZXOK\GR 3UJKR*OLLKXKTZOGZOUT (634 +6) 6KZXO4KZ 869: IRUTKJKZKIZOUT RGHKR YZX[IZ[XK HKNG\OUX INGTMKUVKXGZOUT 16/ 3UJKR3KXMK 3UJKR9ZGZOYZOIY 3UJKR