=Paper= {{Paper |id=Vol-3041/483-487-paper-89 |storemode=property |title=Simulation of Data Processing for the BM@N Experiment of the NICA Complex |pdfUrl=https://ceur-ws.org/Vol-3041/483-487-paper-89.pdf |volume=Vol-3041 |authors=Daria Priakhina,Vladimir Korenkov,Konstantin Gertsenberger,Vladimir Trofimov }} ==Simulation of Data Processing for the BM@N Experiment of the NICA Complex== https://ceur-ws.org/Vol-3041/483-487-paper-89.pdf
Proceedings of the 9th International Conference "Distributed Computing and Grid Technologies in Science and
                           Education" (GRID'2021), Dubna, Russia, July 5-9, 2021



     SIMULATION OF DATA PROCESSING FOR THE BM@N
           EXPERIMENT OF THE NICA COMPLEX
         D. Priakhina1,2,3 a, V. Korenkov1,2,3, K. Gertsenberger1, V. Trofimov1
1
    Joint Institute for Nuclear Research, Russia, Moscow region, Dubna, 141980, 6 Joliot-Curie
     2
         Dubna State University, Russia, Moscow region, Dubna, 141980, 19 Universitetskaya
3
    Plekhanov Russian University of Economics, Russia, Moscow, 115093, 36 Stremyanny per.

                                       E-mail: a pryahinad@jinr.ru


The paper considers the application of a software complex for data processing simulation in
computing systems. The BM@N experiment data storage and processing system of the NICA complex
is used as a simulated infrastructure. Simulation is performed in order to obtain recommendations on
the organization of experimental data processing during a session with available allocated resources.
The paper presents the results of simulating the processing of data that will be received during the
BM@N experiment session, with several scenarios for distributing job flows across existing
processing centers. In addition, some recommendations for organizing experimental data processing
are proposed. The status of the work and future plans for the development of the software complex are
formulated.

Keywords: simulation, data center, BM@N experiment, NICA complex



                    Daria Priakhina, Vladimir Korenkov, Konstantin Gertsenberger, Vladimir Trofimov



                                                              Copyright © 2021 for this paper by its authors.
                     Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).




                                                    483
Proceedings of the 9th International Conference "Distributed Computing and Grid Technologies in Science and
                           Education" (GRID'2021), Dubna, Russia, July 5-9, 2021



1. Introduction
         A software complex for simulating the processing of data coming from the experimental
facility of the NICA complex is being developed at the Meshcheryakov Laboratory of Information
Technologies of the Joint Institute for Nuclear Research [1]. The software complex allows finding out
how the data storage and processing system will work with the available computing power, as well as
calculating the load on computing farms and communication links with the specified parameters of
data flows and job flows. Unlike the previously developed simulation program [2,3] the software
complex consists of a database, a module for setting the simulated structure and equipment
configurations, a stable core for the simulation of data transmission and processing, a module for
presenting results in the form of graphs. The simulation core is implemented on top of an approach
based on the representation of information processes as byte streams [4].
        Currently, work on the data processing simulation of the BM@N experiment is in progress
[5]. The simulation of the facility session is performed for various hardware configurations, data flow
and job flow parameters, scenarios for running data processing jobs [6]. The simulation results
presented in this article made it possible to draw conclusions that are the base of some formulated
recommendations for organizing the process of experimental data processing during the BM@N
experiment session with available allocated resources.
        Requirements for the improvement of the software complex are formed in the process of its
using. The main aim of the improvement is its further application for data processing simulation in any
data storage and processing centers.
2. Simulation input parameters
        Before starting to simulate data processing, it is necessary to determine the structure of the
data storage and processing center, the configurations of various equipment, as well as the
characteristics of data flows and job flows. All these parameters are input data for simulation.
2.1 Distributed computing system for storing and processing data
       The simulated distributed data storage and processing system with the equipment parameters
is shown in Figure 1.




                   Figure 1. Scheme of the simulated distributed computing system

         The system consists of two centers and includes several levels of data storage. The data of
particle collision events with a fixed target (raw data), which are selected by the trigger (Trigger
BM@N), is first written to the Data reception buffer, then it arrives at the Intermediate data storage
level, after which the full volume of raw data is sent to the local buffers (EOS LHEP and EOS LIT) of
the respective data processing centers. All storage devices have a limited volume, and data


                                                   484
Proceedings of the 9th International Conference "Distributed Computing and Grid Technologies in Science and
                           Education" (GRID'2021), Dubna, Russia, July 5-9, 2021



transmission channels have a limited bandwidth. Data processing jobs are executed on the compute
nodes of the above processing centers (NCX LHEP, T2 LIT and Supercomputer). Data processing jobs
arrive at the compute nodes and are performed if there are free slots, i.e. processor cores. If all slots
are occupied, the job goes to the queue and waits until some slot is free. Each job needs a certain
amount of data to execute. Data is transferred from the storage to the compute node before the job
starts.
2.2 BM@N Run workload and job stream
        A 30-day session of the BM@N experiment is considered in the article. It is assumed that the
session will last continuously, except for minute breaks between runs. Each run lasts 2 minutes.
Experimental data is written to files during each run. The recording is carried out continuously up to
the maximum size of each file (35 GB). The speed of data selection by the BM@N trigger is 10 000
events per second. The average size of a single event is about 0.2 MB. The maximum planned volume
of experimental data per session, taking into account the described characteristics, will be
approximately 350 TB.
         Raw binary data, which is received from the installation, is subject to processing, namely,
digitization (the final format is digit) and conversion to reconstruction data (DST format), after which
physical analysis of experimental data is performed. In addition to processing experimental data, it is
planned to process simulation data (sim), which will also be converted into reconstruction data (DST)
for subsequent physical analysis.
        There are several classes of jobs in accordance with the described process of data processing.
The characteristics of classes are described in Table 1. Job flows for the simulation of data processing
were formed on the basis of the data from the table.
                                                                    Table 1. Characteristics of job classes

№       Class         Avg. event       Average      Number of      Avg. job       Average      Number
                      processing      amount of      events in     execution     amount of     of jobs
                     time on one     input (GB)       the file      time (s)      output
                    processor (ms)                    (1 file =                    (GB)
                                                       1 job)
1    RawToDigit          675             35           175 000       118 125            1        15 552
2    DigitToDst          290              1           175 000        50 750            1        15 552
3    GenToSim             60             0.6          175 000        10 500            8          300
4    SimToDst             30              8           175 000         5 250            1          300
5    DstToAna             10              1           175 000         1 750           0.1        1 000


3. Simulation
         In the example considered below, simulation is performed in order to provide
recommendations on the organization of experimental data processing during the session with
available allocated resources [fig. 1]. First of all, there were simulated three scenarios, where the data
processing jobs described in Table 1 are distributed in different ways across the compute nodes
[tab.2].
                                  Table 2. Distribution of data processing jobs (in %) by compute nodes

                           Scenario 1                    Scenario 2                    Scenario 3
№       Class       NCX      T2     Super-         NCX     T2     Super-        NCX      T2     Super-
                   LHEP     LIT computer          LHEP    LIT computer         LHEP     LIT computer
1    RawToDigit      50      15       35            80     20        -            -      10       90
2    DigitToDst      50      15       35            80     20        -            -      10       90
3    GenToSim         -      20       80             -      -       100         100       -        -
4    SimToDst         -      20       80             -      -       100         100       -        -
5    DstToAna         -      20       80            70     10       20           80      20        -


                                                   485
Proceedings of the 9th International Conference "Distributed Computing and Grid Technologies in Science and
                           Education" (GRID'2021), Dubna, Russia, July 5-9, 2021



         The results of simulating three scenarios showed that one of the computing resources could be
fully allocated for processing simulated data up to reconstruction data (GenToSim and SimToDst
jobs). As for the processing of experimental data, at best only 20% of raw data can be processed to
digit data, and less than 10% of jobs on processing data to reconstruction data will have time to
complete before the end of the session, which lasts 720 hours. Figure 2 shows graphs reflecting the
number of completed experimental data processing jobs (RawToDigit and DigitToDst jobs) on
computing resources in accordance with Scenario 2. Figure 2 does not show graphs of completing
DstToAna jobs since the number of completed jobs is too small and is approximately 1%. This is due
to the full load of all computing resources.




                      (a)                                                        (b)
  Figure 2. Completed RawToDigit and DigitToDst jobs on the LHEP farm (a) and T2 LIT farm (b)

        Surely, the result is unsatisfactory. Such a small amount of experimental data can be processed
by the end of the session. After the end of the session, one will have to wait a few more months before
the end of processing all the data acquired from the experiment. It may be noticed that a large amount
of time is required to fully process a single data file for RawToDigit and DigitToDst jobs [tab. 1]. In
this regard, it was decided to simulate a scenario similar to Scenario 2, in which express file
processing will be added. Express processing will be run simultaneously with full processing. Express
processing consists in processing 1% of the file in RawToDigit jobs to obtain preliminary results.
        Figure 3 shows the results of simulating the scenario with express processing. The graphs
show the number of completed jobs of the full (RawToDigit jobs) and express (RawToDigit Express
jobs) processing of experimental data. The graphs illustrate that express data processing is completed
much faster. It will allow one to obtain preliminary results every hour throughout the experiment, and
to conclude that the measurements obtained are correct.




                      (a)                                                        (b)
                    Figure 3. Free slots on the LHEP farm (a) and T2 LIT farm (b)




                                                   486
Proceedings of the 9th International Conference "Distributed Computing and Grid Technologies in Science and
                           Education" (GRID'2021), Dubna, Russia, July 5-9, 2021



4. Conclusion
         A software complex to simulate the processing of data, which comes from the experimental
facility of the NICA complex, is being developed at MLIT JINR. The presented simulation results of
the data processing of the BM@N experiment showed that with the described parameters of
equipment, data flows and jobs flows, no more than 20% of raw data could be processed by the end of
the session, lasting 30 days. It is also proposed to perform express data processing simultaneously with
full processing, which will allow obtaining preliminary results every hour throughout the experiment.
It is possible to conclude that the measurements obtained are correct.
        Requirements for the improvement of the software complex are formed in the process of its
using. The main aim of the improvement is its further application for data processing simulation in any
data storage and processing centers. Thus, at the next stage, it is planned to develop a module for
launching jobs similar to the pilot, as well as to conduct computational experiments taking into
account the fact that the equipment does not have absolute reliability, i.e. to calculate the probability
of equipment failure and recovery time.


5. Acknowledgement
        This work supported by JINR grant for young scientists № 21-602-02.
        The authors express their gratitude to Doctor of Physical and Mathematical Sciences,
Professor, main researcher of the Meshcheryakov Laboratory of Information Technologies, Gennady
Ososkov for his assistance in the work, valuable advice and fruitful discussions.


References
[1] Kekelidze V., Kovalenko A., Lednicky R., Matveev V., Meshkov I., Sorin A. and Trubnikov G.
Status of the NICA project at JINR // EPJ Web Conf. 2017. V. 138. P. 01027
[2] Korenkov V.V., Nechaevskiy A.V., Ososkov G.A., Pryahina D.I., Trofomov V.V., Uzhinskiy
A.V. Simulation concept of NICA-MPD-SPD Tier0-Tier1 computing facilities // Physics of Particles
and Nuclei Letters, 2016, 13(5), pp. 693–699
[3] Nechaevskiy A., Ososkov G., Pryahina D., Trofimov V., Li WD. Simulation approach for
improving the computing network topology and performance of the China IHEP Data Center // 23rd
international conference on computing in high energy and nuclear physics (CHEP 2018), EPJ Web of
Conferences, 2019, 214(08018), DOI: 10.1051/epjconf/201921408018
[4] Korenkov V., Nechaevskiy A., Ososkov G., Priakhina D. and Trofimov V. A Probabilistic
Approach to the Simulation of Data Processing Centers // EPJ Web Conf. 2020. V. 226. P. 03012
[5] Kapishin M. Studies of baryonic matter at the BM@N experiment (JINR) // Nuclear Physics A.
2019. V. 982. P. 967
[6] Priakhina D.I., Trofimov V.V., Ososkov G.A., Gertsenberger K.V. Data center simulation for the
BM@N experiment of the NICA project // AIP Conference Proceedings 2377, 040007 (2021),
https://doi.org/10.1063/5.0063338




                                                   487