<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>October</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>DISTRIBUTED DATA PROCESSING OF THE COMPASS EXPERIMENT</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>A.Sh. Petrosyan</string-name>
          <email>artem.petrosyan@jinr.ru</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>D.M. Malevanniy</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Artem Petrosyan, Daniil Malevanniy</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Joint Institute for Nuclear Research</institution>
          ,
          <addr-line>6 Joliot-Curie st., 141980, Dubna</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Plekhanov Russian University of Economics</institution>
          ,
          <addr-line>36 Stremyanny per., 117997, Moscow</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Saint Petersburg State University, 7/9 University emb.</institution>
          ,
          <addr-line>199034, Saint Petersburg</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <volume>4</volume>
      <issue>2019</issue>
      <fpage>94</fpage>
      <lpage>98</lpage>
      <abstract>
        <p>The implementation of COMPASS data processing in the distributed environment started in 2015. Since the summer of 2017, the data processing system has been working in a production mode, distributing jobs to two traditional Grid sites: CERN and JINR. There are two storage elements, both at CERN: EOS disk storage for short-term storage and Castor tape storage for long-term storage. Processing management services, including the MySQL server, PanDA servers, the APF/Harvester server, a monitoring server, and a production management server, are deployed in the JINR Cloud Service. Thus, the system, which manages distributed data processing of the experiment, is also distributed. The production management system is based on the principles of a service-oriented architecture. Each service of the system is maximally isolated from the others, executed independently and usually performs only one function, for example: sends jobs, checks their statuses, archives results, and so on. During the last year, the system was replenished by a task archiving mechanism, FTS and Harvester services, and a Monte-Carlo processing chain. New HPC machines were also integrated. This article highlights the status, statistics, workflow, data management, infrastructure overview, and future plans.</p>
      </abstract>
      <kwd-group>
        <kwd>distributed computing</kwd>
        <kwd>workflow management system</kwd>
        <kwd>Grid</kwd>
        <kwd>HPC</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. System overview</title>
      <p>
        A concept of the data processing system in the Grid environment for COMPASS [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] via the
PanDA workload management system was presented in 2015 [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. The first system prototype was
prepared in 2016. In 2017, to provide a maximum level of automation for task and job processing, a
dedicated workflow management system was developed [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Since August 2017, the system has been
working in a production mode. During this period, the system was transformed several times. The
main reason for the transformation with an upgrade was the need to provide better reliability. Several
data processing chains were implemented: real data reconstruction, event filtering [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. A Monte-Carlo
chain was implemented in 2019. It covers MC simulation and reconstruction tasks. In addition, every
time the appearance of a new HPC machine, leading to an increase in the load on the system
components, has triggered an upgrade of the system. Two Grid sites are involved in data processing on
an ongoing basis: CERN and JINR. Volunteer HPC processing sites: BlueWaters (2017-2018),
Stampede 2 (2019), Frontera (2019-present time). The current system architecture and workflow are
presented in Figure 1.
      </p>
      <p>During the last two years, more than 9 million jobs, grouped into more than 400 tasks, were
processed by the system. The average wall time of a job in the system is 7.5 hours. Thus, in total,
approximately 7.5 CPU years were consumed by jobs managed by the system. The processing rate can
reach 100K jobs per day (Fig. 2).</p>
      <p>The system covers all steps of data pre- and post-processing, including archiving to the CERN
tape storage: Castor. During two years of operation, more than 700TB of final data were written to
Castor. In 2018, support of CERN FTS was implemented to enable asynchronous file transfers
between EOS and Castor and reduce the files migration time.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Processing on Stampede 2 and Frontera HPCs</title>
      <p>
        Processing on a supercomputer has many significant differences in comparison with working
on a regular Grid site and involves a high degree of detail. Running a large number of parallel tasks
leads to a big load on the file system. In addition, usually such machines have very strict user policies.
COMPASS has a recent experience of using large HPC. During 2018, a prototype as a proof of the
concept that the COMPASS production system could run jobs on an HPC machine was developed on
BlueWaters HPC of the University of Illinois at Urbana-Champaign [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. In 2019, an integration of
Stampede 2 and Frontera, HPCs of the Texas Advanced Computing Centre [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], was performed.
Frontera is one of the most powerful supercomputers in the world; it was at number 5 in the Top 500
in June 2019, Stampede 2 had the number 19 position.
      </p>
      <p>Unlike the scheme used on BlueWaters, with a Multi-Job Pilot as a job management service, a
new service named Harvester (Fig. 3) was used as a job management service on the interactive node
on HPCs at TACC.</p>
      <p>
        Harvester is a resource-facing service between a workload management system and a
collection of pilots [
        <xref ref-type="bibr" rid="ref7 ref8">7-8</xref>
        ]. It is a lightweight stateless service running on a VO box or an edge node of
HPC centres to provide a uniform view for various resources. Harvester was developed, taking into
account the ten years’ experience of operating the Auto Pilot Factory; it provides much greater
configuration flexibility and reliability, as well as monitoring. Harvester was designed so that it could
be used to send pilots both to regular Grid sites and HPCs and have an extendable architecture.
Everything needed for the COMPASS jobs code, such as local MySQL database execution, payload
management, errors handling and stage-out, was transferred from the Multi-Job Pilot and added as
Harvester plug-ins (Fig. 4).
      </p>
      <p>A new Harvester-based setup was prepared and tested on Stampede 2, while data processing is
carried out on Frontera. Since CPUs on Frontera computing nodes are much more efficient than on
Stampede and BlueWaters, much attention has been paid to optimizing data processing in order to
reduce the rate of I/O operations. The server side of the setup was also updated to enable processing
via Harvester: the latest PanDA server version was installed. Moreover, anticipating a high load on the
PanDA server from HPC, an additional PanDA server dedicated exclusively to HPC processing was
installed. Thus, there are two PanDA servers in the production setup at present: for Grid and HPC.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Summary</title>
      <p>The production system continues to provide a reliable platform for handling all types of tasks
and is the main tool for distributed processing of data gathered by the physics facility.</p>
      <p>
        In a relatively small experiment as COMPASS, i.e. in the situation with limited resources, it is
highly important to rely on central services with proven characteristics, even if they are redundant at
first sight. Such services, initially developed for the ATLAS experiment [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], have demonstrated the
expected level of scalability and reliability and allowed to use computing resources of different types,
including modern HPC facilities.
      </p>
      <p>We see that software systems, initially designed for the needs of one collaboration, after a
decade of intensive operation and improvement, have turned into software products that can be used in
other experiments, as well as outside the field of high-energy physics for organizing distributed
computing.</p>
      <p>
        The development and utilization of the COMPASS distributed processing management system
has made it possible to formulate requirements for the components of the Multifunctional
Informational and Computing Complex (MICC) [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] in the Laboratory of Information Technologies at
the Joint Institute for Nuclear Research on the eve of the start of work on the construction of
experimental data processing systems on the Nuclotron-based Ion Collider Facility NICA [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. There
are already ongoing efforts of FTS service deployment, development of unified authentication and
authorization services, establishment of local certification authority, etc.
Workshop
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>P.</given-names>
            <surname>Abbon</surname>
          </string-name>
          et al,
          <source>The COMPASS experiment at CERN</source>
          ,
          <source>Nuclear Instruments and Methods in Physics Research Section A: Accelerators</source>
          , Spectrometers, Detectors and
          <string-name>
            <given-names>Associated</given-names>
            <surname>Equipment</surname>
          </string-name>
          , Vol.
          <volume>577</volume>
          , pp.
          <fpage>455</fpage>
          -
          <lpage>518</lpage>
          ,
          <year>2007</year>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>A.</given-names>
            <surname>Sh. Petrosyan</surname>
          </string-name>
          ,
          <article-title>PanDA for COMPASS at JINR</article-title>
          ,
          <source>Physics of Particles and Nuclei Letters</source>
          , Vol.
          <volume>13</volume>
          , No.
          <issue>5</issue>
          , pp.
          <fpage>708</fpage>
          -
          <lpage>710</lpage>
          ,
          <year>2016</year>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A.</given-names>
            <surname>Petrosyan</surname>
          </string-name>
          ,
          <source>COMPASS Grid Production System, CEUR Workshop Proceedings</source>
          , Vol.
          <year>2023</year>
          , pp.
          <fpage>234</fpage>
          -
          <lpage>238</lpage>
          ,
          <year>2017</year>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>A.</given-names>
            <surname>Petrosyan</surname>
          </string-name>
          ,
          <article-title>COMPASS Production System Overview, EPJ Web Conf</article-title>
          ., Vol.
          <volume>214</volume>
          ,
          <year>2019</year>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>A.</given-names>
            <surname>Sh. Petrosyan</surname>
          </string-name>
          ,
          <source>COMPASS production system: processing on HPC, CEUR Proceedings</source>
          , Vol.
          <volume>2268</volume>
          , pp.
          <fpage>139</fpage>
          -
          <lpage>144</lpage>
          ,
          <year>2018</year>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Texas</given-names>
            <surname>Advanced Computing Centre</surname>
          </string-name>
          , the University of Texas at Austin, available at https://www.tacc.utexas.
          <source>edu/ (accessed 31.10</source>
          .
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>A.</given-names>
            <surname>Anisenkov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Drizhuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Guan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lassnig</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Nilsson</surname>
          </string-name>
          ,
          <string-name>
            <surname>D.</surname>
          </string-name>
          <article-title>Oleynik on behalf of the ATLAS Collaboration, Global heterogeneous resource harvesting: the next-generation PanDA Pilot for ATLAS</article-title>
          ,
          <source>Journal of Physics Conference Series</source>
          , Vol.
          <volume>1085</volume>
          , No.
          <volume>032031</volume>
          ,
          <year>2018</year>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <article-title>[8] Harvester web home</article-title>
          , available at https://github.com/HSF/harvester (accessed
          <volume>31</volume>
          .10.
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <article-title>[9] ATLAS collaboration web home</article-title>
          , available at https://atlas.cern
          <source>/ (accessed 31.10</source>
          .
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <article-title>MICC web home</article-title>
          , available at https://micc.jinr.
          <source>ru/ (accessed 31.10</source>
          .
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <article-title>NICA web home</article-title>
          , available at http://nica.jinr.
          <source>ru/ (accessed 31.10</source>
          .
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>