<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <article-id pub-id-type="doi">10.1142/9789814525220_0008</article-id>
      <title-group>
        <article-title>HIGH PERFORMANCE COMPUTING SYSTEM IN THE FRAMEWORK OF THE HIGGS BOSON STUDIES AT ATLAS</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>N. L. Belyaev</string-name>
          <email>Nikita.Belyaev@cern.ch</email>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
          <xref ref-type="aff" rid="aff6">6</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>A. A. Klimentov</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff6">6</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>R. V. Konoplich</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff5">5</xref>
          <xref ref-type="aff" rid="aff6">6</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>D. V. Krasnopevtsev</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
          <xref ref-type="aff" rid="aff6">6</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>K. A. Prokofiev</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff6">6</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>V. E. Velikhov</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff6">6</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>on behalf of the ATLAS Collaboration</string-name>
          <xref ref-type="aff" rid="aff6">6</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Brookhaven National Laboratory BNL</institution>
          ,
          <addr-line>Upton, Suffolk County, New York</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Physics and Institute for Advanced Study, the Hong Kong University of Science and Technology</institution>
          ,
          <addr-line>Clear Water Bay, Kowloon, Hong Kong</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Manhattan College</institution>
          ,
          <addr-line>4513 Manhattan College Parkway, Riverdale, New York, NY 10471</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>National Research Center “Kurchatov Institute”</institution>
          ,
          <addr-line>1 Akademika Kurchatova pl., Moscow, 123182</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>National Research Nuclear University MEPhI (Moscow Engineering Physics Institute)</institution>
          ,
          <addr-line>31 Kashirskoe highway, Moscow, 115409</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
        <aff id="aff5">
          <label>5</label>
          <institution>New York University</institution>
          ,
          <addr-line>4 Washington Place, New York, NY 10003</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff6">
          <label>6</label>
          <institution>2017 Nikita L. Belyaev, Alexei A. Klimentov, Rostislav V. Konoplich, Dimitrii V. Krasnopevtsev, Kirill A. Prokofiev</institution>
          ,
          <addr-line>Vasily E. Velikhov</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2017</year>
      </pub-date>
      <fpage>23</fpage>
      <lpage>29</lpage>
      <abstract>
        <p>Higgs boson physics is one of the most important and promising fields of study in modern high energy physics. To perform precision measurements of the Higgs boson properties, the use of fast and efficient instruments of Monte Carlo event simulation is required. Due to the increasing amount of data and to the growing complexity of the simulation software tools, the computing resources currently available for Monte Carlo simulation on the Large Hadron Collider (LHC) Grid are not sufficient. One of the possibilities to address this shortfall of computing resources is the usage of institutes' computer clusters, commercial computing resources and supercomputers. In this paper, a brief description of the Higgs boson physics, Monte Carlo generation and event simulation techniques are presented. A description of modern high performance computing systems and tests of their performance are also discussed. These studies have been performed on the Worldwide LHC Computing Grid and Kurchatov Institute Data Processing Center, including Tier-1 WLCG sites and the OLCF Titan supercomputer. Monte Carlo simulated events produced with the Titan supercomputer were used in the Higgs boson analysis and the results have been published by the ATLAS collaboration.</p>
      </abstract>
      <kwd-group>
        <kwd>high performance computing</kwd>
        <kwd>ATLAS</kwd>
        <kwd>Higgs</kwd>
        <kwd>CERN</kwd>
        <kwd>supercomputers</kwd>
        <kwd>data</kwd>
        <kwd>LHC</kwd>
        <kwd>physics</kwd>
        <kwd>reconstruction</kwd>
        <kwd>GRID</kwd>
        <kwd>cloud computing</kwd>
        <kwd>WLCG</kwd>
        <kwd>Tier-1</kwd>
        <kwd>Titan</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. High performance computing in high energy physics</title>
      <p>The observation of a new particle compatible with the Standard Model (SM) Higgs boson by
the ATLAS and the CMS experiments [1, 2] has been an important step towards the validation of our
understanding of nature. This observation is based on the information from the Large Hadron
Collider (LHC). Experimental data was obtained during the first LHC data taking period (Run 1) in
2011-2012. Other impressive discoveries based on the LHC data include the pentaquark observation
[3] and the observation of CP violation in rare baryon decays [4], both by the LHCb experiment.</p>
      <p>To perform more detailed studies in these areas and to probe the new physics beyond the
Standard Model (BSM), theoretical predictions for the physics processes under consideration must be
obtained. The most convenient way to obtain such predictions is to use Monte Carlo (MC)-generated
events. MC simulations are based on concrete theoretical models, which describe the processes under
consideration.</p>
      <p>MC simulations usually require a large amount of computing resources. Generally, there are
two reasons for that. First, modern particle physics theories become more and more complex: they
are based on non-trivial mathematics and calculations required by these theories are
resourceintensive. Second, as far as measurements in particle physics are performed by the particle detectors,
the detector effects must be also taken into account. The ATLAS detector [5] consists of millions of
structural elements (more than 150M sensors). The response of each of them must be modelled and
taken into account. For the physics analyses, it is necessary to perform so-called Full Simulation of
the detector, which takes into account all the detector effects [6].</p>
      <p>The resource intensity of MC tasks can be illustrated in terms of statistics. Figure 1 shows the
fraction of CPU time consumption of all jobs submitted to Worldwide LHC Computing Grid
(WLCG) by all its users during the first 35 weeks of 2017. The jobs were categorized according to
their type: Data processing, Validation, Testing, MC Simulation, MC Simulation Full, MC
Simulation Fast, MC Event Generation, MC Reconstruction, Group Production and Others. This plot
shows that MC-related jobs take more than 85% of all CPU time provided by the WLCG.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Higgs boson physics</title>
      <p>Higgs boson physics is one of the most promising fields of study in modern particle physics.
However, this topic is not only promising, but also complex. The most intensive channel of the Higgs
boson production at the LHC in high energy proton-proton collisions is the gluon-fusion production
(ggF). The Feynman diagram of such a process at the leading order (LO, first-order terms of
Lagrangian in the Perturbation theory of Quantum Chromodynamics [8]) is shown in Figure 2.</p>
      <p>When the center-of-mass energy of two protons is about 10 teraelectronvolts (TeV) or higher,
more than 90% of all Higgs bosons are produced by the ggF mechanism. The process shown in
Figure 2 is relatively simple, because there are no outcoming particles except the Higgs boson and
there are no hadron jets (so-called 0-jet final state).</p>
      <p>Considering second-order terms of the Lagrangian (Next-to-leading order corrections, NLO),
the situation becomes more complex. Figure 3 presents some of the Feynman diagrams of ggF Higgs
boson productions in the NLO.</p>
      <p>
        The complexity of NLO calculations is directly correlated with the number of jets. In NLO
case, 0-jet category consists of 29 Feynman diagrams. The generation time of 100k events at CERN
on LXPLUS machines [
        <xref ref-type="bibr" rid="ref2">10</xref>
        ] is about 10 minutes. The 1-jet category consists of 1050 diagrams and the
calculation time increases up to about 2 hours. The 2-jet category consists of 21510 diagrams and the
calculation time increases even more, up to about 24 hours. For the ggF Higgs boson production
NLO corrections can contribute up to 45% to the cross sections and thus must be taken into account.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Role of High Performance Computers’ in HEP</title>
      <p>
        High Performance Computers (HPC) are presently the most valuable instruments for
CPUintensive tasks, such as Higgs boson production simulation with NLO corrections. One of the most
impressive realizations of supercomputer-based HPC system for science tasks, including high energy
physics, is the OLCF Titan, launched in 2012 by Oak Ridge National Laboratory, the performance of
which is higher than the computing power of all the ATLAS resources in WLCG [
        <xref ref-type="bibr" rid="ref2">10</xref>
        ]. Some
technical characteristics of the Titan supercomputer are listed below [
        <xref ref-type="bibr" rid="ref3">11</xref>
        ]:
• 27 PFLOPS (Peak theoretical performance);
• Cray XK-7 18,688 compute nodes with GPUs;
• 299,008 CPU cores AMD Opteron 6274 @2.2 GHz (16 cores per node);
• 32 GB RAM per node;
• NVidia TESLA K20x GPU per node;
• 32 PB disk storage (center-wide Luster file system);
• More than 1TB/s aggregate FS throughput;
• 29 PB HPSS tape archive;
      </p>
      <p>
        The Titan supercomputer was used to perform the simulation of the Higgs boson production
associated with two hadron jets at NLO level with consequent decay of the Higgs boson to four
leptons:  →  → 4ℓ . Some technical parameters of this simulation are listed in Table 1,
namely: nFiles is the number of files in the input dataset; nEventsPerInputFile is the number of
events per input file and nEventsPerJob is the number of events per job. Expected CPU time is
estimated by internal GRID benchmarks based on previous information about the occupancy of the
particular GRID resource [
        <xref ref-type="bibr" rid="ref4">12</xref>
        ].
      </p>
      <p>It is important to note, that while with CERN LXPLUS machines the generation event rate is
about 2.5k events/hour, with the Titan it is about 650k/hour.</p>
      <p>nEventsPerInputFile
2000</p>
      <p>
        The datasets produced with the Titan supercomputer were subsequently used by the ATLAS
collaboration for physics analysis, with the results of this study presented in the public note [
        <xref ref-type="bibr" rid="ref6">14</xref>
        ]. The
main purpose was to probe the separation power of BDT discriminants to distinguish ggf and vector
boson fusion (VBF) signals. As an example of obtained results, Figure 4 shows the distribution of the
Boosted Decision Tree (BDT) discriminant for a possible detector layout for the ATLAS HL-LHC
upgrade. The separation between the red and blue distributions visualises the ggf-VBF separation.
      </p>
    </sec>
    <sec id="sec-4">
      <title>4. Cloud computing at Tier-1 clusters</title>
      <p>The resource-intensive tasks can be handled not only by supercomputers, but also by WLCG
computing facilities and cloud computing. WLCG is made up of four layers, or "tiers"; 0, 1, 2 and 3.
Each tier provides a specific set of services:
●
●
●
●</p>
      <p>Tier 0 corresponds to the CERN Data Centre, which is located in Geneva, Switzerland and also
at the Wigner Research Centre for Physics in Budapest, Hungary. The two sites are connected
by two dedicated 100 Gbit/s data links. All data from the LHC passes through the central
CERN hub, but CERN provides less than 20% of the total computer capacity. Tier 0 is
responsible for the safe-keeping of the raw data (first copy), first pass reconstruction,
distribution of raw data and reconstruction output to the Tier 1s, and reprocessing of data
during LHC down-times.</p>
      <p>Tier 1 corresponds to thirteen large computer centres with sufficient storage capacity and with
round-the-clock support for the Grid. They are responsible for the safe-keeping of a
proportional share of raw and reconstructed data, large-scale reprocessing and safe-keeping of
corresponding output, distribution of data to Tier 2s and safe-keeping of a share of simulated
data produced at these Tier 2s.</p>
      <p>Tier 2 are typically universities and other scientific institutes, which can store sufficient data
and provide adequate computing power for specific analysis tasks. They handle analysis
requirements and proportional share of simulated event production and reconstruction. There
are currently around 160 Tier 2 sites covering most of the globe.</p>
      <p>Tier 3 corresponds to computing resources, which can consist of local clusters in a University
Department or even just an individual PC. There is no formal engagement between WLCG and
Tier 3 resources.</p>
      <p>
        The most effective systems here are Tier-1 clusters, which are represented by well-organized
and powerful computing systems of participating national organizations and countries. In order to test
the performance of National Research Center “Kurchatov Institute” Tier-1 cluster
(ANALY_RRCKI-T1) and compare it with other WLCG sites, 50k of 2-jet ggF Higgs boson production events were
generated with aMC_NLO Monte Carlo generator [
        <xref ref-type="bibr" rid="ref7">15</xref>
        ]. The results of this simulation are indicated in
Table 2.
      </p>
      <p>The results of performance tests can vary depending on the condition of each individual
machine, its occupancy and some other parameters. However, the performance of considered GRID
sites are close to each other. Similar expected times mean that clusters offer similar compute power
as estimated by their specifications and certain benchmarks, while similar total times means that the
ratio of actual to expected performance is similar across all clusters.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion</title>
      <p>High performance computing systems are now an integral part of the high energy physics
landscape. One of the main physics fields where HPC has a major impact are the Higgs boson
studies. Increasing precision of measurements in particle physics leads to a complexity of subsidiary
calculations and HPC is a great resource to work with. It is also important to notice that some
analyses require even more precision. They have to use next-to-next-to-leading order corrections
(NNLO). Thus, the complexity of calculations will continue to increase in the near future.</p>
      <p>In this paper, a brief overview of the high performance computing systems was presented. An
overview of the Titan supercomputer was provided, and its capabilities for the Higgs boson physics
was demonstrated. Performance of NRC-KI Computing cluster was studied and comparison with
other ATLAS GRID sites was shown. This comparison demonstrates that NRC-KI Computing
cluster is capable of handling tasks in high energy physics computing on par with other ATLAS
GRID sites.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgement</title>
      <p>We wish to thank all our colleagues who contributed to Monte Carlo simulation activities and
to PanDA software development and operations. This work was funded in part by the Russian
Ministry of Science and Education under Contract No. 14.Z50.31.0024 and the U. S. Department of
Energy, Office of Science, High Energy Physics and Advanced Scientific Computing under
Contracts No. DE-AC02-98CH10886 and DE-AC02- 06CH11357. The work is also has been carried
out using computing resources of the federal collective usage center Complex for Simulation and
Data Processing for Mega-science Facilities at NRC “Kurchatov Institute”, http://ckp.nrcki.ru/. The
work of R. Konoplich is partially supported by the US National Science Foundation under Grant
No.PHY-1402964. The work of K. Prokofiev is partially supported by a grant from the Research
Grant Council of the Hong Kong Special Administrative Region, China (Project Nos.
CUHK4/CRF/13G). This research used resources of the Oak Ridge Leadership Computing Facility at
the Oak Ridge National Laboratory, which is supported by the Office of Science of the U.S.
Department of Energy under Contract No. DE- AC05-00OR22725.
[1] Aad G. et al. [ATLAS Collaboration]. ATLAS Collaboration Observation of a new particle in the
search for the Standard Model Higgs boson with the ATLAS detector at the LHC // Phys. Lett. B
2012. V. 716 P. 1
[2] Chatrchyan S. et al. [CMS Collaboration]. Observation of a new boson at a mass of 125 GeV
with the CMS experiment at the LHC // Phys. Lett. B 2012. V. 716 P. 30
[3] Aaij R. et al. [LHCb Collaboration]. Observation of J/ψp Resonances Consistent with
Pentaquark States in Λ0b→J/ψK−pDecays // Phys. Rev. Lett. 2015. V. 115 P. 072001
[4] Aaij R. et al. [LHCb Collaboration]. Measurement of matter-antimatter differences in beauty
baryon decays // Nature Physics 2017. V. 13 P. 391
[5] Aad G. et al. [ATLAS Collaboration]. The ATLAS Experiment at the CERN Large Hadron
Collider // JINST 2008. V. 3 P. S08003
[6] Aad G. et al. [ATLAS Collaboration]. The ATLAS Simulation Infrastructure // Eur. Phys. J. C
2010. V. 70 P. 823</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Deutschmann</surname>
            <given-names>N.</given-names>
          </string-name>
          et. al.
          <article-title>Gluon-fusion Higgs production in the Standard Model Effective Field Theory // CERN-</article-title>
          <string-name>
            <surname>TH-</surname>
          </string-name>
          2017-165 / NIKHEF-2017-035
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Bird</surname>
            <given-names>I.</given-names>
          </string-name>
          et. al.
          <source>Update of the Computing Models of the WLCG and the LHC Experiments // CERN-LHCC-2014-014 / LCG-TDR-002</source>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>[11] URL: https://www.olcf.ornl.gov/computing-resources/titan-cray-xk7/</mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Barner</surname>
            <given-names>S.</given-names>
          </string-name>
          et al.
          <source>Hardware and Software: Verification</source>
          and Testing // 6th International Haifa Verification Conference,
          <string-name>
            <surname>HVC</surname>
          </string-name>
          <year>2010</year>
          , Haifa, Israel, October 4-
          <issue>7</issue>
          ,
          <fpage>2010</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Michelotto</surname>
            <given-names>M.</given-names>
          </string-name>
          <article-title>A comparison of HEP code with SPEC benchmark on multicore worker nodes //</article-title>
          <source>J. Phys.: Conf. Ser</source>
          . 2010 V. 219 P.
          <fpage>052009</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Aad</surname>
            <given-names>G.</given-names>
          </string-name>
          et al. [
          <article-title>ATLAS Collaboration]</article-title>
          .
          <article-title>ATLAS Collaboration Prospective results for vectorboson fusion-mediated Higgs-boson searches in the four lepton final state at the High Luminosity Large Hadron Collider // ATL-</article-title>
          <string-name>
            <surname>PHYS-PUB-</surname>
          </string-name>
          2016-008
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Alwall</surname>
            <given-names>J.</given-names>
          </string-name>
          et al.
          <source>MadGraph</source>
          <volume>5</volume>
          : Going Beyond // JHEP 2011 V. 06 P.
          <fpage>128</fpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>