<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>October</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>TRACKING FOR BM@N GEM DETECTOR ON THE BASIS OF GRAPH NEURAL NETWORK</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>E. Shchavelev</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>P. Goncharov</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>G. Ososkov</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>D. Baranov</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Dubna State University</institution>
          ,
          <addr-line>Universitetskaya 19, Dubna, Moscow Region, 141982</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Egor Shchavelev</institution>
          ,
          <addr-line>Pavel Goncharov, Gennady Ososkov, Dmitriy Baranov</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Joint Institute for Nuclear Research</institution>
          ,
          <addr-line>6 Joliot-Curie street, Dubna, Moscow region</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Saint Petersburg State University</institution>
          ,
          <addr-line>7/9 Universitetskaya Emb., 199034, Saint Petersburg</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <volume>4</volume>
      <issue>2019</issue>
      <fpage>280</fpage>
      <lpage>284</lpage>
      <abstract>
        <p>Particle tracking is a very important part of modern high energy physics experiments. While the data stream from such experiments is increasing day by day, classic tracking methods lack the ability to fit these amounts of data. In order to solve this problem, new effective machine learning algorithms are actively developed in the HEP.TrkX project for Large Hadron Collider detector and for the GEM detector of the BM@N experiment. This work is a logical continuation of the research presented on the XXIII International Scientific Conference of Young Scientists and Specialists (AYSS-2019), where we have already introduced a new application for tracking with the Graph Neural Network based on the Minimum Branching Tree preprocessing. However, that approach has had some problems, including the overall inaccuracy with the segments purification of the preprocessed event graph. In this work, we overcome many of such problems and introduce a revised approach with improved tracking accuracy. Promising results of the improved GNN tracking are given.</p>
      </abstract>
      <kwd-group>
        <kwd>tracking</kwd>
        <kwd>deep learning</kwd>
        <kwd>graph neural networks</kwd>
        <kwd>line graph</kwd>
        <kwd>digraph</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Modern high energy and nuclear physics (HENP) requires precise and effective reconstruction
of events occurring during its experiments. One of the most important parts of event reconstruction is
tracking of the particle trajectories (tracks). This process called particle tracking or just tracking, and it
demands swiftest and most accurate algorithms to operate on, as in the modern experiments beams of
heavy ions are collided on the very high speeds, and particle collisions produce enormous amounts of
traces of the secondary particles in the sensitive elements of tracking detectors. These traces are to be
converted then into so-called ‘hits’, which is usually points in some coordinate space, and one of the
main parts of the event reconstruction process is to search for groups of those hits each connected by
affiliation to the same track.</p>
      <p>
        As we are working on the BM@N experiment of the NICA megaproject [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], we obtain hits
information from the microstrip gaseous chamber GEM detector which has the one main drawback:
with any N actual particles hits, we also get  2 −  spurious hits. Although this problem has
successful classical solutions, as the Kalman filter method [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], it meets difficulties at achieving
satisfying speed on the events with the heavy ions in the recent BM@N experiments due to its
sequential nature and problems with parallelization.
      </p>
      <p>
        At the same time, machine learning approaches have the capability of modeling such complex
environments and can be effectively run in parallel. There are some existing machine learning
approaches for tracking [
        <xref ref-type="bibr" rid="ref6">8</xref>
        ], but a majority of them are sequential, they operate over the chain of hits
and do not process the whole event at a time and can miss important informational patterns. One of the
possible options to represent an event is to consider it as a graph. This approach is not completely new,
there is a successful approach based on the Hopfield Network [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], but it will not be able to handle
properly current amounts of data.
      </p>
      <p>
        Recently the idea of graph representation in machine learning found great success with the
Geometric Deep Learning methods known as Graph Neural Networks (GNN) [4], and the recent
HEP.TrkX approach [
        <xref ref-type="bibr" rid="ref4">5</xref>
        ] based on GNN showed the huge potential for the data from the LHC
pixelbased detector. We were inspired of this approach, so in our previous study, we tried to adopt the same
approach for the data from the GEM detector of the BM@N experiment [6]. However, our approach
was not able to achieve similar success compared to the LHC one. In this work, we propose some
improvements which lead us to more precise and robust results.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Previous study</title>
      <p>The key idea of our approach is an event-as-a-graph representation when registered hits are
nodes of the graph and edges of the graph fully connect the nodes between the adjacent layers of the
detector. One can see the full graph of a particular event simulated for the GEM BM@N in figure 1.</p>
      <p>
        GNN consumes the nodes’ features, namely, 3-D coordinates (X, Y, Z) of the hit and their
connectedness of each other and the adjacency matrix of nodes (hits). The forward pass of the model
consists of iterating data through the 2 subnetworks, so-called ‘Edge’ and ‘Node’ networks. The Edge
network selects nodes’ features and the Node network aggregates neighbor node features. The full
details and explanation of the model one can find at [
        <xref ref-type="bibr" rid="ref4">5</xref>
        ].
      </p>
      <p>During the integration of this approach for the GEM data we recognized the biggest pitfall of
the initial approach – while it works on the data without the fake hits perfectly, it is highly afflicted by
the great fake count, which is a common problem of the microstrip-based detectors like GEM. This
approach cannot handle huge noise rate and for that purpose, we had to introduce a special filtering
process containing two main steps:
● Truncating the segments based on their statistical behavior (i.e. cutting all segments which
angle exceeds some constant). This step brings substantial influence on the overall noise rate,
reducing the noise factor by half.
● Utilizing the Minimum Branching Tree. It brings a huge impact on the overall fake segment
rate. We discovered that MBT-based filtering reduces the fake-to-real ratio from 1:120 to 1:10
(for one true segment we observe 10 fake segments). But this algorithm has one big
disadvantage – it discards about 18% of true segments.</p>
      <p>After applying the filtering described above, GNN showed promising results achieving 77%
recall and 96% precision on the preprocessed data. This result has inspired us towards the continuation
of our studies, so we needed to improve the filtering process and increase the overall network
accuracy.</p>
    </sec>
    <sec id="sec-3">
      <title>3. New event representation</title>
      <sec id="sec-3-1">
        <title>3.1 New network feature idea</title>
        <p>The common straightforward method for improving neural network model accuracy is to feed
the network with more features. As it was mentioned above, the GNN model uses a hits 3-D
coordinate position as a feature. Looking at the physical interpretation of particle track one can notice
that track in the GEM detector is usually a smooth curve that has a concavity, which is not varying
much, being approximately constant. In other words, the track curve has about the same value for its
first-order derivative (and always the same sign). The information of the curve monotonicity and
concavity can be obtained as the gradient of particle track function and its Laplacian, correspondingly.
We would like to use such characteristics as features and feed it to the network. But the network
consumes only the hits positional information as nodes features. The information about the segment
direction (‘monotonicity’) can be obtained only if we use it as an edge feature (because segment
direction depends on the two nodes positions, which cannot be encoded into a single node).
Unfortunately, current GNN architecture utilizes the graph adjacency matrix as a binary mask for the
network message passing mechanism so it is considerably hard to encode such information to the
current event-as-a-graph representation. Therefore, we found a new type of graph representation that is
more in line with our intentions.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2 Line graph</title>
        <p>
          Line graph [
          <xref ref-type="bibr" rid="ref5">7</xref>
          ] of a graph  is another graph  ( ) that represents the adjacencies between
edges of G. Given a graph  , its line graph  ( ) is a graph such that:
● each node of  ( ) represents an edge of  ; and
● two nodes of  ( ) are adjacent if and only if their corresponding edges share a common
endpoint in  .
        </p>
        <p>Roughly speaking, the line graph inverts the nodes and edges of the original graph. We also can use
the directed version of line graph, called ‘line digraph’. If  is a directed graph, its directed line graph
or line digraph has one vertex for each edge of  . Two vertices representing directed edges from  to
 and from  to  in  are connected by an edge from  to  in the line digraph when  =  . That
is, each edge in the line digraph  ( ) represents a length-two directed path in G. The biggest
advantage of such transformation is the fact that edge features from the initial graph turn into node
features of the line graph which becomes really convenient to use within the GNN model.</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3 Applying line digraph approach</title>
        <p>Consider the initial event-as-a-graph conception we are transforming the initial graph  to the
line digraph  ( ). As an initial graph fully-connects adjacent layers nodes we can apply the same
approach described above: each (graph) path of two consecutive segments will be the edge of  ( );
each segment will be the node of the new line digraph  ( ). In this situation, each node of  ( )
gathers two 3-D coordinate positions from the two initial event hits. One can see the resulting line
digraph representing the same event as in figure 1 is depicted in figure 2 where almost all tracks
become more straighten in such representation because they are in fact first-order derivatives of initial
tracks.</p>
        <p>Despite the fact that this representation brought new features ( and  values) to the
network model, it still suffers from a big amount of fake segments. But now filtering fake segments is
a much simpler procedure: the segment of a line digraph now, in fact, represents hits from three
consecutive layers, so we can compute the second-order derivative of a curve which passes through
these hits. So, the current filtering process is the following:
● construct line digraph;
● compute edges weight; edge weight computed as  = ( 2 ,  2 ), where  2 is computed as
 2 =   1 −   0 = ( 2 −  1) − ( 1 −  0) =  2 − 2 ∗  1 −  0.  2,  1, and  0 are three
consecutive hits’  coordinates of a track curve.
● filter out all segments with weight value more than a constant.</p>
        <p>This filtering process achieves 100% purity (purity is the count of true segments left after
filtering divided by the count of initial true segments) and reduces overall fake segment rate up to 10
times (fake segment rate is the count of fake segments divided by the count of true segments). It is
worth to note that constructing line digraph, finding and filtering segments can be computed with up to
 ( ) operations with usage up to  ( 2) memory, where  is a number of segments in the graph. We
also vectorized this algorithm and achieved the resulting speed about ~7 events/sec; which is, in fact,
satisfying for the current study but an important thing to improve in the future.</p>
      </sec>
      <sec id="sec-3-4">
        <title>3.4 Graph Neural Network training and results</title>
        <p>We used the same network architecture and structure described in [6] except feeding it with
segments produced by the filtered line digraph. During the network training, we observed that optimal
feature configuration is a vector of the following five line digraph node features:
(  0,   1,   0,   1,  ). With this configuration, our model achieves 85% precision and 96% recall on
the validation dataset containing 4k events.</p>
        <p>
          Moreover, we also have measured more strict metrics introduced in [
          <xref ref-type="bibr" rid="ref6">8</xref>
          ]: it achieves 94% hit
and 87% track accuracy (hit accuracy is a percentage of true hits found by network out of all true hits
in a single event; track accuracy is a percentage of full tracks without gaps found by network out of all
tracks in a single event). Although overall results are totally satisfying, we discovered the main
shortcoming of the line digraph approach – our algorithm runs out of memory when handling events
with a huge number of hits (more than 3000 hits in a single event). On the other side, the total
percentage of such events in the whole dataset is about 1-2% out of all events. We will investigate and
resolve memory issues in future studies.
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusion</title>
      <p>In this paper, we have introduced a novel approach for the effective graph representation of
the event in GEM detector – line digraph. It facilitated both filtering and feature extraction processes.
GNN being trained on line digraph achieves significant results even with the strict metrics reaching 94
% hit accuracy and 87% track accuracy and handles tracks of any length. The main drawback of the
line digraphs is high memory consumption (because it is quadratic from the total hit count in the
event). Now we are going to speedup the filtering process; embed line graph architecture into the
network model itself; overcome memory issues; expand the approach to solve tracking in a collider
environment.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Acknowledgement References</title>
      <p>The reported study was funded by RFBR, project number 18-02-40101.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M.</given-names>
            <surname>Kapishin</surname>
          </string-name>
          ,
          <article-title>The fixed target experiment for studies of baryonic matter at the Nuclotron (BM@N) // The European Physical Journal A</article-title>
          . -
          <year>2016</year>
          . - V.
          <year>52</year>
          . - No. 8. - pp.
          <fpage>213</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>R.</given-names>
            <surname>Frühwirth</surname>
          </string-name>
          ,
          <article-title>Application of Kalman filtering to track</article-title>
          and vertex fitting //Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and
          <string-name>
            <given-names>Associated</given-names>
            <surname>Equipment</surname>
          </string-name>
          .
          <article-title>-</article-title>
          <year>1987</year>
          . - Vol.
          <volume>262</volume>
          . - No.
          <fpage>2</fpage>
          -
          <lpage>3</lpage>
          . - pp.
          <fpage>444</fpage>
          -
          <lpage>450</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>G.</given-names>
            <surname>Ososkov</surname>
          </string-name>
          et al,
          <article-title>Neural Network Applications for Efficiency Improving of Geometric Reconstruction of Events Detected in the EXCHARM Experiment at the JINR</article-title>
          ,
          <source>Proc. of VI Intern. Workshop on Software Engineering, Artificial Intelligence and Expert Systems</source>
          , (Greece),
          <year>1999</year>
          [4]
          <string-name>
            <surname>M. M. Bronstein</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Bruna</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>LeCun</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Szlam</surname>
            , and
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Vandergheynst</surname>
          </string-name>
          , “
          <article-title>Geometric deep learning: going beyond euclidean data</article-title>
          ,”
          <source>CoRR abs/1611</source>
          .08097 (
          <year>2016</year>
          ), arXiv:
          <fpage>1611</fpage>
          .
          <fpage>08097</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>S.</given-names>
            <surname>Farrell</surname>
          </string-name>
          et al., “
          <article-title>Novel deep learning methods for track reconstruction</article-title>
          ,
          <source>” in 4th International Workshop Connecting The Dots</source>
          <year>2018</year>
          (
          <article-title>CTD2018</article-title>
          ) Seattle, Washington, USA, March
          <volume>20</volume>
          -22,
          <year>2018</year>
          [6]
          <string-name>
            <given-names>E.</given-names>
            <surname>Shchavelev</surname>
          </string-name>
          et al.,
          <article-title>Graph neural network application to the particle track reconstruction for data from the GEM detector /</article-title>
          / AIP Conference Proceedings,
          <year>2019</year>
          . -
          <fpage>Т</fpage>
          .
          <year>2163</year>
          . -
          <fpage>№</fpage>
          . 1. -
          <fpage>С</fpage>
          .
          <year>040003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>H.</given-names>
            <surname>Frank</surname>
          </string-name>
          , R. Norman, “
          <article-title>Some properties of line digraphs</article-title>
          ”
          <source>in Rendiconti del Circolo Matematico di Palermo</source>
          ,
          <year>1960</year>
          , V. 9, pp
          <fpage>161</fpage>
          -
          <lpage>168</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>P.</given-names>
            <surname>Goncharov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Ososkov</surname>
          </string-name>
          ,
          <string-name>
            <surname>D.</surname>
          </string-name>
          <article-title>Baranov Particle track reconstruction with the TrackNETv2 /</article-title>
          /AIP Conference Proceedings. - AIP Publishing,
          <year>2019</year>
          . -
          <fpage>Т</fpage>
          .
          <year>2163</year>
          . -
          <fpage>№</fpage>
          . 1. -
          <fpage>С</fpage>
          .
          <year>040003</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>