<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Geophysical Data Aggregation Center IPE RAS</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Stanislav D. Ivanov</string-name>
          <email>f0ma@ifz.ru</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Igor M. Aleshin</string-name>
          <email>ima@ifz.ru</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Vladimir N. Koryagin</string-name>
          <email>vlad@ifz.ru</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fedor V. Perederin</string-name>
          <email>crash@ifz.ru</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kirill I. Kholodkov</string-name>
          <email>keir@ifz.ru</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Schmidt Institute of Physics, of the Earth RAS</institution>
          ,
          <addr-line>Moscow</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Today mostly data collection centers work with certain types of data. And there is no systematic approach to data storage and publication for users, who don't interwork with such centers. We propose a system of hardware, software and organizational procedures at Schmidt Institute of Physics of the Earth RAS towards to reduce the impact of this problem. There we created a centralized operator-friendly data management system that takes care of configuring all parts of data flow chain from instrument configuration to data publication.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>the data. This raises a concern within the scientific community. for data publication, taxonomy and policy.
Relatively low demand for such data makes establishment of problem-oriented data centers impractical. The
consequences are: complicated and troublesome access or even complete data loss[Vin14].</p>
      <p>Geophysical data can be split into two categories based on its definitiveness: preliminary and definitive.
Preliminary data can undergo modifications and definitive cannot. Another way to break down into categories
is considering how fast the data is published. This way we have archive data that is published after the
research or fieldwork is finished, current data that is published during the research process, and real-time data
that is published immediately after it is extracted from the instrument. Often archive data is definitive and
current and real-time data is preliminary.</p>
      <p>We propose the development of system of hardware, software and organizational procedures that would
enable the unification of data collection and storage irrelevantly of how fast the data is collected, whether it’s
preliminary or definitive. A scientific institution is a good place to mount such an effort so we advanced this
initiative at Schmidt Institute of Physics of the Earth RAS[Iva18]. Several engineering actions and
organizational steps should be made. Among these actions are datacenter maintenance, publication, backup,
access control, some common archive-related actions like archive compilation, modifications, additions. In
this paper we’d focus on these actions and leave organizational steps for future publication.</p>
      <p>IPE RAS already has the required infrastructure for deployment of such centre. It is based on virtual
infrastructure back with data storage system that is connected to dedicated autonomous system with fast peer
connections to major telecommunication organization. Thus, the backing infrastructure is considered
faulttolerant.</p>
      <p>Data sources often use different protocols for access. Some use file-based protocols: FTP, HTTP, SFTP
and even encode files in SMTP. Others expose relational and nonrelational (e.g. InfluxDB) database
endpoints, or message queuing protocols (MQTT, Apache Kafka), or specialized streaming protocols
(Seedlink). Select protocols implement some sort of access control and encryption.</p>
      <p>When data is being manipulated manually the selection of protocol and access control is solely a user’s
decision. By contrast, automated real-time data collection services must implement best suitable protocol and
means of access control, preferably on low network stack level, e.g. VPN.</p>
      <p>For instance, for time-series data we utilize miniSEED format, mainly because it has lots of tools for
collection, transfer, storage and processing. This format is used for collecting data from instruments in
experimental observatory “Biryulevo” (Moscow oblast); structure health monitoring system [Ale19] and
tiltmeters [Ale18a] situated at IPE RAS main building (Moscow); geomagnetic observatory “Klimovskaya”
(Arkhangelsk oblast). In order to transfer miniSEED no additional encoding is required saving resources and
lowering latency. Observers that are connected to retransmission node get new data in 5 seconds at the latest.
Currently all mentioned observatories are equipped with data collection system based on single board
computers (RaspberryPI)[Аle18b]. This solution is proven universal and economic solution for a wide range
of data collection tasks.</p>
      <p>Additionally, the centre perform collection of retransmitted data from heliophysical space vehicle
“Elektro-L N2” (GOMS-3). The data is initially acquired at Fedorov Institute of Applied Physics with
autonomous data collection system - a satellite receiver system and then gets retransmitted with Seedlink
protocol to the IPE RAS Data Aggregation Center. This data includes particle counter data, x-ray irradiation
meters and galactic ray event counters.</p>
      <p>The center also performs archiving tasks. Ad exemplum the data collected from “Biryulevo” observatory
for periods of 2002-2016 and 2011-2017 is available for access in miniSEED format. The center provides
coverage plots for achieved data (Figure 1).</p>
      <p>If properly set, the amount of stored data does not pose any serious technological and organizational
challenge. But the increase of numbers of stored data channels does. To facilitate the management the center
shall provide means of assisted configuration of both center-side and instrument-side. Here, at IPE RAS, we
are gradually erecting a centralized operator-friendly data site management system that takes care of
configuring all parts of data flow chain starting with instrument configuration. The systems relies on
relational DBMS and Ansible deployment and maintenance system. The information inside this database is
used to create automation scripts for Ansible and configuration files that drive the entire system. The system
is capable of both initial configuration and reconfiguration tasks.</p>
      <p>Visualization greatly improves the monitoring and accessibility of available data. We utilize specialized
database management system InfluxDB acting as intermediate database and feeding data to Grafana (Figure
2) visualization framework. This software stack is used to display real-time data, however, currently, these
two components lack centralized automated configuration which is a planned feature update.</p>
      <p>The centre also implements way to monitor both data sources and internal components as well by using
Shinken open-source monitoring software. The implementation watches over hardware and software vital
indicators and provides extensive information should any fault arise. Currently system does not check the
values of the incoming data, only its presence. Data sanity check is also a planned feature update.</p>
      <p>Another activity of the center is real-time data acquisition from field expeditions. In 2018 we’ve evaluated
the portable high-frequency GNSS data collection system [Per18]. The data acquired from Javad Alpha2
GNSS receiver at 10Hz was collected with portable system and transferred with cellular data connection to
the center in realtime. The portable system is also based on single board Raspberry Pi computer. The
transmission was performed from a moving road vehicle on Moscow-Arkhangelsk (Fig. 3) and
MoscowKandalaksha routes.
[Iva18]</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [Vin14]
          <string-name>
            <surname>Vines T. H</surname>
          </string-name>
          . et al.
          <article-title>The availability of research data declines rapidly with article age //Current biology</article-title>
          .
          <source>- 2014</source>
          . -
          <fpage>Т</fpage>
          .
          <year>24</year>
          . -
          <fpage>№</fpage>
          . 1. -
          <fpage>С</fpage>
          .
          <fpage>94</fpage>
          -
          <lpage>97</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [Ale19]
          <string-name>
            <surname>Ivanov S.D.</surname>
          </string-name>
          ,
          <string-name>
            <surname>Aleshin</surname>
            <given-names>I.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kholodkov</surname>
            <given-names>K.I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Perederin</surname>
            <given-names>F.V.</given-names>
          </string-name>
          <article-title>Sistema upravleniya tsentrom agregatsii dannykh IFZ RAN // Nauchnaya konferentsiya molodykh uchenykh i aspirantov IFZ RAN</article-title>
          ,
          <volume>23</volume>
          -
          <fpage>24</fpage>
          aprelya
          <year>2018</year>
          .
          <article-title>Tezisy dokladov</article-title>
          . Moscow,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Aleshin</surname>
            <given-names>I. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ivanov</surname>
            <given-names>S. D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kholodkov</surname>
            <given-names>K. I.</given-names>
          </string-name>
          et al.
          <article-title>Remote real-time structure health monitoring with mini-smik // Seismic Instruments</article-title>
          .
          <article-title>-</article-title>
          <year>2019</year>
          . - Vol.
          <volume>55</volume>
          , no. 5. - P.
          <fpage>589</fpage>
          -
          <lpage>595</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>[Ale18a] Aleshin</surname>
            <given-names>I. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ivanov</surname>
            <given-names>S. D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koryagin</surname>
            <given-names>V. N.</given-names>
          </string-name>
          et al.
          <article-title>Online publication of tiltmeter data based on the seedlink protocol</article-title>
          // Seismic Instruments.
          <article-title>-</article-title>
          <year>2018</year>
          . - Vol.
          <volume>54</volume>
          , no. 3. - P.
          <fpage>254</fpage>
          -
          <lpage>259</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>[Ale18b] Aleshin</surname>
            <given-names>I.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Getmanov</surname>
            <given-names>V.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grudnev</surname>
            <given-names>A.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dobrovol'skii M.N</surname>
          </string-name>
          .,
          <string-name>
            <surname>Kholodkov</surname>
            <given-names>K.I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koryagin</surname>
            <given-names>V.N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krasnoperov</surname>
            <given-names>R.I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kudin</surname>
            <given-names>D.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Solov'</surname>
            ev
            <given-names>A.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ivanov S</surname>
          </string-name>
          .D.
          <article-title>Kompaktnoe energoeffektivnoe ustroistvo sbora i operativnoi peredachi geomagnitnykh dannykh // II Vserossiiskaya nauchnoprakticheskaya konferentsiya «Nauchnoe priborostroenie - sovremennoe sostoyanie i perspektivy razvitiya» 4-7 iyunya 2018 g</article-title>
          .Kazan, Russia
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>[Per18] Perederin</surname>
            <given-names>F. V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aleshin</surname>
            <given-names>I. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ivanov</surname>
            <given-names>S. D.</given-names>
          </string-name>
          <article-title>i dr. Portativnyi kompleks registratsii signalov GNSS s vysokoi chastotoi oprosa: polevye ispytaniya i perspektivy primeneniya // Nauka i tekhnologicheskie razrabotki</article-title>
          .
          <source>- 2018</source>
          . - T.
          <volume>97</volume>
          , №
          <fpage>4</fpage>
          . - S.
          <fpage>28</fpage>
          -
          <lpage>40</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>