<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Athens, Greece
* Corresponding author.
$ hiroki_u@nii.ac.jp (H. Uematsu); takeda@nii.ac.jp (H. Takeda)</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Earthquake Ontology and LOD</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Hiroki Uematsu</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hideaki Takeda</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>National Institute of Informatics</institution>
          ,
          <country country="JP">Japan</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>The Graduate University for Advanced Studies</institution>
          ,
          <addr-line>SOKENDAI</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <volume>000</volume>
      <fpage>0</fpage>
      <lpage>0003</lpage>
      <abstract>
        <p>In this paper, we constructed an earthquake ontology and Linked Open Data for seismology. Earthquake ontology defines data on seismic waveforms such as seismic intensity and occurred time, data on observation stations where seismic waveforms were observed, and classes and properties such as the size and depth of the hypocenter of the observed waveforms. By using the earthquake ontology, it is possible to assign URIs to "earthquakes" that cannot be observed unlike seismic sources and observed waveforms, making it possible to use the necessary earthquake data based on observation information. We developed the seismic dataset not only the world's seismic data for machine learning represented by STEAD but also data publicly available in Japan on a limited basis, such as JMA and NIED prevention, as Linked Open Data using earthquake ontology.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Ontology</kwd>
        <kwd>Linked Open Data</kwd>
        <kwd>Earthquake</kwd>
        <kwd>Seismology</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Japan is one of the most earthquake-prone countries in the world. Around the Japanese
archipelago, four plates collide with each other, and more than 100,000 earthquakes occur per
year, averaging more than 300 earthquakes per day, including those that are not felt.</p>
      <p>
        Seismic motion is observed as waveform data of acceleration and is used in various research
such as calculation of seismic intensity, determination of hypocenter, emergency earthquake
warning, and predicted seismic intensity. In recent years, it has been used as training data for
research using machine learning, such as predicting the seismic intensity at a specific observation
station, whether the observed waveform is a seismic waveform, and identifying the
P-wave/Swave of an earthquake. Since machine learning requires a large amount of high-quality training
data, seismic observation networks are useful. However, one of the networks K-NET[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] which
was established by the National Research Institute for Earth Science and Disaster Resilience
(NIED) waveform data acquisition site does not have an API, users need to specify the date and
time, hypocenter, observation station, etc., and download the waveform data. In order to search
for waveform data independently observed by researchers and observation networks of the
Japan Meteorological Agency (JMA) and local governments, it is possible to create a database
that aggregates waveform data. Although, since the waveform data cannot be republished and
there is no URI that uniquely points to the waveform data, researchers will have their own
databases, making it dificult to create a reusable open waveform database.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Earthquake Observation</title>
      <p>In general, the word "earthquake" refers to events such as tremors felt by people on their own,
but in reality, it refers to the rapid displacement of the bedrock due to the pushing and pulling of
the underground bedrock. Shaking occurs as a result of bedrock displacement and is recognized
by us at the ground surface. Because earthquakes occur underground, it is dificult to actually
observe them. Therefore, information on the waveforms observed at each observation station is
important, such as the hypocenter estimation and calculating the seismic intensity.</p>
      <p>
        Observations of seismic activity are conducted in many countries. The International
Federation of Digital Seismograph Networks (FDSN)[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] has 2196 registered seismograph networks
with 24-bit resolution with data recorded in continuous time series at a sampling rate of at least
20 samples/second are registered. STEAD[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] registers approximately 1.2 million time series of
seismic waveforms observed by seismometers, covering more than 19,000 hours of datasets.
      </p>
      <p>
        In Japan, seismic waveforms observed by observation networks such as K-NET and
Kiknet, which are based on data from observation stations established by NIED, JMA, and local
governments, can be obtained. However, although the acquired data can be used for analysis
and other purposes, it cannot be redistributed, and only some of the JMA’s observation station
and data are registered in the FDSN. Other earthquake data is available in the Earthquake
Monthly Report (Catalog Edition)[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Although the observed waveforms themselves cannot be
obtained, they can be considered to contain metadata on the observed seismic motions.
      </p>
      <p>However, there is no list of which stations observe which earthquakes, although multiple
stations must observe the same earthquake to be selected when searching by the station. In
addition, when searching from the hypocenter, it is not known whether the observation stations
observe the earthquake that occurred at that hypocenter or not without searching the data and
making a list. Furthermore, since the observation stations are diferent from each other, it is
dificult to retrieve the seismograms of earthquakes that occurred at the same hypocenter from
multiple observation stations because the IDs are not assigned to each earthquake.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Method</title>
      <p>To solve the problem in section 2, we aim to make the observed waveforms publicly available
and searchable in the form of Linked Data, which links data together. First, we organized the
vocabulary related to earthquakes. The JMA’s Earthquake Monthly Report (Catalog Edition) does
not provide data on observed waveforms, but it does provide metadata on observed earthquake
ground motions. The data in the Earthquake Monthly Report (Catalog Edition) include source
data, measured data, first motion mechanism solution data, CMT solution data, seismic intensity
data, tsunami data, etc. In this paper, the seismic intensity data file was first selected as the
target. The seismic intensity data contains a record called the hypocenter record, which contains
information on the hypocenter, and information on the observation points where the earthquake
motion that occurred at the hypocenter was observed. Therefore, we created an ontology as an
earthquake vocabulary based on the data contained in the Earthquake Monthly Report (Catalog
Edition) of JMA. The seismic intensity data contains a record called the hypocenter record,
which contains information on the hypocenter, and information on the observation stations
where the earthquake motion that occurred at the hypocenter was observed.</p>
      <p>Since the earthquake itself cannot be observed, it is important to show the relationship
between the waveform information actually observed at the observation station and the hypocenter
and magnitude estimated from the observed waveform as the semantics of the earthquake.
Figure 1 shows the hypocenter and observed waveforms graphically.</p>
      <p>
        The earthquake ontology was constructed based on the hypocenter, seismic motion, observed
waveforms, and observation station that constitute an earthquake. The seismic motion and
the observation station that observe the waveforms at the ground surface were described
using the SOSA (Sensor, Observation, Sample, and Actuator) ontology of the SSN (Semantic
Sensor Network)[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. The earthquake ontology observation station class inherits from the
SOSA:Sensor class, and seismicMotion and observedWave are set to the properties observed
from the observation station. In the earthquake ontology, the hypocenter is identified from
the seismic waveforms observed by the stations, and the data set summarizing these three
relationships is intended to be captured as an earthquake.
      </p>
    </sec>
    <sec id="sec-4">
      <title>4. Earthquake LOD</title>
      <p>We converted the available data from the JMA’s Earthquake Monthly Report (Catalog Edition)
and FDSN earthquake events into Linked Data based on Earthquake Ontology. Since FDSN
includes observation networks registered with ISC and STEAD, data outside Japan are using
FDSN. Networks in FDSN are United States National Seismic Network, Hawaiian Volcano
Observatory Network, Montana Regional Seismic Network, Southern California Seismic Network,
Nevada Seismic Network, Pacific Northwest Seismic Network - University of Washington, USGS
Northern California Seismic Network, Alaska Geophysical Network, Oklahoma Seismic Network,
University of Utah Regional Seismic Network, Raspberry Shake, Alaska Volcano Observatory, Texas
Seismological Network, Puerto Rico Seismic Network &amp; Puerto Rico Strong Motion Program, US
Geological Survey Networks, Lamont-Doherty Cooperative Seismographic Network, Geological
Survey Networks, National Tsunami Warning Center Alaska Seismic Network.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion</title>
      <p>In this paper, we organized earthquake data registered in the Earthquake Monthly Report from
JMA and data from FDSN published. We also created an ontology by organizing vocabulary
related to earthquakes and published it in the form of Linked Open Data by assigning URIs to
earthquakes based on information on observed waveforms and hypocenters.</p>
      <p>In the future, we aim to create an infrastructure for multiple observation networks by
converting the latest data published in the JMA’s seismic intensity database, data in NIED’s
seismic observation network, and seismic data in FDSN into LOD. Furthermore, we will promote
the availability of an earthquake catalog format that can be used as earthquake data and learning
data, and LOD conversion of data observed by our own observation network. Benchmarking
based on the same dataset is important for source determination, calculation of predicted seismic
intensity, and training data for machine learning, but it is believed that datasets for reproduction
are not distributed due to the fact that Japanese data cannot be redistributed and IDs are not
assigned. By using the earthquake ontology created in this paper to describe the datasets used
in earthquake research in LOD, it is expected that the availability of datasets for reconstruction
will be improved.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>N. R. I. for Earth</given-names>
            <surname>Science</surname>
          </string-name>
          ,
          <string-name>
            <surname>D. R.</surname>
          </string-name>
          (NIED),
          <article-title>Nied k-net, kik-net,national research institute for earth science</article-title>
          and disaster resilience,
          <year>2019</year>
          . URL: https://doi.org/10.17598/NIED.0004.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>G.</given-names>
            <surname>Suarez</surname>
          </string-name>
          , T. van
          <string-name>
            <surname>Eck</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Giardini</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Ahern</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Butler</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Tsuboi</surname>
          </string-name>
          ,
          <article-title>The international federation of digital seismograph networks (fdsn): An integrated system of seismological observatories</article-title>
          ,
          <source>IEEE Systems Journal</source>
          <volume>2</volume>
          (
          <year>2008</year>
          )
          <fpage>431</fpage>
          -
          <lpage>438</lpage>
          . doi:
          <volume>10</volume>
          .1109/JSYST.
          <year>2008</year>
          .
          <volume>2003294</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>S. M.</given-names>
            <surname>Mousavi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Sheng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Zhu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. C.</given-names>
            <surname>Beroza</surname>
          </string-name>
          ,
          <article-title>Stanford earthquake dataset (stead): A global data set of seismic signals for ai</article-title>
          ,
          <source>IEEE Access</source>
          (
          <year>2019</year>
          ). doi:doi:10.1109/ACCESS.
          <year>2019</year>
          .
          <volume>2947848</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>J. M. A.</surname>
          </string-name>
          (JMA),
          <source>Earthquake monthly report (catalog edition)</source>
          ,
          <year>2023</year>
          . URL: https://www.data. jma.go.jp/eqev/data/bulletin/.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>M.</given-names>
            <surname>Compton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Barnaghi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Bermudez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>García-Castro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Corcho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Cox</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Graybeal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hauswirth</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Henson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Herzog</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Janowicz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. D.</given-names>
            <surname>Kelsey</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. Le</given-names>
            <surname>Phuoc</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Lefort</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Leggieri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Neuhaus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Nikolov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Page</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Passant</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sheth</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Taylor</surname>
          </string-name>
          ,
          <article-title>The ssn ontology of the w3c semantic sensor network incubator group</article-title>
          ,
          <source>Journal of Web Semantics</source>
          <volume>17</volume>
          (
          <year>2012</year>
          )
          <fpage>25</fpage>
          -
          <lpage>32</lpage>
          . URL: https://www.sciencedirect.com/science/article/pii/S1570826812000571. doi:
          <volume>10</volume>
          .1016/j.websem.
          <year>2012</year>
          .
          <volume>05</volume>
          .003.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>