<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>GPS Trajectory Linked Open Data based on Open POI Information- Through an Experiment in ISWC2016-</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Kouji Kozaki</string-name>
          <email>kozaki@ei.sanken.osaka-u.ac.jp</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Teruaki Yokoyama</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fukami Yoshiaki</string-name>
          <email>yoshiaki@rikkyo.ac.jp</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Kobe Institute of Computing</institution>
          ,
          <addr-line>2-2-7 Kano-cho, Chuo-ku, Kobe 650-0001</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Rikkyo University</institution>
          ,
          <addr-line>3-34-1 Nishi-Ikebukuro,Toshima-ku, Tokyo, 171-8501</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>The Institute of Scientific and Industrial Research, Osaka University 8-1 Mihogaoka</institution>
          ,
          <addr-line>Ibaraki, Osaka, 567-0047</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>We tried an experiment in ISWC 2016, Kobe, as a practical example of data integration of sensor data through IoT devices and semantic information. We provided small GPS devices to some volunteers from participants of ISWC 2016 and collected trajectory data during the conference. Then, we integrated the data with Points of Interest (POI) information collected through existing open data such as open government data by Kobe city, DBpedia and Wikidata. This paper presents the method to integrate GPS trajectory data and existing open data with consideration on usefulness of them as sources of POI information for practical analysis.</p>
      </abstract>
      <kwd-group>
        <kwd>GPS trajectory</kwd>
        <kwd>open data</kwd>
        <kwd>IoT</kwd>
        <kwd>Point of Interest</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Integration of sensor data through IoT devices and semantic information is an important
technology in various fields. We tried an experiment in ISWC 2016, Kobe, as a
practical example of such a semantic data integration. In the examination, we provided small
GPS devices to some volunteers from participants of ISWC 2016 and collected
trajectory data during the conference. Then, we integrated the data with Points of Interest
(POI) information collected through some existing open data. As the result, we
published the integrated trajectory data as Linked Open Data with SPARQL endpoint. It
enables us to analysis trajectory information from various aspects and granularities.</p>
      <p>This paper presents the method to integrate GPS trajectory data and existing open
data from various aspects with multiple granularities. Then, we consider how much
useful are existing open data to analysis real GPS trajectory.</p>
    </sec>
    <sec id="sec-2">
      <title>Experiments for collecting GPS trajectory data in ISWC2016</title>
      <p>There are some methods for collecting trajectory data such as Global Positioning
System (GPS) [1], radio frequency identifier (RDFID), location recognitions from image
[2], Monitoring of WiFi for smartphones [3] etc. In our experiment, we used GPS
because it does not need any special equipment for target area, and it can measure location
more accurately than WiFi monitoring and recognitions from image.</p>
      <p>We conducted an experiment for collecting GPS trajectory in International Semantic
Web Conference 2016 (ISWC2016), Kobe, Japan, October 17-21, 2016. The purpose
of this experiment is to demonstrate of the possibility how we can combine GPS
trajectory data and existing open data.</p>
      <p>We ask some volunteers to receive and bring a small GPS device during the stay in
the ISWC2016 conference. Their movements are recorded on the GPS devices. We
used i-gotU GT-600 which are GPS devices offered commercially for the experiment.
The devises recorded their location per 1 minute. The collected information includes
its position (latitude, longitude), the height above sea level, movement speed and
moving distance. As the result, we collected trajectory data from 11 volunteers during the
ISWC2016. The average of recoded period was 91.8 hours (3.83 days).
3</p>
    </sec>
    <sec id="sec-3">
      <title>GPS trajectory LOD</title>
      <sec id="sec-3-1">
        <title>3.1 Open data for obtaining POI information</title>
        <p>The collected GPS trajectory data is converted into RDF using POI information
obtained through existing open data. We collected POI information in Kobe city since
moving range of the collected all GPS trajectory data, which is shown in Section 2, was
limited in Kobe city. As the result, we collected six datasets. They are classified into
two kinds as follows.</p>
      </sec>
      <sec id="sec-3-2">
        <title>1. Open data published by Kobe city (local government).</title>
        <p>
          We collected four datasets related to sightseeing from open data portal1 by Kobe city
because our experiments aim to analyze interests of participants for the international
conference. They are dataset about (
          <xref ref-type="bibr" rid="ref1">1</xref>
          ) Sightseeing facility, (
          <xref ref-type="bibr" rid="ref2">2</xref>
          ) Night view spot, (
          <xref ref-type="bibr" rid="ref3">3</xref>
          )
Location site and (4) Outdoor sculpture.
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>2. POI information extracted from (5) DBpedia Japanese and (6) Wikidata.</title>
        <p>We extracted data which has position information (latitude and longitude) in Kobe
city using SPARQL query.</p>
        <p>
          We supposed that the former contain POIs closely related to the local area because
they are provided by the local government. On the other hands, we supposed that the
latter cover wide broad kinds of POIs because they are general open data which
everyone can edit on the web. In addition to them, we prepared two kinds of merged datasets;
merged dataset of open data by Kobe city (
          <xref ref-type="bibr" rid="ref1">1</xref>
          )-(4) and merged dataset of all datasets
(
          <xref ref-type="bibr" rid="ref1">1</xref>
          )(6). When we merged these datasets, we considered that if a distance between POIs in
different datasets is less than 50m, they are treated as one merged POI. It is because
different datasets may contain the same POI information.
        </p>
        <p>
          Table 1 shows POI information which we collected. The merged dataset of (
          <xref ref-type="bibr" rid="ref1">1</xref>
          )–(6)
contains 730 POIs. Distances between POI in the dataset are 234.3m in average and
139.1m in median.
        </p>
        <sec id="sec-3-3-1">
          <title>1 https://data.city.kobe.lg.jp/</title>
          <p>Choose one dataset from multiple datasets of POIs according to    
the purpose of the analysis </p>
          <p>POI information (common to all users)
Lat/Long label1 Lat/Long label5 Lat/Long label24
POI‐1 POI‐5 POI‐24
poi poi poi
user1 stayed next user1 stayed next user1 stayed  next
at POI‐1 at POI‐5 at POI‐24 ・・・
Year/Day/Time Year/Day/Time Year/Day/Time</p>
          <p>GPS trajectory (for each user)</p>
        </sec>
      </sec>
      <sec id="sec-3-4">
        <title>3.2 RDF data model for GPS trajectory</title>
        <p>Fig. 1 shows the overview of RDF data model for GPS trajectory. This model consists
two data models for POI information and trajectory of each user.</p>
        <p>POI information is represented by two classes POI class and MergedPOI class. POI
class represents a primitive POI extracted original six datasets, and MergedPOI class
represents a merged POI discussed in Section 3.1. Each POI has properties such as
rdfs:label, position information (geo:lat, geo:long), data source. Links to other LOD
are represented using rdfs:seeAlso property. In addition to these properties, a merged
POI has contains properties which represent primitive POI merged into it. One of these
POI datasets is chosen and commonly referred by GPS trajectory data according to the
purpose of the analysis.</p>
        <p>The GPS trajectory is represented by a series of RDF resources of StayPOI class.
Each StayPOI resource represents information about a stay that a user stayed a POI
during a time interval. It is described using properties such as the username, POI,
starting/ending time point of the stay, a link to its next stay information. The POI is a
reference to an instance of POI class discussed the above. Based on the data model, a
GPS trajectory is represented as a directed graph which looks like the bottom of Fig. 1.</p>
      </sec>
      <sec id="sec-3-5">
        <title>3.3 Translation from GPS trajectory data to RDF</title>
        <p>We translated GPS trajectory data discussed in Section 2 into RDF based on data
models shown in the Section 3.2. Position information are compared with each POI
information in a selected POI dataset and judged which POI the user stayed. The following
steps show how we judge each staying.
1. Each position data in GPS data and each POI information in the selected POI datasets
are compared. If the distance between the position in GPS and the POI is less than
the threshold d, and the distance is minimum among the position in GPS between
each POI, then it is considered that the user stayed the POI.
2. When a series of GPS data records are judged that the user stayed the same POI,
- The earliest date-time from these records is considered as the time which the user
entered (started to stay) in the POI
- The latest date-time from these records is considered as the time which the user
leaved (stop to stay) in the POI
3. On the series of GPS data records, when the POI the user stayed changes, the new</p>
        <p>POI is considered as the new POI that the user stayed in the next.</p>
        <sec id="sec-3-5-1">
          <title>We developed a Java client program for the translation.</title>
          <p>
            We translated GPS trajectory data collected from the eleven users using eight kinds
of POI datasets. We used threshold d as 100m. In the case that we used the merged
dataset (
            <xref ref-type="bibr" rid="ref1">1</xref>
            ) – (6), the obtained number of instances of StayPOI class is 1,462 and the
number of trajectory data in RDF is 13,158 (generated at May 5, 2017). These dataset
is available at http://lodosaka.jp/iswc2016gtl-exp/ with source data, SPARQL endpoint
and sample web application to visualize the trajectory data.
4
          </p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Discussions and Conclusion</title>
      <p>
        Table 2 shows how many POI
obtained existing open datasets and Data # of # of POI # of GPS data
used to transform GPS trajectory sIDet Dataset name dPaOtaIsient sosmtaeyuedser jwudhgicehd citasnstbaey
into RDF. It shows that 68.58% of P1 (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) Sightseeing facility 103 24 1,585 (39.39%)
trajectory by all users are covered P2 (
        <xref ref-type="bibr" rid="ref2">2</xref>
        ) Night view spot 22 12 71 (1.76%)
by open data provided Kobe city P3 (
        <xref ref-type="bibr" rid="ref3">3</xref>
        ) Location site 88 36 2,516 (62.52%)
and 82.88% are covered when P4 (4) Outdoor sculpture 35 26 201 (5.00%)
Wikidata and DBpedia are used PM5P1 (M5)eDrgBepdeddiaatJaaspeatn(e1s)e-(4) 122783 7573 2,765991 ((6187..5167%%))
with them. That is, combination of P6 (6)Wikidata 544 97 2,697 (67.02%)
open data by local governments MP2 Merged dataset (
        <xref ref-type="bibr" rid="ref1">1</xref>
        )-(6) 730 146 3,335 (82.88%)
and social open data such as
Wikidata and DBpedia is good to obtain practical POI information in this experiment.
However, subjects in this experiments did not move so wide area, which is most of them
stayed only city are, because we collect their GPS trajectory only during the conference.
We are planning to analysis the proposed method in different area and settings.
      </p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgements</title>
      <p>This work was supported by JSPS KAKENHI Grant Number 17H01789. The authors
are deeply grateful to volunteers who join the experiments in ISWC2016.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>N.</given-names>
            <surname>Bulusu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Heidemann</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Estrin</surname>
          </string-name>
          .
          <article-title>Gps-less low-cost outdoor localization for very small devices</article-title>
          .
          <source>Personal Communications, IEEE</source>
          ,
          <volume>7</volume>
          (
          <issue>5</issue>
          ), (
          <year>2000</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>C.-N.E.</given-names>
            <surname>Anagnostopoulos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I. E.</given-names>
            <surname>Anagnostopoulos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I. D.</given-names>
            <surname>Psoroulas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Loumos</surname>
          </string-name>
          , E. Kayafas,
          <article-title>License Plate Recognition From Still Images and Video Sequences: A Survey</article-title>
          ,
          <source>IEEE Transactions on Intelligent Transportation Systems, v.9 n.3</source>
          , p.
          <fpage>377</fpage>
          -
          <lpage>391</lpage>
          (
          <year>2008</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>A.B.M.Musa</surname>
          </string-name>
          , Jakob Eriksson,
          <string-name>
            <surname>Tracking Unmodified Smartphones Using Wi-Fi</surname>
            <given-names>Monitors</given-names>
          </string-name>
          ,
          <source>In Proc. of SenSys '12 Proceedings of the 10th ACM Conference on Embedded Network Sensor Systems</source>
          , pp.
          <fpage>281</fpage>
          -
          <lpage>294</lpage>
          , (
          <year>2012</year>
          ) .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>