<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Action Planning based on Open Knowledge Graphs and LOD</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Seiji Koide</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fumihiro Kato</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hideaki Takeda</string-name>
          <email>takeda@nii.ac.jp</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yuta Ochiai</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kenki Ueda</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>National Institute of Informatics</institution>
          ,
          <addr-line>2-1-2 Hitotsubashi, Chiyoda-ku, Tokyo 101-8430</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>SOKENDAI, The Graduate University for Advanced Studies</institution>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Toyota Motor Corporation</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this preliminary report, we show how we can realize action planning by using open knowledge-bases and LOD like Linked Geo Data, DBpedia, and WordNet, etc. To make a recommendation for car drivers and passengers, we combine these open datasets by newly constructed ontologies of facilities and services. Then we develop the inference procedure to translate user requests into SPARQL queries to obtain a recommendation on appropriate facilities and areas for users. Common sense knowledge is also required in the reason process.</p>
      </abstract>
      <kwd-group>
        <kwd>DBpedia</kwd>
        <kwd>LinkedGeoData</kwd>
        <kwd>Knowledge-based system</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>While Linked Data is now gradually growing to be the infrastructure of coming
Knowledge Society, we are still struggling to show the potential of Linked Data
to most people in basic industries. To cope with this situation and propel the
deployment of Semantic Web technology in the society, it is needed to demonstrate
the performance of linking distinct datasets and show the potential and
usefulness of outbound and inbound linking data beyond enterprise data in higher
levels of diverse applications. However, although each collection of large linked
data such as DBpedia, Freebase, and OpenCyc are a kind of isolated showcase
of LOD with internally linked data within their own territory and objective, yet
there is no linking data among them from the viewpoint of LOD applications.</p>
      <p>In this preliminary work, we utilized linked open datasets, DBpedia, Linked
Geo Data, and WordNet for the purpose of making a recommendation system
for car drivers and passengers. We have found that it is required more
goaloriented linked datasets and common sense knowledge as bridge between isolated
LOD datasets. We have also found that Semantic Web technology or specifically
LOD and SPARQL engines are enough as enabling technology to create and
demonstrate new applications based on heterogeneous and diverse datasets.</p>
      <p>In our use-case, the system accepts ambiguous requests from car drivers and
passengers, plans driver actions to achieve goals that satisfies the requests,
including alternatives, and makes a recommendation for the drivers and
passengers.</p>
      <p>To obtain the destination as goal, we utilized Linked Geo Data and DBpedia,
and arranged them with newly constructed facility ontology and service
ontology for linking among such open datasets. WordNet is also utilized as general
knowledge, because it was necessary to make the inference with common sense
to discover driving destinations from user requests. Then, we developed the
inference procedure to translate user requests into SPARQL queries to obtain a
recommendation on appropriate facilities and areas for users.</p>
      <p>The purpose of this preliminary report is to make a clear direction for
development of LOD applications in order to deploy linked data as the infrastructure
of society in future.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Problem Setting for the Use Case</title>
      <p>In setting of the use-case, we firstly made more than ten scenarios of conversation
between users and this system. In each case, a user in a car speaks a single or
a number of requests to do something with driving a car. Then, the system
analyzes the requests under the consideration of current contexts such as time,
location, driving time, etc. At last, the system makes concrete action proposals
to visit specific points (shop, facility, etc.) or areas (sightseeing area, good place
for time-consuming, etc.) with a reasonable visiting order. Basically, the request
may be vague and complex, but the recommendation is specific and concrete.
However, every recommendation is a sequence of actions, and proposed actions
are quite limited within these scenarios, for example, drive somewhere, buy or
eat something, do some sport, and so on. One of the simplest scenarios is as
follows.</p>
      <sec id="sec-2-1">
        <title>Child passenger(hereafter C): I want to see a lion.</title>
        <p>System(hereafter S): How about Ueno Zoo. A baby lion was born
recently.</p>
        <p>C: It sounds good, but I was there last month.</p>
        <p>S: Well, how about Kinoshita Circus. You can see a lion show there.
C: OK. That’s fine.</p>
        <p>In this scenario, the system must discover the knowledge that a lion is a
kind of animal and a zoo is an public entertainment facility for seeing animals.
The system must find out a nearest zoo, that is Ueno Zoo in this case, from
the current location, and must reason that users have enough time to drive to
the destination and walking around the zoo. Furthermore, due to the negative
response of the user, the system must discover a neighboring circus that presents
a lion show as an alternative.</p>
        <p>Ontologies for Facility, Action Target, and Service
Instead of directly searching individual facilities like Ueno Zoo or individual
shops like Yodobashi Akiba store (a home electric appliance mass retailer in
Japan), we considered classes of facilities like zoo or home electric appliance
mass retailer to make the system scalable, then made a facility ontology that
contains typical facilities and we defined typical users’ behavior at such facilities
like “a user sees animals in a zoo” or “a user buys a household appliance at a
home electric appliance mass retailer”. Even if we accidentally fail to guide an
actual facility that satisfies user’s special requests, such a problem will be solved
with the development of more rich and specific datasets that includes individual
facilities.</p>
        <p>The facility ontology is constructed mainly by extracting facility classes
related to leisure and meals in Lined Geo Data (LGD). LGD constructs a shallow
class hierarchy from tags attached to the nodes and ways of OpenStreetMap
(OSM). Therefore, LGD classes makes it easy to incorporate new facilities and
new facility types.</p>
        <p>On the other hand, as a result of adopting LGD / OSM, duplicates of classes
due to notation fluctuation of tags and the low coverage rate of actual facilities
at the instance level could be a big problem. However, we think this approach
is the best for our purpose in our best knowledge, because the LGD / OSM is
the largest facility data that can be freely used at the present. Also note that
actually it is impossible to measure how much the existing facilities are covered
in reality. Regarding duplicates of classes in LGD, we select an entity as primary
class that has both the most information-rich descriptions on the OSM and a
large number of instances, then the rest are associated with owl:equivalentClass
to the primary class.</p>
        <p>The following shows an example of zoo class in the facility ontology. The
meanings of Japanese words are added here in English as turtle comments for
readers. Both a service of “see animal” and “pay admission fee for cultural
facility” are actually described in the service ontology as subclasses of “see” service
and “admission-viewing-gaming” service. Note that each service is described as a
pair of an action and an action target, which users can perform. In this paper, we
manually acquired and created service knowledge of facilities within the scenarios
as necessary. See the statistic numbers in Table 1. As shown below, the lgdo:Zoo
class is linked to the dbo:Zoo class in DBpedia Ontology to make possible to
search related facility instances in DBpedia Japanese. The dbo:Zoo already has
a link to Wikidata’s wikidata:Q43501. Thus, it can be easily expanded when
Wikidata is added.
lgdo:Zoo a owl:Class;
servicevoc:dbpediaClass dbo:Zoo ;
servicevoc:provideService [ servicevoc:hasService [
servicevoc:action action: ;
servicevoc:target target:
# pay
], [ # admission fee
# for cultural facility
servicevoc:action action: ;
servicevoc:target target: ]] ;
rdfs:subClassOf servicevoc:Facility .
# see
# animal</p>
        <p>For the sake of systematical description of actions and action targets, we
used the Household Income Balance Item Classification List (January, 2015) of
the Statistics Bureau of the Ministry of Internal Affairs and Communications,
of which items of statistics data are used to describe purchasing behavior at
facilities. User’s behavior at facilities can be divided into purchasing behavior
(such as buying something or paying for some benefits as service) and the other
actions (see, eat, drink, etc.). This classification is based on a hierarchical
structure of action targets as users’ behavior as consumer, so it is possible to consider
cooperation with statistical data in future, starting with purchase actions. For
actions and action targets other than purchasing behavior, we used Japanese
WordNet, because we want to use WordNet’s knowledge on the relationship
between each verb as action and each noun as an action target. For instance, we
made Action Target Ontology as follows.
target: rdfs:label " ";
servicevoc:wordnet
wnja11instances:word# animal
.
target: a owl:Class; rdfs:label " ";
servicevoc:wordnet
wnja11instances:wordrdfs:subClassOf target: .
# food
;
# purchase object</p>
        <p>The service ontology at the bottom of the table is the ontology we constructed
this time, as explained in the above.</p>
        <p>In the facility ontology, a number of services corresponding to distinct
facilities come up with common abstract services. For example, both museums and
art museums have the same service of “paying entrance fee for cultural facilities”.
In addition, there are hierarchical relationships among users’ action targets, then
we have a similar relationship between services. For example, “seeing animals”
can be regarded as the top of “looking at a lion”. We constructed an ontology of
services apart from facility classes, so that services are independently
recognizable, and it enabled us to expand the performance of inference by applying the
hierarchy of services. In this paper, the part of service ontology is constructed
by using the Classification in the Household Survey of the Ministry of Internal
Affairs and Communications. The top of service ontology is the ‘facility service’
and it is related to aspects of two types of behaviors, namely, ‘purchase service’
focused on purchasing behavior, and an ‘activity service’ focused on the other
behaviors at facilities. The following shows an example of ‘purchase service’
ontology entries.</p>
        <p>service: _ a owl:Class;
rdfs:label " _ ";
servicevoc:action action: ;
servicevoc:target target: ;
# food service
# buy
# food
We have collected a number of open knowledge resources as shown at the upper
part of Table 1, and all of them are stored in one RDF store. However, at the time
of this writing, we have actually used only DBpedia Japanese, LinkedGeoData,
Japanese WordNet, and DBpedia Ontology as open datasets. Wikidata is not
stored because of the capacity.</p>
        <p>The system used one endpoint built with one dedicated RDF store.
5</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Reasoning and Q&amp;A Process</title>
      <p>In this preliminary research, we process natural sentences only within the range
expected at use-cases. Furthermore, in this paper it is assumed that the input
is transcribed as text instead of speech.
5.1</p>
      <sec id="sec-3-1">
        <title>Process Flow and Reasoning</title>
        <p>Work flow of this system is as follows.</p>
        <sec id="sec-3-1-1">
          <title>1. Input a text of user’s requests.</title>
          <p>2. Perform the morphological analysis for the input text.
3. Perform the case analysis starting with surface cases to deep cases.
4. Translate the requests into SPARQL queries.
5. Obtain the reply of SPARQL queries.
6. Generate the answering text from the obtained reply.</p>
          <p>Japanese is a kind of agglutinative languages and a Japanese sentence is
written without a space left among phrases and words. A noun phrase is
composed of a noun and a particle, a verb phrase is composed of a stem of verb and a
grammatical conjugation. So, morphological analysis is requisite in Japanese text
processing in order to separate a sentence into phrases and words. Furthermore,
particles attached to nouns decide the grammar case. For example, in response
to an user’s input “ (I want to see a lion)”, the morphological
analysis and shift-reduce method changes the Japanese sentence into the form of
(( (pos info) 8) (( (pos info) 6) ( (pos info) 5)) (( (pos info) 4) (
(pos info) 0))), here (pos info) stands for a Part-of-Speech information
of each, then case analysis produces the result such as Subject:NIL, Verb:(
(pos info) 5), Object:( (pos info) 0), toPlace:NIL, fromPlace:NIL,
Tool:NIL. Part-of-speech information obtained from morphological analysis is
effectively used in various ways. For example, if there is an auxiliary verb ‘
(want)’ next to a form of a behavioral verb such as ‘ (see)’ or ‘ (eat)’,
the whole sentence is interpreted as request. Thus, a request of seeing a lion is
captured and transformed into a SPARQL query to the endpoints.</p>
          <p>From the interpretation of request (see lion), the system searches facilities
that can see a lion, using action target ontology and facility ontology. However,
we have no common sense as LOD that a lion is in a zoo. When searching fails
here, WordNet is used to generalize the target to more abstract ones by searching
hypernym relations in WordNet until animal is found.</p>
          <p>The SPARQL search picks up a number of facilities that are located near the
current location, and the closest one to the current location is chosen outside of
SPARQL search.
5.2</p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>Inference with SPARQL</title>
        <p>
          Initially, we attempted to make a plan by introducing IS-A logic function into
planning based on classical state space reasoning and backward reasoning [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ].
However, more than it, searching combined ontologies using one SPARQL query
easily enabled us to retrieve acceptable instances of appropriate facility from
the action target ontology and the facility ontology without any problems in
execution speed. The LGD class according to the user’s request from the facility
ontology can be found, and once the LGD class is known, SPARQL allows direct
retrieval of the facility instance within the LGD. If there is a DBpedia class linked
from LGD, DBpedia Japanese is also automatically searched in SPARQL queries.
The current system consists of RDF Store search and inference for interpretation
of user’s requests. This configuration is beneficial at usability and re-usability.
Based on SPARQL search and open resources, it is possible to expand and
refine ontology without touching the inference engine of the planning system
in applications. It is meaningful for practical application of reasoning by large
amount of data.
6
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Example of Execution</title>
      <p>The following shows an example of execution by this prototype system, see the
added comments translated into English for readers.</p>
      <p>SYSTEM(4): (eliza)
system&gt;
;; I want to enjoy some sport, after that, I want to go to hot spring.
;; the current location is Toyota Higashfuji Institute.</p>
      <p>; searching a location for sports
......</p>
      <p>; guiding the nearest place
13.37621km ; the distance is 13.37621km</p>
      <p>; place: Numazu City Ball Park
35.1125 ; longitude
138.863 ; latitude
URL "http://linkedgeodata.org/triplify/node2877270449"
(35.1125 . 138.863) ; the current location is (35.1125 . 138.863)
; searching a location for hot spring
......</p>
      <p>; guiding the nearest place
10.426165km ; the distance is 10.426165km</p>
      <p>; place: Izu-Nagaoka Hot Spring
35.0353 ; longitude
138.929 ; latitude
URL "http://ja.dbpedia.org/resource/ "</p>
      <p>Searching for a facility in the vicinity of the current location, the Toyota
Higashifuji Institute, the system made a recommendation to go to Numazu City
Ball Park, then go to Izu-Nagaoka Hot Spring, in response to a request to go to
a hot spring after enjoying some sport.</p>
      <p>While this prototype of action planning by using open knowledge sources and
SPARQL queries is widely applicable to various kind of applications, yet there is
not enough as intelligent agent. Making more intelligent agent remains in future
work.</p>
    </sec>
    <sec id="sec-5">
      <title>Discussion</title>
      <p>In this preliminary research, the following issues are suggested.</p>
    </sec>
    <sec id="sec-6">
      <title>Conclusion</title>
      <p>In this preliminary research, we made a prototype of action planning system for
events of everyday life and world, based on open knowledge of LOD as fact data
and taxonomy as common knowledge. We utilized a number of large-scale open
databases and knowledge-bases. We found that we had already abundant
knowledge about the everyday life and world as diverse open knowledge resources. This
condition is very different at the era of Good-Old-Fashioned-AI (GOGAI) before
the Web age and LOD. However, we also found that we needed the additional
general and common knowledge that connects such different open resources in
reasoning action plans with SPARQL endpoints. It is obvious that it will be
necessary to make open knowledge more available not only in the verification
and validation for each, but also in the combinations of them for applications.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Ghallab</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nau</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Traverso</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <source>Automated Planning, theory and practice</source>
          ,
          <source>Elsevier</source>
          (
          <year>2004</year>
          ) .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>