<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Action Planning based on Open Knowledge Graphs and LOD</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Seiji</forename><surname>Koide</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">National Institute of Informatics</orgName>
								<address>
									<addrLine>2-1-2 Hitotsubashi, Chiyoda-ku</addrLine>
									<postCode>101-8430</postCode>
									<settlement>Tokyo</settlement>
									<country key="JP">Japan</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Fumihiro</forename><surname>Kato</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">National Institute of Informatics</orgName>
								<address>
									<addrLine>2-1-2 Hitotsubashi, Chiyoda-ku</addrLine>
									<postCode>101-8430</postCode>
									<settlement>Tokyo</settlement>
									<country key="JP">Japan</country>
								</address>
							</affiliation>
						</author>
						<author role="corresp">
							<persName><forename type="first">Hideaki</forename><surname>Takeda</surname></persName>
							<email>takeda@nii.ac.jp</email>
						</author>
						<author>
							<persName><forename type="first">Yuta</forename><surname>Ochiai</surname></persName>
							<affiliation key="aff2">
								<orgName type="institution">Toyota Motor Corporation</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Kenki</forename><surname>Ueda</surname></persName>
							<affiliation key="aff2">
								<orgName type="institution">Toyota Motor Corporation</orgName>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff1">
								<orgName type="laboratory">SOKENDAI</orgName>
								<orgName type="institution">The Graduate University for Advanced Studies</orgName>
							</affiliation>
						</author>
						<title level="a" type="main">Action Planning based on Open Knowledge Graphs and LOD</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">E2CEBE2F5B7635106DB7749F94B142E2</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-25T05:39+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>DBpedia</term>
					<term>LinkedGeoData</term>
					<term>Knowledge-based system</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>In this preliminary report, we show how we can realize action planning by using open knowledge-bases and LOD like Linked Geo Data, DBpedia, and WordNet, etc. To make a recommendation for car drivers and passengers, we combine these open datasets by newly constructed ontologies of facilities and services. Then we develop the inference procedure to translate user requests into SPARQL queries to obtain a recommendation on appropriate facilities and areas for users. Common sense knowledge is also required in the reason process.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>While Linked Data is now gradually growing to be the infrastructure of coming Knowledge Society, we are still struggling to show the potential of Linked Data to most people in basic industries. To cope with this situation and propel the deployment of Semantic Web technology in the society, it is needed to demonstrate the performance of linking distinct datasets and show the potential and usefulness of outbound and inbound linking data beyond enterprise data in higher levels of diverse applications. However, although each collection of large linked data such as DBpedia, Freebase, and OpenCyc are a kind of isolated showcase of LOD with internally linked data within their own territory and objective, yet there is no linking data among them from the viewpoint of LOD applications.</p><p>In this preliminary work, we utilized linked open datasets, DBpedia, Linked Geo Data, and WordNet for the purpose of making a recommendation system for car drivers and passengers. We have found that it is required more goaloriented linked datasets and common sense knowledge as bridge between isolated LOD datasets. We have also found that Semantic Web technology or specifically LOD and SPARQL engines are enough as enabling technology to create and demonstrate new applications based on heterogeneous and diverse datasets.</p><p>In our use-case, the system accepts ambiguous requests from car drivers and passengers, plans driver actions to achieve goals that satisfies the requests, including alternatives, and makes a recommendation for the drivers and passengers.</p><p>To obtain the destination as goal, we utilized Linked Geo Data and DBpedia, and arranged them with newly constructed facility ontology and service ontology for linking among such open datasets. WordNet is also utilized as general knowledge, because it was necessary to make the inference with common sense to discover driving destinations from user requests. Then, we developed the inference procedure to translate user requests into SPARQL queries to obtain a recommendation on appropriate facilities and areas for users.</p><p>The purpose of this preliminary report is to make a clear direction for development of LOD applications in order to deploy linked data as the infrastructure of society in future.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Problem Setting for the Use Case</head><p>In setting of the use-case, we firstly made more than ten scenarios of conversation between users and this system. In each case, a user in a car speaks a single or a number of requests to do something with driving a car. Then, the system analyzes the requests under the consideration of current contexts such as time, location, driving time, etc. At last, the system makes concrete action proposals to visit specific points (shop, facility, etc.) or areas (sightseeing area, good place for time-consuming, etc.) with a reasonable visiting order. Basically, the request may be vague and complex, but the recommendation is specific and concrete. However, every recommendation is a sequence of actions, and proposed actions are quite limited within these scenarios, for example, drive somewhere, buy or eat something, do some sport, and so on. One of the simplest scenarios is as follows.</p><p>Child passenger(hereafter C): I want to see a lion. System(hereafter S): How about Ueno Zoo. A baby lion was born recently. C: It sounds good, but I was there last month. S: Well, how about Kinoshita Circus. You can see a lion show there. C: OK. That's fine.</p><p>In this scenario, the system must discover the knowledge that a lion is a kind of animal and a zoo is an public entertainment facility for seeing animals. The system must find out a nearest zoo, that is Ueno Zoo in this case, from the current location, and must reason that users have enough time to drive to the destination and walking around the zoo. Furthermore, due to the negative response of the user, the system must discover a neighboring circus that presents a lion show as an alternative.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Ontologies for Facility, Action Target, and Service</head><p>Instead of directly searching individual facilities like Ueno Zoo or individual shops like Yodobashi Akiba store (a home electric appliance mass retailer in Japan), we considered classes of facilities like zoo or home electric appliance mass retailer to make the system scalable, then made a facility ontology that contains typical facilities and we defined typical users' behavior at such facilities like "a user sees animals in a zoo" or "a user buys a household appliance at a home electric appliance mass retailer". Even if we accidentally fail to guide an actual facility that satisfies user's special requests, such a problem will be solved with the development of more rich and specific datasets that includes individual facilities.</p><p>The facility ontology is constructed mainly by extracting facility classes related to leisure and meals in Lined Geo Data (LGD). LGD constructs a shallow class hierarchy from tags attached to the nodes and ways of OpenStreetMap (OSM). Therefore, LGD classes makes it easy to incorporate new facilities and new facility types.</p><p>On the other hand, as a result of adopting LGD / OSM, duplicates of classes due to notation fluctuation of tags and the low coverage rate of actual facilities at the instance level could be a big problem. However, we think this approach is the best for our purpose in our best knowledge, because the LGD / OSM is the largest facility data that can be freely used at the present. Also note that actually it is impossible to measure how much the existing facilities are covered in reality. Regarding duplicates of classes in LGD, we select an entity as primary class that has both the most information-rich descriptions on the OSM and a large number of instances, then the rest are associated with owl:equivalentClass to the primary class.</p><p>The following shows an example of zoo class in the facility ontology. The meanings of Japanese words are added here in English as turtle comments for readers. Both a service of "see animal" and "pay admission fee for cultural facility" are actually described in the service ontology as subclasses of "see" service and "admission-viewing-gaming" service. Note that each service is described as a pair of an action and an action target, which users can perform. In this paper, we manually acquired and created service knowledge of facilities within the scenarios as necessary. See the statistic numbers in Table <ref type="table" target="#tab_0">1</ref>. As shown below, the lgdo:Zoo class is linked to the dbo:Zoo class in DBpedia Ontology to make possible to search related facility instances in DBpedia Japanese. The dbo:Zoo already has a link to Wikidata's wikidata:Q43501. Thus, it can be easily expanded when Wikidata is added. # animal rdfs:subClassOf servicevoc:Facility .</p><p>For the sake of systematical description of actions and action targets, we used the Household Income Balance Item Classification List (January, 2015) of the Statistics Bureau of the Ministry of Internal Affairs and Communications, of which items of statistics data are used to describe purchasing behavior at facilities. User's behavior at facilities can be divided into purchasing behavior (such as buying something or paying for some benefits as service) and the other actions (see, eat, drink, etc.). This classification is based on a hierarchical structure of action targets as users' behavior as consumer, so it is possible to consider cooperation with statistical data in future, starting with purchase actions. For actions and action targets other than purchasing behavior, we used Japanese WordNet, because we want to use WordNet's knowledge on the relationship between each verb as action and each noun as an action target. For instance, we made Action Target Ontology as follows.</p><p>target:動物 rdfs:label "動物"; # animal servicevoc:wordnet wnja11instances:word-動物 . target:食料 a owl:Class; rdfs:label "食料"; # food servicevoc:wordnet wnja11instances:word-食料 ; rdfs:subClassOf target:購買対象 .</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head># purchase object</head><p>The service ontology at the bottom of the table is the ontology we constructed this time, as explained in the above.</p><p>In the facility ontology, a number of services corresponding to distinct facilities come up with common abstract services. For example, both museums and art museums have the same service of "paying entrance fee for cultural facilities". In addition, there are hierarchical relationships among users' action targets, then we have a similar relationship between services. For example, "seeing animals" can be regarded as the top of "looking at a lion". We constructed an ontology of services apart from facility classes, so that services are independently recognizable, and it enabled us to expand the performance of inference by applying the hierarchy of services. In this paper, the part of service ontology is constructed by using the Classification in the Household Survey of the Ministry of Internal Affairs and Communications. The top of service ontology is the 'facility service' and it is related to aspects of two types of behaviors, namely, 'purchase service' focused on purchasing behavior, and an 'activity service' focused on the other behaviors at facilities. The following shows an example of 'purchase service' ontology entries.</p><p>service:食料_サービス a owl:Class; # food service rdfs:label "食料_サービス"; servicevoc:action action:買う; # buy servicevoc:target target:食料; # food </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Building Knowledge Graphs</head><p>We have collected a number of open knowledge resources as shown at the upper part of Table <ref type="table" target="#tab_0">1</ref>, and all of them are stored in one RDF store. However, at the time of this writing, we have actually used only DBpedia Japanese, LinkedGeoData, Japanese WordNet, and DBpedia Ontology as open datasets. Wikidata is not stored because of the capacity.</p><p>The system used one endpoint built with one dedicated RDF store.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Reasoning and Q&amp;A Process</head><p>In this preliminary research, we process natural sentences only within the range expected at use-cases. Furthermore, in this paper it is assumed that the input is transcribed as text instead of speech.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1">Process Flow and Reasoning</head><p>Work flow of this system is as follows.</p><p>1. Input a text of user's requests. 2. Perform the morphological analysis for the input text.</p><p>3. Perform the case analysis starting with surface cases to deep cases. 4. Translate the requests into SPARQL queries. 5. Obtain the reply of SPARQL queries. 6. Generate the answering text from the obtained reply.</p><p>Japanese is a kind of agglutinative languages and a Japanese sentence is written without a space left among phrases and words. A noun phrase is composed of a noun and a particle, a verb phrase is composed of a stem of verb and a grammatical conjugation. So, morphological analysis is requisite in Japanese text processing in order to separate a sentence into phrases and words. Furthermore, particles attached to nouns decide the grammar case. For example, in response to an user's input "ライオンが見たいな (I want to see a lion)", the morphological analysis and shift-reduce method changes the Japanese sentence into the form of ((な (pos info) 8) ((たい (pos info) 6) (見 (pos info) 5)) ((が (pos info) 4) (ラ イオン (pos info) 0))), here (pos info) stands for a Part-of-Speech information of each, then case analysis produces the result such as Subject:NIL, Verb:(見 る (pos info) 5), Object:(ライオン (pos info) 0), toPlace:NIL, fromPlace:NIL, Tool:NIL. Part-of-speech information obtained from morphological analysis is effectively used in various ways. For example, if there is an auxiliary verb 'たい (want)' next to a form of a behavioral verb such as '見る (see)' or '食べる (eat)', the whole sentence is interpreted as request. Thus, a request of seeing a lion is captured and transformed into a SPARQL query to the endpoints.</p><p>From the interpretation of request (see lion), the system searches facilities that can see a lion, using action target ontology and facility ontology. However, we have no common sense as LOD that a lion is in a zoo. When searching fails here, WordNet is used to generalize the target to more abstract ones by searching hypernym relations in WordNet until animal is found.</p><p>The SPARQL search picks up a number of facilities that are located near the current location, and the closest one to the current location is chosen outside of SPARQL search.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.2">Inference with SPARQL</head><p>Initially, we attempted to make a plan by introducing IS-A logic function into planning based on classical state space reasoning and backward reasoning <ref type="bibr" target="#b0">[1]</ref>. However, more than it, searching combined ontologies using one SPARQL query easily enabled us to retrieve acceptable instances of appropriate facility from the action target ontology and the facility ontology without any problems in execution speed. The LGD class according to the user's request from the facility ontology can be found, and once the LGD class is known, SPARQL allows direct retrieval of the facility instance within the LGD. If there is a DBpedia class linked In this preliminary research, the following issues are suggested.</p><p>1. It is necessary to understand data characteristics of coverage and granularity of each dataset, but it is generally hard for large datasets. At this time, we firstly made a utilization plan on the whole data set, after we examined the availability of actual data on the premise of these use-case scenarios. 2. Generally, it is tough work to find out correct relations between datasets.</p><p>While simple string matching allows us an automatic matching process, the ontology mapping cannot be avoid human power at the present. While the accuracy of this mapping greatly affects the result, mechanical matching processing is difficult. In addition, we built intermediate ontologies and mapped them to LOD datasets, but building ontology is generally not easy for a novice. 3. Since DBpedia and LGD are datasets made by crowd sourcing, we cannot expect the completeness and validity of them. Missing or biased data is still problematic at reasoning. Actually, we found a closed food shop as results.</p><p>At this time we attempted to eliminate errors as soon as it was found, but we need to think about some tools for (semi) automated error checking. 4. The inference procedure was designed according to these use-case scenarios.</p><p>For other problems, different datasets and different work flows may be used.</p><p>For example, it depends on features of a target problem about how the balance should be taken between general knowledge and fact data to solve the problem.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="8">Conclusion</head><p>In this preliminary research, we made a prototype of action planning system for events of everyday life and world, based on open knowledge of LOD as fact data and taxonomy as common knowledge. We utilized a number of large-scale open databases and knowledge-bases. We found that we had already abundant knowledge about the everyday life and world as diverse open knowledge resources. This condition is very different at the era of Good-Old-Fashioned-AI (GOGAI) before the Web age and LOD. However, we also found that we needed the additional general and common knowledge that connects such different open resources in reasoning action plans with SPARQL endpoints. It is obvious that it will be necessary to make open knowledge more available not only in the verification and validation for each, but also in the combinations of them for applications.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head></head><label></label><figDesc>lgdo:Zoo a owl:Class; servicevoc:dbpediaClass dbo:Zoo ; servicevoc:provideService [ servicevoc:hasService [ servicevoc:action action:払う; # pay servicevoc:target target:文化施設入場料 ], [ # admission fee # for cultural facility servicevoc:action action:見る; # see servicevoc:target target:動物 ]] ;</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 .</head><label>1</label><figDesc>Outline of Prepared Datasets and Used Datasets</figDesc><table><row><cell>Dataset</cell><cell cols="4">Version Num. triples Num, classes used</cell></row><row><cell>Fact Dataset</cell><cell></cell><cell></cell><cell></cell></row><row><cell>DBpedia core+en</cell><cell cols="2">2016-04-01 1,131,657,931</cell><cell>-</cell><cell>△</cell></row><row><cell>DBpedia Japanese</cell><cell cols="2">2017-02-20 113,299,748</cell><cell>-</cell><cell>○</cell></row><row><cell>LinkedGeoData</cell><cell cols="2">2015-11-02 1,216,560,762</cell><cell>-</cell><cell>○</cell></row><row><cell>General Ontology</cell><cell></cell><cell></cell><cell></cell></row><row><cell>DBpedia Ontology</cell><cell>2016-11-01</cell><cell>30,793</cell><cell cols="2">758 ○</cell></row><row><cell>LGD Ontology</cell><cell>2014-09-09</cell><cell>24,530</cell><cell cols="2">1,200 ○</cell></row><row><cell>Japanese WordNet</cell><cell>2013-06-26</cell><cell>4,003,288</cell><cell cols="2">57,238 ○</cell></row><row><cell cols="2">Japanese Wikipedia Ontology 2013-11-07</cell><cell>21,863,327</cell><cell cols="2">166,397 ×</cell></row><row><cell>YAGO</cell><cell cols="2">3.0.2 1,001,461,792</cell><cell cols="2">5,130,031 ×</cell></row><row><cell>OpenCyc</cell><cell>2012-05-10</cell><cell>5,783,451</cell><cell cols="2">233,644 ×</cell></row><row><cell>UMBEL</cell><cell>1.5</cell><cell>392,728</cell><cell cols="2">33,686 ×</cell></row><row><cell>Service Ontology</cell><cell></cell><cell></cell><cell></cell></row><row><cell>Facility Ontology</cell><cell>2017-02-20</cell><cell>3,257</cell><cell cols="2">418 ○</cell></row><row><cell>Service Ontology</cell><cell>2017-02-20</cell><cell>3,933</cell><cell cols="2">750 ○</cell></row><row><cell>Action Target Ontology</cell><cell>2017-02-20</cell><cell>2,030</cell><cell cols="2">622 ○</cell></row><row><cell>Action Ontology</cell><cell>2017-02-20</cell><cell>153</cell><cell cols="2">55 ○</cell></row><row><cell>subtotal of Service Ontologies</cell><cell></cell><cell>9,373</cell><cell>1,845</cell></row><row><cell>Total</cell><cell cols="2">3,495,087,723</cell><cell>5,624,799</cell></row><row><cell cols="5">rdfs:subClassOf service:購買_サービス . # purchase service</cell></row><row><cell cols="2">service:肉類_サービス a owl:Class;</cell><cell cols="3"># meat service</cell></row><row><cell cols="2">rdfs:label "肉類_サービス";</cell><cell></cell><cell></cell></row><row><cell cols="2">servicevoc:action action:買う;</cell><cell># buy</cell><cell></cell></row><row><cell cols="2">servicevoc:target target:肉類;</cell><cell cols="2"># meat</cell></row><row><cell cols="5">rdfs:subClassOf service:食料_サービス . # food service</cell></row></table></figure>
		</body>
		<back>
			<div type="annex">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>from LGD, DBpedia Japanese is also automatically searched in SPARQL queries. The current system consists of RDF Store search and inference for interpretation of user's requests. This configuration is beneficial at usability and re-usability. Based on SPARQL search and open resources, it is possible to expand and refine ontology without touching the inference engine of the planning system in applications. It is meaningful for practical application of reasoning by large amount of data.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">Example of Execution</head><p>The following shows an example of execution by this prototype system, see the added comments translated into English for readers. ; latitude URL："http://ja.dbpedia.org/resource/伊豆長岡温泉" Searching for a facility in the vicinity of the current location, the Toyota Higashifuji Institute, the system made a recommendation to go to Numazu City Ball Park, then go to Izu-Nagaoka Hot Spring, in response to a request to go to a hot spring after enjoying some sport.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>SYSTEM(</head><p>While this prototype of action planning by using open knowledge sources and SPARQL queries is widely applicable to various kind of applications, yet there is not enough as intelligent agent. Making more intelligent agent remains in future work.</p></div>			</div>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<title level="m" type="main">Automated Planning, theory and practice</title>
		<author>
			<persName><forename type="first">M</forename><surname>Ghallab</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Nau</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Traverso</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2004">2004</date>
			<publisher>Elsevier</publisher>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
