<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">A Fundamental Evaluation of Candidate Answers Generation for Question Answering Using Wikidata</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Ryoga</forename><surname>Nakagawa</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Osaka Electro-Communication University</orgName>
								<address>
									<addrLine>18-8 Hatsuchou, Neyagawa-shi</addrLine>
									<postCode>572-0833</postCode>
									<settlement>Osaka</settlement>
									<country key="JP">Japan</country>
								</address>
							</affiliation>
						</author>
						<author role="corresp">
							<persName><forename type="first">Kouji</forename><surname>Kozaki</surname></persName>
							<email>kozaki@osakac.ac.jp</email>
							<affiliation key="aff0">
								<orgName type="institution">Osaka Electro-Communication University</orgName>
								<address>
									<addrLine>18-8 Hatsuchou, Neyagawa-shi</addrLine>
									<postCode>572-0833</postCode>
									<settlement>Osaka</settlement>
									<country key="JP">Japan</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">A Fundamental Evaluation of Candidate Answers Generation for Question Answering Using Wikidata</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">E0CD96B8C3A9C7DB33451F030230B168</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T17:25+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Knowledge graph</term>
					<term>Wikidata</term>
					<term>Question answering</term>
					<term>Natural language question answering</term>
					<term>SPARQL</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Question answering (QA) is an import application of knowledge graphs (KGs). In order to evaluate knowledge structures within a knowledge graph, we consider an investigation into the details of correspondences between questions and knowledge graph. In this paper, we focus on Wikidata as a knowledge graph in the general domain and evaluate a basic question answering method using Wikidata, aiming to conduct a fundamental evaluation of its content as a knowledge graph.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Knowledge graphs are widely used in various fields and have been applied to tasks such as natural language processing, question answering, information retrieval, and problem-solving. In question answering (QA), a knowledge graph enables users to ask questions in natural language, and the system generates answers based on the knowledge graph. Specifically, the system interprets the meaning of a question, extracts relevant entities and relationships from the knowledge graph, and combines them to generate answers.</p><p>Several existing works focus on developing query answering systems using knowledge graphs. For instance, T. Ploumis et al. developed a query answering system using Wikidata based on the dependency analysis of question text <ref type="bibr" target="#b0">[1]</ref>. X. Hu, et.al proposed a method for question answering over knowledge graph based on SPARQL queries and keyword search <ref type="bibr" target="#b1">[2]</ref>. Such an approach plays an important role in improving the performance of QA systems. These studies utilize datasets from various domains for QA and conduct experiments to evaluate the proposed systems.</p><p>However, to evaluate knowledge structures within a knowledge graph, we are considering a more in-depth investigation into correspondences between questions and knowledge graph. In this paper, we uses Wikidata <ref type="bibr" target="#b2">[3]</ref> as an open knowledge graph in the general domain and evaluate a basic question answering method using Wikidata, aiming to conduct fundamental evaluation of its contents as a knowledge graph.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Test Dataset for Question Answering using Wikidata</head><p>In this study, we have a test dataset for question answering using Wikidata. While there are several datasets for question answering using Wikidata, they do not support Japanese language. Therefore, we utilized the Japanese quiz dataset <ref type="bibr" target="#b3">[4]</ref> which was developed using Japanese Wikipedia for a competition of QA system primarily focused on NLP. This dataset provides provides question text and its answer, which is described as one of Wikipedia articles. Consequently, we can also obtain Wikidata ID corresponding to the article because Wikidata contains resources related to all Wikipedia articles in principle. In order to determine how many questions can be correctly answered using Wikidata, we conducted manual SPARQL queries. It's important to note that some questions cannot be answered using any SPARQL query due to the limitations in the graph structures of Wikidata, which may lack the necessary knowledge to answer certain questions. From the quiz dataset, We choose 50 questions and formulated SPARQL queries. As a result, we were able to create SPARQL queries that provided correct answers for 33 out of the 50 questions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Evaluation of Candidate Answers Generation</head><p>To generate candidate answers, it is necessary to map the words in the question text to Wikidata IDs and then use the mapping results to generate SPARQL queries to find answers using Wikidata. The former corresponds to Entity Linking, and the latter to SPARQL query generation. In the following sections, we evaluate these two methods using the test dataset provided in Section 2.</p><p>As an example, we will use the question shown in (Figure <ref type="figure" target="#fig_0">1</ref> (a)) to illustrate these methods.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Evaluation of Entity Linking</head><p>Out of 50 question in the test dataset, we evaluated entity linking using the 33 questions for which we could obtain correct answers using SPARQL queries on Wikidata. For the entity linking method, we extracted words from question text and searched Wikidata for entities that matched the extracted words using an exact string match. In the example question mentioned above, four entities with Wikidata IDs were obtained from the search results (see Figure <ref type="figure" target="#fig_0">1 (b)</ref>). The evaluation was conducted manually using Wikidata search API to avoid any potential issues caused by morphological analysis.</p><p>Next, we compare the obtained IDs from a question text to the SPARQL query for the question in the test dataset. If all the IDs in the query are included in the obtained IDs, it means that the query could be generated using them. In the case of the example of Figure <ref type="figure" target="#fig_0">1</ref>(a), all the IDs shown in the query in Figure <ref type="figure" target="#fig_0">1</ref>(c) are included in the obtained IDs shown in in Figure <ref type="figure" target="#fig_0">1(b</ref>). However, it's worth noting that Wikidata IDs of properties are not obtained by the entity linking method, so they are replaced with variables such as "?p1", "?p2" in the query. In the evaluation results of entity linking, it was found that for 27 of the 33 questions, all the IDs required to generate the SPARQL queries were obtained. However, for 6 questions, the questions text included some IDs that were not obtained by the entity linking method due to differences in spelling between the question text and Wikidata.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">Evaluation of SPARQL Query Generation</head><p>For the 27 questions which all the required IDs could be obtained through entity linking, we evaluate SPARQL query generation to obtain the candidate answers using the graph structure of Wikidata. In this study, we explore an exhaustive combination of the entities obtained through entity linking. When two IDs are obtained through entity linking, four SPARQL patterns are generated. For instance, when two IDs are obtained, such as ID1 and ID2, four patterns of SPARQL queries are generated as illustrated in Figure <ref type="figure" target="#fig_1">2</ref>. In this exampel, variables like "?p1", "?p2" represent properties, and "?ans" represents the answer to the question. If the SPARQL query accurately represents the question's condition, the correct answer can be returned as the value of "?ans" . However, if there are multiple entities that satisfy the search conditions, a single query pattern can yield multiple candidate answers.</p><p>To evaluate the query generation, we executed the generated queries on SPARQL endpoint of Wikidata and verified whether their query results contained the correct answer to the questions. As a results, out of the 27 questions, the method successfully generated SPARQL queries that produced the correct answer for 23 questions. However, in the case of the remaining four questions, timeouts occurred during SPARQL query processing, preventing us from obtaining the correct answers.. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Discussion</head><p>Figure <ref type="figure" target="#fig_2">3</ref> displays an overview of the evaluation results. For our testing, we used 50 questions from the Japanese quiz dataset. Through manual efforts, We obtained SPARQL queries capable of deriving correct answers from Wikidata for 33 of these questions (Figure <ref type="figure" target="#fig_2">3</ref>(1)). These 33 questions with SPARQL queries constitute the correct data for evaluating question answering using Wikidata. Among the remaining 17 questions, 7 required aliases (alternative labels) to obtain the correct answers using SPARQL queries to Wikidata. For the remaining 10 questions, we could obtain the correct answers using SPARQL due to the lack of knowledge defined in Wikidata. We evaluated fundamental methods of entity linking and SPARQL query generation using the 33 questions and their associated queries in the correct dataset.</p><p>Through the evaluation of entity linking, we obtained all the IDs required for the SPARQL queries for 27 of the 33 questions. We then applied these IDs to query generation and obtained the correct answers using one of the generated SPARQL queries for 23 of the 27 questions. In other words, we were able to obtain the correct answers from Wikidata for 82.2% of the questions when we could obtain enough IDs through entity linking. We believe that we could obtain correct answers for the remaining questions by improving the querying performance because the primary reason why we could not obtain the answers was the timeout problem during SPARQL query processing. When we compare the final results in (3) and the 33 questions in the correct dataset, we found that we could obtain correct answers from Wikidata for 69.7% of questions. To improve these results, we need to enhance the accuracy of entity linking.</p><p>In this way, this study conducts a fundamental evaluation of candidate answers generation for question answering using Wikidata. The evaluation results could serve as baseline data to development of question answering methods using Wikidata. As part of our future works, we plan to extend the number of questions for the test datasets and evaluation results. We will consider using not only manual efforts but also automatic methods to prepare them using entity linking and SPARQL query generation, as discussed in Section 3. Additionally, we are in the process of developing a QA system by extending these methods.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: An example of (a)question text, (b)entity linking and (c)SPARQL query</figDesc><graphic coords="2,151.80,575.47,291.68,74.13" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: Patterns for SPARQL query generation using two IDs</figDesc><graphic coords="3,130.96,454.50,333.37,71.84" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: The overview of evaluation results</figDesc><graphic coords="4,110.13,84.19,375.03,105.11" type="bitmap" /></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<title level="m" type="main">A factoid based question answering system based on dependency analysis and wikidata</title>
		<author>
			<persName><forename type="first">T</forename><surname>Ploumis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Perikos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Grivokostopoulou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Hatzilygeroudis</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="page" from="1" to="7" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Natural language question answering over knowledge graph: the marriage of sparql query and keyword search</title>
		<author>
			<persName><forename type="first">X</forename><surname>Hu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Duan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Dang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Knowledge and Information Systems</title>
		<imprint>
			<biblScope unit="volume">63</biblScope>
			<biblScope unit="page" from="819" to="844" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<title/>
		<author>
			<persName><surname>Wikidata</surname></persName>
		</author>
		<ptr target="https://www.wikidata.org/" />
		<imprint>
			<date type="published" when="2022">2022. 2022/10/25</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<ptr target="https://jaqket.s3.ap-northeast-1.amazonaws.com/data/aio_01/dev1_questions.json" />
		<title level="m">Jaqket japanese on the subject of quizzes qa data-set</title>
				<imprint>
			<date type="published" when="2020">2020. 2022/10/25</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
