1 Introduction

How to Revert Question Answering on Knowledge Graphs

Gaurav Maheshwari

gaurav.maheshwari@uni-bonn.de 1

Mohnish Dubey

dubey@cs.uni-bonn.de 1

Priyansh Trivedi

priyansh.trivedi@uni-bonn.de 1

Jens Lehmann

jens.lehmann@cs.uni-bonn.de jens.lehmann@iais.fraunhofer.de 0 1 0 Fraunhofer IAIS , Bonn , Germany 1 University of Bonn , Germany

A large scale question answering dataset has a potential to enable development of robust and more accurate question answering systems. In this direction, we introduce a framework for creating such datasets which decreases the manual intervention and domain expertise, traditionally needed. We describe the architecture and the design decisions we took while creating the framework, in detail.

1 Introduction

Knowledge bases (KB), such as DBpedia [ 5 ], Freebase [ 1 ], and Wikidata [ 10 ] contain a large amount of interesting information. However, this information can often only be queried by users familiar with query languages and the structure of the knowledge base. To migitate this problem, numerous question answering (QA) approaches for knowledge graphs have been devised (see [ 4 ]). Many of them rely on machine learning techniques, which require large amounts of labeled training data. We introduced LCQuAD [ 7 ] (Large-Scale Complex Question Answering Dataset), consisting of 5000 natural language questions (NLQ) along with the intended SPARQL queries required to answer questions over DBpedia [ 5 ]. In this article, we describe the framework used for creating LC-QuAD and how it can be applied to other data sources. Traditionally, question answering datasets have been created by manually converting a set of questions to their logical forms (for instance, [ 9 ], [ 2 ]). Although a dataset created in this manner can have varied questions, it requires substantial manual work from domain experts, and hence is not scalable. Thus, the need for an alternate workflow with (i) lesser human intervention and (ii) reduced domain expertise is felt.

In this direction, we present a framework for generating questions and their corresponding logical forms. We created our framework by reverse engineering the architecture of Semantic Parsing based Question Answering systems (SQA) such as [ 3 ] and [ 8 ]. These systems convert a NLQ to a formal query language expression, whereas we start These three authors contributed equally. with a formal query language expression and convert it to an NLQ. This reverse task is easier because formal query languages have well defined semantics, and the entities and predicates occurring in the query are explicitly mentioned. Moreover, the target language (NL) is much more resilient to minute errors.

We now describe how we reverse engineered the architecture of AskNow[ 3 ] to create a question generation framework. Figure 1 shows an illustration of our architecture and that of AskNow. Throughout the description, we will use the question: "Name the capital of Germany?" as our running example, to elaborate our dataset generation process in contrast to the process of answering this question.

AskNow[ 3 ] breaks down the process of answering questions into two parts: conversion of questions into an expression of a semi-formal language, namely Normalized Query Structure (NQS) and subsequently converting the NQS expression to that of a formal language. NQS acts like a surface level syntactic template for queries, which maps NLQs having different syntactic structures into a common structure. Given the question, AskNow uses rules based on NL features to create a corresponding NQS instance. Thereafter it maps the entity (dbr:Germany), and the predicate (dbo:capital) present in the question to resources in the target KB, i.e. DBpedia. After mapping, the NQS instance is converted into SPARQL using machine generated and hand made rules. Finally, the query is executed on the query endpoint of DBpedia to retrieve the required answer: dbr:Berlin.

We begin our process where AskNow ends, i.e. by collecting the answer of the questions we aim to generate. For our framework, a list of these answers act as seed entities3. Let "dbr:Berlin" be one such entity. We then create SPARQL queries which would all have this entity as one of their answers. To do so, we generate a subgraph of all the triples within 2-hops around these entities. The triple <dbr:Germany dbo:capital dbr:Berlin> would be one such triple in the subgraph. Since the number of such triples increases exponentially with increasing hops, we reduce the size of subgraph, by stochastically removing predicates. We also remove every triple whose predicates do not occur in our whitelist4. We then juxtapose triples of this subgraph onto a list of SPARQL templates5 to generate a set of valid SPARQL queries, all of them having dbr:Berlin as one of its answers. For instance, SELECT ?uri WHERE { dbr:Germany dbo:capital ?uri. } Note that while, in our example, the SPARQL query comprises of only one triple, we can generate queries having two or more triples in the same manner.

At this point, instead of converting the SPARQL queries to NLQ directly, we employ a two step process similar to AskNow. This helps in further reducing manual intervention required for this conversion. We first transform the queries to a Normalized Natural language Question Templates (NNQT) instance. Corresponding to every SPARQL template, there exists these NNQTs, which are questions with placeholders. These placeholders can be replaced with the surface form of entities and predicates present in the query, to form a coherent question. These NNQTs interpret the semantics of its corresponding SPARQL template and verbalize it, thereby transforming our aforementioned problem to that of converting semi-correct (grammatically) NLQ to a grammatically correct question. These NNQT’s are KB independent and thus can be appropriated for any target KB. An NNQT instance for our running example would look like: W hat is the h capital i of h Germany i?

As is evident, this NNQT instance can be easily converted to a grammatically correct NLQ by native English speakers, without understanding the SPARQL syntax or having any knowledge of the underlying KB schema. This allows our process to scale with minimal efforts. The person correcting these questions is also expected to paraphrase the questions, in order to increase the diversity of our dataset. The resultant NLQ of our NNQT instance would at this point be- "Name the capital of germany ?" Finally, an independent reviewer revises every corrected question, and is expected to make minute tweaks and edits in case of errors. The reviewer is not expected to paraphrase the queries, thereby significantly reducing the time required in this step. The final output of our process would be - "Name the capital of Germany?"

Throughout the process, we use numerous techniques to increase the diversity and complexity of the questions so generated. Some of these techniques are: (i) replacing the entity surface forms within the NNQT instances with their synonyms, using WordNet[ 6 ], (ii) similarly, using Wikidata[ 10 ] for predicate surface forms, (iii) encouraging the question corrector to paraphrase the questions, keeping their semantics intact, (iv) declaring multiple NNQTs for every SPARQL template and stochastically selecting one of them. Due to the modularity of our framework, more such techniques can be added in the future. 3

Conclusions

In this article, we described a framework for generating QA datasets having questions and their equivalent logical forms. The general design of this framework follows a reverse process of SQAs, and in effect reduces the efforts involved in creating questions.

4https://figshare.com/articles/White_List_Relations/5008283 5https://figshare.com/articles/Templates/5242027

We have successfully used our framework to create a dataset, LC-QuAD[ 7 ], having 5000 questions and their corresponding SPARQLs. Our framework is available as an open sourced repository6, under a GPL 3.07 License.

In the future, we aim to explore more techniques to increase the diversity and complexity of the questions so generated. We will also explore machine translation based techniques to further reduce the need of manually correcting the questions.

Acknowledgements This work was partly supported by the grant from the European Union’s Horizon 2020 research Europe flag and innovation programme for the projects Big Data Europe (GA no. 644564), HOBBIT (GA no. 688227) and WDAqua (GA no. 642795).

6https://github.com/AskNowQA/LC-QuAD

7https://www.gnu.org/licenses/gpl.html

Bollacker ,

Evans ,

Paritosh ,

Sturge , and

Taylor . Freebase: a collaboratively created graph database for structuring human knowledge . In Proceedings of the 2008 ACM SIGMOD Conference on Management of Data , pages 1247 - 1250 , 2008 .

Cai and

Yates . Large-scale semantic parsing via schema matching and lexicon extension . In ACL , pages 423 - 433 , 2013 .

Dubey ,

Dasgupta ,

Sharma ,

Höffner , and

Lehmann . Asknow: A framework for natural language query formalization in sparql . In International Semantic Web Conference , pages 300 - 316 , 2016 .

Höffner ,

Walter , E. Marx,

Usbeck ,

Lehmann , and A. -C. Ngonga Ngomo. Survey on challenges of question answering in the semantic web . The Semantic Web , pages 1 - 26 , 2016 .

Lehmann ,

Isele ,

Jakob ,

Jentzsch ,

Kontokostas ,

P. N.

Mendes ,

Hellmann ,

Morsey ,

Van Kleef ,

Auer , and

Bizer . DBpedia-a large-scale, multilingual knowledge base extracted from wikipedia . The Semantic Web , pages 167 - 195 , 2015 .

G. A.

Miller . Wordnet: a lexical database for english . Communications of the ACM , 38 ( 11 ): 39 - 41 , 1995 .

Trivedi , G. Maheshwari,

Dubey , and

Lehmann . Lc-quad: A corpus for complex question answering over knowledge graphs . In International Semantic Web Conference . Springer, 2017 .

Unger ,

Bühmann ,

Lehmann ,

A.-C. N.

Ngomo ,

Cimiano , and

Gerber . Templatebased question answering over rdf data . In Proceedings of the 21st International World Wide Web Conference , pages 639 - 648 . ACM, 2012 .

Unger ,

A.-C. N.

Ngomo , and E. Cabrio. 6th open challenge on question answering over linked data (qald-6) . In Semantic Web Evaluation Challenge , pages 171 - 177 , 2016 .

10. D. Vrandecˇic´. Wikidata: A new platform for collaborative data collection . In Proceedings of the 21st International World Wide Web Conference , pages 1063 - 1064 , 2012 .