-

A Hybrid Similarity Computing Method for KBQA

Chunpei Wang

Xiaowang Zhang

Zhiyong Feng

0 0 College of Intelligence and Computing, Tianjin University , Tianjin 300350 , China

With the rapid growth of knowledge bases(KBs), knowledgebase question answering has drawn huge attention in recent years. Most existing KBQA methods translate questions into SPARQLs to help endusers access the knowledge base represented by RDF more naturally. However, a natural language question is always corresponding to multiple candidate SPARQLs due to the gap between the unstructured Question and the structural SPARQL query. To pick the best SPARQL query from the candidate SPARQL query set, in this Poster, we propose a hybrid similarity computing method to rank the SPARQL query. Firstly, we employ two attentive recurrent neural networks to capture the semantic Similarity between the SPARQL query and the Question. Secondly, we compute the string similarity between the SPARQL query and the Question by leveraging the convolutional neural network. Our method can capture the two-level Similarity between the Question and the SPARQL. Experiments show that our method can improve the effectiveness of KBQA.

KBQA Semantic Parsing RDF SPARQL

Generally, we will obtain multiple SPARQL queries after the semantic parsing stage since the ambiguity between the natural language question and the knowledge base. For instance, the entity mention St. Lawrence of the Question \What body of water does St.Lawrence ow into? " will be mapped to a set of semantic instances in the KB, e.g., E = fhSiant Lawrencei, or hSiant Lawrence Riverig. Thus, the main challenge in the semantic parsing stage is how to pick the best SPARQL query in the candidate query set.

Most existing KBQA work maps the Question and the KB facts(triple) to a common embedding space. The Similarity between the question vector and the SPARQL vectors can be conveniently computed. However, these methods tend to lose original word interaction information. To preserve more original information, we propose a hybrid similarity computing method to pick the best SPARQL query from the candidate set. Both consider the semantic Similarity and the strong Similarity between the Question and the SPARQL query.

Our Approach

In this Poster, we present a hybrid ranking model to rank the SPARQL query, which considers both string similarity and Semantic Similarity. Given the natural language question N , for each query qi in the candidate SPARQL set, we compute the similarity score S(N; qi) that represents the semantic Similarity between N and qi. Finally, all candidate queries are ranked via their similarity scores with N .

Semantic-level Similarity. We construct an attentive recurrent neural network for computing semantic-level similarity between question N and the candidate query qi. The model uses an encoder-compare framework which encodes the semantic information of N and qi into high-dimensional embedding space and then estimates their similarity via multilayer perceptron(MLP). { Encoding. Firstly, each elements in question N and query qi is mapped to its corresponding embedding vectors fw1; : : : ; wLg, where L is the length of the question or query. And then all the embeddings will be input into a bidirectional GRUs neural network to learn the hidden representations H1:L = [h1; : : : ; hL], where hi is the concatenation of forward and backward vectors learned at time i. Since each word contribute di erently to the full sentence semantics, the model would pay di erent attention to each word and learn promising vectors to represent the question/query sentences. The self-attention model is used here to learn the weight of each word semantic for the input sentence. The semantic representation Y of sentence can be calculated as follows:

Y =

L X aihi; i=1 a = Attention(Q; K; V ) = sof tmax

QK> pdk

V ( 1 ) ( 2 ) where fQ; K; V g are the shorthand for fquery, key, valueg, which are three matrices that mapped with the same input. K and V is a one-to-one correspondence with key-value relation. Q could be the hidden state to be processed, such as hi. First, the dot product between Q and K is computed, which will be divided by a scale factor of pdk to prevent the result from being too large. And then, the result will be processed with softmax function to get normalized probability, which will be multiplied by V to get the weight. Finally, each hidden presentation hi is multiplied by the attention weights and summed to get semantic representation Y . { Similarity estimation. With the representations Yp and Yqi of question p and query qi, their similarity will be calculated by a MLP layer z1 = f W > [Yp; Yqi ] + b ( 3 ) where W is the parameters to be learned, b is the deviation and f ( ) is an activation function. The semantic information extracted from the two sentences are spliced as the input of MLP hidden layer, which nonlinearly mapped the two sentences into their Similarity.

String-level Similarity. The Similarity over string-level is evaluated via a text-matching model. Some words or phrases with the same meaning may be expressed di erently in the Question and query, i.e., the pair of words (musical; music) have similar semantics. Since the high-level semantic embedding cannot preserve these words interaction information, we construct a similarity matrix whose elements represent the similarities between question words and query words, regarding it as a two-dimensional vector space to utilize the convolutional layer to capture the matching features.

{ Similarity matrix. Firstly, we construct a similarity matrix M , where each element Mij indicates the basic interaction. The Mij can be calculated as: Mij = ui vi Where ui and vj denotes the i-th and the j-th word was embedding in Question and query, respectively. The operator stands for a general operator to compute the Similarity. Here the matrix can obtain the Similarity of words with di erent expressions. { Convolution layer. The di erent levels of matching patterns can be extracted by a convolutional kernel. The k-th kernel wk scans over the similarity matrix M to generate a feature map gk: ( 4 ) ( 5 ) (6) (7) k gi;j = rk 1 rk 1 X X wsk;t

Combination. With three feature (z1; z2; z3) generated from two level similarity, we utilize a linear layer to learn their respective contribution for holistic similarity score:

S(p; qi) = Sigmoid W >[z1; z2; z3] + b : (8) Finally, all candidate queries qi will be sorted with the similarity scores S(p; qi).

Experiments and Evaluation

Our Method BiCNN(Yih et al.) AMPCNN (Wenpeng et al.) HR-BiLSTM (Yu et al.) Multiple View Matching (Yu et al.)

SimpleQuestions is a single-relation KBQA dataset. This dataset consists of questions annotated with a corresponding fact from Freebase that provides the answer. We report Accuracy as previous studies. We verify our proposed approach on the SimpleQuestion dataset. Table 1 summarizes the experimental results of di erent methods on answer selection and knowledge base question answering. Clearly, our method achieves the state-of-the-art results in SimpleQuestions, which con rms the e ectiveness of our solution.

Acknowledgments

This work is supported by the National Key Research and Development Program of China (2017YFC0908401) and the National Natural Science Foundation of China (61972455). Xiaowang Zhang is supported by the Peiyang Young Scholars in Tianjin University (2019XRX-0032).

1. A. P. B. Veyseh , \ Cross-lingual question answering using common semantic space," in Proceedings of TextGraphs@NAACL-HLT 2016 , pp. 15 { 19 , 2016 .

Dong ,

Wei ,

Zhou , and

Xu , \ Question answering over Freebase with multi-column convolutional neural networks in ACL 2015, July 26-31 , 2015 , Beijing, China, Volume 1 :

Long

Papers , pp. 260 { 269 , 2015 .

Bordes ,

Chopra , and

Weston , \ Question answering with subgraph embeddings," in EMNLP 2014, October 25-29 , 2014 , Doha, Qatar, A meeting of SIGDAT, a Special Interest Group of the ACL , pp. 615 { 620 , 2014 .

Yao and

B. V.

Durme , \ Information extraction over structured data: Question answering with freebase," in ACL 2014 , June 22-27, 2014 , Baltimore, MD , USA, Volume 1 :

Long

Papers , pp. 956 { 966 , 2014 .

Lu ,

Pramanik ,

R. S.

Roy ,

Abujabal ,

Wang , and G. Weikum, \ Answering complex questions by joining multi-document evidence with quasi knowledge graphs," in Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval , SIGIR 2019 , Paris, France, July 21-25 , 2019 ., pp. 105 { 114 , 2019 .