<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Sememe-based Approach for Knowledge Base Question Answering</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Peiyun Wu</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Xiaowang Zhang</string-name>
          <email>xiaowangzhangg@tju.edu.cn</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>College of Intelligence and Computing, Tianjin University</institution>
          ,
          <addr-line>Tianjin 300350</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this poster, we present a sememe-based approach to semantic parsing in question answering over knowledge base by leveraging a sememe-level semantics to improve the performance of semantic similarity between question and relations. Firstly, we propose a double-channel model to extract both sememe-level semantics and word-level semantics. Moreover, we present a context-based representation to encode the sememe of questions to re ne sememe incorporation for reducing noise. Finally, we introduce a hierarchical representation to encode the sememe representation of relations to remove the ambiguity of words maximally. Experiments evaluated on benchmarks show that our model outperforms o -the-shelf models</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>Knowledge base question answering (KBQA) is the task of accurately and
concisely answering a natural language question over knowledge base (KB) by
understanding the intention of the question. As a critical branch of KBQA, semantic
parsing based approaches construct semantic parsing trees or equivalent query
structures (also called query graph) to represent the given question, and then
ranking them by calculating the semantic similarity with the question.</p>
      <p>
        Most current works focus on selecting the semantic relations that most similar
to a question to nd the optimal query graph. Unfortunately, those existing
approaches are limited in di erentiating two relations with the similar word-level
semantics due to the following issues: (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) Polysemous and low-frequency words
often undermine the overall performance of semantic similarity measurement. (
        <xref ref-type="bibr" rid="ref2">2</xref>
        )
More minimal semantics of words between question and relations are ignored.
Existing models heavily dependent on the embeddings closest to the question
representation instead of extracting minimal semantic similarity between them.
      </p>
      <p>To overtake the above limitations, we leverage sememe from external
lexicalsemantic resources. Sememes are minimum semantic units of word meanings [3],
a word may have multiple senses, and a sense consists of several sememes. In
this poster, we present a sememe-based approach to semantic parsing in KBQA
by leveraging a sememe-level semantics.
?? Copyright 2020 for this paper by its authors. Use permitted under Creative
Commons License Attribution 4.0 International (CC BY 4.0).
softmax</p>
      <p>Column-wise
Max-pooling
...</p>
      <p>Sememe</p>
      <p>Cosine similarity
word-channel</p>
      <p>question
representation</p>
      <p>tokens
sememe-channel
question representation
Tanh(STUQ)
...</p>
      <p>LSTM
...</p>
      <p>Question</p>
      <p>LSTM
score
+</p>
      <p>Sense layer
avg
sememe-channel
+ relation
representaavgtion
WHOORMLED HOME WORLD relation</p>
      <p>level</p>
      <p>Cq
Sememe
layer
word-channel</p>
      <p>relation
representation
+</p>
      <p>
        avg
Word level
Our model is shown in Fig. 1. In this paper, based on a heuristic algorithm
in [2], we generate candidate query graphs with considering ve kinds of
semantic constraints: entity, type, temporal (explicit and inexplicit time), order, and
compare.
In this part, we generate word-channel representation of question and
relations. Given a question fw1; w2; :::; wng, we feed it into a bi-directional long
short-term memory network (Bi-LSTM) to generate the hidden representation
Q = (h1; : : : ; hn) and obtain hq after pooling operation. Then we transform
hq with a fully connected layer and a ReLU function to get the word-channel
representation of the question:
qw = ReLU (Wq hq + b1)
(
        <xref ref-type="bibr" rid="ref1">1</xref>
        )
where Wq denotes the linear transformation matrix.
      </p>
      <p>To encode the word-channel relation representation, we take the
relationlevel (e.g. \contained by ") and word-level(e.g. \contained ",\by ") relation names
into consideration. Given relations fr1; r2; :::; rng in a query graph, for
relationlevel representations, we simply take each relation name as a whole unit, and
translate it into vector representation as fr1rl; r2rl; :::; rnrlg. For word-level
representations, we represent the word sequence of each relation using word averaging
as fr1wl; r2wl; :::; rnwlg. We get the nal vector of each word-channel relation
representation as ri = rirl + riwl. Finally, we apply pooling operation over all relations
and obtain the word-channel representation of relations, denoted by rw.
2.2</p>
      <p>Sememe-Channel Representation
Sememe-Channel Question Representation We denote Sm as the set of
all sememes occurring in the question. Then we map Sm into the vectors S =
(s1; : : : ; sn) and adopt a context attention mechanism to deemphasize irrelevant
sememes and focusing on more correlative to context ones. The interactive
context matrix is calculated as Sq = tanh(S&gt;U Q) . Then we obtain the vector sq by
the column-wise max-pooling operation over Sq and use the softmax function.
Finally, we get the sememe-channel question representation as following:
qs = Wsq(softmax(sq) S) + b2
where Wsq is the parameter matrix.</p>
      <p>Sememe-Channel Relation Representation In this part, we adopt a
hierarchical attention method to obtain the sememe-level representation of relations
sense as a set of sense
vecand maximally remove the ambiguity. We denote Rwij
tors of word wij as Rwij sememe as a set of
sense := fseij1; : : : ; seijkg. We denote Rsijk</p>
      <p>sememe := fsmijk1; : : : ; smijkmg.
sememe vectors of sense sijk as Rsijk</p>
      <p>To obtain the context information of the given question, we denote its word
embeddings average as qavg and construct a context representation Cq as
following:</p>
      <p>Cq =
n
X softmax(tanh(wi&gt; qavg)) wi
i=1
sense is represented as below:
Through this, the vector of sense seijk in Rwij</p>
      <p>m
seijk = X softmax(Wsm tanh(smi&gt;jkc Cq) + b3) smijkc</p>
      <p>c=1
where Wsm is a weight matrix. And the representation of the j-th word in
the i-th relation is a weighted sum of its senses fseij1; : : : ; seijkg:
rij =
k
X softmax(Wse tanh(sei&gt;jy Cq) + b4) seijy
y=1</p>
      <p>Finally, we can apply average operation over all words in all relations and
obtain the sememe-channel representation of relations, denoted by rs. In this
way, we can compute the semantic similarity score of two channel as following:
Score = cos (qw; rw) + cos (qs; rs) :
(6)
3</p>
    </sec>
    <sec id="sec-2">
      <title>Experiments and Evaluations</title>
      <p>
        Due to Freebase no longer up-to-date, including the unavailability of APIs and
new dumps, we use the full Wikidata dump as our KB. We conduct our
experiments, namely, WebQuestionSP (WebQSP),QALD-7 (Task 4, English). We use
sememe annotations in HowNet for sememe-channel representation.
(
        <xref ref-type="bibr" rid="ref2">2</xref>
        )
(
        <xref ref-type="bibr" rid="ref3">3</xref>
        )
(
        <xref ref-type="bibr" rid="ref4">4</xref>
        )
(
        <xref ref-type="bibr" rid="ref5">5</xref>
        )
By Table 1, we show that our model is superior to all datasets and
metrics. Our model achieves 51:9%, 39:7%, 7:3%, 6:0% higher F1-score compared
to STAGG, Yu et al. (2017), Sorokin et al. (2018), Maheshwari et al. (2019) on
WebQSP. Analogously, we achieve 40:6%, 33:8%, 22:9%, 7:5% higher F1-score on
QALD-7. We can conclude that our double-channel representation method
performs better than all baselines. We observe that ignoring either word or sememe,
perform worse than the double-channel settings. The comparison demonstrates
that two-channel representation preserve the complementary.
4
      </p>
    </sec>
    <sec id="sec-3">
      <title>Conclusion</title>
      <p>In this poster, we present a sememe-based approach to di erentiate relations
with similar semantics in KBQA, where sememe can be leveraged as the minimal
semantics of words as an extra natural knowledge to enrich semantics for parsing.
In future work, we are interested in maximizing the sememe-level semantics in
overtaking the weakness of the word-level semantics in KBQA.</p>
    </sec>
    <sec id="sec-4">
      <title>5 Acknowledgments</title>
      <p>This work is supported by the National Key Research and Development Program
of China (2017YFC0908401) and the National Natural Science Foundation of
China (61972455). Xiaowang Zhang is supported by the Peiyang Young Scholars
in Tianjin University (2019XRX-0032).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Sorokin</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gurevych</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Modeling semantics with gated graph neural networks for knowledge base question answering</article-title>
          .
          <source>In: COLING'</source>
          <year>2018</year>
          , pp.
          <volume>3306</volume>
          {
          <fpage>3317</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Yih</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chang</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gao</surname>
          </string-name>
          , J.:
          <article-title>Semantic parsing via staged query graph generation: question answering with knowledge base</article-title>
          .
          <source>In: ACL'</source>
          <year>2015</year>
          , pp.
          <volume>1321</volume>
          {
          <fpage>1331</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Bloom</surname>
            <given-names>eld</given-names>
          </string-name>
          , L.:
          <article-title>A set of postulates for the science of language</article-title>
          .
          <source>Language</source>
          ,
          <year>1926</year>
          ,
          <volume>2</volume>
          (
          <issue>3</issue>
          ):
          <fpage>153</fpage>
          -
          <lpage>164</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Maheshwari</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Trivedi</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lukovnikov</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chakraborty</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fischer</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2019</year>
          ).
          <article-title>Learning to rank query graphs for complex question answering over knowledge graphs</article-title>
          .
          <source>In: ISWC'19</source>
          , pp.
          <fpage>487</fpage>
          -
          <lpage>504</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Yu</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yin</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hasan</surname>
            ,
            <given-names>K.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Santos</surname>
            ,
            <given-names>C.N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xiang</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhou</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Improved neural relation detection for knowledge base question answering</article-title>
          .
          <source>In: ACL'</source>
          <year>2017</year>
          , pp.
          <volume>571</volume>
          {
          <fpage>581</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>