<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Xser@QALD-4: Answering Natural Language Questions via Phrasal Semantic Parsing</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Kun Xu</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yansong Feng</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dongyan Zhao</string-name>
          <email>zhaodongyang@pku.edu.cn</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          ,
          <addr-line>Beijing, 100871</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Institute of Computer Science Technology, Peking University.</institution>
          <addr-line>Zhongguancun North Road 128</addr-line>
        </aff>
      </contrib-group>
      <fpage>1260</fpage>
      <lpage>1274</lpage>
      <abstract>
        <p>We present a question answering system (Xser) over Linked Data(DBpedia), converting users' natural language questions into structured queries. There are two challenges involved: recognizing users' query intention and mapping the involved semantic items against a given knowledge base (KB), which will be in turn assembled into a structured query. In this paper, we propose an e cient pipeline framework to model a user's query intention as a phrase level dependency DAG which is then instantiated according to a given KB to construct the nal structured query. We evaluate our approach on the QALD-4 test dataset and achieve an F-measure score of 0.72, an average precision of 0.72 and an average recall of 0.71 over 50 questions.</p>
      </abstract>
      <kwd-group>
        <kwd>Phrasal Semantic Parsing</kwd>
        <kwd>Question Answering</kwd>
        <kwd>Knowledge Base</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>As very large structured knowledge bases have become available, e.g.,YAGO [10],
DBpedia [16] and Freebase[15], answering natural language questions over
structured knowledge facts has attracted increasing research e orts, in both natural
language processing and information retrieval communities. Di erent from
keyword based retrieval, the structure of query intentions embedde in a user's
question can be represented by a set of predicate-argument structures, e.g., &lt;subject,
predicate, object &gt; triples, and e ectively retrieved by a structured query engine
from a given knowledge base. Generally, the main challenge of understanding
the query intention in a structural form is to solve two tasks: recognizing the
predicate-argument structures and then instantiating these structures regarding
a given KB.</p>
      <p>Intuitively, the two subtasks would be solved in a joint framework, e.g., [22]
proposed a PCFG-based semantic parser to simultaneously learn the
combination rules among words or phrases and the mappings to speci c KB components.
However, given the size of existing KBs (usually thousands of predicates,
millions of entities and billions of knowledge facts), it makes di cult to jointly train
such a PCFG-based parser (the model of [22] takes several days to train with
[which] [country] did [france] [colonise]
variable category entity relation</p>
      <p>PO</p>
      <p>SC SP
[which]v [country]c did [france]E [colonise]R
phrase detecting</p>
      <p>parsing
instanting</p>
      <p>PO</p>
      <p>SC SP
[which]v [country]c did [france]E [colonise]R
?x fb:location.country fb:en.france fb:colonise
select ?x where
{
}
?x fb:object.type fb:location.country</p>
      <p>fb:en.france fb:colonise ?x
3,000 sentences), and even more di cult to adapt to other KBs, let alone
retrieving multiple KBs within one query, e.g., some queries in the QALD task[17]
are mixed with predicates from both DBpedia and Yago. In contrast, we nd
that recognizing the query intention structure is usually KB-independent. Take
Figure 1 as an example, without grounding to a knowledge base, we can still
guess that a location called france has some relationship, indicated by the verb
\colonise", with some countries, (the queried objects), which can be learned
directly without reliance on a speci ed KB. On the other hand, the task of
mapping semantic phrases from the intention structures to items in a given KB and
producing the nal structured queries is KB-dependent, since one has to solve
these mappings according to the schema of a speci ed KB.</p>
      <p>Given the observations above, we thus assume that the structure of a
question's query intention can be learned independent from a speci c knowledge base,
while grounding and converting a query intention into a structured query is
dependent on a knowledge base. Our assumption will naturally lead to a pipeline
paradigm to translating a natural language question into a structured query,
which can then be directly retrieved by a structured database query engine, e.g.,
Virtuoso 1.</p>
      <p>In this paper, we deal with the task of understanding natural language
questions in a pipeline paradigm, involving mainly two steps: recognizing the query
intention structure inherent in the natural language questions, and then
instantiating the query intention structures by mapping the involved semantic items
into existing KBs. In the rst phase, we build a phrase detector to detect possible
1 http://www.virtuoso.com
semantic phrases, e.g., variables, entity phrases, category phrases and relation
phrases. We then develop a semantic parser to predict the predicate-argument
structures among phrases to represent the structure of query intentions. In the
second phase, given the intention structures, we are then able to adopt a
structured perceptron model to jointly solve the mappings between semantic phrases
and KB items. By taking a two-phase format, our proposed model can bene t
from the separation of KB related components and KB independent steps, and
recognize the intention structures more e ciently while making the KB-related
component exible, e.g., we can only retrain the second phase when adapting
to new KBs, which is similar in sprite with [23], who rely on a CCG parser
to produce an ontological-independent logical representation to express users'
intention. We evaluate our model on the QALD-4, and achieve an F-measure
score of 0.72, an average precision of 0.72 and an average recall of 0.71 over 50
questions.
2</p>
    </sec>
    <sec id="sec-2">
      <title>The Task</title>
      <p>We de ne the task of using a KB to answer natural language questions as follows:
given a natural language question qNL and a knowledge base KB, our goal is to
translate qNL into a structured query in certain structured query language, e.g.,
SPARQL, which consists of multiple triples: a conjunction of &lt;subject, predicate,
object&gt; search conditions.
3</p>
      <p>Recognizing the Structure of Query Intention
Our framework rst employ a pipeline of phrase detection and phrasal semantic
parsing to recognize the inherent structure of user's query intention, and then
instantiates the query intention regarding speci c KBs.
3.1</p>
      <sec id="sec-2-1">
        <title>Phrase Detection</title>
        <p>Before recognizing the query intentions of questions, we rst detect phrases of
interest from the questions that potentially correspond to semantic items. where a
detected phrase is also assigned with a semantic label l 2 fentity; relation; category; variableg.
For entity phrases, we need to recognize phrases that may correspond to entities
of the KB. Similarity, relation phrases may correspond to KB's predicates, and
category phrases may be mapped in KB's classes. This problem can be casted
as a sequence labeling problem, where our goal is to build a tagger whose input
is a sentence, for example:</p>
        <p>V
W ho has T om Cruise been married to</p>
        <p>B none E B E I none R B R</p>
        <p>I
(here, we use B-I scheme for each phrase label: R-B represents the beginning of
a relation phrase, R-I represents the continuation of a relation phrase). We use
structured perceptron[3] to build our phrase tagger. Structured perceptron is an
extension to the standard linear perceptron for structured prediction. Given a
question instance x 2 X, which in our case is a sentence, the structured
perceptron involves the following decoding problem which nds the best con guration
z 2 Y , which in our case is a label sequence, according to the current model w:
z = arg max w f (x; y0)</p>
        <p>y02Y (x)
where f (x; y0) represents the feature vector for instance x along with con
guration y0. We observe that many phrases of a speci c semantic label may share
similar POS tag sequence pattern and NER sequence pattern. For example, the
POS tag sequences of relation phrases \the area of " and \the population in" are
in the same pattern \DT NN IN ". Therefore, we use three types of features:
lexical features, POS tag features and NER features. Table 1 summarizes the
feature templates we used in the phrase detection.
p = pos tag; n = ner tag; w = word; t = phrase type tag; i = current index
1 unigram of POS tag pi
2 bigram of POS tag pipi+1; pi 1pi
3 trigram of POS tag pipi+1pi+2; pi 1pipi+1; pi 2pi 1pi
4 unigram of NER tag ni
5 bigram of NER tag nini+1; ni 1ni
6 trigram of NER tag nini+1ni+2; ni 1nini+1; ni 2ni 1ni</p>
        <sec id="sec-2-1-1">
          <title>7 unigram of word wi</title>
        </sec>
        <sec id="sec-2-1-2">
          <title>8 bigram of word wiwi+1; wi 1wi 9 trigram of word wiwi+1wi+2; wi 1wiwi+1; wi 2wi 1wi 10 previous phrase type ti 1 11 conjunction of previous phrase type and current word ti 1wi</title>
          <p>As shown in Figure 1, query intention can be represented by dependencies
between \country ", \france" and \colonise", forming a phrase DAG, we thus
introduce a transition-based DAG parsing algorithm to perform a structural
prediction process and reveal the inherent structures.</p>
          <p>Phrase DAG In dependency grammar, structures are determined by the
relationship between a head and its dependents, e.g., syntactic or morphological
dependencies. Here, we propose to use predicate-argument dependencies to
capture the query intention, that is, the arguments of a predicate are dependents of
that predicate. Each predicate is either a unary predicate (which characterizes</p>
          <p>SP
PO</p>
          <p>SP
PO
In [what]v [year]C did [harry potter and the goblet of fire]E</p>
          <p>[win]R the [hugo award for best novel]E
fb:time.month
fb:time.day_of_year
fb:time.event
fb:time.year
...</p>
          <p>fb:m.02hm249
fb:m.05q1cnk
fb:m.0dplz5f
fb:m.08vs8qm
fb:m.031786
...</p>
          <p>fb:sports.sports_team.championships
fb:award.award_winning_work.award
s_won..award.award_honor.award
...</p>
          <p>
            Phrase DAG
fb:m.04p4lvx
fb:m.04p3_v_
fb:m.0l_v4kd
fb:m.0gbwdzs
fb:m.0gbwgrp
fb:m.01yz0x
...
candidates for all phrases and the underlined are the gold-standard mappings.)
(
            <xref ref-type="bibr" rid="ref1">1</xref>
            ) Action:SHIFT
          </p>
          <p>
            Current arcs:
(
            <xref ref-type="bibr" rid="ref2">2</xref>
            ) Action:ARCLEFT
          </p>
          <p>
            Current arcs:
(
            <xref ref-type="bibr" rid="ref3">3</xref>
            ) Action:REDUCE
Stack
A
Stack
A
Stack
Stack
B
          </p>
          <p>
            Current arcs:
(
            <xref ref-type="bibr" rid="ref4">4</xref>
            ) Action:SHIFT
          </p>
          <p>Current arcs:</p>
          <p>B
A
B
A
B
A
C
A
Input phrases
Input phrases
Input phrases
Input phrases</p>
          <p>C
B
C
B
C
B
D
B
Desired output
A
Initial state
Stack</p>
          <p>B
Current arcs:</p>
          <p>D
C
D
C
D
C
C</p>
          <p>Input phrases
A
A
C
D
D
D
D</p>
          <p>B
B</p>
          <p>D</p>
          <p>C</p>
          <p>
            C
(
            <xref ref-type="bibr" rid="ref5">5</xref>
            ) Action:ARCLEFT
          </p>
          <p>Stack</p>
          <p>
            B
(
            <xref ref-type="bibr" rid="ref6">6</xref>
            ) Action:SHIFT
          </p>
          <p>Stack
C
B</p>
          <p>
            Current arcs:
(
            <xref ref-type="bibr" rid="ref7">7</xref>
            ) Action:ARCRIGHT
          </p>
          <p>
            Current arcs:
(
            <xref ref-type="bibr" rid="ref8">8</xref>
            ) Action:SHIFT
          </p>
          <p>Stack
C
B
Stack
D
C
B
Current arcs:</p>
          <p>D
D
D
A
D
A
A
C
Input phrases</p>
          <p>D
Input phrases
Input phrases
Input phrases</p>
          <p>B
B
B</p>
          <p>C
C</p>
          <p>C
Current arcs:</p>
          <p>A</p>
          <p>B</p>
          <p>C</p>
          <p>D
D
D
D
its only argument) or a binary predicate (which represents the semantic
relation between its two arguments). Here, category phrases correspond to unary
predicates, and relation phrases correspond to binary predicates.</p>
          <p>We formulate the problem of recognizing the dependency structures in a
phrase level: the detected phrases in a question form the nodes of the phrase
dependency DAG, with four types, variable, entity, relation, and category nodes,
where the former two are referred as content nodes in the following. We draw a
directed edge from the head to its dependent, indicating the predicate-argument
or argument-argument relationship.
The Phrase DAG Parsing Note that, in our setup, one phrase can have more
than one head, as in Figure 2, variable node what has two heads in the resulting
dependency DAG. We thus use the framework proposed by [2], i.e., extending
traditional arc-eager shift-reduce parsing with multiple heads to nd a DAG
directly. Speci cally, given a question with sequence of phrases, our parser uses
a stack of partial DAGs, a queue of incoming phrases, and a series of actions to
build a dependency DAG. We assume that each input phrase has been assigned
a POS-tag (conjunction of each token's POS-tag which composes of the input
phrase) and a semantic label.</p>
          <p>Our semantic parser uses four actions: SHIFT, REDUCE, ARCRIGHT and
ARCLEFT (Table 2).</p>
          <p>
            The SHIFT action follow the standard de nitions that just pushes the next
incoming phrase onto the stack, as shown in Figure 3 action(
            <xref ref-type="bibr" rid="ref1">1</xref>
            ).
          </p>
          <p>
            The REDUCE action pops the stack top, as shown in gure 3 action(
            <xref ref-type="bibr" rid="ref3">3</xref>
            ).
Note that, the standard REDUCE action which is taken on the condition that the
stack top has at least one head. This precondition ensures the dependency graph
is a connected graph. However, our phrase dependency parser only concerns
the predicate-argument structures, and we add a dependency only between the
predicate and argument of our interest. In our case, the dependency graph can
be an unconnected directed graph.
          </p>
          <p>
            The ARCRIGHT action adds a dependency edge from the stack top to the
rst phrase of the incoming queue, where the phrase on the stack is the head
and the phrase in the queue is the dependent (the stack and queue are left
untouched), as long as a left arc does not already exist between these two phrases,
as shown in Figure 3 action(
            <xref ref-type="bibr" rid="ref7">7</xref>
            ).
          </p>
          <p>
            The ARCLEFT action adds a dependency edge from the rst phrase on the
queue to the stack top, where the phrase in the queue is the head and the phrase
on the stack is the dependent (again, the stack and queue are left untouched),
as long as a right arc does not already exist between the two phrases, as shown
in Figure 3 action(
            <xref ref-type="bibr" rid="ref2">2</xref>
            ).
          </p>
          <p>Figure 3 shows an example of shift-reduce parsing process.</p>
          <p>The decoding algorithm for phrase DAG parsing We apply the standard
beam-search along with early-update to perform inexact decoding [9] during
training. To formulate the decoding algorithm, we de ne a candidate item as a
tuple hS; Q; F i, where S represents the stack with partial derivations that have
been built, Q represents the queue of incoming phrases that have not been
processed, and F is a boolean value that represents whether the candidate item has
been nished. A candidate item is nished if and only if the queue is empty, and
no more actions can be applied to a candidate item after it reaches the nished
status. Given an input sentence x, we de ne the start item as the un nished item
with an empty stack and the whole input sentence as the incoming phrases(line
2). A derivation is built from the start item by repeated applications of actions
(SHIFT, REDUCE, ARCLEFT and ARCRIGHT) until the item is nished.</p>
          <p>To apply beam-search, an agenda is used to hold the K-best partial (un
nished) candidate items at each parsing step. A separate candidate output is used
to record the current best nished item that has been found, since candidate
items can be nished at di erent steps. Initially the agenda contains only the
start item, and the candidate output is set to none(line 3). At each step during
parsing, each candidate item from the agenda is extended in all possible ways
by applying one action according to the current status(line 7), and a number of
new candidate items are generated(line 8). If a newly generated candidate is
nished, it is compared with the current candidate output. If the candidate output
is none or the score of the newly generated candidate is higher than the score of
the candidate output, the candidate output is replaced with the newly generated
item(line 11); otherwise the newly generated item is discarded (line 14). If the
newly generated candidate is un nished, it is appended to a list of newly
generated partial candidates. After all candidate items from the agenda have been
processed, the agenda is cleared(line 18) and the K-best items from the list are
put on the agenda(line 19). Then the list is cleared and the parser moves on
to the next step. This process repeats until the agenda is empty (which means
that no new items have been generated in the previous step), and the candidate
output is the nal derivation. Pseudocode for the decoding algorithm is shown
in Algorithm 1.
Features Features play an important role in transition-based parsing. Our
parser takes three types of features: lexical, semantic and structure-related
features. We summarize our feature templates in Table 3, where ST represents
the top node in the stack, N0, N1, N2 represent the three incoming phrases
from the incoming queue, subscript t indicates POS tags, subscript p
indicates lexical surface forms and subscript s represent the semantic label of the
phrase(entity,relation, category and variable).</p>
          <p>Lexical features include features used in traditional word level dependency
parsing with some modi cations: all co-occurrences are built on phrase nodes
and the POS tag of a phrase is de ned as the concatenation of each token's POS
tag in the phrase.</p>
          <p>Note that ARCLEFT and ARCRIGHT actions are considered only when
the top phrase of the stack and the next phrase are variable, entity or
relation phrases.To guide the ARCLEFT and ARCRIGHT actions, we introduce
semantic features indicating the semantic label of a phrase.</p>
          <p>Recall that, our phrase semantic DAG parser allows one phrase to have
multiple heads. Therefore, we modify the ARCLEFT and ARCRIGHT actions so
that they can create new dependency arcs without removing the dependent from
further consideration for being a dependent of other heads. We thus introduce
new structure-related features to indicate whether an arc already exists between
the top phrase on the stack and the next phrase on the queue.
4</p>
          <p>Instantiating Query Intention Regarding Existing KBs
After recognizing the structure of the user's query intention, our parser can
output a KB-independent phrase dependency DAG, as shown in Figure 2. To covert
the the query intention into structured queries, semantic items are grounded to
a speci c KB, e.g., DBpedia. In experiment, we utilize di erent methods to
maintain the candidate sets for di erent types of phrases.</p>
          <p>For the entity phrase, we rst employ the wikipedia miner tool2 to map the
phrase to wikipedia entities, and then generate the candidate set of DBpedia
entities which share the same wikipedia ids with wikipedia entities.</p>
          <p>For the relation and category phrase, we use the PATTY resource to construct
a lexicon that maps phrases to predicates and categories of DBpedia.</p>
          <p>Note that, a nice paradigm will be jointly resolving the mappings of all
semantic items involved: labeling phrase nodes and the dependency relations with
corresponding items in the given KB. Therefore, inspired by [21], we formulate
this problem as a structured learning task, where we choose a candidate for each
phrase and dependency relation, and nd a globally maximum con guration
among multiple local candidates.</p>
          <p>For the dependency relation candidates especially for that between two
content phrases, we develop a function getDepRel to retrieve the possible
dependency relation candidates which are coherent with the domain of the phrases in
the question from the set of all KB predicates.
2 http://wikipedia-miner.cms.waikato.ac.nz/
Algorithm 2 The decoding algorithm to instantiate the query intention
structures
Require: Instance x =&lt; (x1; x2; : : : ; xm); ; o &gt; and the oracle output y if for
training.</p>
        </sec>
        <sec id="sec-2-1-3">
          <title>K:Beam Size.</title>
          <p>[ f?g: phrase candidates.</p>
          <p>[ f?; SP; P O; SCg: dependency relation candidates.</p>
          <p>Ensure: 1-best prediction z for x
1: Set beam ["] //empty con guration
2: for all i 1::m do
3: buf fz0 ljz0 2 B; l 2 [ f?gg //phrase labeling
4: B K-best(buf )
5: if y1:(i 1) m+1 2= B then
6: return B[0] //for early update
7: end if
8: for all (xk; ) 2 do
9: /* search for dependent relation*/
10: buf ;
11: for all z0 2 B do
12: buf buf [ fz0 ?g
13: if dep(i; k; ) 2 o then
14: //this dependency from phrase i to phrase k
15: //exists in the DAG structures
16: /*dependency relation labeling*/
17: buf buf [ fz0 rjr 2 getDepRel( ; z0) [ SP [ P O [ SC[ ?g
18: end if
19: end for
20: B K-best(buf )
21: if y1:i k 2= B then
22: return B[0] // for early update
23: end if
24: end for
25: end for
4.1</p>
        </sec>
      </sec>
      <sec id="sec-2-2">
        <title>Instantiating Query Intention with Structured Perceptron</title>
        <p>
          As shown in Figure 2, each phrase and dependency relation have multiple
candidates and the best con guration is consisted of the blue and underlined
candidates. However, it is intractable to perform exact search in our framework
because: (
          <xref ref-type="bibr" rid="ref1">1</xref>
          ) by simultaneously mapping phrase nodes and dependency relations,
the search space becomes much more complex, (
          <xref ref-type="bibr" rid="ref2">2</xref>
          ) we propose to make use of
arbitrary global features, which makes it infeasible to perform exact inference
e ciently.
        </p>
        <p>To address this problem, we apply beam-search along with early-update
strategy to perform inexact decoding.
4.2</p>
      </sec>
      <sec id="sec-2-3">
        <title>Label sets</title>
        <p>Here we introduce the label sets for phrase node and dependency relation in the
model. We use [ f?g to denote the phrase candidates, and ? indicates that
the phrase node can not be mapped to an entity or a predicate of the KB.
Similarly, [ fSP; P O; SC; ?g denotes the dependency relation candidates, where
is the set of all KB predicates, SP and PO are the two labels representing
the dependency relation between relation phrase and entity phrase, where SP
indicates that the entity phrase is the subject of the relation phrase, and PO
indicates the other way. SC is the label representing the dependency relation
between content phrase and category phrase, where the category phrase
characterized the content phrase. and ? means that this dependency relation can not
be instantiated.
4.3</p>
      </sec>
      <sec id="sec-2-4">
        <title>The decoding algorithm for instantiating query intentions</title>
        <p>Let x =&lt; (x1; x2; : : : ; xm); ; o &gt; denote the sentence instance, where xi
represents the i-th phrase in the sentence, = ff(xk; ci)gis=1gkm=1 is the set of KB
item candidates for phrases where s denotes the size of candidates for xk, and
o = ffdepl(i; j; ci)gid=1gl=1 is the set of KB predicate candidates for dependency
n
relations where i and j denotes the head and dependent index, d denotes the
size of candidates for the dependency relation. We use</p>
        <p>y = (t1; a1;1; : : : ; a1;m; : : : ; tm; am;1; : : : ; am;m)
to denote the corresponding gold standard structure, where ti represents the KB
item assignment for the phrase xi, and ai;k represents KB item assignment for
the dependency relation between the head xi and dependent xk. Algorithm 2
describes the beam-search procedure with early-update. During each step with
phrase i, there are two sub-steps:
Phrase labeling We enumerate all possible KB item candidates for the current
phrase(line 3). K-best partial con gurations are selected to the beam(line 4),
assuming the beam size is K.</p>
        <p>Dependency relation labeling After the phrase labeling step, we traverse
all con gurations in the beam. Once a phrase candidate for xi is found in
the beam, the decoder searches through the dependency relation candidates
o to label each dependency relation where the current phrase as the head
and xi as the dependent(line11-19). After labeling each dependency relation,
we again score each partial assignment and select the K-best results to the
beam(line 20).</p>
        <p>Note that, the score of a con guration is indeed the sum of two scores from
each sub-step. Therefore the resulting best con guration will jointly solve the
phrase and dependency relation labeling. We use two types of features: local
features and global features. Table 4 summarize these features.</p>
        <sec id="sec-2-4-1">
          <title>Category</title>
        </sec>
        <sec id="sec-2-4-2">
          <title>Phrase</title>
        </sec>
        <sec id="sec-2-4-3">
          <title>Dependency</title>
        </sec>
        <sec id="sec-2-4-4">
          <title>Relation</title>
        </sec>
        <sec id="sec-2-4-5">
          <title>Phrase</title>
        </sec>
        <sec id="sec-2-4-6">
          <title>Dependency</title>
        </sec>
        <sec id="sec-2-4-7">
          <title>Relation</title>
          <p>After instantiating the query intention, our system outputs a phrasal semantic
DAG regarding KB. We convert the DAG into structured queries, e.g., SPARQL,
mainly by the rules shown in Figure 4, where e1 and e2 represent entity phrases,
r denotes a relation phrase and c denotes a category phrase. The left is the
graph pattern which may occur in the semantic DAG, and the right is the query
triple converted from the pattern. Note that, multiple graph patterns may occur
in a semantic DAG, as shown in Figure 3. The nal structured queries are the
conjunction of query triples converted from DAG.
5.1</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Experimental</title>
      <sec id="sec-3-1">
        <title>Data set</title>
        <p>We use the Free917 dataset [6] as our training data, which contains 917 questions
annotated with logical forms. Recall that, in our model, we require gold-standard
phrase dependency DAGs of these questions as our training data, which we
prepare as follows. For each question, we nd its variable, entity, category and
relation phrases, manually annotate the question with phrase dependency DAG
and map phrases and dependencies to Freebase semantic items.</p>
        <p>The QALD-4 task for question answering over linked data consists of 50 test
questions. We use these 50 questions as our test data and directly use the parser
trained on Free917 to predict the query intention and instantiate the query
intention regarding DBpedia.
The following table gives the evaluation results with average precision, average
recall and F-measure. It shows the number of question our system can answer,
the number of right and partially right answers among them.
We notice that in many cases our system can be further improved. For example,
in the question from QALD4 \what is the bridge with the longest span", our
system incorrectly detects \the longest span" as a category phrase, while this
phrase is certainly a relation phrase. This detection error is propagated to phrasal
semantic parsing. A necessary direction for our future step should be jointly solve
phrase detection and phrasal semantic DAG parsing to avoid such errors.
6</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Conclusion and Future Work</title>
      <p>In this paper, we propose a novel framework to translate natural language
questions into structural queries, which can be e ectively retrieved by structured
query engines and return the answers according a KB. The novelty of our
framework lies in modeling the task in a KB-independent and KB-related pipeline
paradigm, where we use phrasal semantic DAG to represent users' query
intention, and develop a KB-independent shift-reduce DAG parser to capture the
structure of the query intentions, which are then grounded to a given KB via
joint mappings. This gives the advantages to analyze the questions independent
from a KB and easily adapt to new KBs without much human involvement.</p>
      <p>Currently, our model requires question-DAG pairs as the training data, though
we do not need to annotate new data for every datasets or KBs, it is still
promising to extend our model to train on question-answer pairs only, further saving
human involvement. Secondly, in the pipeline of phrase detection and phrasal
semantic parsing, error propagated from the upstream to downstream. A joint
model with multiple perceptrons may help to eliminate the error propagation.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>A.</given-names>
            <surname>Fader</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Soderland</surname>
          </string-name>
          , and
          <string-name>
            <given-names>O.</given-names>
            <surname>Etzioni</surname>
          </string-name>
          .
          <article-title>Identifying relations for open information extraction</article-title>
          .
          <source>In EMNLP</source>
          , pages
          <fpage>1535</fpage>
          -
          <lpage>1545</lpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>K.</given-names>
            <surname>Sagae</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Tsujii</surname>
          </string-name>
          .
          <article-title>Shift-reduce dependency dag parsing</article-title>
          .
          <source>In COLING</source>
          , pages
          <fpage>753</fpage>
          -
          <lpage>760</lpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>M.</given-names>
            <surname>Collins</surname>
          </string-name>
          .
          <article-title>Discriminative training methods for hidden markov models: Theory and experiments with perceptron algorithms</article-title>
          .
          <source>In Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing (EMNLP-2002)</source>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>K. D. Bollacker</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Evans</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Paritosh</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Sturge</surname>
            , and
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Taylor</surname>
          </string-name>
          . Freebase:
          <article-title>a collaboratively created graph database for structuring human knowledge</article-title>
          .
          <source>In SIGMOD Conference</source>
          , pages
          <fpage>1247</fpage>
          -
          <lpage>1250</lpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>Q.</given-names>
            <surname>Cai</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Yates</surname>
          </string-name>
          . Semantic Parsing Freebase:
          <article-title>Towards Open-domain Semantic Parsing</article-title>
          .
          <source>In Proceedings of the Second Joint Conference on Lexical and Computational Semantics (*SEM)</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>M.</given-names>
            <surname>Collins</surname>
          </string-name>
          .
          <article-title>Discriminative training methods for hidden markov models: Theory and experiments with perceptron algorithms</article-title>
          .
          <source>In Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing (EMNLP-2002)</source>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>A.</given-names>
            <surname>Frank</surname>
          </string-name>
          , H.-U. Krieger,
          <string-name>
            <given-names>F.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Uszkoreit</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Crysmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Jorg</surname>
          </string-name>
          , and
          <string-name>
            <given-names>U.</given-names>
            <surname>Schafer</surname>
          </string-name>
          .
          <article-title>Question answering from structured knowledge sources</article-title>
          .
          <source>J. Applied Logic</source>
          ,
          <volume>5</volume>
          (
          <issue>1</issue>
          ):
          <fpage>20</fpage>
          -
          <lpage>48</lpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>T.</given-names>
            <surname>Lin</surname>
          </string-name>
          , Mausam, and
          <string-name>
            <given-names>O.</given-names>
            <surname>Etzioni</surname>
          </string-name>
          .
          <article-title>Entity linking at web scale</article-title>
          .
          <source>In Proceedings of the Joint Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction, AKBC-WEKEX 12</source>
          , pages
          <fpage>84</fpage>
          -
          <lpage>88</lpage>
          , Stroudsburg, PA, USA,
          <year>2012</year>
          .
          <article-title>Association for Computational Linguistics</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>M.</given-names>
            <surname>Collins</surname>
          </string-name>
          and
          <string-name>
            <given-names>B.</given-names>
            <surname>Roark</surname>
          </string-name>
          .
          <article-title>Incremental parsing with the perceptron algorithm</article-title>
          .
          <source>In ACL</source>
          , pages
          <fpage>111</fpage>
          -
          <lpage>118</lpage>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>F. M. Suchanek</surname>
            ,
            <given-names>G.</given-names>
            Kasneci, and G.
          </string-name>
          <string-name>
            <surname>Weikum.</surname>
          </string-name>
          <article-title>Yago: a core of semantic knowledge</article-title>
          .
          <source>In WWW</source>
          , pages
          <fpage>697</fpage>
          -
          <lpage>706</lpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <given-names>J.</given-names>
            <surname>Krishnamurthy</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Mitchell</surname>
          </string-name>
          .
          <article-title>Weakly supervised training of semantic parsers</article-title>
          .
          <source>In EMNLP-CoNLL</source>
          , pages
          <fpage>754</fpage>
          -
          <lpage>765</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>M. Yahya</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Berberich</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Elbassuoni</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Ramanath</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          <string-name>
            <surname>Tresp</surname>
            , and
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Weikum</surname>
          </string-name>
          .
          <article-title>Natural language questions for the web of data</article-title>
          .
          <source>In EMNLP-CoNLL</source>
          , pages
          <fpage>379</fpage>
          -
          <lpage>390</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <given-names>C.</given-names>
            <surname>Unger</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Cimiano</surname>
          </string-name>
          . Pythia:
          <article-title>Compositional meaning construction for ontology-based question answering on the semantic web</article-title>
          .
          <source>In Proceedings of the 16th International Conference on Natural Language Processing and Information Systems, NLDB11</source>
          , pages
          <fpage>153</fpage>
          -
          <lpage>160</lpage>
          , Berlin, Heidelberg,
          <year>2011</year>
          . Springer-Verlag.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>C. Unger</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Buhmann</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>A.-C. N.</given-names>
          </string-name>
          <string-name>
            <surname>Ngomo</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Gerber</surname>
            , and
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Cimiano</surname>
          </string-name>
          .
          <article-title>Template-based question answering over rdf data</article-title>
          .
          <source>In WWW</source>
          , pages
          <fpage>639</fpage>
          -
          <lpage>648</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>K. D. Bollacker</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Evans</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Paritosh</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Sturge</surname>
            , and
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Taylor</surname>
          </string-name>
          . Freebase:
          <article-title>a collaboratively created graph database for structuring human knowledge</article-title>
          .
          <source>In SIGMOD Conference</source>
          , pages
          <fpage>1247</fpage>
          -
          <lpage>1250</lpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <given-names>S.</given-names>
            <surname>Auer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Bizer</surname>
          </string-name>
          , G. Kobilarov,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lehmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Cyganiak</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Z. G.</given-names>
            <surname>Ives. Dbpedia</surname>
          </string-name>
          :
          <article-title>A nucleus for a web of open data</article-title>
          .
          <source>In ISWC/ASWC</source>
          , pages
          <fpage>722</fpage>
          -
          <lpage>735</lpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <given-names>P.</given-names>
            <surname>Cimiano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Lopez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Unger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Cabrio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.-C. N.</given-names>
            <surname>Ngomo</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Walter</surname>
          </string-name>
          .
          <article-title>Multilingual question answering over linked data (qald-3): Lab overview</article-title>
          .
          <source>In CLEF</source>
          , pages
          <fpage>321</fpage>
          -
          <lpage>332</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <given-names>T.</given-names>
            <surname>Kwiatkowski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. S.</given-names>
            <surname>Zettlemoyer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Goldwater</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Steedman</surname>
          </string-name>
          .
          <article-title>Inducing probabilistic ccg grammars from logical form with higher-order uni cation</article-title>
          .
          <source>In EMNLP</source>
          , pages
          <fpage>1223</fpage>
          -
          <lpage>1233</lpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <given-names>J.</given-names>
            <surname>Berant</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Liang</surname>
          </string-name>
          .
          <article-title>Semantic parsing via paraphrasing</article-title>
          .
          <source>In Association for Computational Linguistics (ACL)</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20. L.
          <string-name>
            <surname>Zou</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>J. X.</given-names>
          </string-name>
          <string-name>
            <surname>Yu</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          <string-name>
            <surname>He</surname>
            , and
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Zhao</surname>
          </string-name>
          .
          <article-title>Natural language question answering over rdf</article-title>
          .
          <source>In Special Interest Group on Management Of Data(SIGMOD)</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <given-names>Q.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Ji</surname>
          </string-name>
          , and
          <string-name>
            <given-names>L.</given-names>
            <surname>Huang</surname>
          </string-name>
          .
          <article-title>Joint event extraction via structured prediction with global features</article-title>
          .
          <source>In ACL</source>
          , pages
          <fpage>73</fpage>
          -
          <lpage>82</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>J. Berant</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Chou</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Frostig</surname>
            , and
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Liang</surname>
          </string-name>
          .
          <article-title>Semantic parsing on freebase from question-answer pairs</article-title>
          .
          <source>In EMNLP</source>
          , pages
          <fpage>1533</fpage>
          -
          <lpage>1544</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <given-names>T.</given-names>
            <surname>Kwiatkowski</surname>
          </string-name>
          , E. Choi,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Artzi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>L. S.</given-names>
            <surname>Zettlemoyer</surname>
          </string-name>
          .
          <article-title>Scaling semantic parsers with on-the- y ontology matching</article-title>
          .
          <source>In EMNLP</source>
          , pages
          <fpage>1545</fpage>
          -
          <lpage>1556</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>