<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Mutilingual Hierarchical Expected Answer Type Classi cation using the SMART 2021 Dataset</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Aleksandr Perevalov</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andreas Both</string-name>
          <email>andreas.bothg@hs-anhalt.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Anhalt University of Applied Sciences</institution>
          ,
          <addr-line>Kothen (Anhalt)</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2021</year>
      </pub-date>
      <abstract>
        <p>The Knowledge Graph Question Answering (KGQA) systems are required to understand natural language in order to transform textual questions into structured queries to a knowledge graph. One of the important tasks of natural-language understanding (NLU) in the context of KGQA systems is the classi cation of the expected answer type (EAT). In this paper, we present our approach on the EAT classi cation within the SeMantic Answer Type Prediction Task 2021. The approach is based on machine-translation-based data augmentation, it supports 104 input languages1 and works over DBpedia and Wikidata. The obtained evaluation results demonstrate reasonable quality in comparison to both last year's and this year's solutions.</p>
      </abstract>
      <kwd-group>
        <kwd>Expected Answer Type Classi cation</kwd>
        <kwd>Target Type Identication</kwd>
        <kwd>Multilingual Question Answering</kwd>
        <kwd>Knowledge Graph Question Answering</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p># without EAT prediction
SELECT (COUNT(DISTINCT ?obj) as ?count)
WHERE {</p>
      <p>dbr:Angela_Merkel ?p ?obj .
}
# ?count = 861
# with EAT prediction
SELECT (COUNT(DISTINCT ?obj) as ?count)
WHERE {
dbr:Angela_Merkel ?p ?obj .
?obj rdf:type ?type .</p>
      <p>FILTER(?type = dbo:City)
}
# ?count = 6
case, the EAT classi cation task can be treated as a multi-class text classi cation
task. In the context of the SMART task, the data structure is more
sophisticated. There are two class levels: answer category (resource, literal, boolean) and
answer type. Hence, the class taxonomy is not at and requires approaches for
hierarchical classi cation.</p>
      <p>The o cial description of the data states4: If the category is \resource",
answer types are ontology classes from either the DBpedia ontology5 or the
Wikidata ontology6. If the category is \literal", answer types are either \number",
\date", or \string". For the category \boolean", no additional specialization is
de ned. The number of unique \resource" classes is high. Moreover, they are
represented in the form of a list.</p>
      <p>
        While following our long-term research agenda of increasing the
accessibility of KGQA systems and their components through multilingualization, the
presented solution is mainly based on the multilingual language models and
datasets (as in our last year iteration [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]). We used open-source machine
translation models [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] to translate the provided data into 10 languages (German,
Spanish, Mandarin Chinese, Italian, Romanian, Vietnamese, Russian, French,
Czech, Japanese). Thereafter, a multilingual BERT-based [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] classi er was
netuned on the original and translated data. We used a multi-level classi cation
pipeline to make the nal predictions. In the conclusion of the paper, we discuss
the nal results as well as our ndings during the working process.
      </p>
      <p>This work is structured as follows, in Section 2 we review the related research,
Section 3 describes the exploratory data analysis of the provided datasets, we
describe our approach in Section 4 and present the evaluation results in Section
5. Section 6 concludes the paper.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        The entity- and type-centric models were introduced in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] to identify the
answer type of a question. These models are used to rank the queries given the
entity- or type-related content [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. The idea of incorporating an additional
context to improve answer type predictions was proposed in work [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. One of the
4 https://smart-task.github.io/
5 http://mappings.dbpedia.org/server/ontology/classes/
6 https://www.wikidata.org/wiki/Wikidata:WikiProject_Ontology
ISWC 2020's Semantic Web challenges was addressing the answer type classi
cation (SeMantic AnsweR Type prediction task, SMART) [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. It has shown that
transformer-based models demonstrate the highest results in this task [
        <xref ref-type="bibr" rid="ref12 ref16">16,12</xref>
        ].
The approach based on using external data (e.g., KGQA datasets) was
introduced in [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. Recently, the authors of [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] proposed a system for EAT prediction
in a \distantly supervised fashion" (i.e., no manual data annotation is required).
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Exploratory Data Analysis</title>
      <p>There are two class levels in the datasets: answer type category (resource,
literal, boolean) and answer type. As the class taxonomy is not at it requires
us to use approaches for hierarchical classi cation. The o cial description of the
data states7: If the category is \resource", answer types are ontology classes
from either the DBpedia ontology8 or the Wikidata ontology9. If the category
is \literal", answer types are either \number", \date", or \string". For the
category \boolean" no additional specialization is de ned. Such data has to be
analyzed on di erent levels for class distribution, noise, missing values, etc. In
this section, we demonstrate the analysis of both DBpedia and Wikidata answer
type classi cation datasets.
3.1</p>
      <p>DBpedia Dataset
The DBpedia dataset contains Train (40,621 examples) and Test (10,093
examples) subsets. After removing the null values from the Train subset, the number
of examples decreased to 37,061 (-3,560 examples). The example of the DBpedia
dataset for the answer type prediction is shown in Figure 2. Here, if the category
is resource, the answer type eld is represented as an ordered list of DBpedia
ontology types, where the rst type is the most speci c and the last is the most
general one (see the very rst question in the Figure). While manually analyzing
this year's SMART dataset we were capable to nd several noisy examples (see
Figure 3). In this example, the type eld contains duplicated DBpedia ontology
types.</p>
      <p>We also analyzed the distributions of values for category, literal, type
elds. The results are demonstrated in Figure 4. There is a huge imbalance
towards the resource value while the literal values are more or less balanced.
The distribution of the resource eld values is demonstrated in Figure 5.
Surprisingly, the most frequent values of the answer type eld contain noise.
For example, the second most frequent value has incorrect order of DBpedia
types. The third most frequent value has DBpedia types of di erent hierarchies
(dbo:Place and dbo:Agent).
7 https://smart-task.github.io/
8 http://mappings.dbpedia.org/server/ontology/classes/
9 https://www.wikidata.org/wiki/Wikidata:WikiProject_Ontology
"id": "dbpedia_1",
"question": "Who are the gymnasts coached by Amanda Reddin?",
"category": "resource",
"type": ["dbo:Gymnast", "dbo:Athlete", "dbo:Person", "dbo:Agent"]
"id": "dbpedia_2",
"question": "When did Margaret Mead marry Gregory Bateson?",
"category": "literal",
"type": ["date"]
"id": "dbpedia_3",
"question": "Is Azerbaijan a member of European Go Federation?",
"category": "boolean",
"type": ["boolean"]
"id": "1",
"question": "Who is the child of Ranavalona I s husband?",
"category": "resource",
"type": ["person", "omnivore", "natural person"]
The Wikidata dataset contains Train (43,604 examples) and Test (10,864
examples) subsets. We also cleaned the training data by removing the null values.
Consequently, the number of examples decreased to 43,554 (-50 examples). In
this aspect, the Wikidata data has signi cantly fewer null values rather than
the DBpedia data. An example of the Wikidata dataset for the answer type
prediction is shown in Figure 6.</p>
      <p>Here, if the category is resource, the answer type eld is represented as a
list of Wikidata classes that are retrieved according to the following SPARQL
query:
PREFIX wd: &lt;http://www.wikidata.org/entity/&gt;
PREFIX wdt: &lt;http://www.wikidata.org/prop/direct/&gt;
SELECT ?subClasses WHERE {
wd:Q123456789 wdt:P31 ?x . # subject to be replaced with actual answer entity
?x wdt:P279 ?subClasses .
}</p>
      <p>While manually analyzing the training subset of Wikidata we were capable
to nd several noisy examples of di erent nature in comparison to DBpedia (see
Figure 7). In this example, the type eld contains duplicated DBpedia ontology
types.</p>
      <p>We also analyzed the distributions of values for category, literal, type
elds. The results are demonstrated in Figure 8. For Wikidata we observed the
same data distribution patterns: there is a huge imbalance towards the resource
"id":10395,
"question":"what is the grammatcal mood of turkish",
"category":"resource",
"type":[</p>
      <p>
        "grammatical category", "Q26869183", "grammatical mood"
value while the literal values are more or less balanced. The distribution of the
resource eld values is demonstrated in Figure 9. Obviously, the answer type
values are extremely imbalanced towards the ones related to the person class.
We assume that the observed noise in the data was unintended and there was a
risk of having the data quality in the test dataset. Hence, we decided not to use
this year's SMART Task data. In our local training and evaluation process, the
data from the previous year was used [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. The corresponding data analysis for
the previous year's data is available in our paper [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
4
      </p>
    </sec>
    <sec id="sec-4">
      <title>Approach</title>
      <p>
        While following our long-term research agenda on enhancing multilingual
accessibility of KGQA systems [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], we base our approach on multilingual data
augmentation. We used only the data from SMART 2020 (the previous year's
challenge). For both DBpedia and Wikidata, all the textual questions in the
corresponding datasets were machine-translated from English into German, Spanish,
Chinese, Italian, Romanian, Vietnamese, Russian, French, Czech, and Japanese
using Helsinki NLP tool [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. For the DBpedia dataset, we fetched additional
data from LC-QuAD 1.0 [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] dataset. The same was done w.r.t. the Wikidata
dataset and LC-QuAD 2.0 [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>We used a multi-level hierarchical classi cation pipeline for both DBpedia
and Wikidata. The pipeline consists of the following models: (1) category
classi er, (2) literal classi er { are common models for both knowledge graphs, and
(3) resource classi er { trained separately for DBpedia and Wikidata. The
architecture of the classi cation pipeline is shown in Figure 10. The resource classi er
for DBpedia was trained in the multi-class classi cation setting to predict the
most speci c type of the hierarchy. When the prediction is executed, the rest of
the hierarchy is fetched from DBpedia via the following SPARQL query based
on the predicted type:
PREFIX owl: &lt;http://www.w3.org/2002/07/owl#&gt;
PREFIX rdfs: &lt;http://www.w3.org/2000/01/rdf-schema#&gt;
PREFIX dbo: &lt;http://dbpedia.org/ontology/&gt;
SELECT DISTINCT ?parentClass WHERE {
&lt;predictedType&gt; rdfs:subClassOf* ?parentClass .</p>
      <p>FILTER(?parentClass != &lt;predictedType&gt; &amp;&amp; CONTAINS(STR(?parentClass), "dbpedia"))
}}
The resource classi er for Wikidata was trained in the multi-label classi cation
setting. It works without any additional steps in the prediction phase.</p>
      <sec id="sec-4-1">
        <title>DBpedia</title>
      </sec>
      <sec id="sec-4-2">
        <title>Wikidata</title>
      </sec>
      <sec id="sec-4-3">
        <title>Accuracy NDCG@5 NDCG@10</title>
        <p>0.991 0.643 0.577</p>
      </sec>
      <sec id="sec-4-4">
        <title>Accuracy MRR</title>
        <p>
          0.980 0.430
For the classi cation model, BERT multilingual model [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] was used. We utilized
\transformers"10 Python library for implementing the classi cation pipeline. For
the multi-class classi ers (category, literal, resource for DBpedia), one
fullyconnected layer of size n was added to the BERT model, where n { is the number
of classes. The input for this layer was the last hidden state of the BERT model
(i.e., [CLS] token). We used categorical cross-entropy [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] as a loss function for
the multi-class models. The multi-label classi er { resource for Wikidata { was
also provided with a fully connected layer of the same size, however, binary
cross-entropy loss [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] was used as each of the output neurons represents the
probability of predicted label. The training of the models was done using early
stopping criteria targeted at minimizing the loss with the patience of one epoch.
        </p>
        <p>
          The prediction results were evaluated with the following metrics. For both
DBpedia and Wikidata, the quality of the category predictions was measured
using accuracy. The quality of the answer type predictions was measured using
Mean Reciprocal Rank (MRR) for Wikidata and lenient Normalized Discounted
Cumulative Gain @k with a Linear decay (NDCG@k) for DBpedia, where k 2
5; 10 [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ].
        </p>
        <p>The evaluation was done by organizers on the private dataset using an
internal process. The nal results are shown in Table 1. The obtained results are
demonstrating reasonable quality in comparison to the other participants as well
as to the results of the last year. The Accuracy score of the solution demonstrates
one of the best results on both DBpedia and Wikidata, while metrics related to
the ranking of the answer type hierarchies (NDCG@k, MRR) achieved relatively
poor results. The source code of our solution is available online11. In addition,
we also have deployed a demo-interface for EAT classi cation over DBpedia
online12.
6</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Discussion and Conclusion</title>
      <p>We would like to raise the following questions concerning the evaluation process.
First, as we observed a signi cant imbalance in the data w.r.t. the category
values (see Section 3), we think that the usage of accuracy score to measure
10 https://huggingface.co/bert-base-multilingual-cased
11 https://github.com/Perevalov/smart-2021
12 https://webengineering.ins.hs-anhalt.de:41009/eat-classification
the results is not the best option since that it is not robust to the imbalanced
data. Instead, we propose to use precision and recall scores computed in the
classi cation setting. Secondly, as the answer types of the resource category
questions in Wikidata are not ordered and do not form a hierarchy, we think
that the usage of mean reciprocal rank is not acceptable as this metric is used to
evaluate ordered result sets. Hence, the measure for unordered lists is naturally
applicable to this task, such as precision and recall calculated in an
informationretrieval setting.</p>
      <p>
        In this paper, we demonstrated our approach for hierarchical EAT prediction
based on a multi-level classi cation pipeline. As we used multilingual data and
models for training, the classi cation pipeline supports input in 104 languages.
The evaluation process demonstrated reasonable results w.r.t. the quality
metrics. For future work, we are targeting on improving the quality of the resource
answer types classi cation, and we are planning to create a study on the impact
of EAT classi cation on QA quality for multiple QA systems using a
componentoriented QA framework (e.g., [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]).
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Auer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bizer</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kobilarov</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cyganiak</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ives</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          :
          <article-title>DBpedia: A nucleus for a web of open data</article-title>
          . In: Aberer,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Choi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.S.</given-names>
            ,
            <surname>Noy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            ,
            <surname>Allemang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            ,
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.I.</given-names>
            ,
            <surname>Nixon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Golbeck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Mika</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Maynard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            ,
            <surname>Mizoguchi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Schreiber</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            ,
            <surname>Cudre-Mauroux</surname>
          </string-name>
          , P. (eds.)
          <article-title>The Semantic Web</article-title>
          . pp.
          <volume>722</volume>
          {
          <fpage>735</fpage>
          . Springer Berlin Heidelberg, Berlin, Heidelberg (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Balog</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Neumayer</surname>
          </string-name>
          , R.:
          <article-title>Hierarchical target type identi cation for entity-oriented queries</article-title>
          .
          <source>In: Proceedings of the 21st ACM international conference on Information and knowledge management</source>
          . pp.
          <volume>2391</volume>
          {
          <fpage>2394</fpage>
          . CIKM '12,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY, USA (
          <year>2012</year>
          ). https://doi.org/10.1145/2396761.2398648
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Balog</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Neumayer</surname>
          </string-name>
          , R.:
          <article-title>Hierarchical target type identi cation for entity-oriented queries</article-title>
          . pp.
          <volume>2391</volume>
          {
          <issue>2394</issue>
          (10
          <year>2012</year>
          ). https://doi.org/10.1145/2396761.2398648
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Both</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Diefenbach</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Singh</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shekarpour</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cherix</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lange</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Qanary { a methodology for vocabulary-driven open question answering systems</article-title>
          . In: Sack,
          <string-name>
            <given-names>H.</given-names>
            ,
            <surname>Blomqvist</surname>
          </string-name>
          , E.,
          <string-name>
            <surname>d'Aquin</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ghidini</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ponzetto</surname>
            ,
            <given-names>S.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lange</surname>
          </string-name>
          , C. (eds.)
          <article-title>The Semantic Web</article-title>
          .
          <source>Latest Advances and New Domains</source>
          . pp.
          <volume>625</volume>
          {
          <fpage>641</fpage>
          . Springer International Publishing,
          <string-name>
            <surname>Cham</surname>
          </string-name>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Cox</surname>
            ,
            <given-names>D.R.:</given-names>
          </string-name>
          <article-title>The regression analysis of binary sequences</article-title>
          .
          <source>Journal of the Royal Statistical Society: Series B (Methodological)</source>
          <volume>20</volume>
          (
          <issue>2</issue>
          ),
          <volume>215</volume>
          {
          <fpage>232</fpage>
          (
          <year>1958</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Dash</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mihindukulasooriya</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gliozzo</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Canim</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Type prediction systems</article-title>
          .
          <source>CoRR</source>
          (
          <year>2021</year>
          ), https://arxiv.org/abs/2104.01207
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Devlin</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chang</surname>
            ,
            <given-names>M.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Toutanova</surname>
            ,
            <given-names>K.N.: BERT</given-names>
          </string-name>
          :
          <article-title>Pre-training of deep bidirectional transformers for language understanding</article-title>
          .
          <source>ArXiv</source>
          e-prints (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Dubey</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Banerjee</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Abdelkawi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>LC-QuAD 2.0: A large dataset for complex question answering over Wikidata and DBpedia</article-title>
          . In: Ghidini,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Hartig</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            ,
            <surname>Maleshkova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Svatek</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            ,
            <surname>Cruz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            ,
            <surname>Hogan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Song</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Lefrancois</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Gandon</surname>
          </string-name>
          ,
          <string-name>
            <surname>F</surname>
          </string-name>
          . (eds.)
          <source>The Semantic Web { ISWC 2019</source>
          . pp.
          <volume>69</volume>
          {
          <fpage>78</fpage>
          . Springer International Publishing,
          <string-name>
            <surname>Cham</surname>
          </string-name>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Garigliotti</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hasibi</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Balog</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Target type identi cation for entity-bearing queries</article-title>
          .
          <source>In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval</source>
          . pp.
          <volume>845</volume>
          {
          <fpage>848</fpage>
          . SIGIR '17,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY, USA (
          <year>2017</year>
          ). https://doi.org/10.1145/3077136.3080659
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Mihindukulasooriya</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dubey</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gliozzo</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ngomo</surname>
            ,
            <given-names>A.C.N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Usbeck</surname>
          </string-name>
          , R.:
          <article-title>SeMantic AnsweR Type prediction task (SMART) at ISWC 2020 Semantic Web Challenge</article-title>
          . CoRR/arXiv abs/
          <year>2012</year>
          .00555 (
          <year>2020</year>
          ), https://arxiv. org/abs/
          <year>2012</year>
          .00555
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Mihindukulasooriya</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dubey</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gliozzo</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ngonga</surname>
            <given-names>Ngomo</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>A.C.</given-names>
            ,
            <surname>Usbeck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Rossiello</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            ,
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <surname>U.</surname>
          </string-name>
          :
          <article-title>Semantic answer type and relation prediction task</article-title>
          (smart
          <year>2021</year>
          ). arXiv (
          <year>2022</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Nikas</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fafalios</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tzitzikas</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Two-stage semantic answer type prediction for question answering using BERT and class-speci city rewarding</article-title>
          . In:
          <article-title>Proceedings of the SeMantic AnsweR Type prediction task (SMART) at ISWC 2020</article-title>
          .
          <source>CEUR Workshop Proceedings</source>
          , vol.
          <volume>2774</volume>
          , pp.
          <volume>19</volume>
          {
          <fpage>28</fpage>
          .
          <string-name>
            <surname>CEUR-WS.org</surname>
          </string-name>
          (
          <year>2020</year>
          ), http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2774</volume>
          /paper-03.pdf
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Perevalov</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Both</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Augmentation-based answer type classi cation of the SMART dataset</article-title>
          . In:
          <article-title>Proceedings of the SeMantic AnsweR Type prediction task (SMART) at ISWC 2020</article-title>
          .
          <source>CEUR Workshop Proceedings</source>
          , vol.
          <volume>2774</volume>
          , pp.
          <volume>1</volume>
          {
          <issue>9</issue>
          . CEURWS.org (
          <year>2020</year>
          ), http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2774</volume>
          /paper-01.pdf
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Perevalov</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Both</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Improving answer type classi cation quality through combined question answering datasets</article-title>
          .
          <source>In: Knowledge Science, Engineering and Management</source>
          . pp.
          <volume>191</volume>
          {
          <fpage>204</fpage>
          . Springer International Publishing,
          <string-name>
            <surname>Cham</surname>
          </string-name>
          (
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Perevalov</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Diefenbach</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Usbeck</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Both</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          : QALD-9
          <article-title>-plus: A multilingual dataset for question answering over DBpedia and Wikidata translated by native speakers</article-title>
          .
          <source>In: 2022 IEEE 16th International Conference on Semantic Computing (ICSC)</source>
          .
          <source>IEEE</source>
          (
          <year>2022</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Setty</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Balog</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Semantic answer type prediction using BERT IAI at the ISWC SMART task 2020</article-title>
          .
          <article-title>In: Proceedings of the SeMantic AnsweR Type prediction task (SMART) at ISWC 2020</article-title>
          .
          <source>CEUR Workshop Proceedings</source>
          , vol.
          <volume>2774</volume>
          , pp.
          <volume>10</volume>
          {
          <fpage>18</fpage>
          .
          <string-name>
            <surname>CEUR-WS.org</surname>
          </string-name>
          (
          <year>2020</year>
          ), http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2774</volume>
          /paper-02.pdf
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Tiedemann</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thottingal</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>OPUS-MT | Building open translation services for the World</article-title>
          .
          <source>In: Proceedings of the 22nd Annual Conference of the European Association for Machine Translation (EAMT)</source>
          . Lisbon, Portugal (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Tonon</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Catasta</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Prokofyev</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Demartini</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aberer</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cudre-Mauroux</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Contextualized ranking of entity types based on knowledge graphs</article-title>
          .
          <source>Journal of Web Semantics 37-38</source>
          ,
          <issue>170</issue>
          {
          <fpage>183</fpage>
          (
          <year>2016</year>
          ). https://doi.org/10.1016/j.websem.
          <year>2015</year>
          .
          <volume>12</volume>
          .005
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Trivedi</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Maheshwari</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dubey</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>LC-QuAD: A corpus for complex question answering over knowledge graphs</article-title>
          . In: d'Amato,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Fernandez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Tamma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            ,
            <surname>Lecue</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            ,
            <surname>Cudre-Mauroux</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Sequeda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Lange</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>He</surname>
          </string-name>
          <string-name>
            <surname>in</surname>
          </string-name>
          , J. (eds.)
          <source>The Semantic Web { ISWC 2017</source>
          . pp.
          <volume>210</volume>
          {
          <fpage>218</fpage>
          . Springer International Publishing,
          <string-name>
            <surname>Cham</surname>
          </string-name>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>