<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>PVLDB Reference Format:
Louise Parkin. Cooperative Techniques for Dealing with Unsatisfactory
Answers in RDF Knowledge Bases. PVLDB</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Cooperative Techniques for Dealing with Unsatisfactory Answers in RDF Knowledge Bases</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Louise Parkin supervised by Brice Chardin</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stéphane Jean</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>France louise.parkin@ensma.fr</string-name>
        </contrib>
      </contrib-group>
      <pub-date>
        <year>2020</year>
      </pub-date>
      <volume>14</volume>
      <issue>1</issue>
      <abstract>
        <p>When querying Knowledge Bases, users are faced with large sets of data, often without knowing their underlying structures. It follows that users may make mistakes when formulating their queries, therefore receiving an unhelpful response. These unhelpful responses have been categorized into five types that consider either the number of results returned or the content of the results. These problems have been studied individually, and a category of proposed solutions involves modifying the original query in order to produce answers better suited to the user requirement. Similarities between the five problems suggest that techniques developed for one problem could be used to improve the treatment of the others. The goal of this PhD thesis is to propose a unified framework to deal with unexpected or unsatisfactory answers. In this paper, we analyse the state of the art on the unexpected answer problems and discuss the similarities between them, present our first contribution towards generalising the problems and present our future work.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>
        A Knowledge Base (KBs) is a collection of entities and facts about
them. With the development of the Semantic Web, numerous KBs
have been created in academic and industrial fields. Well known
examples of KBs include DBpedia [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ], and LinkedGeoData [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. These
KBs store information using RDF triples (subject, predicate, object)
and are queried using the SPARQL language [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. KBs typically
contain billions of facts and are often structured using an ontological
schema and rules, such as those provided by RDFS [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] or OWL [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        A new end user querying a KB is often unfamiliar with the KB’s
structure and the data within it. As such, mistakes or misconceptions
can manifest in queries, and cause unexpected or unsatisfactory
answers. Mistakes refer to the user incorrectly writing their query,
for example creating an unwanted Cartesian product by omitting
a triple pattern, or misspelling a term. Misconceptions represent
the diference between a user’s view of a KB, and its reality [
        <xref ref-type="bibr" rid="ref33">33</xref>
        ].
For instance if in a hospital database, the property treats can only
link a Doctor to a Patient, and a user writes a query based on the
patients that a Nurse treats, they will be frustrated to receive no
answers. Alternatively, a user may believe that the property
birthPlace uniquely describes a person’s town of birth whereas in the
KB birthPlace is used for the country, county, town, and address
of birth. A query involving birthPlace may overwhelm the user by
producing four times as many answers as expected. The issue of
unexpected answers is one of the challenges to database system
usability [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ].
      </p>
      <p>There are five basic types of unexpected or unsatisfactory answer
problems, each associated with a why-question.</p>
      <p>(1) The query returns no answers (why-empty).
(2) The query returns too few answers (why-so-few).
(3) The query returns too many answers (why-so-many).
(4) An expected answer is missing from the result (why-not).
(5) An unwanted answer is included in the result (why-so).
In the first three cases, the issue is with the number of answers,
and in the last two cases it is with the content of the answers. In all
cases, the user does not understand the result of their query, and is
usually forced to modify their query through frustrating trial and
error, before achieving the desired output.</p>
      <p>
        In every unexpected answer problem, the answer produced by
executing a query does not meet a user requirement. We use the
formalism of Perez et al. [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ] for the result set of a query Q
executed on a triplestore D: [[Q]]D . For each type of problem, we can
define a boolean function on the query answer, such that a query
either succeeds (the answers meet the user requirement) or fails
(the answers do not meet the user requirement). It is important to
note that while query failure has been used in literature to refer to
a query producing no answers, our definition of query failure relies
on the boolean property specific for each type of unexpected answer
problem. For instance, for the why-so-few problem, with a threshold
of K answers, the boolean failure property is |[[Q]]D | &lt; K , which
means a query fails if it produces fewer than K answers, and
succeeds otherwise. In the why-so problem, with an unwanted answer
mapping µ , the boolean property is µ &lt; [[Q]]D , which means that
a query fails if µ is included in its answers and succeeds otherwise.
      </p>
      <p>There are links between the five problems. The why-not and
whyso have strictly opposite failure conditions. This is also the case
for the why-so-few and why-so-many problems. The why-empty
problem is also an extreme case of the why-so-few problem, with
a threshold for insuficient answers set to 0. Furthermore, some
problems require a combination of failure conditions. This is the
case for cardinality problems where the number of answers must
be between two limits, requiring a combination of why-so-few and
why-so-many. Content-based problems may also need combining
when a user wants to include some answers and exclude others.</p>
      <p>Several studies have considered these problems separately, but a
unified framework to deal with unsatisfactory answers has yet to
be proposed. Our goal is to build upon the ideas proposed for
individual problems, and produce a unified explanation and rewriting
strategies for any query producing unexpected answers.
2</p>
    </sec>
    <sec id="sec-2">
      <title>STATE OF THE ART</title>
      <p>
        Our work falls in the scope of cooperative query answering, where
the focus is on making a database response understandable and
useful to users, even if they are unfamiliar with its content and
technical requirements. Motro categorized cooperative query answering
methods into two groups [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ]. The first seeks to aid users when they
formulate their requirements and queries, and includes methods
such as example queries [
        <xref ref-type="bibr" rid="ref25 ref35">25, 35</xref>
        ]. The second category analyses user
queries to detect anomalies. The latter methods can be used when
a user is not satisfied with their query’s answer. Existing work has
two main focuses: answer explanation, and query modification .
2.1
      </p>
    </sec>
    <sec id="sec-3">
      <title>Answer explanation</title>
      <p>Answer explanations can either be based on the data manipulated,
or on the query formulated by a user.</p>
      <p>
        Data based explanations exist for the why-not and why-so
problems. In the case of why-so, several techniques can be used to
identify the provenance of a piece of data [
        <xref ref-type="bibr" rid="ref34">34</xref>
        ]. In the why-not problem,
a data-based explanation can identify operations on a database that
lead to some missing information [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. Since data based methods do
not consider the user query, they cannot help to fix it if it contains
a mistake or misconception. In that case, query based explanations
are particularly useful, as the root cause of unsatisfactory answers is
the user’s lack of understanding of the database’s structure, content,
or query process.
      </p>
      <p>
        Failure causes were introduced to deal with the why-empty
problem [
        <xref ref-type="bibr" rid="ref19 ref23">19, 23</xref>
        ]. Rather than return a potentially misleading answer,
users are provided with the false presuppositions included in their
query. This led to the definition of Minimal Failing Subqueries
(MFS) in relational databases [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. The MFS are the smallest parts
of a query that cause it to fail. They have then been used in the
context of certain and uncertain KBs [
        <xref ref-type="bibr" rid="ref10 ref11">10, 11</xref>
        ]. Query-based failure
causes have also been considered for the why-not problem, first
in relational databases [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] then in KBs [
        <xref ref-type="bibr" rid="ref32">32</xref>
        ]. In the RDF context,
a divide-and-conquer approach is used, first studying the query’s
triple patterns, then its SPARQL operators. A failure cause shows
users which triple pattern or operator causes an answer to be
absent. To our knowledge, no query based failure causes have been
studied for the why-so-many, why-so-few and why-so problems.
2.2
      </p>
    </sec>
    <sec id="sec-4">
      <title>Query modification</title>
      <p>Query modification involves rewriting a query submitted by the
user in order to remove the mistakes and misconceptions it may
contain. This step is successful if the results of the modified query
are better suited to the user requirements. There are three ways
to redefine a query: relaxation (the new query’s answers contain
the answers of the previous query), refinement (the new query’s
answers are contained in the answers of the previous query) or
modification (the new query’s answers contain some of the answers
of the previous query and add others). The query rewriting process
can have varying user involvement, from simply providing users
with explanations and leaving rewriting up to them, to a purely
automatic process where the user is absent, through interactive
processes where users guide the process by providing preferences.</p>
      <p>
        For the why-not problem in relational databases, the ConQueR
system [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ] modifies queries to obtain the missing answer as part
of the results of the new query. A similarity metric based on editing
distance, and imprecision metric are jointly used to rank modified
queries. First, the system attempts to find a modified query where
only the selection predicates are changed, and if none exist then
modifies other parts of the query. In the field of knowledge graphs,
recent work on the why-not and why-so problems has proposed
exact algorithms and heuristics to refine a user’s query [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ]. They
incrementally modify a query in order to obtain some of the answers
originally missed. They also define metrics to measure the rewriting
cost, and the answer closeness or precision.
      </p>
      <p>
        For the plethoric answers problem (why-so-many), in the field
of fuzzy queries, intensification strategies are used to strengthen
patterns present in the user query to make them more restrictive
[
        <xref ref-type="bibr" rid="ref22 ref6">6, 22</xref>
        ]. Alternatively, new patterns are added to the query [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. They
are chosen based on a measure of correlation between predicates
so that they are semantically close to the original query and reduce
the number of answers. In the field of graph queries, a method
based on removing elements of the initial query finds the largest
parts of the query that succeed. They are called maximum common
connected subgraphs (MCCS) and are provided to the user as
alternative non failing queries [
        <xref ref-type="bibr" rid="ref31">31</xref>
        ]. A similar notion is considered in the
empty answers problem (why-empty), called maXimal Succeeding
Subqueries. They are the largest succeeding queries obtained by
removing parts of the original query [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
      </p>
      <p>
        The MFS defined for the empty answers problem have been used
as part of several automated query modification systems dealing
with relational databases [
        <xref ref-type="bibr" rid="ref18 ref7">7, 18</xref>
        ]. In the context of RDF, the
additional cost of computing MFS has been weighed against the time
saved by avoiding executing queries that are known to fail once
the MFS are known, and a hybrid method computing the MFS at
key points in the relaxation has been presented [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. This work
uses an existing measure of similarity between queries based on
information content measure [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. Another use of MFS in the empty
answer problem is as part of an interactive query rewriting
framework [
        <xref ref-type="bibr" rid="ref18 ref21">18, 21</xref>
        ]. At each step in the query relaxation users choose the
parts to be relaxed. These works show that the eficiency of query
modification methods can be improved if an answer explanation
step has been previously performed.
3
      </p>
    </sec>
    <sec id="sec-5">
      <title>FAILURE CAUSE DEFINITION</title>
      <p>In existing work, attempts to explain query failure and to modify
queries have been suggested for individual problems. Starting by
identifying failure causes can improve the performance of query
modification strategies. However, this failure cause identification
step has not been studied for all the unexpected answer problems.
As such, the existing query modification methods rely on trial and
error. Our first goal is to establish a method of identifying failure
SELECT * WHERE {
?d treats ?p . # t1
?d experience ?e . # t2
?d supervises ?n . # t3
?n providesCare ?pt . # t4
?n service ERNurse } # t5
t1t3
t1
causes that will support any unexpected answer problem. Our first
contribution is the extension of the definition of failure causes used
for the empty answers problem to the plethoric answers problem.</p>
      <p>
        Several existing approaches rely on decomposing an initial
failing query into parts to identify failure causes. In both the why-empty
problem [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] and the why-not problem [
        <xref ref-type="bibr" rid="ref32">32</xref>
        ] conjunctive queries
are decomposed by removing triple patterns incrementally. We
have used this decomposition of SPARQL queries as the base of our
approach. From an original query Q given in figure 1, written as a
conjunction of its triple patterns t1 AN D t2 AN D t3 AN D t4 AN D t5
or t1t2t3t4t5 for short, we build its lattice of subqueries as shown
in figure 2. From this decomposition of the initial query, the next
step is defining failure causes. To that end, a boolean condition
indicating query failure is defined for every query in the lattice of
subqueries. Its exact definition depends on the unexpected answer
problem we are dealing with.
      </p>
      <p>
        A widely used notion in the empty answers problem, is the
minimal failing subquery (MFS) [
        <xref ref-type="bibr" rid="ref10 ref11 ref13 ref18 ref19 ref21 ref23 ref4 ref7">4, 7, 10, 11, 13, 18, 19, 21, 23</xref>
        ]. MFS
are subqueries that fail, that have no failing subqueries. In the empty
answers problem, when dealing with conjunctive queries, Godfrey
showed that if a query fails (in this case fails means produces no
answers) then all its superqueries fail. In this case, we say that the
failure condition is monotonic. With a monotonic failure condition,
any query containing an MFS fails, so MFS are a good way to
describe failure causes.
      </p>
      <p>When studying the plethoric answers problem, we noted that
its failure condition (a query produces more than a threshold K
answers) is not monotonic. This is also the case for the insuficient
answers problem, whose failure condition is that a query produces
fewer than a threshold K answers. For a problem whose failure
condition is not monotonic, a failing query can have a succeeding
superquery. As such, an MFS can be a part of a succeeding query
so the notion of MFS is no longer adequate to describe a failure
cause. We have introduced a new concept, that of minimal failure
inducing subquery (MFIS) to describe failure causes in cases where
the failure cause is not monotonic. A failure inducing subquery
(FIS) of a query Q is one of its subqueries Q ′ such that all subqueries
of Q that are superqueries of Q ′ fail. An MFIS is then defined as
an FIS that does not have any other FIS as a subquery. In the case
of a monotonic failure condition, the definitions of MFIS and MFS
are strictly equivalent. Since this definition of failure cause no
longer relies on a monotonic failure condition it can be used in any
unsatisfactory answer problem.</p>
      <p>
        The dual notion of MFS which was also introduced to deal with
the empty answers problem, is called maximum succeeding
subquery (XSS). An XSS of a query Q is one of its succeeding subqueries
that is not a subquery of any other succeeding query. The concept
of XSS does not depend on the monotony of the failure condition,
so it can also be used in every answer problem. In terms of query
failure, the set of MFIS and the set of XSS provide the same
information. They do not necessarily determine the success or failure of
every subquery of the initial query, but do identify all failure causes
(parts of a query that if present mean that the query will fail). There
can be exponentially many MFS and XSS [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], and we have shown
that is also the case for MFIS. So no algorithm can compute MFIS
and XSS in polynomial time in the worst case.
      </p>
      <p>
        For the plethoric answers problem, we have shown that
leveraging query and data properties can improve performance in certain
cases [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ]. These properties allow us to determine that a query fails
without executing it, based on the failure of another query. They
require conditions based on the variables contained in the queries,
or the cardinalities of predicates. We created improved algorithms
based on these properties and implemented them in Java with three
triplestores (JenaTDB, Jena Fuseki, and Virtuoso). These algorithms
have been experimentally evaluated first using synthetic data and
queries from the WatDiv benchmark [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], then real data from
DBpedia and query logs from the Linked SPARQL Queries Dataset project
[
        <xref ref-type="bibr" rid="ref28">28</xref>
        ]. Our two algorithms ran respectively 36% and 44% faster than
a baseline algorithm which executes all subqueries. Our
implementation is available at https://forge.lias-lab.fr/projects/tma4kb with a
tutorial to reproduce our experiments. Previous work on the empty
answers problem in the RDF context produced similar results. It
showed that query properties can be used to compute MFS and XSS
efectively [
        <xref ref-type="bibr" rid="ref10 ref11">10, 11</xref>
        ]. These experimental results are promising in
regards to the applicability of an approach based on failure causes
to deal with unexpected answer problems.
4
      </p>
    </sec>
    <sec id="sec-6">
      <title>FUTURE WORK</title>
      <p>Our work so far has involved taking a method used for the
whyempty problem and apply it to the plethoric answers problem. The
existing notions were too restrictive for this new problem, so we
introduced some new definitions and properties. We end up with
a method applicable to both problems, which we have shown is
usable in practise.</p>
      <p>Our goal is to build a framework identifying failure causes in
unsatisfactory answer problems, through the following steps:
(1) to determine the search space, i.e. the set of queries under
consideration,
(2) to establish a partial order relationship between the queries
in the search space,
(3) to define a failure condition, a boolean property expressed
over the set of queries,
(4) to provide a failure cause definition, i.e. the MFIS,
(5) to develop particular inference rules describing the links
between queries and query failure.</p>
      <p>
        Currently, the search space is the lattice of subqueries, which
contains the original query and its subqueries obtained by removing
patterns. In the long run, this search space can be extended by
considering queries with additional triple patterns or modified
triple patterns from the original query. The partial order is also
related to triple pattern inclusion. This supposes that the queries
considered are conjunctive queries, which means that they do not
contain SPARQL operators such as UNION, OPTIONAL, FILTER.
This hypothesis is used in some existing solutions[
        <xref ref-type="bibr" rid="ref11 ref13">11, 13</xref>
        ], but it is a
significant restriction, as a study of SPARQL queries has shown that
a large proportion of SPARQL queries contain these operators [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
We have therefore started the extension of subquery definition to
support these operators.
      </p>
      <p>For each problem, the failure condition is linked to the result set
of a query [[Q ]]D and considers either its cardinality or its content.
We have shown that the MFIS can be used as failure causes for
the empty answers and plethoric answers problems, and we are
working on applying them to the other three problems. The last
step of determining inference rules will allow us to determine the
success or failure of a query based on the success or failure of one
of the queries it is linked to. This can make the computation of
MFIS and XSS much more eficient.</p>
      <p>
        After completing the answer explanation step, we will consider
query modification based on these explanations, drawing from
existing query rewriting algorithms. We plan to ofer users a varying
degree of participation in the query modification. Expert users
can simply be provided with failure causes and fix their queries
themselves, whereas more novice users may prefer an automatic
approach where the modified query is computed with minimal input
from them. Evaluation of the query modification will be performed
with various metrics introduced in existing work, such as similarity,
imprecision [
        <xref ref-type="bibr" rid="ref32">32</xref>
        ], answer closeness and editing cost [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ].
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Güneş</given-names>
            <surname>Aluç</surname>
          </string-name>
          , Olaf Hartig,
          <string-name>
            <given-names>M Tamer</given-names>
            <surname>Özsu</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Khuzaima</given-names>
            <surname>Daudjee</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Diversified stress testing of RDF data management systems</article-title>
          .
          <source>In ISWC'14</source>
          . Springer,
          <fpage>197</fpage>
          -
          <lpage>212</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Sören</given-names>
            <surname>Auer</surname>
          </string-name>
          , Jens Lehmann, and
          <string-name>
            <given-names>Sebastian</given-names>
            <surname>Hellmann</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>LinkedGeoData: Adding a Spatial Dimension to the Web of Data</article-title>
          .
          <source>In ISWC'09</source>
          . Springer.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Sean</given-names>
            <surname>Bechhofer</surname>
          </string-name>
          , Frank van Harmelen,
          <string-name>
            <surname>Jim Hendler</surname>
          </string-name>
          , Ian Horrocks, Deborah L.
          <string-name>
            <surname>McGuinness</surname>
          </string-name>
          ,
          <string-name>
            <surname>Peter F. Patel-Schneider</surname>
          </string-name>
          , and Lynn Andrea Stein.
          <year>2004</year>
          .
          <article-title>OWL Web Ontology Language Reference</article-title>
          .
          <source>W3C Recommendation</source>
          (
          <year>2004</year>
          ). https://www.w3. org/TR/owl-ref/.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Chourouk</given-names>
            <surname>Belheouane</surname>
          </string-name>
          , Stéphane Jean, Hamid Azzoune, and
          <string-name>
            <given-names>Allel</given-names>
            <surname>Hadjali</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Cooperative treatment of failing queries over uncertain databases: a matrixcomputation-based approach</article-title>
          .
          <source>Journal of Intelligent Information Systems</source>
          <volume>52</volume>
          ,
          <issue>1</issue>
          (
          <year>2019</year>
          ),
          <fpage>211</fpage>
          -
          <lpage>238</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Nicole</given-names>
            <surname>Bidoit</surname>
          </string-name>
          , Melanie Herschel, and
          <string-name>
            <given-names>Katerina</given-names>
            <surname>Tzompanaki</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Query-based why-not provenance with nedexplain</article-title>
          .
          <source>In Extending database technology (EDBT).</source>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Patrick</given-names>
            <surname>Bosc</surname>
          </string-name>
          , Allel HadjAli, and
          <string-name>
            <given-names>Olivier</given-names>
            <surname>Pivert</surname>
          </string-name>
          .
          <year>2006</year>
          .
          <article-title>About overabundant answers to flexible queries</article-title>
          .
          <source>In IPMU'06</source>
          , Vol.
          <volume>6</volume>
          .
          <fpage>2221</fpage>
          -
          <lpage>2228</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Patrick</given-names>
            <surname>Bosc</surname>
          </string-name>
          , Allel Hadjali, and
          <string-name>
            <given-names>Olivier</given-names>
            <surname>Pivert</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Incremental controlled relaxation of failing flexible queries</article-title>
          .
          <source>Journal of Intelligent Information Systems</source>
          <volume>33</volume>
          ,
          <issue>3</issue>
          (
          <year>2009</year>
          ),
          <fpage>261</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Patrick</given-names>
            <surname>Bosc</surname>
          </string-name>
          , Allel Hadjali, Olivier Pivert, and
          <string-name>
            <given-names>Grégory</given-names>
            <surname>Smits</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Une approche fondée sur la corrélation entre prédicats pour le traitement des réponses pléthoriques.</article-title>
          .
          <source>In EGC'10</source>
          .
          <fpage>273</fpage>
          -
          <lpage>284</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Dan</given-names>
            <surname>Brickley</surname>
          </string-name>
          and
          <string-name>
            <given-names>R.V.</given-names>
            <surname>Guha</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <source>RDF Schema 1.1. W3C Recommendation</source>
          (
          <year>2014</year>
          ). https://www.w3.org/TR/rdf-schema/.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Ibrahim</surname>
            <given-names>Dellal</given-names>
          </string-name>
          , Stéphane Jean, Allel Hadjali, Brice Chardin, and
          <string-name>
            <given-names>Mickaël</given-names>
            <surname>Baron</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>On Addressing the Empty Answer Problem in Uncertain Knowledge Bases</article-title>
          . In DEXA'
          <volume>17</volume>
          .
          <fpage>120</fpage>
          -
          <lpage>129</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Géraud</surname>
            <given-names>Fokou</given-names>
          </string-name>
          , Stéphane Jean, Allel Hadjali, and
          <string-name>
            <given-names>Mickaël</given-names>
            <surname>Baron</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>RDF query relaxation strategies based on failure causes</article-title>
          .
          <source>In European semantic web conference</source>
          . Springer,
          <fpage>439</fpage>
          -
          <lpage>454</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>Mario</given-names>
            <surname>Arias</surname>
          </string-name>
          <string-name>
            <given-names>Gallego</given-names>
            ,
            <surname>Javier D Fernández</surname>
          </string-name>
          ,
          <string-name>
            <surname>Miguel A Martínez-Prieto</surname>
          </string-name>
          , and Pablo de la Fuente.
          <year>2011</year>
          .
          <article-title>An Empirical Study of Real-World SPARQL Queries</article-title>
          .
          <source>In Proceedings of the USEWOD workshop co-located with WWW'11.</source>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>Parke</given-names>
            <surname>Godfrey</surname>
          </string-name>
          .
          <year>1997</year>
          .
          <article-title>Minimization in Cooperative Response to Failing Database Queries</article-title>
          .
          <source>International Journal of Cooperative Information Systems 6</source>
          ,
          <issue>2</issue>
          (
          <year>1997</year>
          ),
          <fpage>95</fpage>
          -
          <lpage>149</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>Steve</given-names>
            <surname>Harris</surname>
          </string-name>
          and
          <string-name>
            <given-names>Andy</given-names>
            <surname>Seaborne</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>SPARQL 1.1 Query Language</article-title>
          .
          <source>W3C Recommendation</source>
          (
          <year>2013</year>
          ). https://www.w3.org/TR/sparql11-query/.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Hai</surname>
            <given-names>Huang</given-names>
          </string-name>
          , Chengfei Liu, and
          <string-name>
            <given-names>Xiaofang</given-names>
            <surname>Zhou</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Approximating query answering on RDF databases</article-title>
          .
          <source>World Wide Web 15</source>
          ,
          <issue>1</issue>
          (
          <year>2012</year>
          ),
          <fpage>89</fpage>
          -
          <lpage>114</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Jiansheng</surname>
            <given-names>Huang</given-names>
          </string-name>
          , Ting Chen, AnHai Doan, and Jefrey F Naughton.
          <year>2008</year>
          .
          <article-title>On the provenance of non-answers to queries over extracted data</article-title>
          .
          <source>Proceedings of the VLDB Endowment 1</source>
          ,
          <issue>1</issue>
          (
          <year>2008</year>
          ),
          <fpage>736</fpage>
          -
          <lpage>747</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>H. V.</given-names>
            <surname>Jagadish</surname>
          </string-name>
          , Adriane Chapman, Aaron Elkiss, Magesh Jayapandian,
          <string-name>
            <given-names>Yunyao</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Arnab</given-names>
            <surname>Nandi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Cong</given-names>
            <surname>Yu</surname>
          </string-name>
          .
          <year>2007</year>
          .
          <article-title>Making Database Systems Usable</article-title>
          . In SIGMOD'
          <volume>07</volume>
          .
          <fpage>13</fpage>
          -
          <lpage>24</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>Dietmar</given-names>
            <surname>Jannach</surname>
          </string-name>
          .
          <year>2006</year>
          .
          <article-title>Techniques for fast query relaxation in content-based recommender systems</article-title>
          .
          <source>In Annual Conference on Artificial Intelligence</source>
          . Springer,
          <fpage>49</fpage>
          -
          <lpage>63</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>S Jerrold</given-names>
            <surname>Kaplan</surname>
          </string-name>
          .
          <year>1982</year>
          .
          <article-title>Cooperative responses from a portable natural language query system</article-title>
          .
          <source>Artificial Intelligence</source>
          <volume>19</volume>
          ,
          <issue>2</issue>
          (
          <year>1982</year>
          ),
          <fpage>165</fpage>
          -
          <lpage>187</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <surname>Jens</surname>
            <given-names>Lehmann</given-names>
          </string-name>
          , Robert Isele, Max Jakob, Anja Jentzsch, Dimitris Kontokostas,
          <string-name>
            <given-names>Pablo N.</given-names>
            <surname>Mendes</surname>
          </string-name>
          , Sebastian Hellmann, Mohamed Morsey, Patrick van Kleef,
          <string-name>
            <surname>Sören Auer</surname>
            , and
            <given-names>Christian</given-names>
          </string-name>
          <string-name>
            <surname>Bizer</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>DBpedia - A Large-scale, Multilingual Knowledge Base Extracted from Wikipedia</article-title>
          .
          <source>Semantic Web</source>
          <volume>6</volume>
          ,
          <issue>2</issue>
          (
          <year>2015</year>
          ),
          <fpage>167</fpage>
          -
          <lpage>195</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>David</given-names>
            <surname>McSherry</surname>
          </string-name>
          .
          <year>2004</year>
          .
          <article-title>Incremental relaxation of unsuccessful queries</article-title>
          .
          <source>In European Conference on Case-Based Reasoning</source>
          . Springer,
          <fpage>331</fpage>
          -
          <lpage>345</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>Samyr</given-names>
            <surname>Abrahão Moises and S do L Pereira</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Dealing with empty and overabundant answers to flexible queries</article-title>
          .
          <source>Journal of Data Analysis and Information Processing</source>
          (
          <year>2014</year>
          ),
          <fpage>12</fpage>
          -
          <lpage>18</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>Amihai</given-names>
            <surname>Motro</surname>
          </string-name>
          .
          <year>1986</year>
          .
          <article-title>Seave: A mechanism for verifying user presuppositions in query systems</article-title>
          .
          <source>ACM Transactions on Information Systems (TOIS) 4</source>
          ,
          <issue>4</issue>
          (
          <year>1986</year>
          ),
          <fpage>312</fpage>
          -
          <lpage>330</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>Amihai</given-names>
            <surname>Motro</surname>
          </string-name>
          .
          <year>1996</year>
          .
          <article-title>Cooperative database systems</article-title>
          .
          <source>International Journal of Intelligent Systems</source>
          <volume>11</volume>
          ,
          <issue>10</issue>
          (
          <year>1996</year>
          ),
          <fpage>717</fpage>
          -
          <lpage>731</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <surname>Davide</surname>
            <given-names>Mottin</given-names>
          </string-name>
          , Matteo Lissandrini, Yannis Velegrakis, and
          <string-name>
            <given-names>Themis</given-names>
            <surname>Palpanas</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Exemplar queries: Give me an example of what you need</article-title>
          .
          <source>Proceedings of the VLDB Endowment 7</source>
          ,
          <issue>5</issue>
          (
          <year>2014</year>
          ),
          <fpage>365</fpage>
          -
          <lpage>376</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <surname>Louise</surname>
            <given-names>Parkin</given-names>
          </string-name>
          , Ibrahim Dellal, Brice Chardin, Stéphane Jean, and
          <string-name>
            <given-names>Allel</given-names>
            <surname>Hadjali</surname>
          </string-name>
          .
          <year>2020</year>
          .
          <article-title>A Cooperative Approach to Address the Overabundant Answers Problem in RDF Knowledge Bases</article-title>
          . In 36ème Conférence sur la Gestion de Données - Principes, Technologies et Applications.
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <surname>Jorge</surname>
            <given-names>Pérez</given-names>
          </string-name>
          , Marcelo Arenas, and
          <string-name>
            <given-names>Claudio</given-names>
            <surname>Gutierrez</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Semantics and Complexity of SPARQL</article-title>
          .
          <source>ACM Trans. Database Syst</source>
          .
          <volume>34</volume>
          ,
          <issue>3</issue>
          ,
          <string-name>
            <surname>Article</surname>
            <given-names>Article</given-names>
          </string-name>
          16 (
          <year>2009</year>
          ),
          <volume>45</volume>
          pages.
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <surname>Muhammad</surname>
            <given-names>Saleem</given-names>
          </string-name>
          , Muhammad Intizar Ali, Aidan Hogan, Qaiser Mehmood, and
          <string-name>
            <surname>Axel-Cyrille Ngonga Ngomo</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>LSQ: The Linked SPARQL Queries Dataset</article-title>
          . In ISWC'
          <volume>15</volume>
          .
          <fpage>261</fpage>
          -
          <lpage>269</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>Q.</given-names>
            <surname>Song</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. H.</given-names>
            <surname>Namaki</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wu</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Answering Why-Questions for Subgraph Queries in Multi-attributed Graphs</article-title>
          .
          <source>In ICDE'19</source>
          .
          <fpage>40</fpage>
          -
          <lpage>51</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>Quoc</given-names>
            <surname>Trung</surname>
          </string-name>
          Tran and
          <string-name>
            <surname>Chee-Yong Chan</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>How to conquer why-not questions</article-title>
          .
          <source>In Proceedings of the 2010 ACM SIGMOD International Conference on Management of data. 15-26.</source>
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <surname>Elena</surname>
            <given-names>Vasilyeva</given-names>
          </string-name>
          , Maik Thiele, Christof Bornhövd, and
          <string-name>
            <given-names>Wolfgang</given-names>
            <surname>Lehner</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Answering “why empty?” and “why so many?” queries in graph databases</article-title>
          .
          <source>J. Comput. System Sci. 82</source>
          ,
          <issue>1</issue>
          (
          <year>2016</year>
          ),
          <fpage>3</fpage>
          -
          <lpage>22</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <surname>Meng</surname>
            <given-names>Wang</given-names>
          </string-name>
          , Jun Liu, Bifan Wei, Siyu Yao, Hongwei Zeng, and
          <string-name>
            <given-names>Lei</given-names>
            <surname>Shi</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Answering why-not questions on SPARQL queries</article-title>
          .
          <source>Knowledge and Information Systems</source>
          <volume>58</volume>
          (
          <year>2019</year>
          ),
          <fpage>169</fpage>
          -
          <lpage>208</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <string-name>
            <given-names>Bonnie</given-names>
            <surname>Lynn</surname>
          </string-name>
          Webber and
          <string-name>
            <given-names>Eric</given-names>
            <surname>Mays</surname>
          </string-name>
          .
          <year>1983</year>
          .
          <article-title>Varieties of user misconceptions: Detection and correction</article-title>
          .
          <source>In IJCAI'83</source>
          , Vol.
          <volume>2</volume>
          .
          <fpage>650</fpage>
          -
          <lpage>652</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [34]
          <string-name>
            <given-names>Allison</given-names>
            <surname>Woodruf</surname>
          </string-name>
          and
          <string-name>
            <given-names>Michael</given-names>
            <surname>Stonebraker</surname>
          </string-name>
          .
          <year>1997</year>
          .
          <article-title>Supporting fine-grained data lineage in a database visualization environment</article-title>
          .
          <source>In Proceedings 13th International Conference on Data Engineering. IEEE</source>
          ,
          <fpage>91</fpage>
          -
          <lpage>102</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [35]
          <string-name>
            <surname>Moshe</surname>
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Zloof</surname>
          </string-name>
          .
          <year>1977</year>
          .
          <article-title>Query-by-example: A data base language</article-title>
          .
          <source>IBM systems Journal</source>
          <volume>16</volume>
          ,
          <issue>4</issue>
          (
          <year>1977</year>
          ),
          <fpage>324</fpage>
          -
          <lpage>343</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>