<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Sparklis: a SPARQL Endpoint Explorer for Expressive Question Answering</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>IRISA, Universite de Rennes 1 Campus de Beaulieu</institution>
          ,
          <addr-line>35042 Rennes cedex</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Sparklis is a Semantic Web tool that helps users explore SPARQL endpoints by guiding them in the interactive building of questions and answers, from simple ones to complex ones. It combines the ne-grained guidance of faceted search, most of the expressivity of SPARQL, and the readability of (controlled) natural languages. No endpoint-speci c con guration is necessary, and no knowledge of SPARQL and the data schema is required from users. This demonstration paper is a companion to the research paper [2].</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Motivation</title>
      <p>
        Sparklis re-uses and generalizes the interaction model of Faceted Search
(FS) [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], where users are guided step-by-step in the selection of items. At each
      </p>
      <sec id="sec-1-1">
        <title>1 Online at http://www.irisa.fr/LIS/ferre/sparklis/osparklis.html</title>
        <p>
          step, the system gives a set of suggestions to re ne the current selection, and
users only have to pick a suggestion according to their preferences. The
suggestions are speci c to the selection, and therefore support exploratory search [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]
by providing overview and feedback during the search process.
        </p>
        <p>
          To overcome expressivity limitations of FS and existing extensions for the
Semantic Web (e.g., gFacet [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ], VisiNav [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ], SemFacet [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ]), we have generalized it
to Query-based Faceted Search (QFS), where the selection of items is replaced
by a structured query. The latter is built step-by-step through the successive
choices of the user. This makes Sparklis a kind of Query Builder (QB), like
SemanticCrystal [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. QBs have the advantage to allow for a high expressivity
while assisting users about syntax, e.g. avoiding syntax errors, listing eligible
constructs. However, the FS-based guidance of Sparklis is more ne-grained
than in QBs. Sparklis avoids vocabulary errors by retrieving the URIs and
literals right from the SPARQL endpoint. It needs not be con gured for a
particular dataset, and dynamically discovers the data schema. In fact, Sparklis
only allows the building of queries that do return results, preventing users to fall
on empty results. That is because system suggestions are computed for the
individual results, not for their common class. In fact, Sparklis is as much about
building answers as about building questions.
        </p>
        <p>
          To overcome the lack of readability of SPARQL queries for most users,
Sparklis queries and suggestions are verbalized in natural language so that SPARQL
queries never need to be shown to users. This makes Sparklis a kind of Natural
Language Interface (NLI), like PowerAqua [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]. The important di erence is that
questions are built through successive user choices in Sparklis instead of
being freely input in NLIs. Sparklis interaction makes question formulation more
constrained, slower, and less spontaneous, but it provides guidance and safeness
with intermediate answers and suggestions at each step. Moreover, it avoids
the hard problem of NL understanding: i.e., ambiguities, out-of-scope questions.
A few NLI systems, like Ginseng [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ], are based on a controlled NL and
autocompletion to suggest the next words in a question. However, their suggestions
are not ne-grained like with FS, and less exible because they only apply to the
end of the question. In Sparklis, questions form complete sentences at any step
of the search; and suggestions are not words but meaningful phrases (e.g., that
has a director), and can be inserted at any position in the current question.
3
        </p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>User Interface and Interaction</title>
      <p>highest-to-lowest/focus on a Writer/that has a nationality. Note that
di erent insertion orderings are possible for a same question. Navigation
buttons allow to move backward/forward in the construction history. A permalink
to the current navigation state (endpoint+question) can be generated at any
time. To switch to another SPARQL endpoint, it is enough to input its URL in
the entry eld. The query focus is moved simply by clicking on di erent parts of
the question, or on di erent table column headers. Every suggestion in the three
lists, as well as every table cell, can be inserted or applied to the current focus
by clicking it. The rst suggestion list contains entities (individuals and
literals). The second list contains concepts (classes and properties). The third list
contains logical connectives, sorting modi ers, and aggregation operators. Each
suggestion list is equipped with an immediate-feedback ltering mechanism to
quickly locate suggestions in long lists. With the rst list, lters can be inserted
into the query with di erent lter operators listed in a drop-down menu (e.g.,
matches, higher or equal than, before). Questions and suggestions use
indentation to disambiguate di erent possible groupings and improve readability,
and syntax coloring to distinguish between the di erent kinds of words.
4</p>
    </sec>
    <sec id="sec-3">
      <title>Performances and Limitations</title>
      <p>Portability. Sparklis conforms to the SPARQL standard, and requires no
preprocessing or con guration to explore an endpoint. It entirely relies on the
endpoint to discover data and its schema. The main limitation is that URIs are
displayed through their local names, which is not always readable.</p>
      <p>Expressivity. Sparklis covers many features of SPARQL: basic graph
patterns (including cycles), basic lters, UNION, OPTIONAL, NOT EXISTS, SELECT,
ORDER BY, multiple aggregations with GROUP BY. Almost all queries of the
QALD2 challenge can be answered. Uncovered features are expressions, named
graphs, nested queries, queries returning RDF graphs, and updates.</p>
      <p>Scalability. Sparklis is responsive on the largest well-known endpoint:
DBpedia. Among the 100 QALD-3 questions, half can be answered in less than 30
seconds (wall-clock time including user interaction and system computations).
5</p>
    </sec>
    <sec id="sec-4">
      <title>Demonstration</title>
      <p>The demonstration has shown to participants how QALD questions over
DBpedia can be answered in a step-by-step process. Those questions cover various
retrieval tasks: basic facts (Give me the homepage of Forbes), entity lists (Which
rivers ow into a German lake?), counts (How many languages are spoken in
Colombia?), optimums (Which of Tim Burton's lms had the highest budget?).
More complex analytical question answering has also been demonstrated (Give
me the total runtime, from highest to lowest, of lms per director and per
country). Participants were also given the opportunity to explore any SPARQL
endpoint of their choice.</p>
      <sec id="sec-4-1">
        <title>2 http://greententacle.techfak.uni-bielefeld.de/~cunger/qald/</title>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Arenas</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grau</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kharlamov</surname>
            , E.,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Marciuska</surname>
            , Zheleznyakov,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>JimenezRuiz</surname>
          </string-name>
          , E.:
          <article-title>SemFacet: Semantic faceted search over YAGO</article-title>
          .
          <source>In: World Wide Web Conf. Companion</source>
          . pp.
          <volume>123</volume>
          {
          <fpage>126</fpage>
          .
          <string-name>
            <given-names>WWW</given-names>
            <surname>Steering Committee</surname>
          </string-name>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Ferre</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Expressive and scalable query-based faceted search over SPARQL endpoints</article-title>
          . In: Mika,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Tudorache</surname>
          </string-name>
          , T. (eds.)
          <source>Int. Semantic Web Conf</source>
          . Springer (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Harth</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>VisiNav: A system for visual search and navigation on web data</article-title>
          .
          <source>J. Web Semantics</source>
          <volume>8</volume>
          (
          <issue>4</issue>
          ),
          <volume>348</volume>
          {
          <fpage>354</fpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Heim</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ertl</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ziegler</surname>
          </string-name>
          , J.:
          <article-title>Facet graphs: Complex semantic querying made easy</article-title>
          . In: et al.,
          <string-name>
            <surname>L.A</surname>
          </string-name>
          . (ed.)
          <source>Extended Semantic Web Conference</source>
          . pp.
          <volume>288</volume>
          {
          <fpage>302</fpage>
          . LNCS 6088, Springer (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Kaufmann</surname>
          </string-name>
          , E.,
          <string-name>
            <surname>Bernstein</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Evaluating the usability of natural language query languages and interfaces to semantic web knowledge bases</article-title>
          .
          <source>J. Web Semantics</source>
          <volume>8</volume>
          (
          <issue>4</issue>
          ),
          <volume>377</volume>
          {
          <fpage>393</fpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Lopez</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fernandez</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motta</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stieler</surname>
          </string-name>
          , N.:
          <article-title>PowerAqua: Supporting users in querying and exploring the semantic web</article-title>
          .
          <source>Semantic Web</source>
          <volume>3</volume>
          (
          <issue>3</issue>
          ),
          <volume>249</volume>
          {
          <fpage>265</fpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Marchionini</surname>
          </string-name>
          , G.:
          <article-title>Exploratory search: from nding to understanding</article-title>
          .
          <source>Communications of the ACM</source>
          <volume>49</volume>
          (
          <issue>4</issue>
          ),
          <volume>41</volume>
          {
          <fpage>46</fpage>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Sacco</surname>
            ,
            <given-names>G.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tzitzikas</surname>
            ,
            <given-names>Y</given-names>
          </string-name>
          . (eds.):
          <article-title>Dynamic taxonomies and faceted search</article-title>
          .
          <source>The information retrieval series</source>
          , Springer (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>