<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>SPARQL Query Formulation and Execution using FedViz</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Syeda Sana e Zainab</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Muhammad Saleem</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Qaiser Mehmood</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Durre Zehra</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stefan Decker</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ali Hasnain</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Insight Centre for Data Analytics, National University of Ireland</institution>
          ,
          <addr-line>Galway</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Universita ̈t Leipzig, IFI/AKSW</institution>
          ,
          <addr-line>PO 100920, D-04009 Leipzig</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>Health care and life sciences research heavily relies on the ability to search, discover, formulate and correlate data from distinct sources. Although the Semantic Web and Linked Data technologies help in dealing with data integration problem, there remains a barrier adopting these for non-technical research audiences. In this paper we present FedViz, a visual interface for SPARQL query formulation and execution. FedViz facilitates both federated and non-federated SPARQL queries formulation through a visual interface and uses FedX for query execution and the results retrieval. We evaluate the usability of our system by using the Standard System Usability scale. Overall usability score of 74.16 suggests that FedViz interface is easy to learn, consistent, and adequate for frequent use.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>FedViz allows the user to select concepts and properties from multiple datasets using
nodes and edges and makes SPARQL query in a background. Furthermore, it also
allow users to edit the resultant SPARQL query before sending it to the the query
execution engine. The assembled SPARQL query is then executed through FedX [12], a
well-known SPARQL query federation engine. Finally, the query execution results are
displayed by FedViz. Note FedX can executes both federated (without using SPARQL
SERVICE clause) and non-federated SPARQL queries. Current version of FedViz
supports six real time Life Science data sources namely Kegg (http://www.genome.jp/kegg/),
Drugbank (http://www.drugbank.ca/), DailyMed (http://dailymed.nlm.nih.gov/dailymed/
index.cfm), Medicare (http://wifo5-03.informatik.uni-mannheim.de/medicare/), Sider
(http://wifo5-03.informatik.uni-mannheim.de/sider/), and Diseasome (http://wifo5-03.
informatik.uni-mannheim.de/diseasome/).
2</p>
    </sec>
    <sec id="sec-2">
      <title>FedViz</title>
      <p>FedViz (Demo available at: http://srvgal86.deri.ie/FedViz/index.html) is a browser based
application currently supported by IE, Firefox and Chrome, that provide users with a
flexible visual interface to formulate and execute SPARQL queries. FedViz visualises
six life sciences datasets and facilitate users to draw meaningful biological correlations
including drug-drug interaction, drug-disease interaction, drug-side effect interaction
and disease-gene interaction. FedViz supports full range of SPARQL1.1 constructs either
selected through interface or can be directly added to the query in SPARQL Editor.
2.1</p>
      <p>Query Formulation
We demonstrate our visual query formulation and execution process by using an example
scenario, i.e., find the drug-disease interaction from the aforementioned data sources.
Scenario: Drug-Disease Interaction: Drugs with their inchi key, molecular weight
average and chemical structure for curing disease Alzheimer:</p>
      <p>This query requires data integration from Drugbank (containing drugs
information) and Diseasome (containing disease information) and be formulated by using the
following step-by-step approach (ref., Fig. 1):
1. The first step is to identify how Drugbank and Diseasome datasets are connected to
each other? This connectivity (i.e., via classes drugbank:drug and diseasome
:disease) can be found by using the Class visualisation View of FedViz.
2. User selects Drugbank from the Datasets Selection box (window A).
3. The visualisation for Drugbank dataset can be seen in Window B where user
selects drugbank:drug class and its properties (i.e., inchi key, molecular
weight average and possible disease target).
4. Step 2 and 3 are now followed for Diseasome dataset, i.e., select diseasome:disease
class and it’s name property (window C).
5. Next, FedViz SPARQL Query Editor allows user to add constraints to the
formulated query such as select projection variables, apply SPARQL LIMIT, FILTER,
ORDERY BY clauses, and can further edit the query according to his choice (window
D(a) and D(b)).
6. The final query can be seen on submission (window E).</p>
      <p>A: Datasets selection</p>
      <p>B: Drugbank Concept &amp; Property selection
D(a):Query editor</p>
      <p>D(b):Query editor</p>
      <p>E:Formulated Query
F:Federated query results</p>
      <p>G:Instance data exploration
7. Query is executed over FedX and the retrieve results are displayed by FedViz (Result
window F).
8. Finally, by selecting any URI from the retrieved result, FedViz provided detailed
information regarding that entity (window G).</p>
    </sec>
    <sec id="sec-3">
      <title>3 Evaluation</title>
      <p>To assess the usability of FedViz, we used the standardised, System Usability Scale
(SUS) [10] questionnaire2. The SUS is a low-cost, reliable usability scale that can
be used for global assessments of systems usability[10]. The survey was filled by 15
users3 including researchers and engineers in Semantic Web (by 10th July 2015).
We achieved a mean usability score of 74.16 indicating a high level of usability
according to the SUS score. The average scores (out of 5) for each question along
with standard deviation is shown in Figure 2. The responses to question 1 suggests
that FedViz is adequate for frequent use (average score to question 1 = 3.8 0.86) by
users all of type. The responses to question 7 (average score 4.06 0.96) suggests
that most people would learn to use this system very quickly and the responses to
question 3 indicates that FedViz is easy to use (average score 4 0.84).
2 Our survey can found at: http://goo.gl/forms/bhReuNgd6O
3 Users from AKSW, University of Leipzig and INSIGHT Centre, National University of Ireland,
Galway. Summary of the responses can be found at: https://goo.gl/ZOrJx9</p>
      <p>
        I needed to learn a lot of things before I could get going with this system (
        <xref ref-type="bibr" rid="ref10">10</xref>
        )
      </p>
      <p>
        I felt very confident using the system (
        <xref ref-type="bibr" rid="ref9">9</xref>
        )
      </p>
      <p>
        I found the system very cumbersome to use (
        <xref ref-type="bibr" rid="ref8">8</xref>
        )
I would imagine that most people would learn to use this system very quickly (
        <xref ref-type="bibr" rid="ref7">7</xref>
        )
      </p>
      <p>
        I thought there was too much inconsistency in this system (
        <xref ref-type="bibr" rid="ref6">6</xref>
        )
      </p>
      <p>
        I found the various functions in this system were well integrated (
        <xref ref-type="bibr" rid="ref5">5</xref>
        )
I think that I would need the support of a technical person to be able to use this system (
        <xref ref-type="bibr" rid="ref4">4</xref>
        )
      </p>
      <p>
        I thought the system was easy to use (
        <xref ref-type="bibr" rid="ref3">3</xref>
        )
      </p>
      <p>
        I found the system unnecessarily complex (
        <xref ref-type="bibr" rid="ref2">2</xref>
        )
I think that I would like to use this system frequently (
        <xref ref-type="bibr" rid="ref1">1</xref>
        )
0
      </p>
    </sec>
    <sec id="sec-4">
      <title>Acknowledgement</title>
      <p>The work presented in this DEMO paper has been partly funded by Science
Foundation Ireland under Grant No. SFI/08/CE/I1380 (Lion-2).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>J.</given-names>
            <surname>Borsje</surname>
          </string-name>
          and
          <string-name>
            <given-names>H.</given-names>
            <surname>Embregts</surname>
          </string-name>
          .
          <article-title>Graphical query composition and natural language processing in an rdf visualization interface</article-title>
          .
          <source>Erasmus School of Economics and Business Economics</source>
          , Vol. Bachelor. Erasmus University, Rotterdam,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>B.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. J.</given-names>
            <surname>Wild</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Zhu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Ding</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Dong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Sankaranarayanan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Sun. Chem2bio2rdf</surname>
          </string-name>
          :
          <article-title>A linked open data portal for chemical biology</article-title>
          .
          <source>arXiv:1012.4759</source>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>H.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Yu</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J. Y.</given-names>
            <surname>Chen</surname>
          </string-name>
          .
          <article-title>Semantic web meets integrative biology: a survey</article-title>
          . Briefings in bioinformatics,
          <volume>14</volume>
          (
          <issue>1</issue>
          ):
          <fpage>109</fpage>
          -
          <lpage>125</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>F.</given-names>
            <surname>Haag</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Lohmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Siek</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Ertl</surname>
          </string-name>
          .
          <article-title>Visual querying of linked data with QueryVOWL</article-title>
          .
          <source>In Joint Proceedings of SumPre 2015</source>
          and
          <article-title>HSWI 2014-15</article-title>
          . CEUR-WS, to appear.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>A.</given-names>
            <surname>Hasnain</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Fox</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Decker</surname>
          </string-name>
          , and
          <string-name>
            <given-names>H. F.</given-names>
            <surname>Deus</surname>
          </string-name>
          .
          <article-title>Cataloguing and linking life sciences LOD Cloud</article-title>
          .
          <source>In 1st International Workshop on Ontology Engineering in a Data-driven World collocated with EKAW12</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>A.</given-names>
            <surname>Hasnain</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. R.</given-names>
            <surname>Kamdar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Hasapis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Zeginis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. N. Warren</given-names>
            <surname>Jr</surname>
          </string-name>
          , et al.
          <article-title>Linked Biomedical Dataspace: Lessons Learned integrating Data for Drug Discovery</article-title>
          . In International Semantic Web Conference (In-Use Track),
          <year>October 2014</year>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>A.</given-names>
            <surname>Hasnain</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Mehmood</surname>
          </string-name>
          , S. S. e Zainab, and
          <string-name>
            <given-names>S.</given-names>
            <surname>Decker</surname>
          </string-name>
          .
          <article-title>A provenance assisted roadmap for life sciences linked open data cloud</article-title>
          .
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>A.</given-names>
            <surname>Hasnain</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. S. E.</given-names>
            <surname>Zainab</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. R.</given-names>
            <surname>Kamdar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Mehmood</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. Warren</given-names>
            <surname>Jr</surname>
          </string-name>
          , et al.
          <article-title>A roadmap for navigating the life scinces linked open data cloud</article-title>
          .
          <source>In International Semantic Technology (JIST2014) conference</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>M. R.</given-names>
            <surname>Kamdar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Zeginis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hasnain</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Decker</surname>
          </string-name>
          , and
          <string-name>
            <given-names>H. F.</given-names>
            <surname>Deus</surname>
          </string-name>
          .
          <article-title>Reveald: A user-driven domain-specific interactive search platform for biomedical research</article-title>
          .
          <source>Journal of biomedical informatics</source>
          ,
          <volume>47</volume>
          :
          <fpage>112</fpage>
          -
          <lpage>130</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <given-names>J. R.</given-names>
            <surname>Lewis</surname>
          </string-name>
          and
          <string-name>
            <surname>J. Sauro.</surname>
          </string-name>
          <article-title>The factor structure of the system usability scale</article-title>
          .
          <source>In HCD</source>
          .
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>M. Saleem</surname>
            ,
            <given-names>S. S.</given-names>
          </string-name>
          <string-name>
            <surname>Padmanabhuni</surname>
            ,
            <given-names>A.-C. N.</given-names>
          </string-name>
          <string-name>
            <surname>Ngomo</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Iqbal</surname>
            ,
            <given-names>J. S.</given-names>
          </string-name>
          <string-name>
            <surname>Almeida</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Decker</surname>
            , and
            <given-names>H. F.</given-names>
          </string-name>
          <string-name>
            <surname>Deus</surname>
          </string-name>
          . Topfed:
          <article-title>Tcga tailored federated query processing and linking to lod</article-title>
          .
          <source>Journal of Biomedical Semantics</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <given-names>A.</given-names>
            <surname>Schwarte</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Haase</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Hose</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Schenkel</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Schmidt</surname>
          </string-name>
          . Fedx:
          <article-title>Optimization techniques for federated query processing on linked data</article-title>
          .
          <source>In ISWC</source>
          .
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>