<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Linked Data services for Scientists exemplified using Drosophila melanogaster datasets</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>M.P. van Iersel</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>N. Anwar</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>L. Reynolds</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>General Bioinformatics</institution>
          ,
          <addr-line>Reading</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Semantic technologies allow scientists to rapidly integrate data from experimental datasets and online databases. On the computational side, the massive scale of these datasets poses some challenges. We find that there are additional challenges on the human side. Problems we encounter in practice include translating human questions to SPARQL queries, adjusting data models and ontologies to match the problem space of scientists, visualising data, and providing user-friendly access.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>We have integrated several Drosophila melanogaster datasets and built a public
showcase. This showcase integrates diverse data types such as genes, proteins,
FlyAtlas expression data and BioPAX pathways. This data is stored in a triple store
running in a cloud instance. We provide several options for visualizing the data. First,
a custom Cytoscape plugin can extract triple data and visualize it in network form.
Secondly, a PathVisio plugin can be used to create visualisations on top of pathway
diagrams derived from WikiPathways (See Fig. 1).</p>
      <p>By following the demo, you should get a clear idea how linked data can be used to
integrate diverse Drosophila datasets, how this can be visualized in pathway and
network context, and how this can help to answer research questions.</p>
    </sec>
    <sec id="sec-2">
      <title>Availability</title>
      <p>The integrated data, SPARQL tutorial and videos, RDF downloads, cytoscape
session and live SPARQL endpoint are all available at
http://fly.cloud.generalbioinformatics.com/</p>
    </sec>
  </body>
  <back>
    <ref-list />
  </back>
</article>