<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Ondex: data integration and visualisation for the Semantic Web</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Catherine Canevet</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andrea Splendiani</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Steve Kuo</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Robert Stevens</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Phil Lord</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Chris Rawlings</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Centre for Mathematical and Computational Biology, Rothamsted Research</institution>
          ,
          <addr-line>Harpenden</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>School of Computer Science, University of Manchester</institution>
          ,
          <addr-line>Manchester</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>School of Computing Science, University of Newcastle</institution>
          ,
          <addr-line>Newcastle</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Many systems approaches to biology need to identify, integrate and analyse information that is captured in a myriad of databases which use a wide variety of different formats and access methods. The Ondex data integration platform [1] (www.ondex.org) enables data from diverse biological data sets to be linked together, integrated, analysed and visualised using graph-based techniques. At the basis of Ondex is a graph data structure where entities and properties are associated to classes [2]. This data structure is closely related to the data model of RDF and supports a limited representation of ontologies. In the context of the SABR project (http://www.ondex.org/sabr.html), we are investigating ways to create a mapping between the Ondex data structure and the RDF/OWL. Our objective is to allow Ondex to query, visualise and analyse Semantic Web based knowledge bases [3]. The challenges are to maintain acceptable performance when translating between the RDF and Ondex data models. Once this mapping is implemented, Ondex could also be used to build workflows for the curation and management of such knowledge bases. The Ondex data model shares many similarities with RDF. For example, it has equivalents for Object Properties, Data Type properties, Types and Named graphs. On the other hand, one of the differences is that there are no global identities such as URIs available in Ondex. In fact, different information sources are aggregated by representing the original entities in “blank nodes” first and then by computing mappings among these nodes using their attributes. We wish to devise a mapping strategy that will allow Semantic Web based knowledge bases to be exposed using the Ondex data model because of the rich feature set that Ondex supports. Among the features already implemented in Ondex that could be deployed on a generic Semantic Web based knowledge bases are:</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p> 
 </p>
      <p>Methods to compute mappings between information resources, based on lexical
information, unique identifiers, sequence similarity or text mining (mapping free
text to entities and relations using information extraction).</p>
      <p>Methods to perform network analysis using betweenness and degree centrality
measures (how influential an entity is in the network, how many relations it has)
as well as a statistics module.</p>
      <p>Methods to interactively explore the content of integrated datasets as shown in
Figure 1. A set of filters help users narrow down their integrated data sets to
regions of interest in the network.
Top-left: AraCyc, DRASTIC and microarray data. Pie charts show when genes are up/down
regulated (red/green) given the selected treatments.</p>
      <p>Top-right: Poplar genome sequence, PoplarCyc, UniProt, Pfam, GO, GOA and Medline. The
genomic view allows user to select QTLs of interest and search their neighbourhood based on
keywords and a given neighbourhood depth.</p>
      <p>
        Bottom-left: GOA, UniProt, Medline and GO of Arabidopsis thaliana. Data integration and
literature analysis methods are used to predict the function of previously unannotated genes.
Bottom-right: TAIR, IntAct and BioGRID. The resulting network shows the Arabidopsis
interactome. The threshold filter displays the distribution of the degree centrality calculated for
each protein and added to each protein as an attribute (previously by an annotator).
As a reference use case, we have started the investigation of how Ondex can be used
to interact with BioGateway [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], which implements a Sparql endpoint. Apart from
creating a mapping that would make BioGateway accessible in Ondex, we are
defining a set of guidelines to harmonise the development of Ondex modules to
ensure a common representation compliant with this mapping [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Köhler</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baumbach</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Taubert</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Specht</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Skusa</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ruegg</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rawlings</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Verrier</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Philippi</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Graph-based analysis and visualization of experimental results with Ondex</article-title>
          .
          <source>Bioinformatics</source>
          <volume>22</volume>
          (11):
          <fpage>1383</fpage>
          -
          <lpage>1390</lpage>
          (
          <year>2006</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Taubert</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sieren</surname>
            ,
            <given-names>K.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hindle</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hoekman</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Winnenburg</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Philippi</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rawlings</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Köhler</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>The OXL format for the exchange of integrated datasets</article-title>
          .
          <source>Journal of Integrative Bioinformatics</source>
          <volume>4</volume>
          ,
          <issue>62</issue>
          (
          <year>2007</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Antezana</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kuiper</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mironov</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Biological knowledge management: the emerging role of the Semantic Web technologies</article-title>
          .
          <source>Briefings in Bioinformatics 10</source>
          ,
          <fpage>392</fpage>
          --
          <lpage>407</lpage>
          (
          <year>2009</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Antezana</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Blondé</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Egaña</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rutherford</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stevens</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>De Baets</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mironov</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kuiper</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>BioGateway: a semantic systems biology tool for the life sciences</article-title>
          .
          <source>BMC Bioinformatics 10 (Suppl</source>
          <volume>10</volume>
          )
          <article-title>:S11 (</article-title>
          <year>2009</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Splendiani</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kuiper</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rawlings</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Toward a new paradigm for user interaction on the Semantic Web to support life sciences investigation</article-title>
          . To appear
          <source>in Proceedings of ISWCSWUI (CEUR)</source>
          (
          <year>2009</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>