<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>DC-THERA Directory, a Knowledge Management System for the support of the European Dendritic Cell Immunology Community</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Marco Brandizi</string-name>
          <email>marco@leafbioscience.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Michaela Gündel</string-name>
          <email>michaela.guendel@leafbioscience.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ciro Scognamiglio</string-name>
          <email>ciro.scognamiglio@leafbioscience.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andrea Splendiani</string-name>
          <email>andrea@leafbioscience.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Leaf Bioscience s.r.l.</institution>
          ,
          <addr-line>Via G. Puccini 3, 20121 Milan</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>DC-THERA Directory is a web portal to support collaboration, communication and knowledge sharing within DC-THERA, a community focused on immunology. We show how we have faced the problem of representing and managing highly heterogeneous and interconnected knowledge. One aspect of the application interface is the search and navigation through web ontologies. Another aspect is the dynamic representation of information entities having variable sets of properties. These results have been achieved by adopting a modelling approach that combines traditional object-oriented modelling with a triple-based knowledge representation. We discuss advantages of such an approach, especially for what concerns the possible future integration of other information sources.</p>
      </abstract>
      <kwd-group>
        <kwd>Immunology Community</kwd>
        <kwd>Knowledge Management Systems</kwd>
        <kwd>Bio-Ontologies</kwd>
        <kwd>Dendritic Cells</kwd>
        <kwd>On Line Communities</kwd>
        <kwd>Semantic Web</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Knowledge Management Systems based on web technologies are tools widely used
for promoting collaboration and information flow among organizations, including
scientific communities [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Moreover, standard terminology and formal ontologies in
knowledge applications are particularly important for the life sciences domain [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], as
this is characterised by highly heterogeneous and interconnected information. These
technologies are also useful for supporting cooperation in communities put together
by research projects [
        <xref ref-type="bibr" rid="ref3 ref4">3,4</xref>
        ]. An example of that is DC-THERA1, a European project,
focusing on integrating a large community of researchers and therapists on dendritic
cell research, an important topic in immunology.
      </p>
      <p>In the DC-THERA context, people often need to answer questions like: are certain
entities (e.g., blood samples of a certain kind or a purification method) available in the
network? Who is maintaining them? Who is expert in their usage? How can I contact
them? These requirements are addressed in the DC-THERA Directory, a web-based
knowledge management system that allows to collect summary information about the
bio-medical resources available among the DC-THERA researchers. The Directory
eases knowledge sharing and exchange by providing a single, unified access point to a
variety of bio-entities. It promotes collaboration by representing links between
bio-resources and people who are related to them (for example, someone having experience
in applying a given laboratory protocol, or someone else studying a certain data set).
1 http://www.dc-thera.org
Furthermore, by integrating other biological repositories (e.g., about specific
experimental data, or about scientific publications), it allows to narrow down an initial
search or find desired details that are managed outside the scope of the Directory
itself. Finally, having an electronic repository where to store reference information
about the work and achievements produced in the context of an EU-funded research
project helps in not dispersing such information and potentially keeping it available to
the general public.</p>
    </sec>
    <sec id="sec-2">
      <title>The Directory</title>
      <p>The application currently supports the main categories visible in fig. 1. Clicking on
one of them, or their sub-categories, a list of existing resources is shown, together
with a taxonomy of sub-types defined for that category, which is built based on the
underlying ontologies. From here, it is possible to open the details form of a particular
resource in a “subject centric” view, that puts together resource properties, such as
title or description, and links to other items (fig. 1, bottom). Both the details view and
the list of properties can flexibly vary, depending on the specific resource.</p>
      <p>
        Another use case is the classical keyword-based search. An auto-completion
feature suggests search hints while typing, by dynamically querying (via AJAX) the
knowledge base and proposing expressions such as term variants, associated
keywords and synonyms (again, the knowledge contained in the ontologies is used for
that). Results are presented by highlighting the terms in the text. Depending on the
resource category that is queried, additional related external data are shown (e.g.: list of
publications in fig. 1; gene expression information, coming from EBI's Gene
Expression Atlas2 or BASE installations [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]).
      </p>
      <p>
        Architecture and Implementation. DC-THERA Directory has been designed by
adopting a modelling approach that is similar to the one proposed in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. On the one
hand, an object model has been defined, implemented in PHP (fig. 2), containing
classes for the main resource types and defining general and specific properties in
toplevel and pertinent classes respectively. On the other hand, the representation of
property/value pairs and helpers for querying the knowledge base by using triple patterns
has been embedded in the object model (e.g.: &lt;r, p, *&gt; finds all resources that are
prelated to r).
      </p>
      <p>
        This “mixed” model makes use of the advantages of both the world of structured
object-oriented architecture and the world of semi-structured “RDF-like” triples. The
PHP classes can contain implementation-specific code, such as methods invoked by
the Symfony3 framework (e.g.: the Dataset class in fig. 2). The “triplified” part allows
to dynamically define any set of properties and links for a resource, e.g.,
characterising the resource with terms from external ontologies, or embedding triple statements
obtained by external services. For example, we have selected and imported those
Open Biomedical Ontologies [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] that are suitable to represent the kind of information
2 http://www.ebi.ac.uk/gxa
3 http://www.symfony-project.org
currently available in the Directory and the tackled domain – especially OBI, the
Ontology for Biomedical Investigations4.
      </p>
      <p>
        Discussion and future work
A first stable version of the Directory was released to the project's participants in July
2009. Before, a test had been done with a selected panel of pilot users, who helped in
evaluating and further improving the quality of the tool and its contents by answering
a questionnaire (unpublished). This had confirmed that the application and the way it
has been developed is found useful by the users. As an example of improvement
suggested by the survey, we are using the BioLexicon [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] text mining tool to extend
ontology classes with term variants, extracted from literature text mining. This will
improve keyword-based searches, which already use ontologies for aspects like
synonyms or semantically close terms.
      </p>
      <p>
        Another feature under development is an export of the Directory content to the
RDF/OWL format. This could, for instance, be useful to integrate and compare the
knowledge base with similar on line resources, to find useful information without the
need to query many repositories one by one [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Another example is comparing the
Directory by semantic similarity, analysing ontologically related terms [10].
      </p>
      <p>In conclusion, as the first user feedback suggests, the DC-THERA Directory
provides an effective way to make available the knowledge asset developed in the
five-year project time. As far as we know, it is the first time that the hereby described
hybrid modelling approach is used to build a rapid web development infrastructure.
This allows both the development of similar bio-medical applications and, thanks to
the fact it is compatible with Semantic Web technologies, to integrate them together.
4 http://purl.obofoundry.org/obo/obi</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Das</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , et al.:
          <article-title>Building biomedical web communities using a semantically aware content management system</article-title>
          .
          <source>Brief Bioinform</source>
          ;
          <volume>10</volume>
          (
          <issue>2</issue>
          ):
          <fpage>129</fpage>
          -
          <lpage>38</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Coskun</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , et al.:
          <article-title>Towards Corporate Semantic Web: Requirements</article-title>
          and
          <string-name>
            <given-names>Use</given-names>
            <surname>Cases</surname>
          </string-name>
          . Freie Universität Berlin,
          <source>Tech Rep TR-B-08-09</source>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Clark</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kinoshita</surname>
          </string-name>
          , J.:
          <article-title>Alzforum and SWAN: The Present and Future of Scientific Web Communities</article-title>
          .
          <source>Brief. in Bioinformatics</source>
          <volume>8</volume>
          (
          <issue>3</issue>
          ):
          <fpage>163</fpage>
          -
          <lpage>171</lpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Gaines</surname>
            ,
            <given-names>B. R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shaw</surname>
            ,
            <given-names>M. L. G.</given-names>
          </string-name>
          :
          <article-title>Knowledge management for research communities</article-title>
          .
          <source>Proc AAAI Spring Symposium on A.I. in Knowledge Management</source>
          , Stanford University, pp
          <fpage>55</fpage>
          -
          <lpage>62</lpage>
          (
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Saal</surname>
            ,
            <given-names>L. H.</given-names>
          </string-name>
          , et al.:
          <article-title>BioArray Software Environment: A Platform for Comprehensive Management and Analysis of Microarray Data</article-title>
          .
          <source>Genome Biology</source>
          <volume>3</volume>
          (
          <issue>8</issue>
          ):
          <year>software0003</year>
          .
          <fpage>1</fpage>
          -
          <issue>0003</issue>
          .6 (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Puleston</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          , et al.:
          <article-title>Integrating object-oriented and ontological representations: A case study in Java and OWL</article-title>
          .
          <source>The Semantic Web - ISWC</source>
          <year>2008</year>
          , pp.
          <fpage>130</fpage>
          -
          <lpage>145</lpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          , et al.:
          <article-title>The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration</article-title>
          .
          <source>Nat Biotechnol</source>
          . (
          <volume>11</volume>
          ):
          <fpage>1251</fpage>
          -
          <lpage>5</lpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Rebholz-Schuhmann</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , et al.:
          <article-title>BioLexicon: Towards a Reference Terminological Resource in the Biomedical Domain</article-title>
          .
          <source>Proc ISMB-2008</source>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Heng</surname>
            ,
            <given-names>T. S. P.</given-names>
          </string-name>
          , et al.:
          <article-title>The Immunological Genome Project: networks of gene expression in immune cells</article-title>
          .
          <source>Nat Immunology</source>
          <volume>9</volume>
          ,
          <fpage>1091</fpage>
          -
          <lpage>1094</lpage>
          (
          <year>2008</year>
          )
          <fpage>10</fpage>
          .
          <string-name>
            <surname>Pedersen</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          , et al.:
          <article-title>Measures of semantic similarity and relatedness in the biomedical domain</article-title>
          .
          <source>J of Biomedical Informatics</source>
          , Vol.
          <volume>40</volume>
          , Issue 3. (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>