<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Knowledge Management System in Preclinical Radiooncology / Radiobiology Research</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Wahyu W. Hadiwikarta</string-name>
          <email>w.hadiwikarta@dkfz-heidelberg.de</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nadja Ebert</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mareike Roscher</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ina Kurth</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Michael Baumann</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Radiotherapy and Radiation Oncology, Faculty of Medicine and University Hospital Carl Gustav Carus, Technische Universität</institution>
          ,
          <addr-line>Fetscherstraße 74, 01307 Dresden</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Division of Radiooncology / Radiobiology, Deutsches Krebsforschungszentrum - German Cancer Research Center</institution>
          ,
          <addr-line>Im Neuenheimer Feld 280, 69120 Heidelberg</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Animal research is integral in the settings of clinical research as it provides preclinical evidence to support clinical studies. Unfortunately, these preclinical information and knowledge are not always readily available and presented through a proper knowledge management system. In this poster publication, a concept for such system, that is able to support radiotherapy research for cancer treatment is described. To optimize the value of the data, the concept of Linked Data and the Semantic Web technology is utilized and the interface for scientific users to query through standard SQL query and the more advanced SPARQL query is made available. External applications can connect through WEB-API support of the system. Type of submission. This poster describes a concept of a software system.</p>
      </abstract>
      <kwd-group>
        <kwd>cancer</kwd>
        <kwd>preclinical</kwd>
        <kwd>animal experiment</kwd>
        <kwd>knowledge management</kwd>
        <kwd>semantic web</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Radiooncology and radiobiology research works on the translational aspects of
integrating biological findings of cellular radiotherapy responses into clinical
radiooncology studies and to uncover the biological mechanisms behind the observed clinical
responses of radiotherapy or combined radiotherapy treatments. Henceforth, knowledge
of preclinical research from animal experiments is one of the most valuable assets,
particularly when aggregated e.g. compared and combined, with clinical radiotherapy data.
From animal welfare aspects and the fastest possible use of research data for clinical
applications, it is absolutely essential and necessary to extract as much information as
possible from a preclinical translational dataset. Such a knowledge base offers
continuity and data security; complete data on a trial is obtained and serve as a planning basis
for future trials, avoids redundant experiments and provides a data pool for novel
reanalysis. Institutional-level to multicentric pooling of raw preclinical data become an
important activity for the development of an application-specific prediction model.
Here, the concept of Linked Data and the Semantic Web technology may be utilized.
2
2.1</p>
    </sec>
    <sec id="sec-2">
      <title>State of the Art</title>
      <p>
        To observe the current state of the art, a bibliometric analysis tool, i.e. bibliometrix [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ],
was employed to analyze a record of publications downloaded from Web of Science
[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. This collection was characterized based on the keywords preclinical, cancer and
database. These words were chosen because a preclinical cancer database is the
foundation for subsequent implementation of knowledge management strategies in
preclinical cancer research.
      </p>
      <p>
        By running the words co-occurrence analysis on the words category ‘Keywords
Plus’ [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], we found that the word mouse model that represents a keyword in preclinical
research is an isolated node and in a distant range from another important word
database. This is shown in Figure 1. The word cancer is the central concept in the overall
collected publications. This result provides an information that in the selected published
works, preclinical research and preclinical database development are not commonly
associated concepts, let alone the development of a knowledge base in preclinical
research. This is in accordance with what we have observed and experienced in the past
years. We are aware of the existence of the Animal Study Registry (ASR) [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], launched
in January 2019. It is operated by the German Centre for the Protection of Laboratory
Animals (Bf3R) and the German Federal Institute for Risk Assessment (BfR).
However, there is still an urgent need for a knowledge management system that go beyond
a registration system, especially in the field of cancer radiotherapy and radiobiology.
This increases the motivation for the research and development of a preclinical
knowledge management system as described in this publication.
2.2
      </p>
      <sec id="sec-2-1">
        <title>Ontology</title>
        <p>
          Ontology is central in the development and utilization of a knowledge base. The Open
World Assumption (OWA) highlights the importance of reusability of domain specific
ontologies. Dekker et al. [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] proposed the Radiation Oncology Ontology (ROO)
optimized for clinical radiotherapy cancer research. Upon examination, this is currently one
of the many ontologies that is proximal into our use-case. Nevertheless, development
of extension is required to accommodate preclinical research settings.
2.3
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Clinical Trial Platform RadPlanBio</title>
        <p>
          The German Cancer Consortium (DKTK) as a joint initiative of more than 20
institutions and university hospitals, under the umbrella of the German Cancer Research
Center (DKFZ) has started the project for the development of a clinical trial data
management platform ‘RadPlanBio’ to support multicentric cancer radiotherapy studies in
2012 [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]. In the time of this writing, the platform is running dozens of multicentric
clinical studies across DKTK sites and external partners beyond Germany. Currently,
it is under development to expand its support for preclinical studies data management.
The knowledge management system that is described in this poster publication will be
a further expansion to the RadPlanBio system.
3
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Concepts</title>
      <p>The knowledge management system (see Figure 2 for the concept illustration of the
architecture) that currently under development is comprised of three central
repositories. The first repository is a relational database system that stores the preclinical
datasets in digital format. Currently, efforts of digitization from paper are done through
manual labor. The scheme in this relational database accommodates the requirement
that data need to be stored in a structure reflecting the uniqueness of each project study
protocol, the category of tumor model and the model of treatment. Each of the entered
data, models an animal subject tagged by a unique subject identifier. The identifier will
allow identification not just of the unique subject, but also of the treatment cohort and
the type of treatment or drug combination used for this particular unique mouse subject.</p>
      <p>The second repository is a metadata repository. To ensure consistency of the study
parameters and their meaning across projects, we employ a metadata repository that
operates as a reference of registered annotations for the preclinical parameters used in
a study. Knowing that different study projects can have different parameters, including
novel parameters, the metadata repository is constantly updated for new information.</p>
      <p>The third repository is a semantic repository comprising RDF stores exposing the
preclinical datasets to a set of ontologies that links multiple data sources e.g. projects,
that will allow for potential new knowledge discovery. The semantic repository
supports the implementation formats of the RDF/RDFS/OWL standards.</p>
      <p>To interact with the repositories, the system supports a user web interface as an
application accessible through the web browser which will allow a scientific user to do a
query on the database by using SQL query and in the same way to the knowledge base
by using SPARQL query. To increase usability on the interface, a translator from
natural language i.e. English, to SPARQL will be utilized. The system will also have
WEB-API support for external applications to connect and to access the data and the
knowledge base.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Conclusions</title>
      <p>Preclinical research data has proven to be indispensable in translational cancer research.
Unfortunately, guidelines for proper practice of preclinical cancer data and knowledge
management in radiotherapy are scarce. Meanwhile, animal research itself has been
under pressure over the last years. Every time new regulations come out, the procedure
and the required ethical administration to do animal experiment become more stringent.
Hence, there is an urgency to have a preclinical knowledge management system that
may support future preclinical research in cancer radiotherapy. The existence of such
system may potentially reduce the scale of required preclinical study results and hence
the number of animals needed as redundant experiments no longer occur and even more
prediction for radiotherapy responses can be acquired through machine-supported
analysis on the knowledge base.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Aria</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cuccurullo</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <article-title>: bibliometrix: An R-tool for comprehensive science mapping analysis</article-title>
          ,
          <source>Journal of Informetrics</source>
          ,
          <volume>11</volume>
          (
          <issue>4</issue>
          ),
          <fpage>959</fpage>
          -
          <lpage>975</lpage>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2. Web of Science Homepage, http://www.webofknowledge.com,
          <source>last accessed</source>
          <year>2020</year>
          /05/14.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3. Web of Science Support Page, https://support.clarivate.com/ScientificandAcademicResearch/s/article/KeyWords-Plus-
          <article-title>generation-creation-and-changes?language=en_US, last accessed</article-title>
          <year>2020</year>
          /08/14.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Bert</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heinl</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chmielewska</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schwarz</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grune</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hensel</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Greiner</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schönfelder</surname>
          </string-name>
          , G.:
          <article-title>Refining animal research: The Animal Study Registry</article-title>
          ,
          <source>PloS Biology</source>
          ,
          <volume>17</volume>
          (
          <issue>10</issue>
          ),
          <year>e3000463</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Traverso</surname>
            , A., van Soest,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wee</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dekker</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>The radiation oncology ontology (ROO): Publishing linked data in radiation oncology using semantic web and ontology techniques</article-title>
          ,
          <source>Medical physics</source>
          ,
          <volume>45</volume>
          (
          <issue>10</issue>
          ),
          <fpage>e854</fpage>
          -
          <lpage>e862</lpage>
          (
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Skripcak</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          , et. al.:
          <article-title>Creating a data exchange strategy for radiotherapy research: towards federated databases and anonymised public datasets</article-title>
          ,
          <source>Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology</source>
          ,
          <volume>113</volume>
          (
          <issue>3</issue>
          ),
          <fpage>303</fpage>
          -
          <lpage>309</lpage>
          (
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>