<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Semantic Representation of Preclinical Data in Radiation Oncology</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Olga Giraldo</string-name>
          <email>olga.giraldo@dkfz-heidelberg.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Abumansur Sabyrrakhim</string-name>
          <email>sabyrrakhim.amd@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mareike Roscher</string-name>
          <email>mareike.roscher@dkfz-heidelberg.de</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rosemarie Euler-Lange</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Michael Baumann</string-name>
          <email>michael.baumann@dkfz-heidelberg.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ina Kurth</string-name>
          <email>ina.kurth@dkfz-heidelberg.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Wahyu Wijaya Hadiwikarta</string-name>
          <email>w.hadiwikarta@dkfz-hiedelberg.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Division of Radiooncology/Radiobiology, German Cancer Research Center (DKFZ)</institution>
          ,
          <addr-line>69120 Heidelberg</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>German Cancer Consortium (DKTK), Core Center Heidelberg</institution>
          ,
          <addr-line>69120 Heidelberg</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Heidelberg Institute of Radiation Oncology (HIRO)</institution>
          ,
          <addr-line>69120 Heidelberg</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Service Unit for Radiopharmaceuticals and Preclinical Studies, German Cancer Research Center (DKFZ)</institution>
          ,
          <addr-line>69120 Heidelberg</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Background: In radiation oncology, the data generated from preclinical trials serve as initial validation for treatment effectiveness and optimizing clinical approaches by unraveling molecular mechanisms underlying different treatment responses. Therefore, it is important to standardize the practice in managing preclinical trial data to ensure consistency and reproducibility across studies, promoting collaboration, and facilitating regulatory review. The primary goal of this work is to standardize the representation of data collected from preclinical radiobiology and radiation oncology studies as a way to facilitate knowledge discovery. To achieve this goal, we combined ontology with semantic Web techniques to publish mapped data and easily query them using SPARQL Protocol and RDF Query Language (SPARQL). Results: We expanded the Radiation Oncology Ontology (ROO) to include terminology related to the exposure of animal models to treatment, animal model's demographic characteristics; as well as clinical information in live animals. The extended ROO contains 123 new entities (89 classes, 29 data properties and 5 object properties). We combined the extended ontology with Semantic Web technologies to demonstrate how to integrate and query data from different relational databases. Discussion: The use of ontologies and semantic web tools are a way to comply to the FAIR principles. FAIR preclinical data improve collaboration, transparency, and reproducibility in radiotherapy research. 1 15th International Conference on Biological and Biomedical Ontology, July 17-19 2024, Enschede, The Netherlands ∗ Corresponding author.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>In radiation oncology, preclinical trials are conducted in animals prior to clinical trials to
evaluate the safety and efficacy of radiation therapy effects, taking into account various aspects
such as new radiation treatment techniques, radiation delivery methods, and novel therapeutic
agents. The data generated from preclinical trials are very important, because they serve as
initial validation for treatment effectiveness. Furthermore, considering the ethical and
economical aspects of performing animal studies, preclinical data are highly valuable.</p>
      <p>Therefore, it is important to standardize the practice in managing preclinical trial data to
ensure consistency and reproducibility across studies, which is critical for advancing scientific
knowledge, promoting collaboration, and facilitating regulatory review.</p>
      <p>
        Some efforts focused on standardization of preclinical data exist in the field. An example is
the Standard for Exchange of Nonclinical Data (SEND) from the Clinical Data Interchange
Standards Consortium (CDISC), aimed at standardizing collected individual animal data in
tabular data structures according to different nonclinical domains e.g., animal demographics,
laboratory test results, treatment procedures, etc.[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Conversely, ontologies such as the
Dependency Layered Ontology for Radiation Oncology (DLORO) [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], the Radiation Oncology
Structures Ontology (ROS) [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], and the Radiation Oncology Ontology (ROO) [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], were
developed for use cases in the clinical radiation oncology domain. Unfortunately, these
aforementioned ontologies were designed to support human clinical trials and their
relationships, therefore, unfortunately lack the required representation of preclinical data.
      </p>
      <p>
        The primary goal of this work is to standardize the representation of data collected from
preclinical radiobiology and radiation oncology studies as a way to facilitate knowledge
discovery. Formalized preclinical data will serve as a critical basis for the conduct and
interpretation of clinical trial data stored in the database system RadPlanBio [
        <xref ref-type="bibr" rid="ref5 ref7">5,7</xref>
        ]. The data are
stored according to the CDISC SEND standard. To achieve this goal, we propose populating the
ROO with terminologies related to the exposure of animal models to treatment, animal model’s
demographic characteristics; as well as clinical information in live animals. We decided to reuse
and extend the ROO because this ontology contains classes that cover the most common
concepts in radiation oncology, including oncological diseases, cancer staging systems, and
oncological treatments. To reuse and extend an existing ontology is in principle aligned to the
open world assumption (OWA).
      </p>
      <p>In this article we: i) present the materials and methods used to populate the ROO with
preclinical terminologies; ii) describe the validation and evaluation process of the ontology, iii)
show the current state of the extended ontology, and iv) conclude with the discussion and
outlook for future work.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Materials and Methods</title>
      <sec id="sec-2-1">
        <title>2.1. Preclinical database</title>
        <p>
          As a use case, we analyze information from databases collected in a study focused on
investigating the effect of nimorazole combination treatments on hypoxic tumor areas in mice
[
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]. The databases are available in the German Cancer Consortium (DKTK) RadPlanBio
platform, a web-based platform which supports the collection and the exchange of radiotherapy
research data in clinical and preclinical studies [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ].
        </p>
        <p>The databases analyzed in this work include &gt;2000 mice and contain information on: i)
demographic characteristics of each individual animal; ii) details of an animal’s exposure to
treatment; iii) body weights of animals during the study and at the end of the study; iv) diagnosis
of the cause of death of animals, and; v) laboratory test data per animal. The information in
these databases is organized and structured according to the format suggested by the SEND
standard for tabulation of nonclinical datasets.</p>
        <p>Due to the heterogeneous nature of the data, they provide a good validation for the extended
ROO. The extended ROO was applied to represent each value in the database and to map them
through the concepts available in the ontology.</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. The ROO extension process</title>
        <p>The process of enriching and extending the Radiation Oncology Ontology (ROO) with
preclinical concepts consists of three steps: i) collection of preclinical concepts; ii) semantic
analysis of existing vocabularies, and; iii) ontology extension.</p>
        <p>
          In the first step, terms were collected from the preclinical databases mentioned in section
2.1. The next step was to identify reusable terminologies from other ontologies. BioPortal [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]
and the UMLS Metathesaurus Browser [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] were used throughout this stage to find references
and definitions for each terminology. In the last step, the ROO was extended with terminologies
that come primarily from the National Cancer Institute Thesaurus (NCIt) [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]. New terms that
are not coming from existing ontologies use the prefix ‘roo’ and a local ID that starts with the
letters DKFZ followed by 6-digit numbers; as an example, the identifier for the class ‘Animal
Identifier’ is roo:DKFZ000006. Protégé v. 5.6.1 [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] was used to create new concepts and manage
the ontology.
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>2.3. Ontology validation and evaluation</title>
        <p>The ontology validation procedure ensures that the ontology can effectively represent and
capture the knowledge and data from the preclinical relational databases. This validation
process involves mapping the elements (rows, columns, and values) of the database to the
concepts and properties (predicates) in the ontology. Figure 1 shows a correspondence between
the columns in the relational database and the ontology entities. At the top (rectangle a) the
hierarchical structure of the extended ROO is illustrated. White boxes represent existing
concepts from original ROO. Grey boxes represent new concepts proposed in this work.
(a)
(b)</p>
        <p>Animal
sty:T008</p>
        <p>Person
ncit:C25190
ROO classes
New classes
Instances</p>
        <p>Mouse
ncit:C14238
is a
Property
Mapping</p>
        <p>Entity
sty:T071
dkfz:Unique_Subject
_Identifier
“N150a009” rdf:type Subject Unique Identifier</p>
        <p>ncit:C69256
dkfz:Study_Identifier
dkfz:Age</p>
        <p>dkfz:Age_Unit
“Xeno Nimo Hypox;FaDu”
74.0^^xsd:decimal</p>
        <p>“days”</p>
        <p>Extended ROO
Identifier
ncit:25364
Subject Id</p>
        <p>N150a009
Preclinica(Mldoautsaebainsfeorsmcahteiomna) …</p>
        <p>Study identifier Age
Xeno Nimo Hypox;FaDu 74.0</p>
        <p>Age Unit
days</p>
        <p>The added concept “Mouse (ncit:C14238)” is a subclass of “Animal (sty:T008)” in description
logic syntax, it can be expressed as Mouse ⊑ Animal. In addition, mouse or person are
animals (Mouse ⊔ Person ⊑ Animal) and mouse is not a person (Mouse ⊑ ¬ Person).</p>
        <p>Boxes with rounded corners represent instances or individuals. Hierarchical relationships
(“is subclass of”) between classes, are represented by dotted arrows. Properties are represented
with arrows; they connect classes or instances between each other.</p>
        <p>At the bottom (rectangle b) demographic information about the mouse (e.g., age, age unit)
and the study identifier of which the animal was enrolled are presented as examples. In the
extended ROO, the column “Subject Id” is mapped to the concept “Subject Unique Identifier
(ncit:C69256)”. The link between a mouse and the subject identifier is the property “Unique
Subject Identifier (DKFZ000009)”. In description logic syntax, any mouse that has a unique
subject identifier can be expressed as Mouse ⊓ ∃ Unique_Subject_Identifier.⊤.</p>
        <p>
          Several languages and software tools are available to perform the mapping procedure from
relational databases to RDF triples [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]. We use RDF Mapping Language (RML), an extension
of R2RML to map columns and rows of preclinical databases and our ontology. R2RML is a W3C
standard for mapping relational databases to RDF. RML follows exactly the same syntax as
R2RML; therefore, RML mappings are themselves RDF graphs [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. The stages we implemented
to generate linked data between the extended ROO and our preclinical relational databases are
illustrated in Figure 2 and explained below.
        </p>
        <sec id="sec-2-3-1">
          <title>Starting phase</title>
          <p>List of SPARQL
queries, Expected
outcomes, and
Ontology validation.</p>
        </sec>
        <sec id="sec-2-3-2">
          <title>Linked data generation</title>
          <p>RMLMapper.
In the first stage, we gathered a set of SPARQL queries and the corresponding expected
outcomes (triples and query result). We focused on the functional aspects that we wanted the
ontology to represent. The queries we gathered include, “retrieve the Subject Unique Identifier of
the animals tested”.</p>
        </sec>
      </sec>
      <sec id="sec-2-4">
        <title>2.3.2. Preparation</title>
        <p>As preparation, we exported the analyzed preclinical databases to CSV formats. Then, we
created turtle files specifying expected triples. Some of the expected triples we specified include,
&lt;N150a009&gt; rdf:type &lt;Subject Unique Identifier (ncit:C69256)&gt;; &lt;N150a009&gt; study identifier
(roo:DKFZ000008) &lt;Xeno Nimo Hypox;FaDu&gt;.</p>
        <p>30/09/2022</p>
        <p>List of RDF files.</p>
        <sec id="sec-2-4-1">
          <title>Output</title>
          <p>Page 6</p>
        </sec>
      </sec>
      <sec id="sec-2-5">
        <title>2.3.3. Linked data generation</title>
        <p>
          To generate linked data, we use the RMLMapper [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ] which executes RML rules to achieve its
task. We used Docker [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] to run RMLMapper and storing data.
2.3.4. Output
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Results</title>
      <p>An ontology validation process was considered done with valid result if the generated outcomes
are not different from the expected outcomes. Therefore, in this stage the ontology evaluation
is done and we compare the generated triples with the expected triples specified in the
preparation step.</p>
      <sec id="sec-3-1">
        <title>3.1. Extended Radiation Oncology Ontology (ROO)</title>
        <p>
          The extended ROO contains 123 new entities (89 classes, 29 data properties and 5 object
properties). The new terminologies represent: i) attributes that are common across the used
databases e.g., “subject unique identifier (ncit:C69256)”, study identifier ( roo:DKFZ000008); ii)
demographic characteristics e.g., “strain (roo:DKFZ000041)”; iii) findings or information
collected during a study e.g., “body weight (roo:DKFZ000017)”, “cause of death
(roo:DKFZ000021)” and “clinical observation (roo:DKFZ000036)”; iv) exposure information e.g.,
“treatment name (roo:DKFZ000013)”, “route of administration (roo:DKFZ000011)” and
“treatment vehicle (roo:DKFZ00012)”. We followed the design principles from ROO. The
extended ROO is saved as OWL and available on GitHub [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ].
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Ontology validation and evaluation</title>
      </sec>
      <sec id="sec-3-3">
        <title>3.2.1. SPARQL queries</title>
        <p>
          The ontology represents and captures the knowledge and data from the preclinical relational
databases. The expected SPARQL queries were executed by using a Protégé desktop plug-in
that provides support for writing and executing SPARQL queries. All the queries returned the
expected results. The complete list of the queries is available on GitHub [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ].
        </p>
      </sec>
      <sec id="sec-3-4">
        <title>3.2.2. Linked data</title>
        <p>Based on Linked Data principles, an ontology enables semantic interoperability across
preclinical data available in relational databases. Our ontology facilitates data sharing and
transparent access to data. Figure 3 shows a database transformed into CSV format and the RDF
triples produced for the first and second subject Ids of the transformed database. The database
presents demographic information of two mice. The first mouse has the unique subject
identifier “N150a009”. The second mouse has the unique subject identifier “N150a011”. Both
were registered in the same study “Xeno Nimo Hypox.FaDu”. Each mouse was given an
identifier used within the study; “9” is the study identifier for the first mouse and “11” for the
second mouse. The age is available for the first mouse “74.0”; the age unit is “days”. Both mice
are male (represented as “0”) and belong to the strain/substrain “Nude Mouse”. As seen in Figure
3, the RDF triples obtained after running RMLMapper capture the data described above.</p>
        <p>
          CSV tables reflecting the content of the databases, the Turtle files specifying expected triples,
and the RDF files generated by running RMLMapper are available on GitHub [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ].
RDF triples produced for the first Subject Id of the database.
        </p>
        <p>Database transformed into CSV
RDF triples produced for the second Subject Id of the database.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Discussion and future work</title>
      <p>Semantic representation of preclinical data in radiobiology and radiation oncology involves
structuring and encoding information about demographic characteristics of animals, findings
and treatments in a machine-readable format that facilitates data integration, analysis, and
interpretation of outcomes, such as, overall survival or toxicities after treatment.</p>
      <p>
        To achieve this goal, we have expanded the ROO to describe preclinical data [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. This
ensures semantic interoperability and enabling integration with other datasets and knowledge
resources. Our extended ontology supports publishing preclinical data as linked data using RDF
to enable integration and interoperability with other datasets. The use of ontologies and
semantic web tools are a way of adhering to the FAIR principles [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]. FAIR preclinical data
enhances collaborations, transparency, and reproducibility in preclinical research.
      </p>
      <p>In this work, we were able to map all the entities present in the analysed databases with
concepts and properties from the extended ROO. Nevertheless, it is not without limitations. The
extended ontology should be validated against other preclinical data to ensure robustness.
Additionally, improving the ontology extension strategy is crucial, e.g., by utilizing
owl:imports. Currently, the extension was performed manually, while preserving the existing
ROO entities to maintain the organizational structure of preclinical terminology derived from
our relational databases. Improving the extension strategy will address issues such as the lack
of unique URIs for the preclinical entities from the analyzed RDBs, and enabling to index the
ontology on a repository such as BioPortal. Further step includes testing our ontology against
competency questions that retrieve information from two or more databases and establish
interconnections. For example, “survival of mice when are exposed to a particular treatment such
as cisplatin”. Then will be to integrate the ontology to the semantic layer of the RadPlanBio
platform, through a knowledge graph to allow semantic querying, reasoning, and inference.
The final step will be to develop a plan to maintain the ontology over time. This plan will
involves addressing issues such as ontology evolution, version control, and alignment with
evolving domain knowledge.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgements</title>
      <p>The first author of this paper has received funding from the European Federation for Cancer
Images (EUCAIM), Project ID: 101100633. The authors acknowledge the DKTK funding for the
operation of RadPlanBio platform in DKFZ. We also acknowledge Dr. Freddy Priyatna for his
support in the creation of mapping files and in the transformation/generation of triples. The
authors also wish to thank to Thomas Früchtel, Myta Pristanty and Betül Çakir, for the
digitization and standardization of the preclinical data, and the reviewers of this article for their
valuable comments.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>CDISC</given-names>
            <surname>SEND</surname>
          </string-name>
          <article-title>Standard</article-title>
          . URL: https://www.cdisc.org/standards/foundational/send
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Kalet</surname>
            <given-names>AM</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Doctor</surname>
            <given-names>JN</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gennari</surname>
            <given-names>JH</given-names>
          </string-name>
          , et al.
          <article-title>Developing Bayesian networks from a dependency-layered ontology: A proof-of-concept in radiation oncology</article-title>
          .
          <source>Med Phys</source>
          .
          <year>2017</year>
          . doi:
          <volume>10</volume>
          .1002/mp.12340.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Bibault</surname>
            <given-names>JE</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zapletal</surname>
            <given-names>E</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rance</surname>
            <given-names>B</given-names>
          </string-name>
          , et al.
          <article-title>Labeling for Big Data in radiation oncology: The Radiation Oncology Structures ontology</article-title>
          .
          <source>PLoS One</source>
          .
          <year>2018</year>
          . doi:
          <volume>10</volume>
          .1371/journal.pone.
          <volume>0191263</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Traverso</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>van Soest</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wee</surname>
            <given-names>L</given-names>
          </string-name>
          , et al.
          <article-title>The radiation oncology ontology (ROO): Publishing linked data in radiation oncology using semantic web and ontology techniques</article-title>
          .
          <source>Med Phys</source>
          .
          <year>2018</year>
          . doi:
          <volume>10</volume>
          .1002/mp.12879.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <article-title>[5] RadPlanBio</article-title>
          . URL: https://helmholtz.software/software/radplanbio
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Koi</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bitto</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weise</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          et al.
          <article-title>Prognostic biomarkers for the response to the radiosensitizer nimorazole combined with RCTx: a pre-clinical trial in HNSCC xenografts</article-title>
          .
          <source>J Transl Med</source>
          <volume>21</volume>
          ,
          <issue>576</issue>
          (
          <year>2023</year>
          ). https://doi.org/10.1186/s12967-023-04439-2
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>T.</given-names>
            <surname>Skripcak</surname>
          </string-name>
          et al.,
          <article-title>"Toward Distributed Conduction of Large-Scale Studies in Radiation Therapy and Oncology: Open-Source System Integration Approach,"</article-title>
          <source>in IEEE Journal of Biomedical and Health Informatics</source>
          , vol.
          <volume>20</volume>
          , no.
          <issue>5</issue>
          , pp.
          <fpage>1397</fpage>
          -
          <lpage>1403</lpage>
          , Sept.
          <year>2016</year>
          , doi: 10.1109/JBHI.
          <year>2015</year>
          .
          <volume>2450833</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Whetzel</surname>
            <given-names>PL</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Noy</surname>
            <given-names>NF</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shah</surname>
            <given-names>NH</given-names>
          </string-name>
          , et al.
          <article-title>BioPortal: enhanced functionality via new Web services from the National Center for Biomedical Ontology to access and use ontologies in software applications</article-title>
          .
          <source>Nucleic Acids Res</source>
          .
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Bodenreider</surname>
            <given-names>O.</given-names>
          </string-name>
          <article-title>The Unified Medical Language System (UMLS): integrating biomedical terminology</article-title>
          .
          <source>Nucleic Acids Res</source>
          .
          <year>2004</year>
          . doi:
          <volume>10</volume>
          .1093/nar/gkh061.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>NCI</surname>
          </string-name>
          <article-title>Thesaurus (NCIt)</article-title>
          . URL: https://ncithesaurus.nci.nih.gov/ncitbrowser/
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Protégé</surname>
          </string-name>
          . URL: https://protege.stanford.edu/
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Hert</surname>
            <given-names>M.</given-names>
          </string-name>
          et al.
          <article-title>A comparison of RDB-to-RDF mapping languages</article-title>
          , in (ACM Press;
          <year>2011</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>RDF</given-names>
            <surname>Mapping</surname>
          </string-name>
          <article-title>Language (RML)</article-title>
          . URL: https://rml.io/specs/rml/
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <article-title>RMLMapper</article-title>
          . URL: https://github.com/RMLio/rmlmapper-java
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Docker</surname>
          </string-name>
          . URL: https://www.docker.com
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <article-title>ROOext.owl</article-title>
          . URL: https://github.com/DKFZ-E220/ROOx/blob/main/ROOext_v0.1.owl
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <article-title>SPARQL queries from ROOx</article-title>
          . URL: https://github.com/DKFZE220/ROOx/blob/main/SPARQL%20queries%
          <article-title>20examples%20from%20ROOx</article-title>
          .txt
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <article-title>Linked data</article-title>
          . URL: https://github.com/DKFZ-E220/ROOx/tree/main/mapping_ROOx_v0.
          <fpage>1</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <surname>Wilkinson</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dumontier</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aalbersberg</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          et al.
          <article-title>The FAIR Guiding Principles for scientific data management and stewardship</article-title>
          .
          <source>Sci Data</source>
          <volume>3</volume>
          ,
          <issue>160018</issue>
          (
          <year>2016</year>
          ). https://doi.org/10.1038/sdata.
          <year>2016</year>
          .18
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>