<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Ontological Modelling and Reasoning of Phenotypes</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Alexandr UCITELI</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Christoph BEGER</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Toralf KIRSTEN</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Frank A. MEINEKE</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Heinrich HERRE</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Faculty of Applied Computer and Biological Sciences, University of Applied Sciences Mittweida</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Growth Network CrescNet, University of Leipzig</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Institute for Medical Informatics, Statistics and Epidemiology (IMISE), University of Leipzig</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The successful determination and analysis of phenotypes plays a key role in the diagnostic process, the evaluation of risk factors and the recruitment of participants for clinical and epidemiological studies. The development of computable phenotype algorithms to solve these tasks is a challenging problem, caused by various reasons. Firstly, the term 'phenotype' has no generally agreed definition and its meaning depends on context. Secondly, the phenotypes are most commonly specified as non-computable descriptive documents. Recent attempts have shown that ontologies are a suitable way to handle phenotypes and that they can support clinical research and decision making. The SMITH Consortium is dedicated to rapidly establish an integrative medical informatics framework to provide physicians with the best available data and knowledge and enable innovative use of healthcare data for research and treatment optimization. In the context of a methodological use case “phenotype pipeline” (PheP), a technology to automatically generate phenotype classifications and annotations based on electronic health records (EHR) is developed. A large series of phenotype algorithms will be implemented. This implies that for each algorithm a classification scheme and its input variables have to be defined. Furthermore, a phenotype engine is required to evaluate and execute developed algorithms. In this article we present a Core Ontology of Phenotypes (COP) and a software Phenotype Manager (PhenoMan), which implements a novel ontology-based method to model and calculate phenotypes. Our solution includes an enhanced iterative reasoning process combining classification tasks with mathematical calculations at runtime. The ontology as well as the reasoning method were successfully evaluated based on different phenotypes (including SOFA score, socioeconomic status, body surface area and WHO BMI classification) and several data sets.</p>
      </abstract>
      <kwd-group>
        <kwd />
        <kwd>Phenotype definition</kwd>
        <kwd>phenotype classification</kwd>
        <kwd>phenotype calculation</kwd>
        <kwd>phenotype ontology</kwd>
        <kwd>phenotype reasoning</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Despite its long ago introduction in 1909 by Wilhelm Johannsen, the term ‘phenotype’
still has no generally agreed definition [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Usually, a phenotype is considered as an
observable characteristic or trait of an organism, such as its morphology, function,
behaviour, or its biochemical and physiological properties [
        <xref ref-type="bibr" rid="ref1 ref2 ref3">1–3</xref>
        ]. Correct determination
of phenotypes plays a key role for diagnosis of diseases, evaluation of risk factors and
recruitment of patients for clinical and epidemiological studies [
        <xref ref-type="bibr" rid="ref4 ref5">4,5</xref>
        ]. One challenge is to
translate phenotype algorithms, which “are most commonly represented as
noncomputable descriptive documents and knowledge artifacts” [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], into machine-readable
form. Recent attempts have shown that ontologies are suitable to handle phenotypes and
that they can support clinical research and decision making [
        <xref ref-type="bibr" rid="ref7 ref8 ref9">7–9</xref>
        ].
      </p>
      <p>
        The main goal of the German Medical Informatics Initiative (MII) [
        <xref ref-type="bibr" rid="ref10 ref11">10,11</xref>
        ] is making
clinical data available for research. Most German university hospitals participate in one
of the four funded consortia. Smart Medical Information Technology for Healthcare
(SMITH) is one of these consortia [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. Within the ongoing SMITH project, a
phenotyping pipeline (PheP) will be established to systematically develop, evaluate and
execute validated algorithms and models for classifying and annotating patient data
based on routine EHR. These annotations and derivatives will be provided for triggering
alerts and actions, data sharing and deep analyses of patient care and outcomes.
Phenotype engines and factories are required as an overall infrastructure to specify, set
up and execute phenotype algorithms.
      </p>
      <p>In this article, we propose a novel ontology-based method to model and calculate
phenotypes. Our approach provides an extended reasoning combining phenotypic data
to derive complex phenotypes based on calculations and classifications. The developed
tools are designed to work as phenotype engine and factory in SMITH context.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Methods</title>
      <p>This section outlines the embedding of the PhenoMan in the SMITH infrastructure
(Figure 1).</p>
      <p>
        The required EHR data will be integrated in a Health Data Storage (HDS) in a
standardized manner based on HL7 FHIR [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. Structured data from different source
systems in hospitals as well as unstructured documents are taken into account. Natural
Language Processing (NLP) techniques are used to extract and transform relevant data
from unstructured EHR documents into structured form. For the specification of the HDS
schema (i.e., metadata including single data elements, data element groups, value sets,
referenced terminologies, etc.) required to transform and integrate data from various
sources, the software ART-DECOR® [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] is used. ART-DECOR® is an open-source
tool suite that enables creation and maintenance of HL7 templates, value sets, scenarios
and data sets and supports, inter alia, FHIR capabilities.
      </p>
      <p>
        The PhenoMan imports the data elements from ART-DECOR® and inserts them
into the ontology. The phenotype designer uses the Phenotype Editor to develop
phenotype algorithms/models based on the source data elements. Each phenotype
algorithm is saved as a Phenotype Algorithm Specification Ontology (PASO) by
PhenoMan. For the communication with the FHIR Server, the PhenoMan Service is
established, which encapsulates the PhenoMan API. The service generates subscriptions
(rest-hook) [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] for each PASO and transmits them to the FHIR Server. As soon as FHIR
resources (e.g., patient or observation resources) are present that fulfil the criterion of a
subscription (e.g., after update or create), the FHIR Server sends the resources to the
PhenoMan Service. Additionally, the PhenoMan Service can request further resources
(e.g., observations, conditions or medications) required for phenotypes
calculation/reasoning. After receiving required resources, the PhenoMan Service
calculates phenotypes (using PhenoMan API and PASOs) and writes the results as
observation resources back to the FHIR Server. For the specification of the subscription
criteria and querying the FHIR Server, FHIR Search [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] is used.
      </p>
      <p>This work focusses on the ontology-based modelling and reasoning of phenotypes
using PhenoMan. The SMITH infrastructure components as well as the integration of
PhenoMan in SMITH will be described in details in further papers.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Results</title>
      <sec id="sec-3-1">
        <title>3.1. Core Ontology of Phenotypes (COP)</title>
        <p>
          We developed the Core Ontology of Phenotypes (COP, Figure 2) to model, classify and
calculate phenotypes based on instance data sets (e.g., of a patient). In this article, we
consider a phenotype as an individual (in sense of General Formal Ontology, GFO [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ]),
for example, the weight of a specific person. Hereinafter, abstract instantiable entities
that are instantiated by phenotypes are called phenotype classes. For instance, the
abstract property ‘weight’ possess individual weights as instances. We distinguish
between single and composite properties (traits), and correspondingly, between single
and composite phenotypes. A composite property is defined as a property that has single
properties as parts [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ]. Based on the definitions of single and composite properties [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ],
we define single phenotypes as single properties (e.g., age, weight, height) and composite
phenotypes as composite properties (e.g., height and weight, BMI, SOFA score [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ]) of
an organism 2 or of one of its subsystems. Composite phenotypes are divided into
combined and derived phenotypes. A combined phenotype is only a combination of
corresponding phenotypes (e.g., a combination of height and weight), whereas a derived
phenotype is an additional property (e.g., BMI) derived from the corresponding
phenotypes (height and weight). In the framework of GFO we modelled properties or
traits using the class gfo:Property. In the present article, composite phenotype classes are
modelled using a Boolean expression based on has_part relation (e.g., weight and height:
has_part some height and has_part some weight). Derived phenotype classes additionally
define a calculation rule/mathematical formula (e.g., BMI = weight[kg] / height[m]²).
Furthermore, combined phenotype classes can associate certain conditions with specific
predefined values (scores), which can be used, e.g., in further formulas. For example, if
bilirubin value is greater than 12 mg/dl, then the value 4 is used for the calculation of the
SOFA score [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ].
        </p>
        <p>Additionally, we distinguish between restricted and non-restricted phenotype classes,
depending on whether their extensions (set of instances) are restricted to a certain range
of individual phenotypes by defined conditions or all instances are allowed. For example,
the phenotype class ‘age’ is instantiated by the ages of all living beings (non-restricted),
whereas the phenotype class ‘young age’ is instantiated by the ages of the young ones,
e.g., if the age is below 30 years (restricted).</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Phenotype Algorithm Specification Ontologies (PASO)</title>
        <p>Specific phenotypes (algorithms) are modelled in Phenotype Algorithm Specification
Ontologies (PASO)3 using the COP. PASOs are embedded in the COP in such a way that
the classes of the PASO are subclasses of the COP classes. Every PASO subclass of the
COP classes cop:Single_Phenotype, cop:Combined_Phenotype or
cop:Derived_Phenotype is a phenotype class and is instantiated by phenotypes. The
direct subclasses are non-restricted (e.g., Bilirubin, Figure 4), while the subclasses of
the non-restricted phenotype classes are restricted (e.g., Bilirubin_s_ge_2_0_l_6_0, i.e.,
bilirubin between 2 and 6 mg/dL).</p>
        <p>Phenotype classes possess various common attributes (e.g., labels, descriptions and
links to external concepts). Other attributes vary depending on the type of the phenotype
2 Properties of an organism are considered as all documentable information about it, whereby the
modeller is left to decide what is relevant to the current situation.</p>
        <p>3 A PASO is not a usual domain ontology describing a domain by suitable concepts, different relations
between them and axioms (like "patient is treated in some hospitals", "patient has some diseases" or "disease
was diagnosed by some doctors"). The main purpose of a PASO is to efficiently model concrete phenotypes
(algorithms) that should be calculated by the software based on relevant patient characteristics.
class. Non-restricted single phenotype (NSiP) classes, for example, define the datatype,
a unit of measure and an optional aggregate function; non-restricted derived phenotype
(NDeP) classes – a mathematical formula; restricted single (RSiP) and derived
phenotype (RDeP) classes – a restriction; and restricted combined phenotype (RCoP)
classes – an optional score value. The logical relations between phenotype classes as well
as range restrictions are represented in OWL by anonymous equivalent classes or general
class axioms based on property restrictions.</p>
        <p>
          The modelling procedure is illustrated by means of an example for calculating the
SOFA (Sequential (or Sepsis-related) Organ Failure Assessment) score [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ]. The SOFA
score plays an important role in medicine to quantitatively describe the degree of multiple
organ dysfunction/failure over time in patients. The total score is calculated as a sum of
the 6 single organ scores (respiration, coagulation, liver, cardiovascular, central nervous
system and renal). Each single organ score may take values from 0 (normal) to 4 (most
abnormal), so that the maximum SOFA score is 24 (Figure 3).
        </p>
        <p>First, we model the NSiP classes, e.g., Bilirubin, Dopamine and Eye_Opening
representing single patient characteristics relevant for calculating the SOFA score as
subclasses of cop:Single_Phenotype (Figure 4). Labels, descriptions, related concepts,
etc. can be specified as annotations. Next, the RSiP classes (e.g.,
Bilirubin_s_ge_2_0_l_6_0, Dopamine_s_g_5_0_le_15_0 or
Eye_opening_to_verbal_command) for value ranges are defined as subclasses of the
NSiP classes. For every RSiP class, the anonymous equivalent class is created that
represents the corresponding restriction (Figure 4: B, C). The single organ scores can be
modelled using combined phenotype classes. For each score a subclass of
cop:Combined_Phenotype is defined (e.g., SOFA_Liver_Score,
SOFA_Cardiovascular_System_Score or GCS_Eye_Opening_Score). The subclasses of
these non-restricted combined phenotype (NCoP) classes represent the single score
values (e.g., SOFA_Cardiovascular_System_Score_3). These classes reference the
corresponding RSiP range classes using a general class axiom and define the score values
(Figure 4: D1, D2).</p>
        <p>
          The score for nervous system, the Glasgow Coma Scale (GCS) [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ], is calculated as
a sum of three single scores “Eye opening”, “Verbal response” and “Motor response”.
We model the GCS as a NDeP class. The formula is defined as annotation using the
names of NCoP classes (Figure 4: E). Now, the RDeP classes for GCS ranges are defined
(e.g., GCS_Score_s_ge_10_0_le_12_0). Then, the overall nervous system score is
modelled as NCoP class SOFA_Nervous_System_Score with RCoP classes (e.g.,
SOFA_Nervous_System_Score_3), which reference the GCS range classes and define
the score values.
        </p>
        <p>The final step is to define the SOFA score as NDeP class and to specify the formula
‘SOFA_Cardiovascular_System_Score + SOFA_Coagulation_Score +
SOFA_Kidneys_Score + SOFA_Liver_Score + SOFA_Nervous_System_Score +
SOFA_Respiratory_System_Score’.</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Phenotype Manager (PhenoMan)</title>
        <p>We developed the software Phenotype Manager (PhenoMan), which implements a
multistage reasoning approach combining standard reasoners (e.g., Pellet or HermiT) and
mathematical calculations. This section briefly outlines the main ideas of our solution
based on the example from section 3.2.</p>
        <p>First, an instance data set received from the FHIR Server as FHIR resources (Figure
5: A-C) is interpreted by PhenoMan and inserted into the ontology. On the one hand, the
individual properties (single phenotypes) are inserted as instances of the direct subclasses
of cop:Single_Phenotype (Bilirubin, Dopamine, Eye_Opening, etc.) and the values are
modelled as property assertions based on the has_value relation (e.g., “has_value 10” for
Dopamine). On the other hand, a composite phenotype is defined as instance of the class
cop:Composite_Phenotype, which combines all the single phenotype instances using
property assertions based on has_part relation. In the first step (classification step), a
standard reasoner classifies the single phenotype instances in restricted classes. In our
example, the instance of Eye_Opening is classified in the class
Eye_opening_to_verbal_command, the instance of Bilirubin – in the class
Bilirubin_s_ge_2_0_l_6_0 (i.e., the Bilirubin value is &gt;= 2.0 and &lt; 6.0 mg/dL), the
instance of Dopamine – in the class Dopamine_s_g_5_0_le_15_0, etc.</p>
        <p>Next, the composite phenotype instance is classified in the suitable score value
classes. For instance the cardiovascular system score has the score value 3, because the
composite phenotype instance is classified in the class
SOFA_Cardiovascular_System_Score_3 (Figure 4: D1, D2). In the next step
(calculation step), the formula of the derived phenotype class GCS_Score can be
calculated by PhenoMan. It inserts the determined score values for “Eye opening”,
“Verbal response” and “Motor response” in the formula and calculated the sum. After
the calculation the classification step must be performed again. The GCS_Score instance
is classified in the class GCS_Score_s_ge_10_0_le_12_0, so that the score value of the
nervous system score can be determined. In the final calculation step, the overall SOFA
score value is calculated based on the six single organ scores.</p>
        <p>In the case of complex phenotypes (e.g., SOFA) the classification and calculation
steps can be executed several times. That is the case if a NDeP class has subclasses, i.e.,
RDeP classes, which are in turn used in combined phenotypes. Both steps are repeated
until all formulas are calculated and all phenotypes are classified. Then, all derived and
calculated phenotypes are returned by PhenoMan as FHIR resources (Figure 5: D).</p>
        <p>The PhenoMan supports 4 primitive datatypes xsd:decimal, xsd:string, xsd:boolean
and xsd:date. All other complex datatypes (e.g., FHIR code or quantity) are mapped to
the primitive datatypes (e.g., code to xsd:string with additional attributes and quantity to
xsd:decimal with additional unit attribute). Furthermore, the PhenoMan provides, inter
alia, aggregate functions, Boolean, date and measurement unit arithmetic, integration of
external terminologies as well as reading and writing FHIR resources. Nevertheless, it is
not our aim to completely model the EHR. Instead, our approach can support the
modelling and calculation of selected phenotypes in a user-friendly standardized manner.</p>
      </sec>
      <sec id="sec-3-4">
        <title>3.4. Phenotype Editor</title>
        <p>The Phenotype Editor is an interactive user interface for managing and developing
PASOs. In Figure 6 you can see how the phenotype
SOFA_Cardiovascular_System_Score_3 is defined with the Phenotype Editor forms.
The phenotype is a restricted combined phenotype and thus, requires a Boolean
expression, which was built by drag-and-dropping the phenotypes from the left site into
the expression form field. The form data is transferred to the backend service via JSON
and the service uses the PhenoMan API to insert the phenotype metadata into a PASO.</p>
      </sec>
      <sec id="sec-3-5">
        <title>3.5. Implementation</title>
        <p>
          The PhenoMan is implemented in Java using OWL API [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ] and two reasoners, HermiT
[
          <xref ref-type="bibr" rid="ref22">22</xref>
          ] and Openllet [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ]. For calculations we utilize the Java Expression Evaluator
(EvalEx) [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ], but the integration of other libraries (e.g., for executing R scripts) or rule
systems (e.g., SWIRL or Drools) is also possible. The EvalEx enables evaluating
mathematical and Boolean (inter alia, Boolean operators and IF-THEN-ELSE structures)
expressions and supports defining custom functions and operators.
        </p>
        <p>
          The Phenotype Editor4 is a desktop app, designed with JavaScript and is shipped as
cross platform Electron [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ] app with an integrated lightweight web browser
(Chromium). We decided to outsource the logic (i.e., creation/update of a phenotype and
reasoning) into a backend service 5 , which provides information and management
functionalities of a PASO via REST interface. The backend is a DropWizard [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ]
application, which serves as a mediator to the PhenoMan API. The advantage of splitting
the phenotype managing application into frontend and backend is, that users are able to
work on one ontology collaboratively and all created ontologies are centrally stored. The
ontology service could also be executed on the local machine, so that the user could use
it to create his own ontologies. Additional features like access control or audit logging
are currently not available, but we plan to add them in future releases.
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Related Work</title>
      <p>
        We developed a novel approach to support ontological modelling and reasoning of
phenotypes. In contrast to [
        <xref ref-type="bibr" rid="ref7 ref8">7,8</xref>
        ], our solution serves to determine and to classify
phenotypes based on instance data (e.g., EHR). Moreover, the proposed reasoning
process includes calculation of mathematical formulas at runtime.
      </p>
      <p>
        Very similar to our approach, Fernández-Breis et al. [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ] propose to take advantage
of the best features of EHR standards and ontologies. The authors developed methods
allowing a direct use of EHR data for the identification of patient cohorts leveraging
current EHR standards and semantic web technologies. In [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ], openEHR [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ]
archetypes were used as EHR standard. An ontological infrastructure was designed
including different ontologies for representing domain entities (colorectal-domain), the
rules for determining the risk level and the data. The mappings between the phenotyping
archetype and the colorectal-domain ontology were defined and are automatically
executed on the archetyped data instances to generate the OWL dataset. The data is then
transformed into OWL, where the classification is performed. We use HL7 FHIR as a
standard for exchanging healthcare information in the SMITH infrastructure. But the
main difference to the approach of Fernández-Breis et al. lies in our three-level
ontological architecture. The COP is founded by GFO and provides a framework for
developing PASOs. In this way, each particular phenotype algorithm specified as a
PASO has the same standardized structure and can be executed by PhenoMan in the same
manner. A further advantage of our solution is that the PhenoMan supports classification
as well as calculation tasks and works directly with FHIR format, so that no further
transformations are required. The mapping between EHR data and ontology is performed
by PhenoMan automatically using terminology associations, which are defined for each
data element in ART-DECOR® (and imported into ontology) as well as in FHIR
resources (e.g., Observation).
      </p>
      <p>
        The main objective of SHARPn [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ] is to develop methods and modular open-source
resources for enabling secondary use of EHR data for high-throughput phenotyping. The
4 Source code and releases of the Phenotype Editor are available on GitHub under the GPL-3.0 license:
https://github.com/ChristophB/phenotype_editor
      </p>
      <p>
        5 Source code and releases of the Ontology Service (backend) are available on GitHub under the
GPL3.0 license: https://github.com/ChristophB/ontology_service
phenotype algorithms are specified based on Quality Data Model (QDM) [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ] and
represented in the HL7 Health Quality Measures Format (HQMF or eMeasure) [
        <xref ref-type="bibr" rid="ref31">31</xref>
        ].
According to the authors, there are two main challenges. Firstly, data elements in an EHR
may not be represented in a format consistent with the QDM. Secondly, an EHR typically
does not natively have the capability to automatically consume and execute eMeasure
logic. To address these challenges, a translator tool was developed that converts
QDMdefined phenotyping algorithm criteria into executable Drools rules scripts.
      </p>
      <p>
        The Phenotype Execution and Modeling Architecture (PhEMA) [
        <xref ref-type="bibr" rid="ref32">32</xref>
        ] is an
opensource infrastructure for standards-based authoring, sharing, and execution of
phenotyping algorithms. Similarly to SHARPn, PhEMA uses QDM and HQMF to model
phenotype definitions. Phenotyping algorithms are represented using the PhEMA
Authoring Tool (PhAT), are exported from the PhAT into executable KNIME [
        <xref ref-type="bibr" rid="ref33">33</xref>
        ]
workflows and are executed against data warehouses or data repositories.
      </p>
      <p>In contrast to the rule- or workflow-based description of phenotyping algorithms,
we use an ontology-based one. Our approach is rather generic and enables a standardized
and structured modelling as well as the reuse of phenotyping algorithms and their parts
(e.g., concepts and restrictions). Furthermore, the PhenoMan is compatible with the
native representation of EHR data (HL7 FHIR) in the SMITH infrastructure and does not
need an additional import of the data into a data warehouse.</p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref34">34</xref>
        ] a FHIR-compatible model was designed to support capture of cancer clinical
data. Our approach allows the modelling of different phenotypes based on a core
ontology (COP) and is independent of the EHR representation standards. The
interpretation of FHIR data and the mapping to specified phenotypes using terminology
associations are provided by PhenoMan.
      </p>
      <p>
        A method to enable automated transformation of clinical data into OWL ontologies
is presented in [
        <xref ref-type="bibr" rid="ref35">35</xref>
        ]. The developed system generates OWL representations of openEHR
archetypes and automatically transforms openEHR data to OWL individuals. In our
approach, the phenotypes are directly modelled in the ontology and are automatically
mapped to the EHR data. Moreover, our solution supports classification as well as
calculation of phenotypes.
      </p>
      <p>
        As described in section 3.1, phenotypes can possess links to concepts of external
ontologies. For instance, they may be annotated with concepts of anatomic structures
(e.g., Foundational Model of Anatomy [
        <xref ref-type="bibr" rid="ref36">36</xref>
        ]), or situations, respective processes, where
phenotypes are observed (e.g., electrocardiographic monitoring). The linkage is similar
to the Entity-Quality method [
        <xref ref-type="bibr" rid="ref37">37</xref>
        ] (entity: anatomic structure or process, quality:
phenotype) and may improve comparison of COP across multiple domains.
      </p>
      <p>
        Hoehndorf et al. [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] proposed the PhenomeNET for incorporation of phenotype
ontologies from different species. PhenomeNET can predict orthologous genes with
common pathways and common related diseases. Apart from the different interpretation
of the term ‘phenotype’, the main focus of our attempt is to deduce complex phenotypes
from a set of basic phenotypes of an individual.
      </p>
      <p>
        The Human Phenotype Ontology (HPO) [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] associates phenotypic abnormalities
with underlying diseases and participating genes, whereas COP can contain all sorts of
properties of an organism (including non-abnormalities). Currently, COP does not offer
weights for phenotype-disease relations, like HPO does to sort diseases for a phenotype
set by relevance. We will investigate ways to add this functionality to COP in future.
      </p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion and Future Work</title>
      <p>
        We developed a novel ontology-based method to model phenotypes of living beings with
the aim of automated phenotype reasoning based on instance data (e.g., patient data).
Our solution includes an enhanced reasoning process, which is iterative and combines
classification tasks with mathematical calculations at runtime. This new approach can be
used in clinical context, e.g., for supporting the diagnostic process, evaluating risk factors
or recruiting appropriate participants for clinical or epidemiological studies. About 20
phenotype algorithms have already been modelled and the ontology as well as the
reasoning method were successfully evaluated based on several data sets. Some
algorithms (such as socio-economic status6, SES [
        <xref ref-type="bibr" rid="ref38">38</xref>
        ]) were evaluated in comparison
with the corresponding SPSS derivatives based on the research database of the LIFE
study [
        <xref ref-type="bibr" rid="ref39">39</xref>
        ].
      </p>
      <p>
        An integration of more complex algorithms into the reasoning process is possible
and has to be investigated in respect of accessing external libraries (e.g., R scripts). The
current formalism will be extended in the future to include the further desiderata
expounded by Mo et al. [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. PhenoMan and Phenotype Editor will function as phenotype
engine and factory in SMITH context.
      </p>
    </sec>
    <sec id="sec-6">
      <title>6. Conflict of Interest</title>
      <p>The authors state that they have no conflict of interests.</p>
    </sec>
    <sec id="sec-7">
      <title>7. Acknowledgment References</title>
      <p>This work was supported by the German Federal Ministry of Education and Research
(SMITH: 01ZZ1803A; Leipzig Health Atlas: 031L0026).</p>
      <p>6 We consider SES in the broadest sense as a composite property of a person, i.e., as a kind of derivative
that can be modelled and calculated in the same way as a phenotype.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M.</given-names>
            <surname>Mahner</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Kary</surname>
          </string-name>
          ,
          <article-title>What exactly are genomes, genotypes and phenotypes? And what about phenomes?</article-title>
          ,
          <source>J. Theor. Biol</source>
          .
          <volume>186</volume>
          (
          <year>1997</year>
          )
          <fpage>55</fpage>
          -
          <lpage>63</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>R.</given-names>
            <surname>Hoehndorf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Oellrich</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Rebholz-Schuhmann</surname>
          </string-name>
          ,
          <article-title>Interoperability between phenotype and anatomy ontologies</article-title>
          ,
          <source>Bioinformatics</source>
          .
          <volume>26</volume>
          (
          <year>2010</year>
          )
          <fpage>3112</fpage>
          -
          <lpage>3118</lpage>
          . doi:
          <volume>10</volume>
          .1093/bioinformatics/btq578.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A.</given-names>
            <surname>Uciteli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Groß</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kireyev</surname>
          </string-name>
          , and
          <string-name>
            <given-names>H.</given-names>
            <surname>Herre</surname>
          </string-name>
          ,
          <article-title>An ontologically founded architecture for information systems in clinical and epidemiological research</article-title>
          ,
          <source>J. Biomed. Semant</source>
          .
          <volume>2</volume>
          (
          <issue>2011</issue>
          )
          <article-title>S1</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>A.R.</given-names>
            <surname>Deans</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.E.</given-names>
            <surname>Lewis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Huala</surname>
          </string-name>
          , et al.,
          <source>Finding Our Way through Phenotypes</source>
          ,
          <source>PLOS Biol</source>
          .
          <volume>13</volume>
          (
          <year>2015</year>
          )
          <article-title>e1002033</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>P.N.</given-names>
            <surname>Robinson</surname>
          </string-name>
          ,
          <article-title>Deep phenotyping for precision medicine</article-title>
          ,
          <source>Hum. Mutat</source>
          .
          <volume>33</volume>
          (
          <year>2012</year>
          )
          <fpage>777</fpage>
          -
          <lpage>780</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>H.</given-names>
            <surname>Mo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.K.</given-names>
            <surname>Thompson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.V.</given-names>
            <surname>Rasmussen</surname>
          </string-name>
          , et al.,
          <article-title>Desiderata for computable representations of electronic health records-driven phenotype algorithms</article-title>
          ,
          <source>J. Am. Med</source>
          . Inform. Assoc. JAMIA.
          <volume>22</volume>
          (
          <year>2015</year>
          )
          <fpage>1220</fpage>
          -
          <lpage>1230</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>S.</given-names>
            <surname>Köhler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.A.</given-names>
            <surname>Vasilevsky</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Engelstad</surname>
          </string-name>
          , et al.,
          <source>The Human Phenotype Ontology in 2017, Nucleic Acids Res</source>
          .
          <volume>45</volume>
          (
          <year>2017</year>
          )
          <fpage>D865</fpage>
          -
          <lpage>D876</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>R.</given-names>
            <surname>Hoehndorf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.N.</given-names>
            <surname>Schofield</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.V.</given-names>
            <surname>Gkoutos</surname>
          </string-name>
          ,
          <article-title>PhenomeNET: a whole-phenome approach to disease gene discovery</article-title>
          ,
          <source>Nucleic Acids Res</source>
          .
          <volume>39</volume>
          (
          <year>2011</year>
          )
          <article-title>e119</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>F.</given-names>
            <surname>Loebe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Stumpf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Hoehndorf</surname>
          </string-name>
          , and
          <string-name>
            <given-names>H.</given-names>
            <surname>Herre</surname>
          </string-name>
          ,
          <article-title>Towards improving phenotype representation in OWL</article-title>
          ,
          <source>J. Biomed. Semant</source>
          .
          <volume>3</volume>
          (
          <issue>2012</issue>
          )
          <article-title>S5</article-title>
          . doi:
          <volume>10</volume>
          .1186/2041-1480-3-S2-S5.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>S.</given-names>
            <surname>Gehring</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Eulenfeld</surname>
          </string-name>
          , German Medical Informatics Initiative:
          <article-title>Unlocking Data for Research and Health Care, Methods Inf</article-title>
          .
          <source>Med</source>
          .
          <volume>57</volume>
          (
          <year>2018</year>
          )
          <fpage>e46</fpage>
          -
          <lpage>e49</lpage>
          . doi:
          <volume>10</volume>
          .3414/ME18-13-0001.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>S.C.</given-names>
            <surname>Semler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Wissing</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Heyder</surname>
          </string-name>
          , German Medical Informatics Initiative,
          <source>Methods Inf. Med</source>
          .
          <volume>57</volume>
          (
          <year>2018</year>
          )
          <fpage>e50</fpage>
          -
          <lpage>e56</lpage>
          . doi:
          <volume>10</volume>
          .3414/ME18-03-0003.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>A.</given-names>
            <surname>Winter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Stäubert</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Ammon</surname>
          </string-name>
          , et al.,
          <source>Smart Medical Information Technology for Healthcare (SMITH)</source>
          ,
          <source>Methods Inf. Med</source>
          .
          <volume>57</volume>
          (
          <year>2018</year>
          )
          <fpage>e92</fpage>
          -
          <lpage>e105</lpage>
          . doi:
          <volume>10</volume>
          .3414/ME18-02-0004.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>HL7</surname>
            <given-names>FHIR</given-names>
          </string-name>
          , (n.d.). https://www.hl7.org/fhir/.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>ART-DECOR</surname>
            <given-names>®</given-names>
          </string-name>
          , (n.d.). https://www.art-decor.org/.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>FHIR</given-names>
            <surname>Subscription</surname>
          </string-name>
          , (n.d.). https://www.hl7.org/fhir/subscription.html.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>FHIR</given-names>
            <surname>Search</surname>
          </string-name>
          , (n.d.). https://www.hl7.org/fhir/search.html.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>H.</given-names>
            <surname>Herre</surname>
          </string-name>
          ,
          <article-title>General Formal Ontology (GFO): A Foundational Ontology for Conceptual Modelling</article-title>
          , in: R. Poli,
          <string-name>
            <given-names>M.</given-names>
            <surname>Healy</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A</given-names>
            .
            <surname>Kameas</surname>
          </string-name>
          (Eds.),
          <source>Theory Appl. Ontol. Comput. Appl.</source>
          , Springer, Netherlands,
          <year>2010</year>
          : pp.
          <fpage>297</fpage>
          -
          <lpage>345</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>A.</given-names>
            <surname>Uciteli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Neumann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Tahar</surname>
          </string-name>
          , et al.,
          <article-title>Ontology-based specification, identification and analysis of perioperative risks</article-title>
          ,
          <source>J. Biomed. Semant</source>
          .
          <volume>8</volume>
          (
          <year>2017</year>
          )
          <fpage>36</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <surname>J.-L. Vincent</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Moreno</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Takala</surname>
          </string-name>
          , et al.,
          <string-name>
            <surname>The</surname>
            <given-names>SOFA</given-names>
          </string-name>
          (
          <article-title>Sepsis-related Organ Failure Assessment) score to describe organ dysfunction/failure,</article-title>
          <source>Intensive Care Med</source>
          .
          <volume>22</volume>
          (
          <year>1996</year>
          )
          <fpage>707</fpage>
          -
          <lpage>710</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>G.</given-names>
            <surname>Teasdale</surname>
          </string-name>
          , and
          <string-name>
            <given-names>B.</given-names>
            <surname>Jennett</surname>
          </string-name>
          ,
          <article-title>ASSESSMENT OF COMA AND IMPAIRED CONSCIOUSNESS: A Practical Scale, The Lancet</article-title>
          .
          <volume>304</volume>
          (
          <year>1974</year>
          )
          <fpage>81</fpage>
          -
          <lpage>84</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <surname>OWL</surname>
            <given-names>API</given-names>
          </string-name>
          ,
          <article-title>(n</article-title>
          .d.). http://owlcs.github.io/owlapi/.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <surname>HermiT</surname>
            <given-names>Reasoner</given-names>
          </string-name>
          , (n.d.). http://www.hermit-reasoner.com/.
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>Openllet</given-names>
            <surname>Reasoner</surname>
          </string-name>
          ,
          <year>2018</year>
          . https://github.com/Galigator/openllet.
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>U.</given-names>
            <surname>Klimaschewski</surname>
          </string-name>
          , EvalEx - Java
          <source>Expression Evaluator</source>
          ,
          <year>2019</year>
          . https://github.com/uklimaschewski/EvalEx.
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <surname>Electron</surname>
          </string-name>
          , (n.d.). https://electronjs.org/.
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <surname>Dropwizard</surname>
          </string-name>
          , (n.d.). https://www.dropwizard.io.
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>J.T.</given-names>
            <surname>Fernández-Breis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.A.</given-names>
            <surname>Maldonado</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Marcos</surname>
          </string-name>
          , M. del
          <string-name>
            <given-names>C.</given-names>
            <surname>Legaz-García</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Moner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Torres-Sospedra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Esteban-Gil</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Martínez-Salvador</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Robles</surname>
          </string-name>
          ,
          <article-title>Leveraging electronic healthcare record standards and semantic web technologies for the identification of patient cohorts</article-title>
          ,
          <source>J. Am. Med</source>
          . Inform. Assoc.
          <volume>20</volume>
          (
          <year>2013</year>
          )
          <fpage>e288</fpage>
          -
          <lpage>e296</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>[28] openEHR, (n.d.). https://www.openehr.org/.</mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>J.</given-names>
            <surname>Pathak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.R.</given-names>
            <surname>Bailey</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.E.</given-names>
            <surname>Beebe</surname>
          </string-name>
          , et al.,
          <article-title>Normalization and standardization of electronic health records for high-throughput phenotyping: the SHARPn consortium</article-title>
          ,
          <source>J. Am. Med</source>
          . Inform. Assoc. JAMIA.
          <volume>20</volume>
          (
          <year>2013</year>
          )
          <fpage>e341</fpage>
          -
          <lpage>348</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <surname>QDM - Quality Data</surname>
            <given-names>Model</given-names>
          </string-name>
          , (n.d.). https://ecqi.healthit.
          <article-title>gov/qdm-quality-data-model.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <surname>HQMF - Health Quality Measure</surname>
            <given-names>Format</given-names>
          </string-name>
          , (n.d.). https://ecqi.healthit.
          <article-title>gov/hqmf-health-quality-measureformat.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>J.A.</given-names>
            <surname>Pacheco</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.V.</given-names>
            <surname>Rasmussen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.C.</given-names>
            <surname>Kiefer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.R.</given-names>
            <surname>Campion</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Speltz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.J.</given-names>
            <surname>Carroll</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.C.</given-names>
            <surname>Stallings</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Mo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ahuja</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Jiang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.R.</given-names>
            <surname>LaRose</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.L.</given-names>
            <surname>Peissig</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Shang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Benoit</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.S.</given-names>
            <surname>Gainer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Borthwick</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.L</given-names>
            .
            <surname>Jackson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sharma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.Y.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.N.</given-names>
            <surname>Kho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.M.</given-names>
            <surname>Roden</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Pathak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.C.</given-names>
            <surname>Denny</surname>
          </string-name>
          , and
          <string-name>
            <given-names>W.K.</given-names>
            <surname>Thompson</surname>
          </string-name>
          ,
          <article-title>A case study evaluating the portability of an executable computable phenotype algorithm across multiple institutions and electronic health record environments</article-title>
          ,
          <source>J. Am. Med</source>
          . Inform. Assoc. JAMIA.
          <volume>25</volume>
          (
          <year>2018</year>
          )
          <fpage>1540</fpage>
          -
          <lpage>1546</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <string-name>
            <surname>KNIME</surname>
          </string-name>
          , (n.d.). https://www.knime.com/.
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [34]
          <string-name>
            <given-names>H.</given-names>
            <surname>Hochheiser</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Castine</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Harris</surname>
          </string-name>
          , G. Savova, and
          <string-name>
            <given-names>R.S.</given-names>
            <surname>Jacobson</surname>
          </string-name>
          ,
          <article-title>An information model for computable cancer phenotypes</article-title>
          ,
          <source>BMC Med</source>
          . Inform. Decis. Mak.
          <volume>16</volume>
          (
          <year>2016</year>
          )
          <fpage>121</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [35]
          <string-name>
            <given-names>B.</given-names>
            <surname>Haarbrandt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Jack</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Marschollek</surname>
          </string-name>
          ,
          <source>Automated Transformation of openEHR Data Instances to OWL</source>
          , Stud.
          <source>Health Technol. Inform</source>
          .
          <volume>223</volume>
          (
          <year>2016</year>
          )
          <fpage>63</fpage>
          -
          <lpage>70</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          [36]
          <string-name>
            <given-names>C.</given-names>
            <surname>Rosse</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.L.V.</given-names>
            <surname>Mejino</surname>
          </string-name>
          ,
          <article-title>A reference ontology for biomedical informatics: the Foundational Model of Anatomy</article-title>
          , J. Biomed. Inform.
          <volume>36</volume>
          (
          <year>2003</year>
          )
          <fpage>478</fpage>
          -
          <lpage>500</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          [37]
          <string-name>
            <given-names>N.L.</given-names>
            <surname>Washington</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.A.</given-names>
            <surname>Haendel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.J.</given-names>
            <surname>Mungall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ashburner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Westerfield</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.E.</given-names>
            <surname>Lewis</surname>
          </string-name>
          ,
          <article-title>Linking human diseases to animal models using ontology-based phenotype annotation</article-title>
          ,
          <source>PLoS Biol</source>
          .
          <volume>7</volume>
          (
          <year>2009</year>
          )
          <article-title>e1000247</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          [38]
          <string-name>
            <given-names>T.</given-names>
            <surname>Lampert</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Müters</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Stolzenberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.E.</given-names>
            <surname>Kroll</surname>
          </string-name>
          , and KiGGS Study Group,
          <article-title>Measurement of socioeconomic status in the KiGGS study: first follow-up (KiGGS Wave 1</article-title>
          ), Bundesgesundheitsblatt Gesundheitsforschung Gesundheitsschutz.
          <volume>57</volume>
          (
          <year>2014</year>
          )
          <fpage>762</fpage>
          -
          <lpage>770</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          [39]
          <string-name>
            <given-names>LIFE</given-names>
            <surname>Health</surname>
          </string-name>
          Study - University of Leipzig, (n.d.). http://life.uni-leipzig.de/en/life_health_study.html.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>