<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>R. M.S. Carvalho);</journal-title>
      </journal-title-group>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Graph for Electronic Health Records</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ricardo M.S. Carvalho</string-name>
          <email>rmscarvalho@fc.ul.pt</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andreia Sofia</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Teixeira</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Catia Pesquita</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Workshop</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Ontologies, Semantic Annotation, Clinical Temporal Knowledge Graphs, Temporal Annotation</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>LASIGE, Faculdade de Ciências, Universidade de Lisboa</institution>
          ,
          <country country="PT">Portugal</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>1891</year>
      </pub-date>
      <volume>000</volume>
      <fpage>0</fpage>
      <lpage>0002</lpage>
      <abstract>
        <p>The integration of Machine Learning (ML) into healthcare solutions has been accelerated by the proliferation of Electronic Health Records (EHRs). While various studies have leveraged ML to predict clinical outcomes, they often overlook the semantic context of clinical data. However, this context could be easily captured using ontologies and knowledge graphs (KG) as tools in ML development. EHRs are particularly suitable for KG representation, afording resources to improve predictive clinical AI models and generate new data discoveries. Traditional KGs often neglect the temporal dynamics crucial for maintaining understanding over time, however, recent advancements suggest incorporating temporal dimensions into KGs, termed Temporal Knowledge Graphs (TKGs), as a promising approach for modeling patient data. Despite the potential for TKGs, few resources exist to produce such representations within the healthcare domain.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        The surge in Machine Learning (ML) applications within healthcare, particularly for clinical
decisionmaking, has been notable due to the rise of Electronic Health Records (EHRs) systems [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ]. Studies
have successfully applied ML to predict various clinical outcomes, including mortality, sepsis, and
risk of readmission [
        <xref ref-type="bibr" rid="ref3 ref4 ref5 ref6">3, 4, 5, 6</xref>
        ], using data mining methods ranging from random forests to recurrent
neural networks. However, most of these approaches largely ignore the context of clinical data, relying
primarily on direct EHRs data extraction.
      </p>
      <p>
        Ontologies play a crucial role in connecting data with scientific context by representing domain
entities and their relationships in a formalized manner [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. By linking EHR data to ontologies through
semantic annotations, one can provide additional information about the meaning of the clinical data
to ML systems. When data is organized according to the schema of one or more ontologies, it forms
a Knowledge Graph (KG). A KG is composed of entities and relations structured as triples in graph
format [
        <xref ref-type="bibr" rid="ref10 ref8 ref9">8, 9, 10</xref>
        ]. KGs go beyond mere knowledge bases, as they can represent complex entities
and relations in a way that reflects real-world domains. This enables the incorporation of domain
ontology knowledge and mappings across real data entities [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], enhancing domain representation,
interpretability, and explainability.
      </p>
      <p>
        Health-related data such as EHRs are particularly well-suited for representation through KGs. These
KGs can enhance predictive clinical AI models by adding semantic information [
        <xref ref-type="bibr" rid="ref12 ref13 ref2 ref3">2, 12, 3, 13</xref>
        ],
potentially revealing new data correlations and improving performance. However, one critical oversight in
traditional KG approaches is the neglect of time as a dynamic factor. Real-world data, especially in
      </p>
      <p>CEUR</p>
      <p>
        ceur-ws.org
the clinical field, changes over time and requires temporal consideration to maintain logical coherence
and accuracy. Non-KG-based ML techniques have highlighted the significance of including time for
more precise predictions [
        <xref ref-type="bibr" rid="ref14">14, 15, 16</xref>
        ]. In the clinical field, there is a growing interest in using KGs to
represent clinical data temporally [17, 18], particularly for modelling patient data. Still, there is a lack
of resources that are able to combine both the semantic richness aforded by biomedical ontologies and
temporal aspects.
      </p>
      <p>In this work, we target this critical gap in healthcare data resources by proposing an approach to
encapsulate the semantic and temporal context of clinical entities in a KG representation. As a case
study, we apply this methodology to the MIMIC-III EHR data set [19]. Our approach integrates several
relevant knowledge engineering methods such as ontology selection, semantic annotation and ontology
alignment, and in particular, pairs semantic annotation with temporal annotation so that every fact
describing a patient has temporal context, enabling the tracking of the patient’s progression. Our
method focuses on building edge-centric TKGs, given the abundance of downstream approaches built
following that paradigm [20, 21, 22, 23, 24], thus aiding in improving data interoperability and future
ML applications in healthcare.</p>
      <p>The resulting TKG comprises four biomedical ontologies representing about 60,000 hospital stays
with more than 27 million temporal facts.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <sec id="sec-2-1">
        <title>2.1. KG Construction</title>
        <p>The methodology for constructing a KG should be tailored to relevant factors such as the domain, the
intended application, any pre-existing data sources, and the construction theme. Building a KG from
data sources includes two main steps: entity discovery, where entities acquired from the source data
constitute the nodes of the KG, and triple extraction, where these entities are linked through their
semantic relationships. These steps are usually preceded by data pre-processing and data cleaning.
Many KGs include a semantic backbone composed of ontologies, and as such, the entity discovery
step includes linking the entities to their types in the ontologies that compose the KG. Entity linking,
also known as semantic annotation, involves associating data objects with ontology entities with
well-defined semantics. Both more classical NLP [ 25, 26, 27] and neural approaches have been employed
in entity linking [28, 29, 30, 31, 32].</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Entity linking in EHRs</title>
        <p>EHRs can contain both highly structured data such as clinical codes and terminology (e.g.,
SNOMEDCT), as well as short-form and long form free text variables. While controlled vocabularies allow for
a straightforward inclusion in the KG, the annotation of free text poses additional challenges due to
the complexity of the biomedical vocabulary, the abundance of polysemy, synonymy, acronyms and
abbreviations, which result in ambiguity, and clinical free text variables often contain spelling mistakes.
The typical entity linking process takes text as input and enriches it with entity mentions linked to
nodes in a KG (or concepts in an ontology). The task is commonly split into entity mention detection
and entity disambiguation sub-tasks.</p>
        <p>Many clinical semantic annotators rely on a combination of strategies, including text processing,
large-scale knowledge bases, semantic similarity measures and ML techniques [33, 34, 35, 36, 37]. Large
Language Models (LLMs) have recently played a growing role in this task (e.g., [38, 39, 40]).</p>
      </sec>
      <sec id="sec-2-3">
        <title>2.3. Temporal Knowledge Graphs</title>
        <p>TKGs are an extension of traditional KGs [41] where facts about a domain include a temporal dimension.
There are two major paradigms in capturing time in KGs: edge-centric and node-centric. In the
edgecentric paradigm, the approaches are based on the notion of time-stamping an edge to capture the
validity of a fact. Most of the work that employs this paradigm is not specifically concerned with the
challenges in modelling the temporal dimension in KGs but rather with how to explore it in the context
of mining and ML applications. There are well-established data sets, such as YAGO or DBPedia, that
already have published temporal knowledge graph resources for downstream tasks [42, 43, 44].</p>
        <p>The node-centric paradigm captures the temporal dimension by modelling facts not as triples but
as instances of an event class with properties that capture the validity of the event. An early notable
efort in event modelling was the Simple Event Model (SEM) [ 45]. SEM represents data from the Web as
events, in and across various domains, driven by minimal commitment (e.g., no cardinality restrictions,
no functional or inverse functional properties, and Simple Knowledge Organization System (SKOS) [46]
is used to link to other vocabularies or event models to avoid inheriting their constraints.) SEM has been
used as the foundation for several event-based KGs, including EventKG[47] and Open EventKG [48].</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Methods</title>
      <p>Our decision to model the TKG as edge-centric was motivated both by the fact that there is no single
central event in a hospital stay (but rather a sequence of several occurrences) and the fact that the
majority of temporal KG mining approaches are applicable to edge-centric TKGs [49].</p>
      <p>Following [50], we define an edge-centric TKG as  = (, ,  ,  ) , where  ,  , and  are the sets
of entities, relations, and timestamps, respectively.  ∈  ×  ×  ×  is the set of all possible facts.
Respecting this definition, facts can take two forms: time-point facts, denoted as  = {ℎ,  , ,  } , where ℎ,  ,
 , and  are the head entity, the relation, the tail entity, and the timestamp, respectively; and time-interval
facts, denoted as  = {ℎ,  , , &lt;   ,   &gt;}, where ℎ,  ,  , and   ,   are the head entity, the relation, the tail
entity, the timestamp for the start of the time interval, and the timestamp for the end of the interval.</p>
      <p>The methodology we designed includes five components, as shown in Figure 1. The first component
is Data pre-processing and cleaning where textual terms from selected tables from the EHR database
are prepared for entity linking. The second component is Entity Linking, which includes selecting the
most appropriate ontologies using an external ontology repository and linking the textual terms to
the most appropriate ontology classes. The third component is Ontology Alignment, which aligns the
selected ontologies to increase their connectivity in the TKG. The fourth component, Fact Creation,
creates KG facts to represent entries in the EHR, linking a patient stay to the appropriate ontology class
that captures that specific entry. Finally, the fith component is dedicated to the Temporal annotation of
these facts, where every created fact is annotated with a time-point or time interval.</p>
      <p>The resulting KG describes patients under multiple ontologies that cover the appropriate domains,
with each fact about a patient including a temporal dimension. A subgraph representing a patient in
the final TKG is given in Figure 2.</p>
      <sec id="sec-3-1">
        <title>3.1. Data Pre-processing and Cleaning</title>
        <p>Although some information in EHRs is recorded using controlled vocabularies or terminologies, such
as the ICD terminology, many entries are filled in manually by clinicians. The goal of this step is to
minimize the impact of errors and inconsistencies found in EHR entries in the form of clinical free text
or terminology codes.</p>
        <p>For clinical free text, we apply the following criteria:
• The empty and inconsistent entries (e.g., very short or with irrelevant symbols) are excluded.
• Incomplete entries are kept because the annotators will try to generate insightful annotations.
• Entries with more than one term are decoupled (e.g., multiple diagnoses split by comma).
For terminological codes, cleaning focuses on adherence to terminological constraints:
• Empty entries are excluded.
• Faulty formatted codes are corrected to follow the terminology guidelines (e.g.: E3924 is corrected
to E392.4).</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Entity Recognition and Linking</title>
        <p>When EHR entries are in free text, the entities they refer to need to be linked to relevant biomedical
ontologies. This step requires both Ontology Selection — to identify the relevant ontologies that best
describe the text entries — and Semantic Annotation — to link each text entry to one or more relevant
ontology classes.</p>
        <sec id="sec-3-2-1">
          <title>3.2.1. Ontology Selection</title>
          <p>Our approach employs an ontology selection method that, given a list of input textual terms and an
ontology repository, outputs based on coverage, a sorted set of ontologies.</p>
          <p>While diferent ontology selection methods can be used in this step, we employed the BioPortal
Recommender platform [51] since it is linked to the largest repository available, which contains more
than 1000 biomedical ontologies. The BioPortal Recommender successfully handles misspellings and
uses a straightforward dictionary-based approach to create annotations of textual terms to ontology
classes, which are then employed to assess how well an ontology describes a set of terms. We utilize
the standard text processing capabilities and weight configurations of the Recommender. To operate
the BioPortal Recommender, we use input information from a cleaned set, in line with BioPortal’s
guidelines, and an ontology repository that has been pre-selected to ensure domain relevance. This
pre-selection facilitates the choice of an ontology for annotation based on its coverage, selecting the
one that ofers the best coverage.</p>
        </sec>
        <sec id="sec-3-2-2">
          <title>3.2.2. Semantic Annotation</title>
          <p>We propose two alternatives to the annotation process for the free text variable: a lexical similarity
approach designed for full-text search, and a large language model-based approach that uses
sentencetransformers for semantic search.</p>
        </sec>
        <sec id="sec-3-2-3">
          <title>Lexical similarity-based approach</title>
          <p>Lexical similarity can be measured through string similarity algorithms such as Jaccard Similarity,
Cosine Similarity, Levenshtein Distance, etc. Given the large size of the vocabularies both in the EHR
data and for each selected ontology, we developed an annotation approach based on the high-performing
search engine ElasticSearch (ES)[52] to find the best matches between an ontology’s vocabulary (i.e.,
its textual component in the form of entity labels) and the EHR textual entries. The search step for
each EHR term outputs the six best-matching ontology classes. To select the single best matching class,
we measure the Levenshtein Distance between the class label and the input term of every candidate
match, and select the best scoring class provided its similarity is above a threshold of 0.6 (empirically
determined).</p>
        </sec>
        <sec id="sec-3-2-4">
          <title>Transformer-based approach</title>
          <p>Lexical matching methods rely on keyword-based linking between input entities and document
spaces [53], ignoring contextual and semantic information. However, vocabulary ambiguities, which
are very common in clinical data, are challenging and lead to mismatches. An alternative for more
informed matches is semantic search.</p>
          <p>Semantic search is an alternative to traditional keyword-based searches [54] that incorporates
semantic information in the search process [53]. Semantic search employs LLMs to generate numerical
vector embeddings that represent textual terms, and uses vector operations to compare them. We used
a pre-trained sentence transformer [55] — multi-qa-mpnet-base-dot-v11 – to generate dense vector
representations for both ontology class labels and EHR terms and computed the similarity for each
1a MPNet [56] family model designed specifically for semantic search and trained on a large and multi-source question
answering set. Fine tuned model available at https://huggingface.co/sentence-transformers/multi-qa-mpnet-base-dot-v1
pair using the dot product. Each EHR entry was annotated to the class ontology with the most similar
vector.</p>
        </sec>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Ontology Alignment</title>
        <p>Ontology matching (OM) and alignment are operations to enhance the interoperability and integration
of diferent domain data sources. OM focuses on establishing correspondences between diferent
ontology classes, taking two distinct ontologies, and producing an alignment that is a collection of
equivalence semantic relations or mappings of their entities [57, 58].</p>
        <p>The necessity for ontology alignment in EHR-related data comes from the diversity of domains and
ontologies. The alignment allows a gate between the diferent ontologies for interoperability and to
facilitate downstream tasks that establish paths in a KG.</p>
        <p>We employed the AgreementMakerLight (AML) system — one of the best performing OM systems [59]
— to generate a fast and reliable alignment between all the biomedical ontologies used. For every
alignment, each mapping is transformed into an equivalent statement between the two classes formulated as
’owl: equivalent class’ efectively linking the ontologies.</p>
      </sec>
      <sec id="sec-3-4">
        <title>3.4. Triple Extraction</title>
        <p>Triple extraction transforms clinical data structured according to the EHR model into the KG model.
All the acquired and annotated entities will be mapped to the KG through relations to the patients and
their hospital stays. Each patient is represented by a node (instance) in the KG with associated triples
for properties.</p>
        <p>For each patient, we first acquire the set of related EHR entries according to their type (e.g., diagnosis,
drug prescriptions, etc.). For each entry, if there is a semantic annotation to an ontology class, a
simple triple is constructed according to the KG model  = {,  ,  _ } and using
the appropriate relation (see Figure 2).</p>
      </sec>
      <sec id="sec-3-5">
        <title>3.5. Temporal Annotation</title>
        <p>The temporal annotation adds temporal information to the patient’s triples, capturing the validity
period of the relation described by the triple. The process involves stamping the facts with the time
stamps contained in the patient’s entries on the data set.</p>
        <p>On EHR data, the temporal information is represented in datetime variables formatted as  −−,  ∶
 ∶  . Each relation is described in the patient’s data entry with either one or two time entries
representing instantaneous facts or facts with a given duration or validity.</p>
        <p>Following the definitions in section 3, to model instantaneous facts, we add a single time stamp to
the original triple, resulting in a time-point fact, and to model a fact with a specific duration, we add
two timestamps marking the beginning and end of the fact’s validity, resulting in a time-interval fact.</p>
        <p>The set of all temporal facts, ontologies and alignments represents the final TKG.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Results</title>
      <p>4.1. Data
We applied the proposed methodology to the generation of an ontology-rich TKG based on the
MIMICIII dataset. We created two versions of the TKG, one describing full hospital stays and the other only
ICU stays.</p>
      <p>The medical data used in this project is sourced from the MIMIC-III dataset, a substantial, openly
accessible database containing de-identified health data from patients admitted to the critical care units
of Beth Israel Deaconess Medical Center between 2001 and 2012. MIMIC-III includes data from 53,423
unique hospital admissions for adult patients (aged 16 and older) and provides a wide array of information
such as demographics, hourly bedside vital sign measurements, laboratory test results, procedures,
medications, caregiver notes, imaging reports, and mortality data (including post-discharge) [19].</p>
      <sec id="sec-4-1">
        <title>4.1.1. Data Acquisition, Cleaning and Pre-Processing</title>
        <p>The MIMIC-III data is available as a set of CSV files. These files were processed to extract the necessary
features to describe each patient. We focus on five specific features to allow a simplistic yet realistic
patient representation: Initial Diagnosis, Drug Prescriptions, Laboratory Events, Procedures, and Final
Diagnosis.</p>
        <p>The diagnosis is collected at admission and at discharge. At admission, the initial diagnosis is provided
as a preliminary, free text entry and at discharge, the final diagnosis is coded using The International
Classification of Diseases, 9th Revision, Clinical Modification (ICD9CM) glossary. Laboratory Events
respect the Logical Observation Identifier Names and Codes (LOINC) terminology and contain
information regarding laboratory-based measurements and include both in-hospital and out-of-hospital
laboratory measurements from clinics the patient has visited. Drug Prescriptions detail the medication
orders attributed to a patient, formatted according to National Drug Code (NDC). Procedures are coded
and include the procedures undergone by patients, specifically ICD9CM procedures.</p>
        <p>The five types of features extracted from MIMIC-III are thus primarily structured as clinical codes,
except the initial diagnosis in short-form free text.</p>
        <p>The cleaning process (section 3.1) results showcased in Table 1 point to a reduced data loss. Regarding
the initial diagnosis, the cleaning process resulted in more diagnoses, since some entries had more than
one diagnosis.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.1.2. Ontology Selection</title>
        <p>The only feature type that required ontology selection is the Initial Diagnosis, since all others respect
particular controlled vocabularies. To match the clinical domain we pre-select NCIT, SNOMEDCT, MeSH,
MedDRA and EFO as the biomedical ontologies to select from.</p>
        <p>After running the BioPortal’s Recommender, the ontology with the best coverage is the NCIT (Figure
3), reaching about 30% coverage. This apparently low coverage is not unexpected due to the limited
dictionary-based approach of the Recommender. The selected set of ontologies to use in the TKG is
then NCIT, LOINC, ICD9CM and the Drug Ontology (DRON)2. Their characteristics are described in
Table 2</p>
        <sec id="sec-4-2-1">
          <title>4.2. Semantic Annotation</title>
          <p>The annotation process is conducted on the entire set of cleaned and unique diagnoses collected from
the MIMIC-III dataset, totaling 10, 117 diagnoses. In our implementation, we perform two alternative
processes: the Lexical similarity-based approach and the Transformer-based approach. The Initial
Diagnosis is matched to the NCIT.</p>
        </sec>
      </sec>
      <sec id="sec-4-3">
        <title>4.2.1. Lexical similarity-based Annotation</title>
        <p>The final similarity distribution for the NCIT-Diagnosis matches is shown in Figure 4. Lexical similarity
has a median similarity score of 0.78, with the annotations skewed towards the higher lexical similarity
values, meaning that the diagnosis are likely matched to a class with an appropriate label. The lexical
annotation covers 83.5% of the full diagnosis set.</p>
      </sec>
      <sec id="sec-4-4">
        <title>4.2.2. Transformer-based annotation</title>
        <p>The coverage obtained by the Transformer-based annotation is higher than the lexical approach. The
similarity distribution for the transformer-based approach has a median similarity score around 0.8
and as in the lexical approach, annotations are skewed towards the higher similarity values. The two
approaches measure two diferent types of similarity. As such one can only argue that; the lexical
similarity-based approach is capable of providing annotations that match the ontology class labels and
2Because NDC is a private repository, we replace it with DRON. Both concern the same information, and DRON is already
prepared with information mapping class matches. Despite this change we were able to retain 75% of the full drugs set.
that the transformer approach is able to match entities to conceptually appropriate ontology classes
based on their semantics. This recognition paired with the better coverage, motivated our decision to
use the transformer annotations to build the TKG.</p>
        <sec id="sec-4-4-1">
          <title>4.3. Ontology alignment</title>
          <p>We generated alignments between all selected ontologies. The alignment results are shown in Table 3
and show a substantially larger alignment between ontologies with more related domains.</p>
        </sec>
        <sec id="sec-4-4-2">
          <title>4.4. Temporal Annotation</title>
          <p>The initial diagnosis, laboratory events, procedures, and final diagnosis are all instantaneous, and were
represented as time-point facts. Drug prescriptions, however, have an associated duration and were
captured as time-interval facts.</p>
          <p>For the ICU stay, we apply the following criteria:
• Instantaneous annotations are selected to the ICU stay set if their temporal stamp precedes or
falls within the period of the patient’s ICU stay.
• Prolonged annotations are selected for the ICU stay set if all or a part of their validity period falls
within the stay period or if their validity period precedes the ICU stay completely.
ICU Stay
79,644
15,212,374
129,203
54,605
15,475,826
1,606,234
1,606,234</p>
          <p>The resulting set sizes are shown in Tables 4 and illustrate a substantially large set of temporal facts
for the MIMIC-III data set. The diference in size is expected since ICU stays tend to be shorter than the
overall hospital stays.</p>
        </sec>
        <sec id="sec-4-4-3">
          <title>4.5. TKG description</title>
          <p>After applying our methodology to the MIMIC-III clinical data set we were able to generate the two TKGs.
Table 5 describes some relevant statistics, with totals for the hospital stay TKG (which encapsulates the
ICU stay plus additional information recorded outside of the ICU).</p>
          <p>The resulting TKG represents patients and their hospital stays (including ICU stays) relating them to
their diagnoses, prescribed drugs, conducted procedures and lab analyses which are in turn described
by a relevant biomedical ontology. Each of these relations contains a temporal dimension, either a
time-point or a time-interval. The TKG contains a total of 1.2 million nodes and more than 27 million
edges that describe every patient stay. All code and resource links required to produce the TKGs are
freely available 1. Researchers are required to formally request access to the MIMIC-III dataset via a
process documented on the MIMIC website 2.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusions</title>
      <p>One of the most challenging aspects of building a clinical TKG is the modelling decision regarding
temporal data. Our methodology focuses on edge-centric TKGs with both time-point and time-interval
facts to tackle the diversity in temporal data formats aforded by EHRs. Also challenging is the handling
of text-based EHR entries that are not bound by any terminological constraints. The processing of such
data is complex and entity recognition and linking requires the identification of suitable ontologies.
Our methodology emphasizes the importance of proper ontology selection and alignment to maximize
coverage and interoperability. Semantic and temporal annotation are crucial steps for linking clinical
entities to semantically relevant ontology concepts, placing them at the appropriate time points or
intervals.</p>
      <p>We aim for the MIMIC-III TKG to serve as a resource for the community, supporting the development
of TKG mining approaches for predictive tasks. This involves exploring the connections and nuances
between the semantic context provided by biomedical ontologies and the temporal aspects extracted
from the EHR. In particular, by transforming MIMIC-III data into a KG which includes several biomedical
ontologies, we aim to support the application of knowledge-based explanations [60]. We believe this
ability is crucial to support the uptake of artificial intelligence approaches in a clinical setting.
1at https://github.com/liseda-lab/clinical-temporal-kg under a GPL-3.0 license
2https://physionet.org/content/mimiciii/1.4/</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>”This work was partially supported by Fundação para a Ciência e Tecnologia
through the PhD scholarship ref. 2022.10769.BD, and the LASIGE Research Unit, ref.
UIDB/00408/2020 (https://doi.org/10.54499/UIDB/00408/2020) and ref.
UIDP/00408/2020 (https://doi.org/10.54499/UIDP/00408/2020). This work was also partially supported by the KATY project, which
has received funding from the European Union’s Horizon 2020 research and innovation programme
under grant agreement No 101017453, and in part by project 41, HfPT: Health from Portugal, funded by
the Portuguese Plano de Recuperação e Resiliência.
Conference, DASFAA 2021, Taipei, Taiwan, April 11–14, 2021, Proceedings, Part III 26, Springer,
2021, pp. 429–444.
[15] F. Xie, H. Yuan, Y. Ning, M. Ong, M. Feng, W. Hsu, B. Chakraborty, N. Liu, Deep learning for
temporal data representation in electronic health records: A systematic review of challenges
and methodologies, Journal of Biomedical Informatics 126 (2021) 103980. doi:1 0 . 1 0 1 6 / j . j b i . 2 0 2 1 .
1 0 3 9 8 0 .
[16] A. Woods, C. Meyer, B. Sauer, B. Cohen, Mining time-stamped electronic health records using
referenced sequences, arXiv preprint arXiv:2007.14336 (2020).
[17] J. Schrodt, A. Dudchenko, P. Knaup-Gregori, M. Ganzinger, Graph-representation of patient data:
a systematic literature review, Journal of medical systems 44 (2020) 86.
[18] R. M. Carvalho, D. Oliveira, C. Pesquita, Knowledge graph embeddings for icu readmission
prediction, BMC Medical Informatics and Decision Making 23 (2023) 12.
[19] A. Johnson, T. Pollard, L. Shen, L.-w. Lehman, M. Feng, M. Ghassemi, B. Moody, P. Szolovits,
L. Celi, R. Mark, Mimic-iii, a freely accessible critical care database, Scientific Data 3 (2016) 160035.
doi:1 0 . 1 0 3 8 / s d a t a . 2 0 1 6 . 3 5 .
[20] D. Liben-Nowell, J. Kleinberg, The link prediction problem for social networks, in: Proceedings
of the twelfth international conference on Information and knowledge management, 2003, pp.
556–559.
[21] L. Yao, L. Wang, L. Pan, K. Yao, Link prediction based on common-neighbors for dynamic social
network, Procedia Computer Science 83 (2016) 82–89.
[22] Z. Han, Y. Ma, Y. Wang, S. Günnemann, V. Tresp, Graph hawkes neural network for forecasting
on temporal knowledge graphs, in: Automated Knowledge Base Construction, 2020.
[23] R. Goel, S. M. Kazemi, M. Brubaker, P. Poupart, Diachronic embedding for temporal knowledge
graph completion, in: Proceedings of the AAAI conference on artificial intelligence, volume 34,
2020, pp. 3988–3995.
[24] S. S. Dasgupta, S. N. Ray, P. Talukdar, Hyte: Hyperplane-based temporally aware knowledge graph
embedding, in: Proceedings of the 2018 conference on empirical methods in natural language
processing, 2018, pp. 2001–2011.
[25] J. Jovanovic, E. Bagheri, Semantic annotation in biomedicine: The current landscape, Journal of</p>
      <p>Biomedical Semantics 8 (2017). doi:1 0 . 1 1 8 6 / s 1 3 3 2 6 - 0 1 7 - 0 1 5 3 - x .
[26] P. Stenetorp, S. Pyysalo, G. Topić, T. Ohta, S. Ananiadou, J. Tsujii, Brat: a web-based tool for
nlp-assisted text annotation, in: Proceedings of the Demonstrations at the 13th Conference of the
European Chapter of the Association for Computational Linguistics, 2012, pp. 102–107.
[27] V. Basile, J. Bos, K. Evang, N. Venhuizen, Developing a large semantically annotated corpus, in:</p>
      <p>LREC 2012, Eighth International Conference on Language Resources and Evaluation, 2012.
[28] W. Shen, Y. Li, Y. Liu, J. Han, J. Wang, X. Yuan, Entity linking meets deep learning: Techniques
and solutions, IEEE Transactions on Knowledge and Data Engineering 35 (2021) 2556–2578.
[29] Ö. Sevgili, A. Shelmanov, M. Arkhipov, A. Panchenko, C. Biemann, Neural entity linking: A survey
of models based on deep learning, Semantic Web 13 (2022) 527–570.
[30] M. C. Phan, A. Sun, Y. Tay, J. Han, C. Li, Neupl: Attention-based semantic matching and
pairlinking for entity disambiguation, in: Proceedings of the 2017 ACM on Conference on Information
and Knowledge Management, 2017, pp. 1667–1676.
[31] L. Chen, G. Varoquaux, F. M. Suchanek, A lightweight neural model for biomedical entity linking,
in: Proceedings of the AAAI conference on artificial intelligence, volume 35, 2021, pp. 12657–12665.
[32] J. Raiman, O. Raiman, Deeptype: multilingual entity linking by neural type system evolution, in:</p>
      <p>Proceedings of the AAAI Conference on Artificial Intelligence, volume 32, 2018.
[33] H. Wu, G. Toti, K. I. Morley, Z. M. Ibrahim, A. Folarin, R. Jackson, I. Kartoglu, A. Agrawal, C. Stringer,
D. Gale, et al., Semehr: A general-purpose semantic search system to surface semantic data from
clinical notes for tailored care, trial recruitment, and clinical research, Journal of the American
Medical Informatics Association 25 (2018) 530–537.
[34] E. Soysal, J. Wang, M. Jiang, Y. Wu, S. Pakhomov, H. Liu, H. Xu, Clamp–a toolkit for eficiently
building customized clinical natural language processing pipelines, Journal of the American
Medical Informatics Association 25 (2018) 331–336.
[35] R. Gazzotti, C. Faron-Zucker, F. Gandon, V. Lacroix-Hugues, D. Darmon, Injection of automatically
selected dbpedia subjects in electronic medical records to boost hospitalization prediction, 2020,
pp. 2013–2020. doi:1 0 . 1 1 4 5 / 3 3 4 1 1 0 5 . 3 3 7 3 9 3 2 .
[36] Z. Kraljevic, T. Searle, A. Shek, L. Roguski, K. Noor, D. Bean, A. Mascio, L. Zhu, A. A. Folarin,
A. Roberts, et al., Multi-domain clinical natural language processing with medcat: the medical
concept annotation toolkit, Artificial intelligence in medicine 117 (2021) 102083.
[37] A. Arbabi, D. Adams, S. Fidler, M. Brudno, Identifying Clinical Terms in Free-Text Notes Using</p>
      <p>Ontology-Guided Machine Learning, 2019, pp. 19–34. doi:1 0 . 1 0 0 7 / 9 7 8 - 3 - 0 3 0 - 1 7 0 8 3 - 7 _ 2 .
[38] N. Gupta, S. Singh, D. Roth, Entity linking via joint encoding of types, descriptions, and context,
in: Proceedings of the 2017 conference on empirical methods in natural language processing, 2017,
pp. 2681–2690.
[39] H. Yuan, Z. Yuan, R. Gan, J. Zhang, Y. Xie, S. Yu, Biobart: Pretraining and evaluation of a
biomedical generative language model, in: Proceedings of the 21st Workshop on Biomedical
Language Processing, 2022, pp. 97–109.
[40] D. Kartchner, J. Deng, S. Lohiya, T. Kopparthi, P. Bathala, D. Domingo-Fernández, C. S. Mitchell, A
comprehensive evaluation of biomedical entity linking models, in: Proceedings of the Conference
on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in
Natural Language Processing, volume 2023, NIH Public Access, 2023, p. 14462.
[41] J. Zhang, S. Liang, Y. Sheng, J. Shao, Temporal knowledge graph representation learning with
local and global evolutions, Knowledge-Based Systems 251 (2022) 109234.
[42] Z. Li, X. Jin, W. Li, S. Guan, J. Guo, H. Shen, Y. Wang, X. Cheng, Temporal knowledge graph
reasoning based on evolutional representation learning, in: Proceedings of the 44th international
ACM SIGIR conference on research and development in information retrieval, 2021, pp. 408–417.
[43] H. Dong, P. Wang, M. Xiao, Z. Ning, P. Wang, Y. Zhou, Temporal inductive path neural network
for temporal knowledge graph reasoning, Artificial Intelligence 329 (2024) 104085.
[44] H. Sun, J. Zhong, Y. Ma, Z. Han, K. He, Timetraveler: Reinforcement learning for temporal
knowledge graph forecasting, arXiv preprint arXiv:2109.04101 (2021).
[45] W. R. Van Hage, V. Malaisé, R. Segers, L. Hollink, G. Schreiber, Design and use of the simple event
model (sem), Journal of Web Semantics 9 (2011) 128–136.
[46] A. Miles, S. Bechhofer, Skos simple knowledge organization system reference, W3C
recommendation (2009).
[47] S. Gottschalk, E. Demidova, Eventkg–the hub of event knowledge on the web–and biographical
timeline generation, Semantic Web 10 (2019) 1039–1070.
[48] S. Gottschalk, E. Kacupaj, S. Abdollahi, D. Alves, G. Amaral, E. Koutsiana, T. Kuculo, D. Major,
C. Mello, G. S. Cheema, et al., Oekg: The open event knowledge graph, in: CLEOPATRA 2021
Cross-lingual Event-centric Open Analytics 2021, April 12 2021, Ljubiljana, Slovenia, volume 2829,
Aachen, Germany: RWTH Aachen, 2021.
[49] S. Ji, S. Pan, E. Cambria, P. Marttinen, S. Y. Philip, A survey on knowledge graphs: Representation,
acquisition, and applications, IEEE transactions on neural networks and learning systems 33 (2021)
494–514.
[50] B. Cai, Y. Xiang, L. Gao, H. Zhang, Y. Li, J. Li, Temporal knowledge graph completion: A survey,
arXiv preprint arXiv:2201.08236 (2022).
[51] M. Martínez-Romero, C. Jonquet, M. O’Connor, J. Graybeal, A. Pazos, M. Musen, Ncbo ontology
recommender 2.0: An enhanced approach for biomedical ontology recommendation, Journal of
Biomedical Semantics 8 (2017). doi:1 0 . 1 1 8 6 / s 1 3 3 2 6 - 0 1 7 - 0 1 2 8 - y .
[52] O. Kononenko, O. Baysal, R. Holmes, M. Godfrey, Mining modern repositories with elasticsearch
(2014). doi:1 0 . 1 1 4 5 / 2 5 9 7 0 7 3 . 2 5 9 7 0 9 1 .
[53] F. Lashkari, E. Bagheri, A. A. Ghorbani, Neural embedding-based indices for semantic search,
Information Processing &amp; Management 56 (2019) 733–755. URL: https://www.sciencedirect.com/
science/article/pii/S0306457318302413. doi:h t t p s : / / d o i . o r g / 1 0 . 1 0 1 6 / j . i p m . 2 0 1 8 . 1 0 . 0 1 5 .
[54] R. Guha, R. McCool, E. Miller, Semantic search, in: Proceedings of the 12th international conference
on World Wide Web, 2003, pp. 700–709.
[55] T. Wolf, L. Debut, V. Sanh, J. Chaumond, C. Delangue, A. Moi, P. Cistac, T. Rault, R. Louf, M.
Funtowicz, et al., Huggingface’s transformers: State-of-the-art natural language processing, arXiv
preprint arXiv:1910.03771 (2019).
[56] K. Song, X. Tan, T. Qin, J. Lu, T.-Y. Liu, Mpnet: Masked and permuted pre-training for language
understanding, Advances in neural information processing systems 33 (2020) 16857–16867.
[57] J. Euzenat, P. Shvaiko, J. Euzenat, P. Shvaiko, Matching strategies, Ontology Matching (2013)
149–197.
[58] D. Faria, C. Pesquita, E. Santos, M. Palmonari, I. Cruz, F. Couto, The agreementmakerlight ontology
matching system, volume 8185, 2013. doi:1 0 . 1 0 0 7 / 9 7 8 - 3 - 6 4 2 - 4 1 0 3 0 - 7 _ 3 8 .
[59] D. Faria, E. Santos, B. S. Balasubramani, M. C. Silva, F. M. Couto, C. Pesquita, Agreementmakerlight,</p>
      <p>Semantic Web (2023) 1–13.
[60] S. Chari, D. M. Gruen, O. Seneviratne, D. L. McGuinness, Directions for explainable
knowledgeenabled systems, in: Knowledge graphs for explainable artificial intelligence: Foundations,
applications and challenges, IOS Press, 2020, pp. 245–261.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>P.</given-names>
            <surname>Jensen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Jensen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Brunak</surname>
          </string-name>
          ,
          <article-title>Mining electronic health records: Towards better research applications and clinical care, Nature reviews</article-title>
          .
          <source>Genetics</source>
          <volume>13</volume>
          (
          <year>2012</year>
          )
          <fpage>395</fpage>
          -
          <lpage>405</lpage>
          .
          <source>doi:1 0 . 1 0 3 8 / n r g 3 2</source>
          <volume>0 8 .</volume>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>B.</given-names>
            <surname>Goldstein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Navar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Pencina</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ioannidis</surname>
          </string-name>
          ,
          <article-title>Opportunities and challenges in developing risk prediction models with electronic health records data: A systematic review</article-title>
          ,
          <source>Journal of the American Medical Informatics Association</source>
          <volume>24</volume>
          (
          <year>2016</year>
          )
          <article-title>ocw042</article-title>
          .
          <source>doi:1 0 . 1 0 9 3 / j a m i a / o c w 0</source>
          <volume>4</volume>
          <fpage>2</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>M.</given-names>
            <surname>Scherpf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Gräßer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Malberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Zaunseder</surname>
          </string-name>
          ,
          <article-title>Predicting sepsis with a recurrent neural network using the mimic iii database</article-title>
          ,
          <source>Computers in Biology and Medicine</source>
          <volume>113</volume>
          (
          <year>2019</year>
          )
          <article-title>103395</article-title>
          .
          <source>doi:1 0 . 1 0</source>
          <volume>1 6</volume>
          / j . c o m p
          <source>b i o m e d . 2 0</source>
          <volume>1 9 . 1 0 3 3 9 5 .</volume>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>B.</given-names>
            <surname>Beaulieu-Jones</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Orzechowski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Moore</surname>
          </string-name>
          ,
          <article-title>Mapping patient trajectories using longitudinal extraction and deep learning in the mimic-iii critical care database</article-title>
          ,
          <source>Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing</source>
          <volume>23</volume>
          (
          <year>2018</year>
          )
          <fpage>123</fpage>
          -
          <lpage>132</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Y.-W.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Faghri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Shaw</surname>
          </string-name>
          , R. Campbell,
          <article-title>Analysis and prediction of unplanned intensive care unit readmission using recurrent neural networks with long short-term memory</article-title>
          ,
          <source>PLOS ONE 14</source>
          (
          <year>2019</year>
          )
          <article-title>e0218942. doi:1 0 . 1 3 7 1 / j o u r n a l</article-title>
          .
          <source>p o n e . 0 2</source>
          <volume>1 8 9 4 2 .</volume>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Q.</given-names>
            <surname>Lu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. H.</given-names>
            <surname>Nguyen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Dou</surname>
          </string-name>
          ,
          <article-title>Predicting Patient Readmission Risk from Medical Text via Knowledge Graph Enhanced Multiview Graph Convolution, Association for Computing Machinery</article-title>
          , New York, NY, USA,
          <year>2021</year>
          , p.
          <fpage>1990</fpage>
          -
          <lpage>1994</lpage>
          . URL: https://doi.org/10.1145/3404835.3463062.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>F. Z.</given-names>
            <surname>Smaili</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Gao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Hoehndorf</surname>
          </string-name>
          ,
          <article-title>OPA2Vec: combining formal and informal content of biomedical ontologies to improve similarity-based prediction</article-title>
          ,
          <source>Bioinformatics</source>
          <volume>35</volume>
          (
          <year>2018</year>
          )
          <fpage>2133</fpage>
          -
          <lpage>2140</lpage>
          . URL: https://doi.org/10.1093/bioinformatics/bty933.
          <source>doi:1 0 . 1 0 9</source>
          <article-title>3 / b i o i n f o r m a t i c s / b t y 9 3 3 . a r X i v : h t t p s : / / a c a d e m i c . o u p . c o m / b i o i n f o r m a t i c s / a r t i c l e - p d f / 3 5 / 1 2 / 2 1 3 3 / 4 8 9 3 5 0 4 5 / b i o i n f o r m a t i c s _ 3 5 _ 1 2 _ 2 1 3 3</article-title>
          . p d f .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>S.</given-names>
            <surname>Auer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Kovtun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Prinz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kasprzik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Stocker</surname>
          </string-name>
          , M.-E. Vidal,
          <article-title>Towards a knowledge graph for science</article-title>
          ,
          <year>2018</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>6</lpage>
          .
          <source>doi:1 0 . 1 1</source>
          <volume>4 5 / 3 2 2 7 6 0 9 . 3 2 2 7 6 8 9 .</volume>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Q.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Mao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Guo</surname>
          </string-name>
          ,
          <article-title>Knowledge graph embedding: A survey of approaches and applications, IEEE Transactions on Knowledge and Data Engineering PP (</article-title>
          <year>2017</year>
          )
          <fpage>1</fpage>
          -
          <lpage>1</lpage>
          .
          <source>doi:1 0 . 1 1</source>
          0 9 / T K D E .
          <volume>2 0 1 7 . 2 7 5 4 4 9 9 .</volume>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>L.</given-names>
            <surname>Ehrlinger</surname>
          </string-name>
          , W. Wöß,
          <article-title>Towards a definition of knowledge graphs</article-title>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>A.</given-names>
            <surname>Kiryakov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Popov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Terziev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Manov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Ognyanof</surname>
          </string-name>
          , Semantic annotation, indexing, and retrieval,
          <source>Web Semantics: Science, Services and Agents on the World Wide Web</source>
          <volume>2</volume>
          (
          <year>2004</year>
          )
          <fpage>49</fpage>
          -
          <lpage>79</lpage>
          .
          <source>doi:1 0 . 1 0</source>
          <volume>1 6</volume>
          / j . w e b s e
          <source>m . 2 0 0 4 . 0 7 . 0 0 5 .</source>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>L.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Shea</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Qian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Masurkar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Deng</surname>
          </string-name>
          , D. Liu,
          <article-title>Patient clustering improves eficiency of federated machine learning to predict mortality and hospital stay time using distributed electronic medical records</article-title>
          ,
          <source>Journal of Biomedical Informatics</source>
          <volume>99</volume>
          (
          <year>2019</year>
          )
          <article-title>103291</article-title>
          .
          <source>doi:1 0 . 1 0 1 6 / j . j b i . 2 0</source>
          <volume>1 9 . 1 0 3 2 9 1 .</volume>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>H.</given-names>
            <surname>Suresh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Gong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Guttag</surname>
          </string-name>
          ,
          <article-title>Learning tasks for multitask learning: Heterogenous patient populations in the icu (</article-title>
          <year>2018</year>
          ).
          <source>doi:1 0 . 1 1</source>
          <volume>4 5 / 3 2 1 9 8 1 9 . 3 2 1 9 9 3 0 .</volume>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>R.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , W. Zhang, N. Liu,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>Susceptible temporal patterns discovery for electronic health records via adversarial attack</article-title>
          ,
          <source>in: Database Systems for Advanced Applications:</source>
          26th International
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>