<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Learning semantic rules for intelligent transport scheduling in hospitals</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Pieter Bonte</string-name>
          <email>Pieter.Bonte@intec.ugent.be</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Femke Ongenae</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Filip De Turck</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>IBCN research group, INTEC department, Ghent University - iMinds</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>The financial pressure on the health care system forces many hospitals to balance budget while struggling to maintain quality. The increase of ICT infrastructure in hospitals allows to optimize various workflows, which offer opportunities for cost reduction. This work-in-progress paper details how the patient and equipment transports can be optimized by learning semantic rules to avoid future delays in transport time. Since these delays can have multiple causes, semantic clustering is used to divide the data into manageable training sets.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>Due to the continuing financial pressure on the health care system in Flanders, many
hospitals are struggling to balance budget while maintaining quality. The increasing
amount of ICT infrastructure in hospitals enables cost reduction, by optimizing various
workflows. In the AORTA project1, cost is being reduced by optimizing transport
logistics of patients and equipment through the use of smart devices, self-learning models
and dynamic scheduling to enable flexible task assignments. The introduction of smart
wearables and devices allows the tracking of transports and convenient notification of
personnel.</p>
      <p>
        This paper presents how semantic rules, in the form of OWL-axioms, can be learned
from historical data, to avoid future delays in transport time. These learned axioms are
used to provide accurate data to a dynamic transport scheduler, allowing an optimized
scheduling accuracy. For example, the system could learn that certain transports
during the visiting hour on Friday are often late and more time should be reserved for
those transport during that period. Since transport delays can have multiple causes,
semantic clustering is performed to divide the data in more manageable training sets.
The increasing amount of integrated ICT infrastructure in hospitals allows all facets of
these transport to be captured for thorough analysis. To learn accurate rules, a complete
overview of the various activities in the hospital is mandatory. Since this data is
resulting from various heterogeneous sources, ontologies are utilized that have proven their
strengths in data integration [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The incorporation of the domain knowledge modeled in
the ontology, allows to learn more accurate rules. Furthermore, learning semantic rules
allows to understand and validate the learned results.
      </p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>2.1</p>
      <p>
        Learning Rules
Learning rules from semantic data can be accomplished through various methods.
The most prevalent are association rule mining [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] and Inductive Logic Programming
(ILP) [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. ILP is able to learn rules as OWL-axioms and fully exploits the semantics
describing the data. Incorporating this domain knowledge makes this method more
accurate. Statistical relational learning is an extension of ILP that incorporates probabilistic
data and can handle observations that may be missing, partially observed, or noisy [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
However, since our data is not noisy or possible missing, ILP was used in this research.
      </p>
      <p>
        DL-Learner [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] is an ILP framework for supervised learning in description logics
and OWL. Its Class Expression Learning for Ontology Engineering (CELOE)
algorithm [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] is a promising learning algorithm. It is a class expression learning algorithm
for supervised machine learning that follows a generate and test methodology. This
means that class expressions are generated and tested against the background knowledge
to evaluate their relevance. Furthermore, no explicit features need to be defined, since
the algorithms uses the structure of the ontology to select its features.
2.2
      </p>
      <p>Semantic Similarity
Clustering algorithms use a distance measure to have a notion of how similar two
data points are. Traditional distance measures, such as the Euclidean measure, are not
applicable to semantic data. Therefore, a semantic similarity measure is used to calculate
the semantic distance (1 semantic_similarity).</p>
      <p>
        Semantic similarity measures defines a degree of closeness or separation of target
objects [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Various semantic similarity measures exist, e.g. the Link Data Semantic
Distance [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] uses the graph information in the RDF resources, however it cannot deal
with literal values in the RDF data set.
      </p>
      <p>
        The closest to our approach is the The Maedche and Zacharias (MZ) [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] similarity
measure because it fully exploits the ontology structure. The MZ similarity differentiates
three dimensions when comparing two semantic entities (i) the taxonomy similarity, (ii)
the relation similarity and (iii) the attribute similarity. However, MZ does not take into
account that some relations between instances hold more information than others.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Data set and Ontology</title>
      <p>Real data sets were received from two hospitals describing all transports and related
information over a timespan of several months. A tailored ontology has been created
to model all transport information. It describes the transports, the hospital layout, the
patients, the personnel and their relations.</p>
      <p>Based on the characteristics of the received data, a data set was generated to conduct
our experiments on. For example, about 25% of the scheduled transports do not arrive
on time. The relevant use cases, such as described in Section 5, were provided by the
hospitals as well. An elaborate description and example of the ontology and the generated
data set can be found on http://users.intec.ugent.be/pieter.bonte/aorta/ontology/.
Notification
Manager
Wearable
Devices</p>
      <p>Message Bus</p>
      <p>Log In
Information
Location
Updates</p>
      <p>Triplestore
Context
Layer</p>
      <p>Dynamic
Scheduler</p>
      <p>Rule
Learner
– Popular visiting hours: transports taking place during the visitor hours on Friday or
during the weekends have a considerable chance of delay.</p>
      <p>LateTransport1 executedAtDay some (fFridayg)
– Busy passages: transports that use busy passages, such as a very busy elevator, are
often late and thus should be avoided on busy moments or more time should be
planned when taking this route.</p>
      <p>LateTrans port2 hasRoute some (hasSegment some (fBusyElevatorg))
– Personalized transport times: not all staff members are able to do the transports in the
same amount of time.</p>
      <p>LateTrans port3 executedBy some (fSlowNurseg)
The CELOE algorithm from DL-Learner was utilized to discover hidden axioms, such
as those mentioned above, from the knowledge base. Once the rules are found with a
reasonable accuracy, they are added to the Context Layer, so they can be taken into
consideration when scheduling future transports.
5.1</p>
      <p>Obstacles
Two obstacles occurred when learning expressions with the CELOE algorithm:</p>
      <p>As one can assume, there are various possible explanations why a set of transports
were late. The CELOE algorithm should thus result in a Class expression containing
multiple unifications to cover all possibilities. However, CELOE gives by design priority
to shorter expressions. It thus has difficulties to capture all the causes, even after
finetuning its parameters. To resolve this matter, we performed semantic clustering on the
set of late transports. Each resulting cluster then captures one of the possible causes
of the delay. If a detected cluster is sufficient large and represent a subset of data, it is
fed to CELOE separately to find a suitable explanation. These explanation are merged
afterward. The clustering of semantic data is further discussed in Section 5.2.</p>
      <p>Furthermore, learning causes such as in the first example (visitor hours on Friday
are busy) require a notion of time. However, days of the week or interesting periods,
e.g., vacations, are difficult to derive from the dateTimeStamp data type by the CELOE
algorithm. Therefore, we extended our semantic model with various concepts and
individuals to incorporate more knowledge about the time points at which the transport was
planned and eventually executed. For example, the day of the week, the period of the
day (morning, noon, etc.) and the shifts of the staff are modeled.
5.2</p>
      <p>Semantic Clustering
Calculating the semantic distance: Compared to the MZ similarity measure, we
propose a semantic similarity measure that takes into account that some relations between
certain individuals might hold more information than others. Comparable to Term
Frequency-Inverse Document Frequency (TF-IDF), which reflects how important a
word is to a document in a collection, some relations from one individual to another, are
more important to identify the similarity between two instances in a semantic knowledge
base. Since clustering takes place on the subset of transport data that were late, relations
to more frequently occurring individuals might hold more information. For example,
referring back to the first example, executedOnDay relationships to Friday occur more
frequently in the data set of late transports than other days. As such, the executedOnDay
relationships to Friday should get more weight. Formally, this means that when two
individuals have the same relation to a third individual (of type C) which is more
frequently referred to than other individuals of type C, extra weight is added to the
similarity of this relation. The similarity can be measured as:</p>
      <p>PBSim(i; j) = å
taxSim(i; j) + relSim(i; j) + attSim(i; j)
3
The calculation of taxonomy (taxSim) and attribute similarity (attSim) is similar to
those proposed by the MZ measure. The relational similarity is further elaborated upon.
To explain this similarity some additional functions and terminalogy are introduced.
D = document describing all Classes (C), Relations (Rel) and Individuals (Inds).
linkFreq(x) = jf(s; p; x)j(s; p; x) 2 D; p 2 Rel; s; x 2 Indsgj
typeFreq(x) = jf(s; p; y)j(s; p; y) 2 D; p 2 Rel; s; x; y 2 Inds;C(x) = C(y)gj
distinctTypeFreq(x) = jfyjx; y 2 Inds;C(x) = C(y)gj</p>
      <p>R(x) = f pj(x; p; y) 2 D; p 2 Rel; x; y 2 Indsg</p>
      <p>ER(x; q) = fyj(x; q; y) 2 D; q 2 Rel; x; y 2 Indsg
The similarity itself can be defined as min(1; rSim(i; j)):</p>
      <p>( linkFreq(e) distinctTypeFreq(e)
rSim(i; j) = år2R(i);r2R( j) åe2ER(i;r) typeFreq(e)
år2R(i);r2R( j) åe2ER(i;r) PBSim(e; ER( j; r)
if e 2 ER( j; r)
otherwise</p>
      <p>
        It gives more weight to relations to individuals that occur more often than others.
Clustering semantic data: To cluster the data in more managable subset, the K-Means
clustering algorithm is utilized. It calculates centroids to compute to which cluster the
various data points belong to. In the original algorithm, the centroid is an averaged
vector of all the data points in that cluster. Since there are no vectorized data points
in our cluster, this is not possible. Therefore, a mean individual is calculated for each
cluster and used as centroid, similar to the mean schema proposed by Fleischhacker et
al. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. The type of the centroid is based on the most occurring type of individuals in
the cluster. When no consensus can be made, the class hierarchy is taken into account.
The most occurring attributes are selected, while the specific value of the literals can be
averaged over the occurring values. When selecting the relations, the relations occurring
with more than half of the individuals in the cluster are added.
6
      </p>
    </sec>
    <sec id="sec-4">
      <title>Results</title>
      <p>A data set was generated reflecting the characteristics of a real hospital setting. Various
reasons why the transport is late are integrated in the data over various clusters. Since
DL-Learner can only discover one of the reason with an accuracy of 80.46%, and a
F-measure of 41.77%, clustering was performed. The K-Means algorithm is utilized
to compare the MZ similarity measure and our own PB similarity. Table 1 shows
the clustering precision and recall and the accuracy (DL-Acc) and F-measure (DL-F)
T P
retrieved from DL-learner on the clusters. The precision is calculated as T P+FP and the</p>
      <p>T P
recall as T P+FN . With the true positives (TP), false positives (FP) and false negatives
(FN) defines as follows:
K-Means
T P = number of transports correctly assigned to clusteri.</p>
      <p>F P = number of transports incorrectly assigned to clusteri.</p>
      <p>F N = number of transports incorrectly assigned to a cluster j, instead of clusteri.</p>
      <p>Note that the results are averages over the various clusters. The combination of
KMeans and our own similarity measures allows DL-Learner to learn the expected rules
with acceptable accuracy, which was not the case before clustering the data. Using the
MZ similarity measure, clustering often fails to identify the correct clusters. This results
in an unmanageable subset of data for DL-Learner to perform its rule learning on.
7</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusion &amp; Future Work</title>
      <p>Scheduling transports can be optimized by learning from past delays. The possible
high number of various reasons for this delay can be handled by performing semantic
clustering on the data set, producing more manageable data sets for learning algorithms
such as DL-Learner’s CELOE.</p>
      <p>In future work, we will investigate how the learned rules can be incorporated to
influence the assignment. Furthermore, a statistical relational learning solution will be
researched, to be able to handle possible missing and noisy data.</p>
      <p>Acknowledgment: This research was partly funded by the AORTA project, co-funded
by the IWT, iMinds, Xperthis, Televic Healthcare, AZMM and ZNA.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Bergamaschi</surname>
          </string-name>
          , et al.:
          <article-title>Semantic integration of heterogeneous information sources</article-title>
          .
          <source>Data &amp; Knowledge Engineering</source>
          <volume>36</volume>
          ,
          <fpage>215</fpage>
          -
          <lpage>249</lpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>De Raedt</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kersting</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Statistical relational learning</article-title>
          .
          <source>In: Encyclopedia of Machine Learning</source>
          , pp.
          <fpage>916</fpage>
          -
          <lpage>924</lpage>
          . Springer (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Fleischhacker</surname>
          </string-name>
          , et al.:
          <article-title>Mining rdf data for property axioms</article-title>
          .
          <source>In: On the Move to Meaningful Internet Systems: OTM</source>
          <year>2012</year>
          , pp.
          <fpage>718</fpage>
          -
          <lpage>735</lpage>
          . Springer (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Konstantopoulos</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , et al.:
          <article-title>Formulating description logic learning as an inductive logic programming task</article-title>
          .
          <source>In: FUZZ-IEEE</source>
          . pp.
          <fpage>1</fpage>
          -
          <lpage>7</lpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Lehmann</surname>
          </string-name>
          , J.:
          <article-title>Dl-learner: Learning concepts in description logics</article-title>
          .
          <source>J. Mach. Learn. Res</source>
          .
          <volume>10</volume>
          ,
          <fpage>2639</fpage>
          -
          <lpage>2642</lpage>
          (
          <year>Dec 2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , et al.:
          <article-title>Class expression learning for ontology engineering</article-title>
          .
          <source>Web Semantics: Science, Services and Agents on the World Wide Web</source>
          <volume>9</volume>
          (
          <issue>1</issue>
          ),
          <fpage>71</fpage>
          -
          <lpage>81</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          , et al.:
          <article-title>Sherlock: A Semi-automatic Framework for Quiz Generation Using a Hybrid Semantic Similarity Measure</article-title>
          .
          <source>Cognitive Computation</source>
          <volume>7</volume>
          (
          <issue>6</issue>
          ),
          <fpage>667</fpage>
          -
          <lpage>679</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Nebot</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Berlanga</surname>
          </string-name>
          , R.:
          <article-title>Finding association rules in semantic web data</article-title>
          .
          <source>Knowledge-Based Systems 25(1)</source>
          ,
          <fpage>51</fpage>
          -
          <lpage>62</lpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Ng</surname>
          </string-name>
          , R.T., et al.:
          <article-title>Clustering ontology-based metadata in the semantic web</article-title>
          .
          <source>Principles of Data Mining and Knowledge Discovery</source>
          <volume>1704</volume>
          (
          <issue>February</issue>
          ),
          <fpage>262</fpage>
          -
          <lpage>270</lpage>
          (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Passant</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Measuring semantic distance on linking data and using it for resources recommendations</article-title>
          .
          <source>In: AAAI spring symposium: linked data meets artificial intelligence</source>
          . vol.
          <volume>77</volume>
          , p.
          <volume>123</volume>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>