<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Leiden, Netherlands
sepideh.hooshafza@adaptcentre.ie (S. Hooshafza); gaye.stephens@adaptcentre.ie (G. Stephens);
mark.little@adaptcentre.ie (M. Little); beyza.yaman@adaptcentre.ie (B. Yaman)</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Temporal Data Modelling Evaluation in Knowledge Graphs: A Healthcare Use Case</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Sepideh Hooshafza</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Gaye Stephens</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mark A. Little</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Beyza Yaman</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>ADAPT Research Centre, School of Computer Science and Statistics, Trinity College Dublin</institution>
          ,
          <addr-line>Dublin</addr-line>
          ,
          <country country="IE">Ireland</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Trinity Health Kidney Centre, Trinity College Dublin</institution>
          ,
          <addr-line>Dublin</addr-line>
          ,
          <country country="IE">Ireland</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2024</year>
      </pub-date>
      <volume>000</volume>
      <fpage>0</fpage>
      <lpage>0002</lpage>
      <abstract>
        <p>Healthcare data, such as patients' symptoms, laboratory test results, and various clinical measurements are temporal in nature, and are associated with a time. Modelling temporal healthcare data could benefit healthcare practitioners in healthcare decision making and support patient care. One method for modelling data that has been used in academia and industry is RDF-based knowledge graphs (KGs). Many approaches have been proposed to model temporal data in RDF-based KGs which have not been evaluated systematically in the healthcare domain. In this paper, an evaluation framework is proposed for the evaluation of temporal data modeling approaches in KGs and has been applied in a healthcare use case.1 Modelling temporal healthcare data supports medical professionals to comprehend disease patterns, evaluate patient histories, identify relationships in clinical events, and make informed predictions for improved patient care [1, 2]. A number of RDF-based approaches to modelling temporal data in KGs exist, including “standard reification”, and “singleton property”[3, 4]. However, to the best of our knowledge, the current RDF based temporal data modelling approaches have not been systematically evaluated using a healthcare use case. Existing approaches have produced different results in terms of complexity, and performance which require further evaluation and comparison [5]. In this study, an evaluation framework is proposed to evaluate temporal data modelling approaches in KGs. The evaluation framework components consist of six phases including data modeling approaches identification, use case identification, KG creation, KG hosting, KG deployment, metrics identification and evaluation. The evaluation framework was applied to a healthcare use case that addresses modeling medication data for patients with the rare disease anti-neutrophil cytoplasmic antibody (ANCA) Associated Vasculitis (AAV) in FAIRVASC [6, 7] project. The framework will guide data and knowledge engineers in evaluating various temporal data modeling approaches within KGs.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
    </sec>
    <sec id="sec-2">
      <title>2. Experiment</title>
      <p>Two well-known approaches for adding temporal data to a KG were chosen including singleton
property and standard reification. The evaluation was performed based on the six phases of the
evaluation framework and the healthcare use case. The dataset contains a total of 600 patients.
The data included details regarding the medications utilized for patients, including both the start
and stop dates for each medication. Two ontologies were designed based on identified
approaches, RDF data was generated using the R2RML engine. RDF data were imported into a
triple store. A competency question was designed and RDF data was queried using SPARQL.</p>
      <p>The comparative analysis between the singleton property and standard reification approaches
provides insights into their performance and complexity within the realm of RDF data modeling.
In terms of modeling complexity, the standard reification approach is more complex than the
singleton property approach. While assessing performance metrics, there isn't a notable
difference between the two approaches, primarily attributed to the limited size of the dataset.
Modelling Number of Statements
Complexity Additional triple generation</p>
      <p>Resource redundancy
Performance Data load time in triple store</p>
      <p>Query length requirement to execute a particular task
Query response time</p>
    </sec>
    <sec id="sec-3">
      <title>3. Conclusion</title>
      <p>This study proposed an evaluation framework for evaluating temporal data modeling approaches
in KGs. The framework can guide data and knowledge engineers in evaluating various temporal
data modeling approaches within KGs. With this knowledge, they will be able to choose the
methods that will best meet their needs when modeling temporal health data in graph databases.</p>
    </sec>
    <sec id="sec-4">
      <title>Acknowledgements References</title>
      <p>This work was funded by the ADAPT Centre for Digital Content Technology under the SFI
Research Centres Programme (Grant13/RC/2106-P2).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <source>[1] [2] [3] [4] [5] [6]</source>
          [7]
          <string-name>
            <given-names>N</given-names>
            <surname>Poh</surname>
          </string-name>
          , N.,
          <string-name>
            <given-names>S.</given-names>
            <surname>Tirunagari</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Windridge</surname>
          </string-name>
          .
          <article-title>Challenges in designing an online healthcare platform for personalised patient analytics</article-title>
          .
          <source>in 2014 IEEE Symposium on Computational Intelligence in Big Data (CIBD)</source>
          .
          <year>2014</year>
          Combi,
          <string-name>
            <surname>C.</surname>
          </string-name>
          , G. Cucchi, and
          <string-name>
            <given-names>F.</given-names>
            <surname>Pinciroli</surname>
          </string-name>
          ,
          <article-title>Applying object-oriented technologies in modeling and querying temporally oriented clinical databases dealing with temporal granularity and indeterminacy</article-title>
          .
          <source>IEEE Trans Inf Technol Biomed</source>
          ,
          <year>1997</year>
          .
          <volume>1</volume>
          (
          <issue>2</issue>
          ): p.
          <fpage>100</fpage>
          -
          <lpage>27</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Hernández</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hogan</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Krötzsch. Reifying</surname>
          </string-name>
          <string-name>
            <surname>RDF</surname>
          </string-name>
          :
          <article-title>What works well with wikidata? 2015.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Nguyen</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Bodenreider</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Sheth</surname>
          </string-name>
          .
          <article-title>Don't like RDF reification? Making statements about statements using singleton property</article-title>
          .
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Magkanaraki</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , et al.
          <article-title>Benchmarking RDF schemas for the Semantic Web</article-title>
          .
          <source>in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)</source>
          . 2002 Yaman,
          <string-name>
            <surname>B.</surname>
          </string-name>
          , et al.,
          <article-title>Towards A Rare Disease Registry Standard: Semantic Mapping of Common Data Elements Between FAIRVASC and the European Joint Programme for Rare Disease McGlinn</article-title>
          ,
          <string-name>
            <surname>K.</surname>
          </string-name>
          , et al.,
          <article-title>FAIRVASC: A semantic web approach to rare disease registry integration</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <source>Computers in Biology and Medicine</source>
          ,
          <year>2022</year>
          .
          <volume>145</volume>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>