<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Towards A Rare Disease Registry Standard: Semantic Mapping of Common Data Elements Between FAIRVASC and the European Joint Programme for Rare Disease</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Beyza Yaman</string-name>
          <email>beyza.yaman@adaptcentre.ie</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kris McGlinn</string-name>
          <email>kris.mcglinn@adaptcentre.ie</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lucy Hederman</string-name>
          <email>lucy.hederman@adaptcentre.ie</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Declan O'Sullivan</string-name>
          <email>declan.osullivan@adaptcentre.ie</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mark A. Little</string-name>
          <email>mark.little@adaptcentre.ie</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>ADAPT Centre, SCSS, Trinity College Dublin</institution>
          ,
          <addr-line>Dublin</addr-line>
          ,
          <country country="IE">Ireland</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Trinity Translational Medicine Institute, Trinity College Dublin</institution>
          ,
          <addr-line>Dublin</addr-line>
          ,
          <country country="IE">Ireland</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper describes the extension of the FAIRVASC rare disease ontology, with Joint Research Council Common Data Elements (CDE), and mapping to the European Joint Programme on Rare Diseases (EJPRD) CDE ontology. We use the rare autoimmune condition ANCA vasculitis as a model disease to illustrate this. Semantic modelling of CDE for Rare Diseases over registry data is important to represent the specific concepts around these conditions. We describe the development of an ontology which facilitates the simultaneous uplift of tabular data into a common RDF format from several registries. The ontology allows the data to be integrated across the registries and increases the interoperability and standardisation among datasets, thus enhancing collaboration with external stakeholders. The ontology, therefore, creates an efective rare disease research environment which enables the disease and its impact on the patient to be investigated in an efective manner across national borders. This paper presents the methodology and road map to implement the CDE ontology for the health domain.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;ontology</kwd>
        <kwd>health informatics</kwd>
        <kwd>rare disease</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>The quantity of digital health data sources has been growing immensely, with a resultant
increase in global complexity and size. These data sources provide structured or unstructured
clinical information which makes data management and analysis a highly important task in the
medical domain. With improved management and analysis of health data, there is significant
potential to discover new solutions for dificult health challenges, particularly in the rare disease
space, where datasets are sparse and distributed across multiple countries.</p>
      <p>
        Autoimmune disease is one such area, where the cause of the disease is unknown and
management unclear. Immune systems that are overactive are directed against self-antigens
which harm and damage the body’s own tissues (autoimmune diseases)[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The immune system
may produce antibodies that, instead of defending against infections, attack the body’s own
tissues in reaction to an unknown stimulus. The goal of autoimmune disease treatment is to
reduce the immune system activity; however, this treatment can cause the patient to have
severe infections and in some cases increases the risk of cancer. Large datasets are required to
study these challenges, but these are rarely available for use in one location, necessitating a
combination of multiple datasets.
      </p>
      <p>The European Joint Programme on Rare Diseases (EJP-RD) unites 130 institutions from
35 countries to create an efective ecosystem between research, care and medical innovation.
EJP-RD has two major objectives: i) Through the creation, demonstration, and promotion of
Europe/worldwide sharing of research and clinical data, materials, processes and expertise to
increase the integration, efectiveness, production, and social impact of rare disease research.
ii) Implement and further develop an eficient model of financial support for all categories of
rare disease research as well as rapid utilisation of research findings for patient benefit 1. This
will enhance the lives of individuals with rare diseases by giving new and improved treatment
choices and diagnostic tools. The Common Data Elements ontology is one such tool to increase
the research eficiency in the domain.</p>
      <p>
        The PersonAlisation of RelApse risk in autoimmune DISEase (PARADISE) project is a specific
example of a programme that studies the precise tailoring of immunosuppressive drugs and the
prediction of disease reactivation. PARADISE is an interdisciplinary project bringing computer
scientists, clinicians and health informatics together to solve this problem. It builds upon the
FAIRVASC EU project2 and the AVERT project3, which have established the foundation for
the proposed semantic web technology approach. The use of semantic web based ontologies
underpins the integration of diferent data sources in these projects [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Ontologies are used to
describe a domain with formalised definitions and axioms to infer more meaningful information
from the data [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. The standard definition of the concepts in the domain through an ontology
can enable the straightforward integration of the data with the least efort possible. Such
definition of concepts and relations through a W3C standard based representation of the
ontology also allows for the use of pre-existing tools and applications for health data and
maximises the interoperability of systems in the domain. This is in line with the European
Commission recommendation on interoperability of electronic health record systems across
borders such that any other system or application in Europe can comprehend and interpret
the information that has been shared4. It is also significant to re-use the ontologies created in
a specific domain to increase the interoperability and reduce the engineering time and efort.
Interoperability is important in the health domain. Because as opposed to machines, which are
unable to distinguish between the same illness even when it has been documented diferently
in other registries, clinicians are able to understand the illness. Thus, semantic interoperability
establishes a shared understanding that allows computers to communicate reliably. The capacity
1https://www.ejprarediseases.org/what-is-ejprd/project-structure/
2https://fairvasc.eu/
3https://www.tcd.ie/medicine/thkc/avert/
4https://eur-lex.europa.eu/legal-content/EN/TXT/PDF/?uri=CELEX:32008H0594&amp;from=EN
of multiple ontologies to map diverse concepts to shared semantics, or meaning, is essential
for machine communication. Communicating data in an efective way is a challenging task
without semantic interoperability across diverse healthcare IT systems. In our case, we are
using declarative mapping as a way of specifying how data can be transformed from a data
model according to one ontology to a data model according to a diferent ontology.
      </p>
      <p>The use of standard-based approaches also has a huge impact on the understanding of
the data as well as concluding meaningful solutions. Another proposal of relevance is the
ISO/IEC 11179 Metadata Registry5 standard, which is a global ISO standard for expressing
metadata in a metadata registry for an organisation, which is also used for the health registry
domain. It outlines the process of standardising and registering metadata in order to make data
more comprehensible and shared. The Common Data Elements (CDE) ontology, developed by
the EU Joint Research Centre and implemented by The European Joint Programme on Rare
Diseases6(EJP-RD) is another approach to standardise the modelling of a rare disease.</p>
      <p>The primary focus of this paper is to map the CDE ontology to the FAIRVASC ontology
using the ontology developed by the EJP-RD, thereby linking FAIRVASC and EJP-RD. The
development of the mapping demonstrates the utility of taking an ontology-based approach to
support interoperability in the health domain. The CDE extension of the FAIRVASC ontology,
which represents all of the EJP-RD Common Data Elements, is therefore the study’s contribution.
Because the CDE ontology is broad and generic, we worked from the CDE elements pertaining
to our project first and created the mappings to the generic CDE ontology. Creating these
mappings enables rare disease registry interoperability to inform clinical care and increases
understanding among registries and clinicians. Thus, in this project, FAIRVASC ontology is
adopted and extended to build on the PARADISE project.</p>
      <p>The paper is structured as follows: Section 2 presents the FAIRVASC Project and EJP-RD
CDEs upon which this work is built. Section 3 introduces the PARADISE project. Section 4
discusses the newly created attributes for FAIRVASC ontology, needed to comply with the
EJP-RD proposal and the mappings between FAIRVASC and CDE ontologies. Section 5 discusses
the lessons learned from the implementation process. Finally, Section 6 presents the conclusions
and future work.</p>
    </sec>
    <sec id="sec-2">
      <title>2. FAIRVASC Project</title>
      <p>This section presents the concepts which the PARADISE project is depending on. Section
2.1 presents the elements that underpin the relationship between FAIRVASC, PARADISE and
EJP-RD. Section 2.2 introduces the FAIRVASC ontology. Section 2.3 provides a brief introduction
to the Common Data Elements ontology proposed by EJP-RD.</p>
      <sec id="sec-2-1">
        <title>2.1. Relationship between FAIRVASC, PARADISE and EJP-RD</title>
        <p>The Rare Kidney Disease Registry and Biobank7 (RKD) is one of the seven FAIRVASC registries
and is also the source registry for the PARADISE project. It was founded in 2012 to conduct</p>
        <sec id="sec-2-1-1">
          <title>5https://www.iso.org/standard/60341.html 6https://www.ejprarediseases.org/ 7https://www.tcd.ie/medicine/thkc/research/rare.php</title>
          <p>research on rare kidney diseases in Ireland.</p>
          <p>The RKD Registry records data on most patients with ANCA vasculitis in Ireland. Clinical
study cases are distributed and tracked over multiple hospitals, research institutes and clinics.
Due to the rare nature of the disease, the low patient numbers require all the research institutes
and clinics to be involved in the study. The RKD Registry data is collected from the patients
manually through hospital registration and a mobile patient application called patientMpower8.
Using both infrastructures enables longitudinal tracking of the patient’s condition. The data
is stored in the REDCap web platform9 which provides a secure platform for managing and
maintaining online databases. REDCap provides automated export procedures as well as
common statistical packages, and ad-hoc reporting tools.</p>
          <p>As one of the rare disease registries embedded within the European Reference Network for
rare immune disorders, ERN-RITA10, the RKD registry seeks to be fully interoperable with
the registry structure envisaged by EJP-RD through its engagement with FAIRVASC. When
designing the PARADISE project, the FAIRVASC ontology was adapted and mapped onto the
CDE of the EJP-RD ontology. Figure 2 presents the relation between FAIRVASC and EJP-RD.
Registry datasets are uplifted using R2RML mappings and FAIRVASC ontology. Uplifted data
is stored in a triplestore which is then later queried via a query interface. The end-user can
pose queries using FAIRVASC and CDE ontologies using created mappings. PARADISE project
employs the extended FAIRVASC ontology for its use case.</p>
        </sec>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. FAIRVASC Ontology</title>
        <p>FAIRVASC is a research project of the European Vasculitis Society and RITA European
Reference Network, bringing together computer scientists, clinicians and patient organisations. It
comprises ten partners across Europe that represent all aspects of care of patients with the rare</p>
        <sec id="sec-2-2-1">
          <title>8https://info.patientmpower.com/ 9https://projectredcap.org/software/ 10https://ern-rita.org/</title>
          <p>disease ANCA vasculitis. Seven national registries are partners in the project, namely, Ireland’s
RKD registry, the UK’s UKIVAS registry, the French Vasculitis Study Group registry, the Czech
Vasculitis Registry, the Polish Vasculitis Registry POLVAS, the GEVAS German/Austrian/Swiss
registry and Sweden’s Skåne Vasculitis Inception Cohort.</p>
          <p>
            The FAIRVASC ontology was created to manage data related to ANCA vasculitis and it is
based on the harmonisation of terms in the seven registries. Each registry provides feedback
on proposed harmonised clinical terms. Once agreed, these harmonised terms are formally
integrated into the FAIRVASC ontology. Protege[
            <xref ref-type="bibr" rid="ref4">4</xref>
            ], an ontology building tool, was used to
create the ontology11 (Figure 1) for a snapshot of the FAIRVASC class hierarchy in Protege. This
tool was used to define each class, its connections, and data characteristics (those in bold are
FAIRVASC classes, the rest are from the Birmingham Vasculitis Activity Score12 ontology which
was developed to standardise the representation of Birmingham Vasculitis Activity Scoring
across the registries as RDF). There are 9 top-level classes, 13 classes total, 9 object attributes,
and 24 data properties in the ontology. Patient, Patient Overview, Diagnosis, Clinical Outcomes,
Encounter, Clinical Test, and Organ Pattern are the top-level classes.
          </p>
          <p>Rather than creating a new ontology, existing ontologies (or a part of an existing ontology)
are employed in the project. NCIT, SNOMED-CT, Medical Dictionary for Regulatory Activities
Terminology (MedDRA) and the Orphanet nomenclature (ORPHAcode) ontologies are explored.
However, it was seen that some of the ontologies had highly restrictive licensing (e.g.
SNOMEDCT), thus mappings were created between the less restrictive ontologies (NCIT and Orphanet)
in order to add rich semantics to the data.
11http://ontologies.adaptcentre.ie/fairvasc/
12http://ontologies.adaptcentre.ie/bvas/</p>
        </sec>
      </sec>
      <sec id="sec-2-3">
        <title>2.3. Common Data Elements for Rare Diseases</title>
        <p>The European Joint Programme on Rare Diseases (EJP-RD) aims to enhance treatment for
individuals with rare diseases as part of wider European Union-wide initiatives to coordinate
actions to address data fragmentation concerns in European medical registries. To stimulate
and facilitate research in the rare illnesses sector, EJP-RD has designed a virtual platform
architecture, which is a service-oriented eco-system of interconnected online services. They
identified five key services: authentication and authorization, rare disease data discovery and
elaboration, data request and access, dataset enhancement (e.g. pseudonymization), and services
for making resources FAIR for federated use, such as catalogues of ontological models for rare
disease data and metadata13. EJP-RD emphasises the use of standard data representations such
as ontologies and their reuse, as well as the use case driven design. The work in this article is
well-suited to the EJP’s general structure.</p>
        <p>
          The EU Joint Research Centre initially created Common Data Elements for Rare Diseases[
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]
which comprises 16 data components that must be reported by each European rare illness
registry as they are considered crucial for future study. The elements are grouped around
8 main subjects namely pseudonymization of the patient, a patient’s personal information,
status, diagnosis, disease history, care pathway, disability as well as information on consent
for research purposes. A Common Data Elements ontology [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] was created to model the
components semantically14. The authors proposed a generic semantic data model of the set of
common data elements for rare disease registration. CDE ontology is based on Semantic Science
Integrated Ontology (SIO) [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] as the core framework for representing the entities and their
relationships. The ontology also integrates the Orphanet Rare Disease Ontology [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ], Human
Phenotype Ontology [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] and National Cancer Institute Thesaurus [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] to describe the diagnoses.
The authors also provide a list of templates to convert the CDE data CSV files to RDF in a
semi-automatic way, as well as a SHeX model to enable the validation of the converted files.
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. PARADISE Project</title>
      <p>The PARADISE project stems from data generated in the Irish RKD registry, which is also a
FAIRVASC registry. It addresses the greatest challenge in autoimmune disease, which is to
devise personalised strategies for precise tailoring of immunosuppressive drugs to prevent
disease relapse or “flare". PARADISE specifically focuses on ANCA vasculitis which is archetypal
heterogeneous relapsing and remitting chronic autoimmune disease. Due to the lack of strong
prognostic techniques, clinicians treating ANCA vasculitis use conventional dosage regimens
that give time-based immunosuppressive drugs dose modifications but presume no disease
progression and provide limited customised accounting. Deviations from these regimens are
based on clinician intuition and experience, as well as recognized biomarkers (e.g.,
urinalysis/autoantibody levels) and an estimate of past immunosuppressive drugs exposure. Future
lfare risk predictions are subjective, inconsistent, and frequently inaccurate.</p>
      <p>The PARADISE project is being led by the SFI ADAPT Centre and Trinity School of Medicine.
The goal is to investigate a novel solution for individualised flare risk prediction in autoimmune
13https://www.ejprarediseases.org/
14https://github.com/ejp-rd-vp/CDE-semantic-model
disease and prevention of over-treatment with immunosuppressive drugs. The PARADISE
consortium aims at solving this problem by combining semantic web technologies, clinical
expertise, targeted biomarker analysis, and patient-sourced health data by integrating readily
implementable data streams in the physician workflow and patient self-management tools.</p>
    </sec>
    <sec id="sec-4">
      <title>4. FAIRVASC-CDE Mappings</title>
      <p>In order to develop the mapping between the FAIRVASC and CDE ontologies, we adopted a
consultation process bringing together health informaticians, clinicians and computer scientists
deeply knowledgeable of the main concepts in the rare kidney disease domain. A three-step
methodology was followed to extend this ontology: i) EJP-RD CDE ontology was compared with
the FAIRVASC ontology and missing data elements in FAIRVASC ontology were detected. ii)
Missing CDE related to the existing concepts were created for the FAIRVASC ontology iii) The
mappings between the ontologies were created. Moreover, following the creation of ontology
concepts, Relational Database to RDF Mapping Language (R2RML) scripts were written to uplift
the registry data. As the FAIRVASC ontology is aligned with the RKD database, the database
has already had most of the relevant concepts in the data. During the design of the FAIRVASC
ontology, the EJP-RD concepts were constantly reviewed and, where possible, we adopted the
concepts suggested by the CDE ontology in order to increase the potential for interoperability
of RKD datasets and therefore PARADISE project. The review of the ontologies was undertaken
manually.</p>
      <p>A set of mappings (Table 1) was then developed to ensure the interoperability of the datasets
within the consortium and with CDE ontology-based datasets. Thirteen of the 16 common
elements of the CDE ontology align well with the FAIRVASC ontology. The other three
elements are not recorded in the FAIRVASC ontology: “Undiagnosed case", “Genetic case" and
“Classification of functioning, disability" attributes. The undiagnosed case does not exist in the
database because FAIRVASC data is specialised in ANCA vasculitis, so any other diagnosis is
not recorded in the database. ANCA vasculitis is not a genetic disease, thus that option also
does not exist. Similarly, the disability option is not recorded in the FARVASC ontology. This
diference is not surprising as the CDE ontology is targeted at representing diseases in a generic
manner, whereas the FAIRVASC ontology has been created to support concepts specific to
ANCA vasculitis.</p>
      <p>In particular, there are two types of mappings created for the interoperability of the datasets.
One of them is property equivalence (owl:equivalentProperty) which describes that two
properties have same“values". Other one is subproperty declaration (rdfs:subPropertyOf)
meaning property extension of the property (e.g. death) are also members of the property
extension of the high level property (e.g. status output)15. On the other hand, fvc:death
requires transformation because the death information is kept as a boolean value while CDE
ontology stores this piece of information as a string. Transformation will allow the boolean data
type to be converted to a string before the relation mapping. Another important point in the
table is that the last three rdfs:subpropertyOf relations are described as inverse relations
which means CDE semantic model concept is a subproperty of the FAIRVASC concept due to
its broadness.</p>
      <p>Listing 1: Example R2RML Mapping for Schema Uplifting
&lt;# d i a g n o s i s &gt;
r r : s u b j ec t M a p [
r r : t e m p l a t e " h t t p : / / d a t a . f a i r v a s c . i e / r e s o u r c e /
rkd / d i a g n o s i s / { RECORD_ID } { DIAGNOSIS_DATE } " ;
r r : c l a s s f v c : D i a g n o s i s ;
] ;
r r : p r e d i c a t e O b j e c t M a p [
r r : p r e d i c a t e f v c : m a i n D i a g n o s i s ;
r r : objectMap [
r r : column " DIAGNOSIS " ;
r r : d a t a t y p e xsd : s t r i n g ;
] ;
] ;
r r : p r e d i c a t e O b j e c t M a p [
r r : p r e d i c a t e f v c : hasDateAtOnset ;
r r : objectMap [
r r : column " DATE_OF_SYMPTONS " ;
r r : d a t a t y p e xsd : dateTime ;</p>
      <p>Listing 1 presents the R2RML script for the Diagnosis concept. FAIRVASC ontology already
has the Diagnosis concept, thus, missing CDE components were added to the script to be able
to produce these triples as well. FAIRVASC ontology and R2RML scripts are available in an
open repository16.</p>
      <p>The mappings are evaluated in 2 ways: i) Semantic validation is approved by the group of
15https://www.w3.org/TR/owl-ref/#subPropertyOf-def
16https://opengogs.adaptcentre.ie/yamanbey/PARADISE
10 people at the end of the discussions. ii) Technical validation is conducted by producing the
triples from the mappings which have been around 50000 triples.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Lessons Learned</title>
      <p>We conducted collaborative meetings with clinicians and computer scientists. Even if the
pandemic has been a non-pleasant experience for everyone, it paved the way for regular and timely
online meetings and discussions. Communicating computer science terms with researchers who
have medical backgrounds could be challenging, however, due to their experience with Semantic
Web since the beginning of the FAIRVASC project it was a straightforward task. Analysis of the
RKD data was conducted manually and this is a time-consuming process but it is a necessary
step to create R2RML mappings and uplift data. In our use case, the adoption of semantic
technologies and creating mappings between FAIRVASC and CDE ontologies for rare diseases
has shown the following: i) EJP-RD ontology has been implemented in a generic way to be able
to model all the rare diseases, thus, some concepts do not apply to our registry (e.g. genetic
disease concept) ii) interdisciplinary nature of the project shows that overlong problems in
health domain could be solved via mixed-proficient interest group collaborations iii) created
mappings increases the understanding between registries from various countries by specifically
declaring equality and hierarchy among concepts.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusions and Future Work</title>
      <p>The health domain data is increasing as well as its complexity and volume. Thus, it is highly
important to keep semantic interoperability as rich as possible to reduce semantic heterogeneity
and increase the understanding among registries. There is a high demand for ontologies from
the health domain and for putting the data in context. We have described the development of
a model which can be used to uplift tabular data into a common RDF format. Linkage of the
created ontology will help the data to be integrated and increase the interoperability among
datasets including collaboration with external groups and projects. Other researchers could
follow the proposed steps and create the mappings for their systems to enrich their data. On
the other hand, users who have experience with the mapped ontology could benefit from these
mappings and query data according to their knowledge. This will provide a more complex
SPARQL query feature to the system without losing the practicality.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgements</title>
      <p>This work is funded by grant EJPRD19-200, the Meath Foundation 208591 and also the Health
Research Board / Irish Nephrology Society (MRCG-2016-12). Research is also supported by
ADAPT SFI Research Centre (grant number 13/RC/2106_P2).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>P.</given-names>
            <surname>Travers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Walport</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Shlomchik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Janeway</surname>
          </string-name>
          ,
          <article-title>Immunobiology: the immune system in health and disease</article-title>
          ,
          <source>Churchill Livingstone</source>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>K.</given-names>
            <surname>McGlinn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Hussey</surname>
          </string-name>
          ,
          <article-title>An analysis of demographic data in irish healthcare domain to support semantic uplift</article-title>
          ,
          <source>in: International Conference on Computational Science</source>
          , Springer,
          <year>2020</year>
          , pp.
          <fpage>456</fpage>
          -
          <lpage>467</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>R.</given-names>
            <surname>Hoehndorf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. N.</given-names>
            <surname>Schofield</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. V.</given-names>
            <surname>Gkoutos</surname>
          </string-name>
          ,
          <article-title>The role of ontologies in biological and biomedical research: a functional perspective, Briefings in bioinformatics 16 (</article-title>
          <year>2015</year>
          )
          <fpage>1069</fpage>
          -
          <lpage>1080</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>N. F.</given-names>
            <surname>Noy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Crubézy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. W.</given-names>
            <surname>Fergerson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Knublauch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. W.</given-names>
            <surname>Tu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Vendetti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Musen</surname>
          </string-name>
          , Protégé-2000:
          <article-title>an open-source ontology-development and knowledge-acquisition environment</article-title>
          ., in: AMIA...
          <source>Annual Symposium proceedings. AMIA Symposium</source>
          ,
          <year>2003</year>
          , pp.
          <fpage>953</fpage>
          -
          <lpage>953</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <article-title>[5] Set of Common Data Elements , EU RD Platform , https://eu-rd-platform.jrc.ec.europa.eu/ set-of-common-dataelements_en</article-title>
          , ???? [Online; accessed
          <volume>24</volume>
          .02.
          <year>2022</year>
          ].
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>R.</given-names>
            <surname>Kaliyaperumal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. D.</given-names>
            <surname>Wilkinson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. A.</given-names>
            <surname>Moreno</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Benis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Cornet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. dos Santos</given-names>
            <surname>Vieira</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dumontier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. H.</given-names>
            <surname>Bernabe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Jacobsen</surname>
          </string-name>
          ,
          <string-name>
            <surname>C. M. Le Cornec</surname>
          </string-name>
          , et al.,
          <article-title>Semantic modelling of common data elements for rare disease registries, and a prototype workflow for their deployment over registry data</article-title>
          ,
          <source>medRxiv</source>
          (
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>M.</given-names>
            <surname>Dumontier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. J.</given-names>
            <surname>Baker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Baran</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Callahan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Chepelev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Cruz-Toledo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. R.</given-names>
            <surname>Del Rio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Duck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. I.</given-names>
            <surname>Furlong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Keath</surname>
          </string-name>
          , et al.,
          <article-title>The semanticscience integrated ontology (sio) for biomedical research and knowledge discovery</article-title>
          ,
          <source>Journal of biomedical semantics 5</source>
          (
          <year>2014</year>
          )
          <fpage>1</fpage>
          -
          <lpage>11</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>D.</given-names>
            <surname>Vasant</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Chanas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Malone</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hanauer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Olry</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Jupp</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. N.</given-names>
            <surname>Robinson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Parkinson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Rath</surname>
          </string-name>
          ,
          <article-title>Ordo: an ontology connecting rare disease, epidemiology and genetic data</article-title>
          ,
          <source>in: Proceedings of ISMB</source>
          , volume
          <volume>30</volume>
          , researchgate. net,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>P. N.</given-names>
            <surname>Robinson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Mundlos</surname>
          </string-name>
          ,
          <article-title>The human phenotype ontology</article-title>
          ,
          <source>Clinical genetics 77</source>
          (
          <year>2010</year>
          )
          <fpage>525</fpage>
          -
          <lpage>534</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>J.</given-names>
            <surname>Golbeck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Fragoso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Hartel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hendler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Oberthaler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Parsia</surname>
          </string-name>
          ,
          <article-title>The national cancer institute's thesaurus and ontology</article-title>
          ,
          <source>Journal of Web Semantics First Look</source>
          <volume>1</volume>
          _
          <issue>1</issue>
          _
          <issue>4</issue>
          (
          <year>2003</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>