<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Version control for interdependent ontologies: Challenges and first propositions</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Sherbrooke</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Quebec</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Canada</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Paul Fabry</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Adrien Barton</string-name>
          <email>adrien.barton@irit.fr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jean-Francois Ethier</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Institut de Recherche en Informatique de Toulouse (IRIT)</institution>
          ,
          <addr-line>CNRS</addr-line>
          ,
          <institution>Université de Toulouse</institution>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <fpage>154</fpage>
      <lpage>165</lpage>
      <abstract>
        <p>The variety and quantity of health data have significantly increased due to healthcare making data interoperability Biomedical ontologies are a useful tool to support this interoperability. However, to provide an adequate coverage, ontological models can advantageously leverage a network of ontologies or part of ontologies, from various origins and strongly interdependent. While it fosters interoperability and minimizes duplication, this structure is very sensitive and any evolution of one of its parts requires more and more maintenance efforts. The challenge of tracking changes and assessing their effects is addressed in computer science through version control and while it has been applied to ontologies, applying this approach while importing parts of external ontologies and managing the potential impact of their evolution is a challenging task. This article focuses on OWL ontologies and two questions regarding their versioning are addressed: what should be versioned in an ontology and how to determine its identity? The current processes rely on a versioning at the level of the ontology, which raises problems that are discussed. Versioning at a more fine-grained level of socalled “OWL components” could mitigate such problems. We identify two kinds of entities as potentially relevant to their identity: the associated cognitive representations and assertive statements; being able to track both separately would be an effective way to mitigate the aforementioned problems. While further work is warranted to provide an operational definition of “OWL component”, an approach along the lines proposed here can support not only at a better change management through a cohesive, complete and fined-grained version control, but also a better import process while contributing to support ontology engineering methods.</p>
      </abstract>
      <kwd-group>
        <kwd>improvements</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Advances in modern healthcare have significantly increased the variety and quantity of
health data generated. Ontologies are well positioned to serve as mediation models to support
interoperability between diverse clinical data sources as they provide formal,
sourceindependent representations and do not rely on specific data models that might be tied to
specific technologies or formats. One such example is PARS3 [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], a distributed data access
platform that is currently being deployed to support activities of the Health Data Research
Network Canada [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and to enable the implementation of a mortality prediction tool in hospitals
in Quebec. Implementing these projects implies of series of tasks from ontology development,
to pivot relational model generation, to mappings with data sources. While this approach has
      </p>
      <p>
        2023 Copyright for this paper by its authors.
CEUR
ISSN1613-0073
been demonstrated and validated to enable the support of various tools like ReflexD [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], an
audit-feedback tool for diabetes treatment in primary care, the next step is to support the
evolution of users’ needs. Science evolves, new requirements emerge, mistakes are identified,
ideas are refined, etc. This will often imply changes in the knowledge representation generated
at the inception of the project. This ontological evolution has a significant effect on the activities
downstream of ontology development. It is critical to be able to identify what changes constitute
a modification that can affect a user’s understanding of what is represented or what should be
instantiated in a table.
      </p>
      <p>To follow on the questions above, if an ontology changes, how can one identify which of its
parts might need to be reviewed for potential adjustments? If the answer is always “all of them”,
scalability is obviously at risk. What could be the consequences of this change on the
applications which depend on it? In the context of PARS3, what are the impacts of a change in
the ontology on the mappings between a data source and the pivot relational model? If the
answer here again is to ask every source to review or redo every mapping between their assets
and the pivot model, maintenance, scalability and as a result, acceptability and sustainability
will be markedly reduced.
1.1.</p>
    </sec>
    <sec id="sec-2">
      <title>Background</title>
      <p>
        This paper focuses on ontologies expressed in OWL2, as it is the most used
DescriptionLogic language for biomedical ontologies. Good engineering principles have been proposed to
mitigate the impact of ontology change. The Open Biomedical Ontologies Foundry [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] has
proposed guidelines [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] including an emphasis on reusing the existing classes, a clearly defined
scope for ontologies to ensure orthogonality of ontologies representing different domains and
unique textual definitions following an Aristotelian form [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. These principles were followed
for the development of the ontologies supporting PARS3. A modular approach was chosen for
the creation of the ontology model with several domain ontologies addressing specific topics
such as drug prescriptions [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], laboratory results [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] or clinical forms [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. All these ontologies
are built upon the upper-level ontology Basic Formal Ontology (BFO) [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] and reuse classes
and properties from reference ontologies in the biomedical domain like the Ontology of
Biomedical Investigations (OBI) [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. Finally, these ontologies are linked together via
midlevel ontologies describing relevant parent categories pertaining to e.g. health procedures [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]
or services [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
      </p>
      <p>Consequently, the clinical model structure consists of a network of ontologies or part of
ontologies, which are either imported or created within our group, and are strongly
interdependent. On the one hand it adds great flexibility and allows us to add representations
of various domains as needed. In addition, ontology reuse enables us to leverage the domain
expertise of other groups. On the other hand this structure is very sensitive and any change in
its parts requires more and more maintenance efforts as the model grows and evolves, which
may compromise its sustainability in the long run. Interestingly, the challenge of tracking
change, evaluating its significance, and identifying the cascading effects is also an issue in
computer science and is often addressed through version control.</p>
      <p>
        Version control has already been discussed in the context of ontologies and is even part of
the OBO Foundry principles [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] as well as FAIR [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. However, in this context it consists in
adding a version date in the metadata of the ontology. The change of the version date is at the
discretion of the authors. Yet, different types of modifications in the ontology, such as the
addition or removal of a class or the correction of an obvious typo in annotations such as a
comment, can have very different impacts and it would be useful to differentiate between them.
This is especially true since a version date refers to the whole ontology and does not in itself
reveal which classes have been modified, which is all the more challenging in the case where a
user imports only a few from a source ontology that may have several thousands.
      </p>
      <p>The objective of this article is to lay the foundation for a versioning methodology that would
allow the safe and efficient identification of what is impacted (or not) by a change in a context
of highly interconnected ontologies. The core principles concerning both ontologies and
versioning upon which this work is based and the issues at stake will be outlined, and some
recommendations will be proposed and discussed. The resulting methodology and its
implementation will be addressed in subsequent works.</p>
    </sec>
    <sec id="sec-3">
      <title>2. Versioning challenges for ontologies</title>
      <p>
        Version control as it pertains to computer science encompasses all the tools and methods for
tracking and providing control over changes to a given project [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. This control is performed
at a granularity level deemed pertinent to manage the project most efficiently without creating
an undue maintenance burden. To achieve this, artefacts of interest that are relevant to the
project will be tracked individually. In many software projects, these artefacts are text files
parts of the project’s source code or documentation for example.
      </p>
      <p>To ensure the tracking of an artefact, it is necessary to endorse a diachronic identity criterion
on artefacts that allow us to identify some of them as evolutions of the same entity. An artefact
like a computer file is identified (that is, uniquely denoted in a given context) by a resource
identifier composed by the location in the file tree and the name of the file. This resource
identifier provides a criterion of diachronic identity of the file it refers to, i.e. two files are
considered as two states (versions) of the same informational artefact if they are identified by
the same resource identifier. From a versioning perspective, even if the content of a file
completely changes over time, the different versions will be considered as the same file at
different states over time as long as they are identified by the same resource identifier. When
trying to apply this methodology to ontologies, challenges emerge.
2.1.</p>
    </sec>
    <sec id="sec-4">
      <title>Ontology as the artefact of interest for versioning</title>
      <p>One option could be to choose the ontology as the artefact of interest for the versioning.
This calls for the characterization of what an ontology is. The word “ontology” can refer,
depending on the context, to artefacts of various kinds, belonging to the fields of philosophy
and computer science, which have as a common goal the representation of entities of the world
or a particular domain of it, the categories to which they belong, their properties and the
relations that hold between them.</p>
      <p>
        In knowledge engineering, ontologies have been classically defined [
        <xref ref-type="bibr" rid="ref17 ref18">17,18</xref>
        ] as follows :
An ontology is a formal explicit specification of a shared conceptualization.
A definition grounded in a realist interpretation of the world is proposed by Smith et al. [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]:
Ontology = def. “A representational artefact, comprising a taxonomy as proper part, whose
representations are intended to designate some combination of universals, defined classes, and
certain relations between them.”
      </p>
      <p>
        The definition from [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] above assumes the existence of universals independently of the
conceptualizations that we can make of the reality. In addition, an important element of Gruber
and Guarino's definitions is the notion of shared conceptualization, that is an ontology is meant
to be shared, usually between its authors and a community that is using it [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ].
      </p>
      <p>
        Many formalisms have been proposed for ontologies; among them, OWL is a widely used
and recognized standard [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. An OWL ontology is uniquely denoted with an internationalized
resource identifier (IRI), which can be used for versioning purposes. For example, OBI is
associated with the IRI: “http://purl.obolibrary.org/obo/obi.owl” that is specific to this
ontology. While the ontology will undergo modifications over time, e.g. by adding and deleting
classes or properties, it will be considered as the same ontology as long as its IRI remains the
same. However, using the ontology as the versioning artefact may not be the best suited method.
      </p>
      <p>Firstly, this approach considers an ontology as a monolithic artefact, one that is therefore
used as a whole. This is a questionable assumption, especially in a context of interconnected
ontologies where sometimes only parts of an ontology might be reused in another. Let’s
consider for example an ontology A that imports some part of an ontology B. If B changes,
should a new version of A be declared even if not a single class declared directly in ontology
A has been modified? If a new version of B is proposed, how to know if the classes imported
in A are affected? If they are not, should this entail the creation of a new version of A?</p>
      <p>
        Secondly, according to the W3C OWL2 reference documents [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ], an ontology is defined
as: “[…] a certain kind of computational artefact – i.e., something akin to a program, an XML
schema, or a web page – generally presented as a document.” While this allows a
documentbased approach to versioning, facilitated by tools such as Git [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ], this might lead to
counterintuitive consequences. For example, two classes with different namespaces located in
the same computational artefact would be part of the same ontology according to the W3C
definition above; however, in a different conception of ontology that would emphasize
namespaces, they would belong to different ontologies.
      </p>
      <p>Thirdly, the size of an ontology may represent a significant challenge for its versioning. The
Gene Ontology [22] for example has more than 40 000 classes. If the modification of one of
them leads to a new version of the whole ontology, this implies the tedious task for all its users
of checking for possible changes in all the classes at each new version. Therefore, using the
whole ontology as the most fine-grained artefact of interest for versioning is not an ideal
solution.
2.2.</p>
    </sec>
    <sec id="sec-5">
      <title>Parts of ontology as the artefact of interest for versioning</title>
      <p>An ontology proposes a representation of a domain by one or several authors. Such a
representation takes place initially in the form of a collection of cognitive representations2 of
its author on the basis of which they create publicly available informational content entities
(ICE) [23]. In the context of OWL2 ontologies, such ICEs will be called “OWL component”
(OC). At least some of those OCs are associated with an identifier, such as an International
Resource Identifier (IRI) that consists of a permanent location, a namespace and a unique string
(for example: “http://purl.obolibrary.org/obo/OBI_0000011” is an IRI from OBI), which is of
particular interest for versioning. Also at least3 some of those OCs refer to a portion of reality
that is part of the domain, in a way that can be understood by other users. These can be analyzed
using a three level framework inspired by Ceusters and Smith [24]:
• Level L1: a portion of reality;
• Level L2: the cognitive representation of this portion of reality;
• Level L3: a publicly accessible ICE, the OWL component, that emanates4 from the</p>
      <p>L2 cognitive representation and is also about the same portion of reality.</p>
      <p>What is named in the literature as a ‘class’ or ‘term’ is an important type of such OCs. OCs
can also include annotations such as definitions, or elucidations. Neuhaus [25] characterizes
some of the annotations as “assertive annotations”, i.e. “…the kind of annotations that are
intended to assert a true proposition about the domain of the ontology.” Neuhaus also regroup
assertive annotations and logical axioms under the term of “assertive statements”. A definition
would specify in natural language a way to group certain particulars, while logical axioms can
be automatically processed by a reasoner to classify the OWL components.</p>
      <p>An OC can be associated with some logical axioms. For example, a class might be
mentioned by some logical axioms.</p>
      <p>For example, the OC with the IRI “http://purl.obolibrary.org/obo/OBI_0000011” (that has
for label “planned process”) is associated to the following assertive statements (among others):
2 Cognitive representations are also concretizations of ICEs according to IAO [23].
3 “at least” because it remains a complex open question whether all OCs are about a portion of reality (e.g. object properties).
4 The precise nature of the relation of emanation is outside the scope of this work.</p>
      <p>Definition: “A process that realizes a plan which is the concretization of a plan
specification.”
planned process SubclassOf process</p>
      <p>Two kinds of relations between a class and a portion of reality are proposed. Firstly, a
relation of aboutness that is determined by the cognitive acts of its author: in the IAO
framework, inspired by Chisholm [26], the “aboutness of those of our representations
formulated in speech or writing […] is to be understood by reference to the cognitive acts with
which they are or can in principle be associated” [23]. Secondly, a relation that is determined
by the interpretation of the assertive statements by another competent reader who will make
their own cognitive representation of a portion of reality on the basis of this interpretation. In
this case, the assertive statements can be considered as describing that portion of reality (we
will presume here that any competent external user will have the same interpretation). The goal
when creating a class is that its assertive statements describe a portion of reality identical to the
one pointed to by its aboutness.
2.3.</p>
    </sec>
    <sec id="sec-6">
      <title>Identity criteria for ontological components</title>
      <p>•</p>
      <p>One may propose two identity criteria of OC based on cognitive representations and
assertive statements, depending on whether one relies on what we will call (inspired by theories
in philosophy of language [27]) the “intentionalist” or “descriptivist” conceptions of the identity
of an OC:
•</p>
      <p>R1: In the intentionalist conception, the identity is determined by its author’s related
cognitive representation.</p>
      <p>R2: In the descriptivist conception, the identity is determined by its associated
assertive statements.</p>
      <p>Hence, in the case of R1, even though the assertive statements might change significantly,
as long as the OC emanates from the same cognitive representation of the author, the identity
of the OC does not change and so the IRI should remain the same. Meanwhile in R2, even
though the author’s cognitive representation associated with an OC might change, as long as
the assertive statements are not significantly modified5, the identity of the OC does not change
and so the IRI should remain the same.</p>
      <p>The distinction between these conceptions is very rarely made explicit when creating and
sharing an ontology, and this may negatively affect its evolution or its external reuse. The
cognitive representation of the author (level 2) and the assertive statements are therefore two
aspects that need to be managed appropriately to evaluate an OC as a potential artefact of
interest for versioning, and as described above, two possible approaches, namely R1 and R2,
need to be considered.</p>
      <p>Let’s consider the following example about a hypothetical ontology about fruits, the Fruit
Ontology (FO). At time t1 the author of this ontology adds an OWL component that refers to
its own cognitive representation of the class of oranges (Citrus sinensis) in the reality. This OC
(let’s call it “OC1”) is associated with the IRI: “FO_004” and is associated with the following
assertive statements:
• Definition D1: “A citrus fruit which is the edible fruit of the orange tree and that is
orange in color.”
• orange subClassOf citrus fruit
5 We are talking about changes that might affect the understanding of the OC’s meaning, not merely a typo fix or a logically
equivalent reformulation of a statement.</p>
      <p>At time t2, the author is made aware that the definition D1 does not explicitly mention unripe
oranges, which are green, and therefore that some of the ontology’s users assume OC1 is about
only ripe oranges (Figure 1).</p>
      <p>The author is now faced with two options: either A) he considers that his initial cognitive
representation is still the relevant one and so the assertive statements need to be updated
accordingly, or B) he considers that what needs to be represented is instead ripe oranges and so
he is now working with a new cognitive representation but as it turns out, the assertive
statements derived from this new representation are similar to those initially derived.</p>
      <p>Each of these options may have an impact on the identity of OC1 that must be evaluated
according to the R1 and R2 conceptions discussed above, we thus have four distinct scenarios:
• Scenario R1A: The cognitive representation of the class of oranges is not modified
between t1 and t2 and a new definition D2 (“A citrus fruit which is the edible fruit
of the orange tree and that is orange in color when ripe, and green when unripe.”) is
proposed at t2 for clarification. In this scenario, OC1 is still conformant to the same
cognitive representation, but one of the associated assertive statements has been
modified. As the identity of OC1 is determined by the cognitive representation in
R1 and this representation didn’t change, what comes out of this scenario is a new
version of OC1, which is thus associated with the same IRI “FO_004”. (Figure 2).
• Scenario R1B: A new cognitive representation of interest (about ripe oranges) has
been identified by the author of OC1. Therefore, a new OWL component (OC2) and
thus a new IRI, is created by reusing assertive statements of OC1 (in this case the
definition). (Figure 2).</p>
      <p>Scenario R2A: R2A describes the same reality as R1A but endorses the R2
conception rather than the R1 conception. A new definition D2 has been formulated
at t2, and consequently the assertive statements have changed. In this case, a new
OC (OC3), and thus a new IRI, is created along the new assertive statements (Figure
3).</p>
      <p>Scenario R2B: Here also, R2B describes the same reality as R1B, but endorses the
R2 conception rather than the R1 conception. The assertive statements have not
been modified and thus the identity of OC1 has not changed even though OC1 is
now related to another cognitive representation (Figure 3).</p>
      <p>Let’s consider that OC1 is imported at time t1 in two fictive distinct ontologies:
• The Plant Parasites ontology (PPO), created by the author of OC1, imported it in
the axiom: medfly subClassOf (is_parasite_of some orange). As a matter of fact, the
author of PPO considered that FO_004 was about oranges at any stage of
development.
• The Juice Ontology (JO) imported it in the axiom: orange juice subClassOf
(is_made_of some orange). The authors of JO have initially imported “FO_004” as
they understood the assertive statements as describing ripe oranges.</p>
      <p>For these two ontologies, the previously mentioned scenarios will have various implication.
For example, in the scenario R1A, as OC1 is now about oranges at any stage of development,
JO needs to modify its axiom and include another OC while PPO doesn’t need to. On the
contrary, in the scenario R2B, as OC1 is now about ripe orange, JO could keep the axiom as is
while PPO need to modify it. In context of a direct import, this will lead to incompatible OCs
that might be difficult to identify, and the “live” change might affect users before the problem
can be identified and fixed.</p>
      <p>These scenarios are found in practice without being explicitly identified as such, which leads
to many ambiguities when reusing ontologies. However, each conception has its advantages
and disadvantages depending on whether one favors the author’s cognitive representation
importance in contributing to the coherence of the OC over the users’ perspective which is
based on their understanding of the assertive statements.</p>
      <p>The examples above illustrate that OCs can be good candidates as artefacts of interest for
versioning, provided that we are able to track both the cognitive representation of the author
(via the operationalization of R1) and the assertive statements (via the operationalization of
R2).</p>
    </sec>
    <sec id="sec-7">
      <title>3. Discussion</title>
      <p>Despite emerging tools that provide an invaluable help [28], managing the evolution of
interconnected ontologies is a tedious and error-prone task. Versioning processes are a way
forward to support the management of ontology evolution. In this work, two issues at the heart
of this question are addressed.
3.1.</p>
    </sec>
    <sec id="sec-8">
      <title>Artefact of interest for ontology versioning</title>
      <p>The first issue concerns the artefacts of interest, whose evolution needs to be tracked
individually. As mentioned above, the current processes imply a versioning at the level of the
ontology, but this presents some important problems, if only because it is not simple to outline
the target of versioning in a context of interconnected modular ontologies where sometimes
only parts of an ontology might be reused in another. Therefore, ontologies should no longer
be considered as standalone and monolithic but rather as highly interconnected informational
constructs, and a focus on a more fine-grained artefact is required, leading to the introduction
of so-called “OWL components” as the versioned artefacts.</p>
      <p>In the current work, we only discussed one kind of OC, namely classes. However, other
components of an ontology such as object properties, data properties or even ontological
instances are possible candidates as OCs of interest for versioning. Indeed, all these elements
can be associated with assertive statements that can change over time. Defining OCs in the least
ambiguous and most comprehensive way possible is of the utmost importance and will be the
subject of subsequent work. In addition, not all parts of an ontology are of interest for versioning
and therefore it will be important to determine whether certain OCs are not relevant for
versioning.</p>
      <p>
        It seems also relevant to consider what an ontology is in relation to OCs. On the one hand,
ontologies are conceptually envisioned through high-level definitions such as the one
previously mentioned [
        <xref ref-type="bibr" rid="ref10 ref18">10,18</xref>
        ]. On the other hand, ontologies as defined by W3C are described
as computerized constructs and manipulated via files or namespaces. As it has been discussed
in this work, the relationship between the two is not so clear cut and this calls for approaching
OWL ontologies according to principles more in line with those of software engineering.
3.2.
      </p>
    </sec>
    <sec id="sec-9">
      <title>Identity of the artefact of interest</title>
      <p>The next issue is the determination of the identity of the artefact of interest. Even tough the
OCs relevant for versioning have only been characterized through their role for versioning, two
possibly relevant aspects for their identity have been identified: their cognitive representations
and their associated assertive statements (logical axioms and assertive annotations).</p>
      <p>
        While the formal aspect of an ontology allows the use of reasoners and query tools to
automatically process an ontology, an ontology is not limited to its formal elements. An
ontology is also a way to share knowledge between communities of humans and for this goal,
assertive annotations such as natural language definitions may have a role as significant as the
logical axioms [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ].
      </p>
      <p>However, not all annotations are assertive and one needs to distinguish assertive annotations
from annotations concerning the OC itself, such as its author or the date of creation. An
annotation that will require additional exploration to arbitrate its inclusion or not as an assertive
statement is the label of an OC. Given the uncertainty around its status, it is not included in the
examples above. A natural language label in itself may be too ambiguous for playing an
assertive role: experience shows that a given label could be associated with several distinct OC
from distinct ontologies. However, labels are not randomly chosen either and the eventual
import that the semantics of natural language labels should have into ontologies will need to be
analyzed.</p>
      <p>In an ideal world, one could envision a one-to-one relation between cognitive
representations and assertive statements, as the cognitive representation of the author of an OC
would be exactly translated in its assertive statements, which in turn would be exactly
interpreted in a similar cognitive representation by the users that have only access to its assertive
statements. However, OCs may retain the same IRI while their assertive statements undergo
significant changes through time. Despite everyone’s best effort, discrepancies exist and will
likely remain and therefore the chosen versioning process must be able to handle these
situations.</p>
      <p>However, regardless of the quality of the assertive statements, an evolution of knowledge
may imply a modification of the cognitive representation associated with an OC. Since OCs are
developed in interlinked coherent groups (ontologies), the cognitive representation of the author
is likely to play an important role in keeping this group coherent. Being able to track the
cognitive representation of the author would allow the user community to know when
something fundamental changed in the way the author structures the knowledge representation
(e.g. because of a shift in science) versus when they attempt to improve the assertive statements
to better match their cognitive representation.</p>
      <p>To this end, being able to individually track the changes of both the cognitive representations
and assertive statements would be a significant advantage. As it stands though only one IRI is
associated with an OC and it is sometimes used to track the author’s cognitive representation
(R1) and sometimes the assertive statements (R2). However, it cannot do both in parallel.
Associating two IRIs, one that tracks OC identity according to conception R1, and another one
that tracks OC identity according to conception R2, would allow the tracking of both the
cognitive representation and the assertive statements. Further work should elaborate the details
of such a proposition.</p>
    </sec>
    <sec id="sec-10">
      <title>4. Conclusion and Future Work</title>
      <p>The work presented here aims at laying the foundation for a safe handling of changes to
OWL components when they are reused. Being able to track changes in the cognitive
representations of the authors or the assertive statements is the first step to further scalability
and sustainability while diminishing the incentive to build “yet another model”.</p>
      <p>Other challenges lie ahead. It will likely be desirable to identify various types of changes
without semantic implications including:
1. Changes to the computational representations e.g. changes of OWL syntax or in the
order of elements;
2. Changes in non-assertive annotations e.g. additions of contact information
annotations;
3. Changes in natural language assertive statements e.g. correcting a typographical
error or replacing a word by a perfect synonym;
4. Changes in logical axioms producing a logically equivalent result e.g. replacing
“OCA OR OCB” by “OCB OR OCA”.</p>
      <p>Automatically detecting changes of type 3 or 4 is not trivial in all cases.</p>
      <p>Finally, when the changes include the cognitive representation of the author or assertive
statements, it will be greatly beneficial to be able to have an approach to navigate through OCs
of interest and identify those not impacted by the changes, leaving only a subset for manual
review and adjustments as needed.</p>
      <p>Overall, an approach based on the principles presented here will ensure not only a better
change management through a cohesive, complete, and fined-grained version control, but also
a better import process while contributing to supporting ontology engineering methods.</p>
    </sec>
    <sec id="sec-11">
      <title>5. References</title>
      <p>[22] Carbon S, Douglass E, Good BM, Unni DR, Harris NL, Mungall CJ, et al. The Gene
Ontology resource: Enriching a GOld mine. Nucleic Acids Research 2021;49:D325–34.
https://doi.org/10.1093/nar/gkaa1113.
[23] Ceusters W, Smith B. Aboutness: Towards Foundations for the Information Artifact
Ontology. Proceedings of the Sixth International Conference on Biomedical Ontology
(ICBO), CEUR vol. 1515; 2015, p. 1–5.
[24] Ceusters W, Smith B. Foundations for a realist ontology of mental disease. Journal of</p>
      <p>Biomedical Semantics 2010;1:10. https://doi.org/10.1186/2041-1480-1-10.
[25] Neuhaus F. What is an Ontology? 2018. https://doi.org/10.48550/arXiv.1810.09171.
[26] Chisholm RM. The primacy of the intentional. Synthese 1984;61:89–109.</p>
      <p>https://doi.org/10.1007/BF00485490.
[27] Michaelson E, Reimer M. Reference. The Stanford Encyclopedia of Philosophy 2022.
https://plato.stanford.edu/archives/sum2022/entries/reference/ (accessed March 31,
2023).
[28] Matentzoglu N, Goutte-Gattat D, Tan SZK, Balhoff JP, Carbon S, Caron AR, et al.</p>
      <p>Ontology Development Kit: a toolkit for building, maintaining and standardizing
biomedical ontologies. Database 2022;2022:baac087.
https://doi.org/10.1093/database/baac087.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <article-title>[1] PARS3 • Solutions • GRIIS</article-title>
          . GRIIS n.d. https://griis.ca/en/solutions/pars3/ (accessed June 16,
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>McGrail</surname>
            <given-names>K</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Diverty</surname>
            <given-names>B</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lix</surname>
            <given-names>L</given-names>
          </string-name>
          .
          <article-title>Introducing Health Data Research Network Canada (HDRN Canada): A New Organization to Advance Canadian And International Population Data Science</article-title>
          .
          <source>IJPDS</source>
          <year>2020</year>
          ;
          <article-title>5</article-title>
          . https://doi.org/10.23889/ijpds.v5i5.
          <fpage>1493</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <article-title>[3] ReflexD • Primary health care and follow-up for people with diabetes • Solutions • GRIIS</article-title>
          . GRIIS n.d. https://griis.ca/en/solutions/reflexd/ (accessed June 16,
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Smith</surname>
            <given-names>B</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ashburner</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosse</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bard</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bug</surname>
            <given-names>W</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ceusters</surname>
            <given-names>W</given-names>
          </string-name>
          , et al.
          <article-title>The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration</article-title>
          .
          <source>Nat Biotechnol</source>
          <year>2007</year>
          ;
          <volume>25</volume>
          :
          <fpage>1251</fpage>
          . https://doi.org/10.1038/nbt1346.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>OBO</given-names>
            <surname>Foundry - Principles overview</surname>
          </string-name>
          n.d. http://obofoundry.org/principles/fp-000- summary.
          <source>html (accessed June 16</source>
          ,
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Seppälä</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ruttenberg</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smith</surname>
            <given-names>B</given-names>
          </string-name>
          .
          <article-title>The Functions of Definitions in Ontologies</article-title>
          .
          <source>Formal Ontology in Information Systems</source>
          <year>2016</year>
          :
          <fpage>37</fpage>
          -
          <lpage>50</lpage>
          . https://doi.org/10.3233/978-1-
          <fpage>61499</fpage>
          -660- 6-37.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Ethier</surname>
            <given-names>J-F</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goyer</surname>
            <given-names>F</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fabry</surname>
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Barton</surname>
            <given-names>A</given-names>
          </string-name>
          .
          <article-title>The prescription of drug ontology 2.0 (PDRO): More than the sum of its parts</article-title>
          .
          <source>International Journal of Environmental Research and Public Health</source>
          <year>2021</year>
          ;
          <volume>18</volume>
          . https://doi.org/10.3390/ijerph182212025.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Barton</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fabry</surname>
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lavoie</surname>
            <given-names>L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ethier J-F. LABO</surname>
          </string-name>
          :
          <article-title>An ontology for laboratory test prescription and reporting. 2019 Joint Ontology Workshops Episode V: The Styrian Autumn of Ontology</article-title>
          ,
          <source>JOWO</source>
          <year>2019</year>
          , vol.
          <volume>2518</volume>
          ,
          <string-name>
            <surname>Graz</surname>
          </string-name>
          ; Austria:
          <article-title>CEUR-WS</article-title>
          .org;
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Fabry</surname>
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Barton</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ethier J-F. QUESTO -</surname>
          </string-name>
          <article-title>An ontology for questionnaire</article-title>
          .
          <source>ICBO|ODLS 2020 International Conference on Biomedical Ontologies</source>
          <year>2020</year>
          , vol.
          <volume>2807</volume>
          ,
          <string-name>
            <surname>Bolzano</surname>
          </string-name>
          ,
          <source>Italy: CEUR-WS.org; 2020</source>
          , p.
          <source>B</source>
          .
          <volume>1</volume>
          -
          <fpage>12</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Arp</surname>
            <given-names>R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smith</surname>
            <given-names>B</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Spear</surname>
            <given-names>AD</given-names>
          </string-name>
          .
          <article-title>Building ontologies with Basic Formal Ontology</article-title>
          . Cambridge, Massachusetts: Massachusetts Institute of Technology;
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Bandrowski</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brinkman</surname>
            <given-names>R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brochhausen</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brush</surname>
            <given-names>MH</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bug</surname>
            <given-names>B</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chibucos</surname>
            <given-names>MC</given-names>
          </string-name>
          , et al.
          <article-title>The Ontology for Biomedical Investigations</article-title>
          .
          <source>PLOS ONE</source>
          <year>2016</year>
          ;
          <volume>11</volume>
          :e0154556. https://doi.org/10.1371/journal.pone.
          <volume>0154556</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Fabry</surname>
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goyer</surname>
            <given-names>F</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Barton</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ethier</surname>
            <given-names>J-F</given-names>
          </string-name>
          .
          <article-title>An Ontological Analysis of Health Procedure Information</article-title>
          . vol.
          <volume>3073</volume>
          ,
          <year>2021</year>
          , p.
          <fpage>36</fpage>
          -
          <lpage>47</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Fabry</surname>
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goyer</surname>
            <given-names>F</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Barton</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ethier</surname>
            <given-names>J-F</given-names>
          </string-name>
          .
          <article-title>An informational perspective on the ontology of services</article-title>
          . vol.
          <source>CEUR Workshop Proceedings, CEUR-WS.org; 2022</source>
          , p.
          <source>paper 5.</source>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14] OBO Foundry n.d. https://obofoundry.org/principles/fp-004-versioning.
          <source>html (accessed March 30</source>
          ,
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Wilkinson</surname>
            <given-names>MD</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dumontier</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aalbersberg</surname>
            <given-names>IjJ</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Appleton</surname>
            <given-names>G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Axton</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baak</surname>
            <given-names>A</given-names>
          </string-name>
          , et al.
          <article-title>The FAIR Guiding Principles for scientific data management and stewardship</article-title>
          .
          <source>Sci Data</source>
          <year>2016</year>
          ;
          <volume>3</volume>
          :
          <fpage>160018</fpage>
          . https://doi.org/10.1038/sdata.
          <year>2016</year>
          .
          <volume>18</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Plaice</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wadge</surname>
            <given-names>WW</given-names>
          </string-name>
          .
          <article-title>A New Approach to Version Control</article-title>
          .
          <source>IEEE Transactions on Software Engineering</source>
          <year>1993</year>
          ;
          <volume>19</volume>
          :
          <fpage>268</fpage>
          -
          <lpage>76</lpage>
          . https://doi.org/10.1109/32.221137.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Gruber</surname>
            <given-names>TR.</given-names>
          </string-name>
          <article-title>A translation approach to portable ontology specifications</article-title>
          .
          <source>Knowledge Acquisition</source>
          <year>1993</year>
          ;
          <volume>5</volume>
          :
          <fpage>199</fpage>
          -
          <lpage>220</lpage>
          . https://doi.org/10.1006/knac.
          <year>1993</year>
          .
          <volume>1008</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <surname>Guarino</surname>
            <given-names>N</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oberle</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Staab</surname>
            <given-names>S</given-names>
          </string-name>
          . What Is an Ontology? In: Staab S,
          <string-name>
            <surname>Studer</surname>
            <given-names>R</given-names>
          </string-name>
          , editors.
          <source>Handbook on Ontologies</source>
          , Berlin, Heidelberg: Springer;
          <year>2009</year>
          , p.
          <fpage>1</fpage>
          -
          <lpage>17</lpage>
          . https://doi.org/10.1007/978-3-
          <fpage>540</fpage>
          -92673-
          <issue>3</issue>
          _
          <fpage>0</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <surname>Neuhaus</surname>
            <given-names>F</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hastings</surname>
            <given-names>J</given-names>
          </string-name>
          .
          <article-title>Ontology development is consensus creation, not (merely) representation</article-title>
          .
          <source>AO</source>
          <year>2022</year>
          ;
          <volume>17</volume>
          :
          <fpage>495</fpage>
          -
          <lpage>513</lpage>
          . https://doi.org/10.3233/AO-220273.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20] OWL 2
          <string-name>
            <given-names>Web</given-names>
            <surname>Ontology Language Primer (Second Edition</surname>
          </string-name>
          ) n.d. https://www.w3.org/TR/2012/REC-owl2
          <string-name>
            <surname>-</surname>
          </string-name>
          primer-20121211/ (accessed June 16,
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <surname>Chacon</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Straub</surname>
            <given-names>B. Pro</given-names>
          </string-name>
          <string-name>
            <surname>Git</surname>
          </string-name>
          . Apress;
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>