<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>September</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>OxO2 - A SSSOM Mapping Browser for Logically Sound Crosswalks</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Henriette Harmse</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Haider Iqbal</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Helen Parkinson</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>James McLaughlin</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Samples</institution>
          ,
          <addr-line>Phenotypes and Ontologies Team (SPOT), EMBL-EBI, Wellcome Genome Campus, Hinxton, CB10 1SD, Cambridgeshire</addr-line>
          ,
          <country country="UK">United Kingdom</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <volume>1</volume>
      <fpage>0</fpage>
      <lpage>12</lpage>
      <abstract>
        <p>EMBL-EBI created OxO to enable users to map between datasets that are annotated with diferent ontologies. Mappings identified by the first version of OxO were not necessarily logically sound, lacked important provenance information such as author and reviewer, and could timeout or crash for certain requests. In this paper we introduce OxO2 to address these concerns. Provenance is addressed by implementing SSSOM, a mapping standard that defines provenance for mappings. SSSOM defines the conditions under which logical sound mappings can be derived and is implemented in OxO2 using Nemo, a Datalog rule engine. To ensure reasoning is performant and memory eficient, Nemo implements a number of strategies that ensures OxO2 will be stable for all requests. Due to these changes, OxO2 users will be able to integrate between disparate datasets with greater confidence.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;OxO</kwd>
        <kwd>mapping</kwd>
        <kwd>crosswalk</kwd>
        <kwd>ontologies</kwd>
        <kwd>FAIR Principles</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Maps to</p>
      <p>OMIM:615770</p>
      <p>ATRIAL</p>
      <p>
        FIBRILLATION,
FAMILIAL, 15; ATFB15
ontology concepts. The notion of an xref is intentionally loosely defined as part of the Open Biological
and Biomedical Ontology (OBO) Flat File Format Specification, which has historically been used to
define ontologies in the bioinformatics community [
        <xref ref-type="bibr" rid="ref12 ref13 ref14">12, 13, 14</xref>
        ]. An xref mapping may be used to indicate
an exact match, but it could as well refer to a broader or narrower match. In fact xrefs may have legio
meanings.
      </p>
      <p>
        A key feature of OxO1 is that it enables walking across ontologies – from a concept in one ontology
to a concept in the next ontology – referred to as a crosswalk. This is illustrated in Figure 1, where the
crosswalk between MONDO:0004981 and DOID:1579 is realized by mapping from MONDO:0004981
to EFO:0000275, to OMIM:615770, and finally to DOID:1579. This is referred to as a mapping at a
distance of 3, since mappings between ontologies were performed 3 times. Distances greater than 3 are
not supported in OxO1 [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Since xrefs are not well defined, the meaning of these kinds of crosswalks
across multiple ontologies are also weakly defined.
      </p>
      <p>Despite the loose definition of mappings in OxO1, it supports the following user groups.
Researchers, such as biologists seeking new treatments for heart disease, who may want to study the
phenotype "enlarged heart".</p>
      <p>Resource providers or data analysts integrating datasets annotated with diferent ontologies, such
as ZP and MP.</p>
      <p>
        At EMBL-EBI, OxO1 is used by resources such as European Variation Archive (EVA) [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], the
GenomeWide Association Study (GWAS) Catalog [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] and Europe PMC [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ].
      </p>
      <p>
        The Simple Standard for Sharing Ontological Mappings (SSSOM) provides precise definitions for
mappings, along with related provenance information, and clearly defines the conditions under which
new mappings can be derived from existing mappings [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. In this paper we introduce OxO2, a
new ontology mapping service that implements SSSOM and enables browsing of mappings between
ontologies with logically sound crosswalks.
      </p>
      <p>In the next section (Section 2), we give a brief overview of SSSOM and explain how it enables
logically sound crosswalks. Section 3 describes the value proposition of OxO2. In Section 4, we
motivate the design decisions of the OxO2 implementation, and in Section 5, we review related SSSOM
implementations. Finally, in Section 6, we discuss the current limitations of our OxO2 implementation
and future developments under consideration.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Background</title>
      <p>In this section, we explain how OxO2 uses Datalog (Section 2.3) to derive logically sound and
complete mappings based on SSSOM chain rules (Section 2.2), in a performant and memory eficient
manner (Section 2.4).</p>
      <sec id="sec-2-1">
        <title>2.1. An Overview of SSSOM</title>
        <p>
          SSSOM [
          <xref ref-type="bibr" rid="ref18 ref19 ref20 ref21">18, 19, 20, 21</xref>
          ] is briefly summarised here, along with an example to illustrate the value
proposition of OxO2.
        </p>
        <p>subject
subject_id = DOID:8567
subject_label = Hodgkin's lymphoma</p>
        <p>predicate
predicate_id = SKOS:exactMatch
mapping_date = 2022-02-23
mapping_set_id = http://w3id.org/mapping_commons/disease-mappings/onto-icd10
mapping_tool = https://github.com/mapping-commons/sssom-py
license =
https://github.com/mapping-commons/mapping-commons.github.io/blob/main/docs/original_license_applies.md</p>
        <p>
          Mappings are defined as a subject mapping via a predicate to an object. In SSSOM, this is
expressed as a subject_id mapping via a predicate_id to an object_id, where the subject_id,
predicate_id and object_id are each expressed as abbreviated Uniform Resource Identifiers (URIs),
called CURIEs [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ]. In general all identifiers in SSSOM are CURIEs, where CURIEs can refer to entities
including ontology concepts, database references etc. To define the provenance of a mapping, a mapping
is described using SSSOM standard metadata elements, such as subject_label, author_id (see
Table 1).
        </p>
        <p>
          For ease of use, SSSOM defines a tab-separated values (TSV) table format for representing
mappings. Table 2 defines three mappings, firstly that DOID:8567 maps via SKOS:exactMatch to
NCIT:C9357, secondly that HP:0012189 maps via OWL:equivalentClass to DOID:8567, and lastly
that MONDO:0009348 maps via SKOS:closeMatch to HP:0012189. To define the metadata of a set
of mappings, a mappings TSV file can be prepended with a header in YAML (a human-friendly data
serialization) format [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ]. The corresponding header for Table 1 is provided in Listing 1.
        </p>
        <p>Listing 1: YAML Header that defines the mapping set for the mappings in Table 2
# m a p p i n g _ s e t _ i d : h t t p s : / / f o i s 2 0 2 5 . com / e x a m p l e . mappings . t s v
# c u r i e _ m a p :
# DOID : h t t p : / / p u r l . o b o l i b r a r y . o r g / obo / DOID_
# HP : h t t p : / / p u r l . o b o l i b r a r y . o r g / obo / HP_
# MONDO: h t t p : / / p u r l . o b o l i b r a r y . o r g / obo /MONDO_
# NCIT : h t t p : / / p u r l . o b o l i b r a r y . o r g / obo / NCIT_
# OWL: h t t p : / / www. w3 . o r g / 2 0 0 2 / 0 7 / owl #
# SKOS : h t t p : / / www. w3 . o r g / 2 0 0 4 / 0 2 / s k o s / c o r e #</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Chain Rules</title>
        <p>
          With SSSOM it is possible to derive mappings from existing ones in a way that ensures derived mappings
are logically sound. This is achieved by providing chain rules that define the circumstances under
which mappings can be composed from existing mappings. A rule defines the conditions under which
an action will occur and follows an if-then structure. Rule chaining (or rule chains) refers to the
mechanism where the execution of one rule triggers the execution of subsequent rules. Execution stops
when no more rules are applicable [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ]. SSSOM chain rules are categorized as transitivity rules, rule
chains over exact and equivalent matches, inverse rules and generalization rules. The approach that
SSSOM takes is to provide reasonable defaults for mapping tools. Hence, SSSOM defines transitive rules
for skos:narrowMatch and skos:broadMatch [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ], despite them not being defined as transitive in
the Simple Knowledge Organization System (SKOS) specification [
          <xref ref-type="bibr" rid="ref27">27</xref>
          ]. In total, the SSSOM chain rules
define 22 rules that can be applied to SSSOM mappings, all of which are implemented and used in OxO2.
        </p>
        <p>
          For illustration purposes, we will focus only on chain rules over exact and equivalent matches, labeled
as RCE2 in the SSSOM specification [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ]. RCE2 has two variants, which apply to mappings where
the predicate_id is either SKOS:exactMatch or OWL:equivalentClass. These two variants are
defined in Listings 2 and 3.
        </p>
        <p>− [] −  ,
 − [OWL:equivalentClass] −</p>
        <p>→  − [] − 
 − [] −  ,
 − [SKOS:exactMatch] − 
→  − [] −</p>
        <sec id="sec-2-2-1">
          <title>Listing 2: RCE2-1</title>
        </sec>
        <sec id="sec-2-2-2">
          <title>Listing 3: RCE2-2</title>
          <p>Listing 2 states that when we have CURIEs ,  and , such that  is mapped to  via an arbitrary
predicate_id , and  is mapped to  via the predicate_id OWL:equivalentClass, then we
can derive that  maps to  via the same arbitrary predicate_id . The meaning of Listing 3 follows
in a similar fashion.</p>
          <p>Based on the mappings in Table 2, we can derive the two new mappings shown in Listings 4 and 5, by
applying the chain rules of Listing 2 and 3, respectively. In Listing 4 we derive that MONDO:0009348
(classic Hodgkin lymphoma) has a SKOS:closeMatch with DOID:8567 (Hodgkin’s lymphoma), by
applying the chain rule of Listing 2. By applying the chain rule of Listing 3, Listing 5 derives that
MONDO:0009348 has a SKOS:closeMatch with NCIT:C9357 (Hodgkin Lymphoma), by using both
an existing mapping from Table 2 and the derived mapping from Listing 4.</p>
        </sec>
        <sec id="sec-2-2-3">
          <title>Listing 4: Apply RCE2-1</title>
          <p>MONDO:0009348 − [SKOS:closeMatch] − HP:0012189,
HP:0012189 − [OWL:equivalentClass] − DOID:8567
→ MONDO:0009348 − [SKOS:closeMatch] − DOID:8567</p>
        </sec>
        <sec id="sec-2-2-4">
          <title>Listing 5: Apply RCE2-2</title>
          <p>MONDO:0009348 − [SKOS:closeMatch] − DOID:8567,
DOID:8567 − [SKOS:exactMatch] − NCIT:C9357
→ MONDO:0009348 − [SKOS:closeMatch] − NCIT:C9357</p>
        </sec>
      </sec>
      <sec id="sec-2-3">
        <title>2.3. Datalog</title>
        <p>In this section we show that SSSOM chain rules can be expressed as Datalog rules, and hence, all
characteristics of Datalog rules also apply to SSSOM chain rules. OxO2 uses Datalog to derive logically
sound and complete mappings in a performant and memory eficient way.</p>
        <p>
          Datalog is a declarative logic programming language designed for logical inference. A Datalog rule
defines the premises from which a conclusion follows and has an if-then structure similar to chain rules.
A Datalog program consists of Datalog rules. Datalog rules are applied recursively which means that
the result of a rule can be used by other rules to derive new information [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ]. A Datalog rules engine is
an application that, given a set of facts and a set of rules, it can infer new facts in a way that is logically
sound. Since Datalog is declarative, it means that one can state the rules that are applicable, without
having to state how the inferences are to be determined. The Datalog rules engine is responsible for
implementing the method(s) of determining inferences [
          <xref ref-type="bibr" rid="ref28">28</xref>
          ].
        </p>
        <p>
          To ensure that all facts that can be derived from a Datalog program are finite, 2 conditions must hold:
ifrstly, all facts must be constants, and secondly, all variables in the conclusion of a rule, must appear in
the body of a rule. We note that these conditions hold for SSSOM chain rules [
          <xref ref-type="bibr" rid="ref29">29</xref>
          ]:
1. All SSSOM mappings are without variables, as seen from the first mapping in Table 2,
which states that DOID:8567 maps via SKOS:exactMatch to NCIT:C9357 where DOID:8567,
SKOS:exactMatch and NCIT:C9357 are all constants.
2. The second condition holds for the chain rule defined in Listing 2. The conclusion, given after
the →, is  − [] −  where ,  and  are variables. These variables appear in the premises
 − [] −  and  − [OWL:equivalentClass] −  where  and  appear in the first premise
and  in the second premise. Doing a similar check, shows that condition 2 holds for Listing 3 as
well. Note that this condition only applies to variables and not constants. Hence, any constant,
such as SKOS:exactMatch can appear in the conclusion without appearing in any premises. We
note that for all SSSOM chain rules [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ] this condition holds.
        </p>
        <p>
          Datalog has some characteristics [
          <xref ref-type="bibr" rid="ref29">29</xref>
          ] that are important for OxO2.
        </p>
        <p>Termination For a finite set of facts, Datalog inferencing always terminates. Hence, the OxO2
dataloader can assume inferencing is complete when the inferencing algorithm terminates.
Soundness All Datalog inferences are guaranteed to follow logically from the assumed facts. For</p>
        <p>OxO2 this means inferred mappings can be trusted to be logically correct.</p>
        <p>Completeness means that all inferences that can be derived from the facts, are indeed derived. Hence,</p>
        <p>OxO2 is guaranteed to include all mappings that follow logically from asserted mappings.
Favourable time and space computational complexity The output of the Datalog inference
algorithm, and the time it takes to run till it terminates, is polynomial in the size of the input. This
means the time and space needs of Datalog inferencing is limited by a polynomial, such as ,
where  represents the input size and  is a constant. This is a positive result when compared
to time and space needs that can be exponential, e.g., . To illustrate the diference between
polynomial time and exponential time, assume the input size is  = 10 and  = 2, then the
polynomial time is  = 102 = 100, while exponential time is  = 210 = 1024. If we now
change the input size to  = 100, we see that the polynomial time is still reasonable, whereas the
exponential time is not 1. This result indicates that theoretically the Datalog inference algorithm
is feasible.</p>
      </sec>
      <sec id="sec-2-4">
        <title>2.4. Strategies for optimizing Datalog Reasoning</title>
        <p>Datalog reasoning has some challenges when applied to large knowledge graphs and hence a number
of strategies are employed to deal with these challenges. We mention the strategies here that are used
in the implementation of OxO2.</p>
        <p>
          The most basic approach to Datalog inferencing is the Naive Evaluation algorithm. It starts with the
original facts, and in the first step it derives new facts by applying all the rules to the original facts.
Subsequent steps apply the rules to both original facts and derived facts, repeatedly, until no new facts
can be added. This can result in the same inferences being made repeatedly. To avoid this redundancy,
an optimization is to apply subsequent rules only to facts derived in the previous iteration, which is
called Seminaive Evaluation [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ].
        </p>
        <p>
          A related concern is the explanation of inferences, which is known to be exponential in the number of
facts. This is because there can be multiple explanations for a single inference. To avoid this explosion
of the number of explanations, a technique called tracing is used. Instead of recording all possible
explanations, tracing aims to provide only 1 explanation for a given inference. This approach ensures
that the explanation of all inferences is feasible [
          <xref ref-type="bibr" rid="ref31">31</xref>
          ].
        </p>
        <p>
          To ensure fast and compact in-memory data storage, a column-based storage layout is used which
enables eficient compression schemes. However, column-based storage increases the cost of updates
(e.g. when new inferences are made). For this reason each rule application is written to a separate delta
table [
          <xref ref-type="bibr" rid="ref32">32</xref>
          ]. To reduce the cost of combining delta tables, combined deltas are cached [
          <xref ref-type="bibr" rid="ref33">33</xref>
          ].
        </p>
        <p>
          Even with the above optimizations, users can wait a long time when inferences are done in realtime.
Therefore, it is recommended to precompute inferences to improve the user experience [
          <xref ref-type="bibr" rid="ref32">32</xref>
          ].
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Results</title>
      <sec id="sec-3-1">
        <title>3.1. OxO1 Challenges</title>
        <p>
          The key value proposition of OxO1 is that it is able to identify potentially related mappings that could
either map directly or indirectly up to a distance of 3 to other CURIEs. OxO1 has 3 main challenges.
1. The semantics of the mappings it returns are vague, which is exacerbated for mappings at a
distance greater than 1. In OxO1 it is possible to derive mappings that have the form of Listing 6:
1There are many other computational complexity classes besides polynomial and exponential. See [
          <xref ref-type="bibr" rid="ref30">30</xref>
          ] for details.
 − [SKOS:broadMatch] −  ,
 − [SKOS:narrowMatch] − 
→  − [meaning?] − 
2. Returning mappings at a distance of 3 is only feasible when requesting mappings for a small set
of CURIEs. This problem became more accentuated as the number of mappings and ontologies
in OxO1 grew, with queries either timing out, or the OxO1 server crashing. To find mappings
for a CURIE at distance 1, OxO1 queries all mappings in its Neo4J database. As these mappings
are stored in the database, returning these results are highly eficient. For retrieving mappings
at a distance of 2 or 3, OxO1 determines derived mapping at query time using the following
algorithm. It first determines the cross product of all terms in the database. This result is used
to define possible starting and ending nodes, for which all possible paths of length 2 or 3 are
determined, depending on the distance specified. The computational complexity of the cross
product is  × , where  is the number of terms. The computational complexity for determining
all paths depends on the length of the path (or distance) , and the average degree of the nodes of
the graph under consideration, which is denoted by . The degree of a node is determined by
the number of inbound and outbound edges connected to it. The computational complexity for
determining all paths is given by . Hence, the total computational complexity for the OxO1
algorithm is  × 2. At the time of writing these values for OxO1 are as follows:
• the number of terms  = 698 651,
• the average degree of nodes  = 3, and
• the length of the paths to determine, which is also the mapping distance in OxO, where
 = 2 or  = 3.
        </p>
        <p>Hence, the worst case complexity at distance 2 is 32 × 698 6502 = 4.393006402 × 1012 and for
distance 3 it is 33 × 698 6502 = 1.317901921 × 1013. Looking at these calculations it becomes
clear why OxO1 crashed or timeout for large requests.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. How OxO2 address these Challenges</title>
        <p>To address the concerns regarding semantics, OxO2 implements SSSOM fully, and hence all metadata
associated with mappings and mapping sets, are stored in OxO2. Its Application Programming Interface
(API) allows searching across all SSSOM metadata elements (see Table 1). This provides data integrators
with multiple ways in which they can find mappings, thereby increasing their chances to find the data
most suitable to their needs.</p>
        <p>
          In order to ensure inferences are logically sound, OxO2 makes use of a Datalog rule engine to infer
mappings. Hence, OxO2 users can trust that inferred mappings are logically correct. To enable fast
searches across inferred mappings, OxO2 materializes all chain rule inferences as part of its data release
(see Section 2.4). Moreover, there is no longer a restriction on the distance at which mappings can be
returned, or the length of crosswalks that can be identified by OxO2. The derived mappings of Listings 4
and 5 were derived using this approach, and the mapping sets from [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ]. Mappings that are derived by
OxO2, are placed in a mapping set with value https://www.ebi.ac.uk/spot/oxo/inferences,
mapping_justification value SEMAPV:MappingChaining and mapping_tool value as OxO2.
        </p>
        <p>SSSOM’s 22 chain rules are helpful to define the precise conditions under which new mappings
can be inferred that are logically sound. But because chain rules can be recursively applied multiple
times, how inferred mappings were derived may not be obvious to humans. To make this information
accessible to users, one needs to identify all the chain rules that have been applied, and the facts that
were used, to derive an inferred mapping. We are able to provide this information by using a Datalog
rule engine, that records this information as part of OxO2’s data release process.</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Benefits to Users</title>
        <sec id="sec-3-3-1">
          <title>OxO2 will help our users in the following ways:</title>
          <p>1. A researcher searching for "enlarged heart" will be able to find the MP concept MP:0000274, but
they may be missing ZP:0000532 (increased size of heart) and HP:0001640 (Cardiomegaly).</p>
          <p>OxO2 will help them to uncover ZP:0000532 and HP:0001640.
2. Logical soundness does not necessarily mean biological correctness. Assume for the
moment a mapping was mistakenly added that states that MP:0000274 (enlarged heart) is a
SKOS:exactMatch with MP:0001095 (enlarged trigeminal ganglion). This could result in
inferred mappings that are logically sound, but which are nonsensical from a biological perspective.
A researcher can identify an error like this by verifying the premises of an inference. As an
example, we can verify the inferences of Listings 4 and 5 by verifying the premises in Table 2.
3. Derived mappings in OxO1 were not axiomatised, and hence, the only way researchers could
verify each of the mappings, was to manually validate them. In the case of OxO2, one can assume
that the inferred mappings are logically correct. Any incorrect inferred mappings will only
ever be due to incorrect explicitly stated mappings. Thus, to validate the correctness of derived
mappings, a user can consider reviewing explicitly stated mappings.
4. Complete explanations of inferred mappings will help data integrators to identify the source
of incorrect data, since explanations clearly state the facts that were used to derive inferred
mappings.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Methods</title>
      <p>
        For materializing inferred mappings, OxO2 uses Nemo, an in-memory rule engine [
        <xref ref-type="bibr" rid="ref28 ref33">28, 33</xref>
        ]. Nemo runs
as a command line tool that loads facts and rules, processes them, and, once complete, writes out the
relevant inferences; then the Nemo process terminates. From this perspective Nemo is well aligned
with being run as part of a data release pipeline, as is the case for OxO2. To ensure that inferencing is
fast and memory eficient, Nemo implements the various optimizations mentioned in Section 2.4.
      </p>
      <p>
        Nemo implements Datalog with various extensions, that results in inferencing not necessarily
terminating (Section 2.3). However, OxO2’s use of Nemo is such that it only uses Datalog without any
extensions and, hence for its use case, Nemo terminates [
        <xref ref-type="bibr" rid="ref33">33</xref>
        ].
      </p>
      <p>Currently OxO2 imports 1 160 020 mappings, from which it is able to infer 49 536 mappings.
Inferencing on an Intel Core Ultra 7 165U x 13 laptop, with 32GB RAM and SSD, completes in about 17
min, and uses about 380MB of memory. These results show that it is possible to run an OxO2 dataload on
a personal computer, without necessarily having access to high-performance computing infrastructure.</p>
      <p>For storing mappings, OxO2 uses Solr rather than Neo4J. Our motivation for this change in
architecture is due to there being no need for graph queries in OxO2 – all derived mappings with their
explanations are determined during the OxO2 dataload. All inferred mappings and their explanations
are then stored in Solr. The OxO2 Solr schema design mimics the SSSOM metadata elements with a
core for storing mapping information and core for storing mapping set information. Hence, all SSSOM
metadata elements are searchable and can be accessed via API calls and its frontend. Explanations are
stored along with the mappings in the mapping core.</p>
      <p>The OxO2 dataload and backend are implemented using Java 17 and Spring Boot. The frontend is
implemented in React using Typescript. Styling is implemented using TailWind CSS, which provides
numerous utility classes that limits the need for custom CSS.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Related Tools</title>
      <p>
        In support of SSSOM there are a number of tools as detailed in [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]:
• sssom-py is a Python library for reading, transforming and manipulating SSSOM mappings.
• The Ontology Access Kit (OAK) can provide mappings in the SSSOM format.
• The sssom-java library is a Java library for reading, transforming and manipulating SSSOM
mappings.
• Semantic Reasoning Assembler (SeMRA) is a Python library that allows one to crawl mappings
in various formats, not just SSSOM. It then de-duplicates mappings, infers mappings, derives
associated confidence for inferred mappings, and filters false positive mappings [
        <xref ref-type="bibr" rid="ref34">34</xref>
        ].
      </p>
      <p>These tools are used for the creation and manipulation of SSSOM files. The main purpose of OxO2 is
to make existing mappings and their derived mappings available for users to search and browse.</p>
      <p>The value of SSSOM as a standard for mappings depends on its community adoption. Currently
most mappings in the bioinformatics domain are still embedded within ontologies, which are extracted
by the OLS dataload into SSSOM files that are imported by OxO2. But many of these mappings are
based on xrefs and hence still sufer from the poor semantics we discussed in the Introduction. For
this reason, the tools listed above are complementary to OxO2, in that they enable users to create high
quality mappings that are essential to the success of OxO2.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Discussion and Future</title>
      <p>With the development of OxO2, we are able to address the key limitations of OxO1 in terms of the
semantics of mappings, the meaning of inferred mappings and the length of crosswalks it can support.
It is only through the availability of SSSOM that OxO2 is able to address these concerns.</p>
      <p>
        Currently OxO2 is still in active development and we will prioritise as follows:
1. Currently, the API of OxO2 focuses on bringing new search capabilities to users, with its own
new request and response structures. For our existing users there may be a need to create a
backwards compatibility layer, so as to not disrupt their services when OxO2 is rolled out. Similar
to OLS4 [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], this will be implemented using view classes that transforms the underlying OxO2
data model into the OxO1 data model view.
2. The value proposition of OxO2 is dependent on high quality SSSOM mappings. We need to
identify mappings sets that can be included in OxO2 such as mappings from MONDO to HP
found at [
        <xref ref-type="bibr" rid="ref35">35</xref>
        ] and SeMRA which provides an SSSOM export of their mappings [
        <xref ref-type="bibr" rid="ref36">36</xref>
        ].
3. The explanation implementation is currently a proof of concept and contains only information on
the subject_id, predicate_id, object_id and chain rule that has been applied. To provide
suficient provenance, it also needs to include information such as author_id, reviewer_id,
and the mapping set from which premises came. Currently, explanations are given in a text form.
      </p>
      <p>This needs to be augmented with a graph that can easily be navigated by users.
4. To ensure the high availability of OxO2, its data release pipeline will be rolled out on the
EMBLEBI high-performance computing infrastructure, and its frontend and backend to the Kubernetes
infrastructure.
5. The EMBL-EBI instance of OxO is focussed on the biological domain. To enable other domains to
install their own instances of OxO2, OxO2 will be dockerized.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments</title>
      <p>J.A.M., H.I., H.P., and H.H. are supported in part by EMBL- EBI Core Funds. J.M., H.I. and H.H are
supported in part by EVORA. The EVORA project has received funding from the European Union’s
HORIZON programme under grant agreement No 101131959.</p>
      <p>For the purpose of Open Access, a CC-BY public copyright licence has been applied to the present
document and will be applied to all subsequent versions up to the Author Accepted Manuscript arising
from this submission.</p>
    </sec>
    <sec id="sec-8">
      <title>Declaration on Generative AI</title>
      <sec id="sec-8-1">
        <title>The author(s) have not employed any Generative AI tools.</title>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M. D.</given-names>
            <surname>Wilkinson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dumontier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I. J.</given-names>
            <surname>Aalbersberg</surname>
          </string-name>
          , G. Appleton,
          <string-name>
            <given-names>M.</given-names>
            <surname>Axton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Baak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Blomberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.-W.</given-names>
            <surname>Boiten</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. B. da Silva</given-names>
            <surname>Santos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. E.</given-names>
            <surname>Bourne</surname>
          </string-name>
          , et al.,
          <article-title>The FAIR Guiding Principles for scientific data management and stewardship</article-title>
          ,
          <source>Scientific data 3</source>
          (
          <year>2016</year>
          )
          <fpage>1</fpage>
          -
          <lpage>9</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>F.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Juty</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Goble</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Jupp</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Parkinson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Courtot</surname>
          </string-name>
          ,
          <article-title>Features of a FAIR vocabulary</article-title>
          ,
          <source>Journal of Biomedical Semantics</source>
          <volume>14</volume>
          (
          <year>2023</year>
          )
          <article-title>6</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>M. van Reisen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Stokmans</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Basajja</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. O.</given-names>
            <surname>Ong'ayo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Kirkpatrick</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Mons</surname>
          </string-name>
          ,
          <article-title>Towards the Tipping Point for FAIR Implementation</article-title>
          ,
          <source>Data Intelligence</source>
          <volume>2</volume>
          (
          <year>2020</year>
          )
          <fpage>264</fpage>
          -
          <lpage>275</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>S.</given-names>
            <surname>Jupp</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Liener</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Sarntivijai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Vrousgou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Burdett</surname>
          </string-name>
          , H. E. Parkinson,
          <article-title>OxO-A Gravy of Ontology Mapping Extracts</article-title>
          ., in: ICBO,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>J.</given-names>
            <surname>Malone</surname>
          </string-name>
          , E. Holloway,
          <string-name>
            <given-names>T.</given-names>
            <surname>Adamusiak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kapushesky</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zheng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Kolesnikov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Zhukova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Brazma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Parkinson</surname>
          </string-name>
          ,
          <article-title>Modeling sample variables with an Experimental Factor Ontology</article-title>
          ,
          <source>Bioinformatics</source>
          <volume>26</volume>
          (
          <year>2010</year>
          )
          <fpage>1112</fpage>
          -
          <lpage>1118</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>N.</given-names>
            <surname>Vasilevsky</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Essaid</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Matentzoglu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. L.</given-names>
            <surname>Harris</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Haendel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Robinson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. J.</given-names>
            <surname>Mungall</surname>
          </string-name>
          , Mondo Disease Ontology:
          <article-title>harmonizing disease concepts across the world</article-title>
          , volume
          <volume>2807</volume>
          , eScholarship, University of California,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>L. M.</given-names>
            <surname>Schriml</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. B.</given-names>
            <surname>Munro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Schor</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Olley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>McCracken</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Felix</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Baron</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Jackson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. M.</given-names>
            <surname>Bello</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Bearer</surname>
          </string-name>
          , et al.,
          <source>The human disease ontology 2022 update, Nucleic acids research</source>
          <volume>50</volume>
          (
          <year>2022</year>
          )
          <fpage>D1255</fpage>
          -
          <lpage>D1261</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>S.</given-names>
            <surname>Köhler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Gargano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Matentzoglu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. C.</given-names>
            <surname>Carmody</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lewis-Smith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. A.</given-names>
            <surname>Vasilevsky</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Danis</surname>
          </string-name>
          , G. Balagura,
          <string-name>
            <given-names>G.</given-names>
            <surname>Baynam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. M.</given-names>
            <surname>Brower</surname>
          </string-name>
          , et al.,
          <source>The human phenotype ontology in 2021, Nucleic acids research</source>
          <volume>49</volume>
          (
          <year>2021</year>
          )
          <fpage>D1207</fpage>
          -
          <lpage>D1217</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>L.</given-names>
            <surname>Vogt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Strömert</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Matentzoglu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Karam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Konrad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Prinz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Baum</surname>
          </string-name>
          ,
          <article-title>Suggestions for extending the FAIR Principles based on a linguistic perspective on semantic interoperability</article-title>
          ,
          <source>Scientific Data</source>
          <volume>12</volume>
          (
          <year>2025</year>
          )
          <fpage>688</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>N.</given-names>
            <surname>Matentzoglu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. M.</given-names>
            <surname>Bello</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Stefancsik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. M.</given-names>
            <surname>Alghamdi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. V.</given-names>
            <surname>Anagnostopoulos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Balhof</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Balk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y. M.</given-names>
            <surname>Bradford</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Bridges</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. J.</given-names>
            <surname>Callahan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Caufield</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Cuzick</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. C.</given-names>
            <surname>Carmody</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. R.</given-names>
            <surname>Caron</surname>
          </string-name>
          , V. de Souza,
          <string-name>
            <given-names>S. R.</given-names>
            <surname>Engel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Fey</surname>
          </string-name>
          , M. Fisher, S. Gehrke,
          <string-name>
            <given-names>C.</given-names>
            <surname>Grove</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Hansen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. L.</given-names>
            <surname>Harris</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Harris</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Harris</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ibrahim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. O. B.</given-names>
            <surname>Jacobsen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Köhler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>McMurry</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Munoz-Fuentes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. C.</given-names>
            <surname>Munoz-Torres</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Parkinson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z. M.</given-names>
            <surname>Pendlington</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Pilgrim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. M. C.</given-names>
            <surname>Robb</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. N.</given-names>
            <surname>Robinson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Seager</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Segerdell</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Smedley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Sollis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Toro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Vasilevsky</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Wood</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Haendel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. J.</given-names>
            <surname>Mungall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>McLaughlin</surname>
          </string-name>
          ,
          <string-name>
            <surname>D.</surname>
          </string-name>
          Osumi-Sutherland,
          <article-title>The Unified Phenotype Ontology : a framework for crossspecies integrative phenomics</article-title>
          ,
          <source>Genetics</source>
          <volume>229</volume>
          (
          <year>2025</year>
          )
          <article-title>iyaf027</article-title>
          . doi:
          <volume>10</volume>
          .1093/genetics/iyaf027.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>J. McLaughlin</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Lagrimas</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Iqbal</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Parkinson</surname>
          </string-name>
          , H. Harmse,
          <article-title>OLS4: A new Ontology Lookup Service for a growing interdisciplinary knowledge ecosystem</article-title>
          ,
          <source>Bioinformatics</source>
          (
          <year>2025</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>J.</given-names>
            <surname>Day-Richter</surname>
          </string-name>
          ,
          <article-title>The OBO flat file format specification</article-title>
          ,
          <source>version 1</source>
          .2,
          <string-name>
            <given-names>The</given-names>
            <surname>Gene Ontology</surname>
          </string-name>
          (
          <year>2006</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>A.</given-names>
            <surname>Laadhar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Abrahão</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Jonquet</surname>
          </string-name>
          ,
          <article-title>Investigating One Million XRefs in Thirthy Ontologies from the OBO World.</article-title>
          , in: ICBO/ODLS,
          <year>2020</year>
          , p.
          <fpage>1</fpage>
          -
          <lpage>12</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>B.</given-names>
            <surname>Smith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ashburner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Rosse</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Bard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Bug</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Ceusters</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. J.</given-names>
            <surname>Goldberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Eilbeck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ireland</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. J.</given-names>
            <surname>Mungall</surname>
          </string-name>
          , et al.,
          <article-title>The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration</article-title>
          ,
          <source>Nature biotechnology 25</source>
          (
          <year>2007</year>
          )
          <fpage>1251</fpage>
          -
          <lpage>1255</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>T.</given-names>
            <surname>Cezard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Cunningham</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. E.</given-names>
            <surname>Hunt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Koylass</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Saunders</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Shen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. F.</given-names>
            <surname>Silva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Tsukanov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Venkataraman</surname>
          </string-name>
          , et al.,
          <article-title>The European Variation Archive: a FAIR resource of genomic variation for all species</article-title>
          ,
          <source>Nucleic acids research</source>
          <volume>50</volume>
          (
          <year>2022</year>
          )
          <fpage>D1216</fpage>
          -
          <lpage>D1220</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>M.</given-names>
            <surname>Cerezo</surname>
          </string-name>
          , E. Sollis,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Ji</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Lewis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Abid</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. O.</given-names>
            <surname>Bircan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Hall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hayhurst</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>John</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Mosaku</surname>
          </string-name>
          , et al.,
          <article-title>The NHGRI-EBI GWAS Catalog: standards for reusability, sustainability and diversity</article-title>
          ,
          <source>Nucleic acids research</source>
          <volume>53</volume>
          (
          <year>2025</year>
          )
          <fpage>D998</fpage>
          -
          <lpage>D1005</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>E. P.</given-names>
            <surname>Consortium</surname>
          </string-name>
          ,
          <string-name>
            <surname>Europe</surname>
            <given-names>PMC</given-names>
          </string-name>
          :
          <article-title>a full-text literature database for the life sciences and platform for innovation</article-title>
          ,
          <source>Nucleic acids research</source>
          <volume>43</volume>
          (
          <year>2015</year>
          )
          <fpage>D1042</fpage>
          -
          <lpage>D1048</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>N.</given-names>
            <surname>Matentzoglu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Balhof</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. M.</given-names>
            <surname>Bello</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Bizon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Brush</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. J.</given-names>
            <surname>Callahan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. G.</given-names>
            <surname>Chute</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. D.</given-names>
            <surname>Duncan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. T.</given-names>
            <surname>Evelo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Gabriel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Graybeal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gray</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. M.</given-names>
            <surname>Gyori</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Haendel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Harmse</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. L.</given-names>
            <surname>Harris</surname>
          </string-name>
          , I. Harrow,
          <string-name>
            <given-names>H. B.</given-names>
            <surname>Hegde</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. L.</given-names>
            <surname>Hoyt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. T.</given-names>
            <surname>Hoyt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Jiao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Jiménez-Ruiz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Jupp</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Koehler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Liener</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Long</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Malone</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>McLaughlin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>McMurry</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Moxon</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. C. MunozTorres</surname>
          </string-name>
          ,
          <string-name>
            <surname>D.</surname>
            Osumi-Sutherland,
            <given-names>J. A.</given-names>
          </string-name>
          <string-name>
            <surname>Overton</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Peters</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Putman</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Queralt-Rosinach</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Shefchek</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Solbrig</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Thessen</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Tudorache</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Vasilevsky</surname>
            ,
            <given-names>A. H.</given-names>
          </string-name>
          <string-name>
            <surname>Wagner</surname>
            ,
            <given-names>C. J.</given-names>
          </string-name>
          <string-name>
            <surname>Mungall</surname>
          </string-name>
          ,
          <article-title>A Simple Standard for Sharing Ontological Mappings (SSSOM)</article-title>
          ,
          <year>Database 2022</year>
          (
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>N.</given-names>
            <surname>Matentzoglu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. E.</given-names>
            <surname>Flack</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Graybeal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. L.</given-names>
            <surname>Harris</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. B.</given-names>
            <surname>Hegde</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. T.</given-names>
            <surname>Hoyt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Toro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. A.</given-names>
            <surname>Vasilevsky</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. J.</given-names>
            <surname>Mungall</surname>
          </string-name>
          ,
          <article-title>A Simple Standard for Ontological Mappings 2022: Updates of data model and outlook</article-title>
          , in: OM@ ISWC,
          <year>2022</year>
          , p.
          <fpage>61</fpage>
          -
          <lpage>66</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>N.</given-names>
            <surname>Matentzoglu</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Braun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. R.</given-names>
            <surname>Caron</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Goutte-Gattat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. M.</given-names>
            <surname>Gyori</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. L.</given-names>
            <surname>Harris</surname>
          </string-name>
          , E. Hartley,
          <string-name>
            <given-names>H. B.</given-names>
            <surname>Hegde</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Hertling</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. T.</given-names>
            <surname>Hoyt</surname>
          </string-name>
          , et al.,
          <article-title>A simple standard for ontological mappings 2023: updates on data model, collaborations and tooling</article-title>
          .,
          <source>in: OM@ ISWC</source>
          ,
          <year>2023</year>
          , p.
          <fpage>73</fpage>
          -
          <lpage>78</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <article-title>Simple Standard for Sharing Ontological Mappings (SSSOM</article-title>
          ),
          <year>2025</year>
          . URL: https://mapping-commons. github.io/sssom/.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <source>[22] CURIE Syntax 1.0</source>
          ,
          <year>2010</year>
          . URL: https://www.w3.org/TR/curie/.
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>Disease</given-names>
            <surname>Mappings</surname>
          </string-name>
          ,
          <year>2024</year>
          . URL: https://github.com/mapping-commons/disease-mappings.
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <surname>YAML: YAML Ain't Markup</surname>
            <given-names>Language</given-names>
          </string-name>
          ™,
          <year>2021</year>
          . URL: https://yaml.org/.
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>S.</given-names>
            <surname>Abiteboul</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Hull</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Vianu</surname>
          </string-name>
          , Foundations of databases, volume
          <volume>8</volume>
          ,
          <string-name>
            <surname>Addison-Wesley</surname>
            <given-names>Reading</given-names>
          </string-name>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>Simple</given-names>
            <surname>Standard for Sharing Ontological Mappings (SSSOM) - Applying Chaining Rules</surname>
          </string-name>
          ,
          <year>2025</year>
          . URL: https://mapping-commons.github.io/sssom/chaining-rules/.
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>SKOS</given-names>
            <surname>Simple Knowledge Organization System Namespace Document</surname>
          </string-name>
          ,
          <year>2009</year>
          . URL: https://www.w3. org/
          <year>2009</year>
          /08/skos-reference/skos.html.
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>A.</given-names>
            <surname>Ivliev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ellmauthaler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Gerlach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Marx</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Meißner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Meusel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Krötzsch</surname>
          </string-name>
          , Nemo:
          <article-title>First glimpse of a new rule engine</article-title>
          ,
          <source>arXiv preprint arXiv:2308.15897</source>
          (
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>S.</given-names>
            <surname>Ceri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Gottlob</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Tanca</surname>
          </string-name>
          , et al.,
          <article-title>What you always wanted to know about Datalog(and never dared to ask), IEEE transactions on knowledge and data engineering 1 (</article-title>
          <year>1989</year>
          )
          <fpage>146</fpage>
          -
          <lpage>166</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>M.</given-names>
            <surname>Davis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Sigal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. J.</given-names>
            <surname>Weyuker</surname>
          </string-name>
          , Computability, complexity, and
          <article-title>languages: fundamentals of theoretical computer science</article-title>
          , Elsevier,
          <year>1994</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>A.</given-names>
            <surname>Elhalawati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Krötzsch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Mennicke</surname>
          </string-name>
          ,
          <article-title>An Existential Rule Framework for Computing WhyProvenance On-Demand for Datalog</article-title>
          , in: G. Governatori, A.-Y. Turhan (Eds.),
          <source>Rules and Reasoning</source>
          , Springer International Publishing, Cham,
          <year>2022</year>
          , p.
          <fpage>146</fpage>
          -
          <lpage>163</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>J.</given-names>
            <surname>Urbani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Jacobs</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Krötzsch</surname>
          </string-name>
          ,
          <article-title>Column-oriented datalog materialization for large knowledge graphs</article-title>
          ,
          <source>in: Proceedings of the AAAI Conference on Artificial Intelligence</source>
          , volume
          <volume>30</volume>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <string-name>
            <given-names>A.</given-names>
            <surname>Ivliev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Gerlach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Meusel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Steinberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Krötzsch</surname>
          </string-name>
          , Nemo:
          <article-title>Your friendly and versatile rule reasoning toolkit</article-title>
          ,
          <source>in: Proceedings of the International Conference on Principles of Knowledge Representation and Reasoning</source>
          , volume
          <volume>21</volume>
          ,
          <year>2024</year>
          , p.
          <fpage>743</fpage>
          -
          <lpage>754</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [34]
          <string-name>
            <given-names>C. T.</given-names>
            <surname>Hoyt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Karis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. M.</given-names>
            <surname>Gyori</surname>
          </string-name>
          ,
          <article-title>Assembly and reasoning over semantic mappings at scale for biomedical data integration</article-title>
          ,
          <source>bioRxiv</source>
          (
          <year>2025</year>
          )
          <fpage>2025</fpage>
          -
          <lpage>04</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [35]
          <article-title>MONDO to HP SSSOM mappings</article-title>
          ,
          <year>2024</year>
          . URL: https://github.com/mapping-commons/ disease-mappings/blob/main/mappings/mondo_hp_lexical.sssom.tsv.
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          [36]
          <string-name>
            <given-names>SeMRA</given-names>
            <surname>Mappings</surname>
          </string-name>
          ,
          <year>2025</year>
          . URL: https://zenodo.org/records/15208251.
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          <article-title>• OxO2 GitHub repository, • Simple Standard for Sharing Ontological Mappings (SSSOM).</article-title>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>