=Paper= {{Paper |id=Vol-3603/workshopOIIDDS1 |storemode=property |title=Ontology Development Strategies and the Infectious Disease Ontology Ecosystem |pdfUrl=https://ceur-ws.org/Vol-3603/workshopOIIDDS1.pdf |volume=Vol-3603 |authors=Giacomo De Colle,Ali Hasanzadeh,John C. Beverley |dblpUrl=https://dblp.org/rec/conf/icbo/ColleHB23 }} ==Ontology Development Strategies and the Infectious Disease Ontology Ecosystem == https://ceur-ws.org/Vol-3603/workshopOIIDDS1.pdf
                         Ontology Development Strategies and the
                         Infectious Disease Ontology Ecosystem
                         Giacomo De Colle 1, Ali Hasanzadeh1 and John C. Beverley 1,2
                         1
                             University at Buffalo, Buffalo, NY, USA
                         2
                             National Center for Ontological Research, Buffalo, NY, USA


                                          Abstract
                                          After motivating a framework for evaluating top-down, middle-out, middle-in, and bottom-up
                                          ontology development strategies, we apply our framework to investigate whether infectious
                                          disease ontologies - specifically, the Virus Infectious Disease Ontology (VIDO) and the
                                          Coronavirus Infectious Disease Ontology (CIDO) - effectively promote semantic
                                          interoperability.

                                          Keywords 1
                                          Top-down, middle-out, bottom-up, middle-in, infectious disease ontologies, CIDO, VIDO

                         1. Introduction
                             Ontologies are developed using numerous strategies. Some follow a top-down strategy, in which
                         classes or categories are devised to constrain lower-level ontology content extending from them [1].
                         Bottom-up strategies tend to begin creating ontology content reflecting a given domain of interest,
                         representing that content with a high degree of fidelity [2]. Data used as a basis for a bottom-up
                         ontology comes in many flavors, e.g. a SQL database, an Excel file, previously existing taxonomies,
                         etc. An ontologist following the bottom-up strategy will attempt to uncover terms and relations
                         implicit in the data, to represent them ontologically. The middle-out strategy aims at attaining the
                         benefits of the preceding strategies, like proximity to the domain and ensuring consistency across
                         lower-level ontologies [3,4]. Middle-out ontologies are developed at some level of abstraction above
                         one or more domains to be modeled, but not as far removed as those that begin with the top-down
                         strategy. Notice that these strategies are distinguished based on the starting point of their development
                         [5]. They are thus distinct from ontology architectures distinguished in terms of their coverage, e.g.
                         top-level ontologies such as the Basic Formal Ontology (BFO) [1,6], DOLCE [7], and YAMATO [8];
                         mid-level ontologies – such as the Common Core Ontologies (CCO) suite [9] or the Industrial
                         Ontology Core [10].
                             While each of these strategies has been discussed in the literature, there has, as of yet, not been a
                         rigorous, fair, comparison provided between them. One of our aims in this article is to provide such a
                         comparison. Another of our aims is to apply the results of our comparison to representative ontologies
                         which follow one of these strategies, specifically, the Coronavirus Infectious Disease Ontology
                         (CIDO) [11,12] and the Virus Infectious Disease Ontology (VIDO) [13], designed according to the
                         top-down strategy. Our evaluation will demonstrate the extent to which these ontologies effectively
                         promote semantic interoperability, a primary goal of ontology development.

                         2. Criteria for Evaluating Ontologies


                         Proceedings of the International Conference on Biomedical Ontologies 2023, August 28th-September 1st, 2023, Brasilia, Brazil
                         1

                         EMAIL: gdecolle@buffalo.edu (A. 1); ahasanza@buffalo.edu (A. 2); johnbeve@buffalo.edu (A. 3)
                         ORCID: 0000-0002-3600-6506 (A. 1); 0009-0003-6686-3319 (A. 2); 0000-0002-1118-1738 (A. 3)
                                       ©️ 2023 Copyright for this paper by its authors.
                                       Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
                                       CEUR Workshop Proceedings (CEUR-WS.org)




CEUR
                  ceur-ws.org
Workshop      ISSN 1613-0073
Proceedings



                                                                                                                                                        184
    We take as a starting point for the identification of evaluative criteria for ontologies, the work of
Denny Vrandečić [14], itself originated from the work of Thomas Gruber. Vrandečić’s evaluating
criteria – displayed in Table 1 – have been used by numerous developers to improve the quality of
their ontologies and knowledge graphs [15-7]. Note, no single ontology can perform well on all these
metrics; indeed, some criteria appear in conflict, such as completeness and conciseness.

Table 1
Vrandečić’s Ontology Evaluation Criteria
         Criterion                                 Description
         Accuracy                 The extent to which an ontology accurately
                               represents the domain within its stated scope.
       Adaptability          The extent to which an ontology can be extended
                                to represent entities in domains outside of its
                                            originally stated scope.
          Clarity             The extent to which an ontology unambiguously,
                               clearly, conveys the meanings of its terms and
                                               relations to users.
      Completeness          The extent to which an ontology includes terms and
                              relations that cover the entire domain within the
                                       intended scope of the ontology.
       Conciseness           The extent to which an ontology is parsimonious,
                             does not include redundant content, or irrelevant
                                                     axioms.
        Coherence             The extent to which an ontology is both logically
                                 consistent and semantically aligned with the
                                           intention of its creators.
   Organizational fitness       The extent to which, within an organizational
                                 context, an ontology is integrated within the
                                                  organization.

    Accuracy follows naturally from the goals of most ontology development, namely, representing a
given domain using a machine-readable controlled language. Accuracy may be determined when
ontology developers and subject-matter experts interact during ontology development, in the interest
of reaching consensus over ontology labels, definitions, logical relationships, etc. [18]. The most
successful ontologies are reused, extended to new domains, and integrated with other ontologies and
knowledge representation projects. An adaptable ontology is one which can be easily extended into a
new domain distinct from that for which it was initially designed. Clarity can be achieved by proper
definition development and documentation. In the absence of definitions, other annotation properties
such as comments, citations, notes, labels, alternative labels, and preferred labels may be used to
promote clarity. A complete ontology will include terms and relations needed to represent any terms
within its scope. Completeness is related to accuracy, as evidenced by the impact completeness may
have on whether competency questions for an ontology can be answered [19]. Adequately answering
competency questions requires that the ontology has adequate coverage of the domain. A concise
ontology will not include unnecessary elements or axioms, which promotes understanding by users
and helps avoid the confusion that might emerge from the presence of, say, many unneeded classes.
Related, a concise ontology will include only those terms and relations that are needed to represent the
domain within its scope, i.e., the minimal set of terms and relations. A coherent ontology will be
logically consistent and will entail as little beyond the intent of its creators as possible. Put another
way, the intended interpretation of the ontology will match as closely as possible the semantic
interpretation generated by model checkers [20] or OWL reasoners [21,22]. Organizational Fitness is
not directly about a given ontology per se but also involves the organizational context in which the
ontology is deployed. An ontology that scores high on organizational fitness is successfully and
consistently deployed in an organizational context, is being maintained and developed, is accessible to
members of the organization, and can be aligned with other ontologies within the organization.




                                                                                                            185
   The standard we use to adjudicate the importance of each of these criteria is, arguably, the primary
aim towards which any good ontology aims, the promotion of semantic interoperability – the ability
of computer systems to exchange data with unambiguous, shared meaning [1,23-4]. The most
important criteria in this perspective seem to be accuracy, coherence, and adaptability. Notice that
coherence cannot be simply reduced to the absence of contradictions in the ontology but is rather the
absence of statements that are semantically, and often implicitly, in conflict. During the development
of VIDO, such a coherence issue was identified with respect to the definition of “organism” used by
the Ontology for Biomedical Investigations (OBI), which included viruses within purview [25].
“Organism”, however, was defined as comprised of cellular entities, despite viruses being acellular.
This is an example of a semantically incoherent statement that was originally not noticed and that
might be more easily avoided by adopting one ontology development strategy over the others.

3. Evaluating Ontology Development Strategies
    The three strategies in our focus are evaluated on a spectrum. The evaluation of these strategies
was carried out by analyzing ontologies that explicitly adopt them, and then testing them against a set
of questions devised to check compliance with the our criteria. In the future, we plan to test these
questions and criteria on a larger scale using empirical methods, i.e. surveys and feedback group
sessions with ontology developers, etc. In what follows, we do not conclude that ontologies borne
from the one strategy necessarily perform better than others with respect to our criteria. Instead, we
suggest that they stand a better chance of doing so than ontologies developed following one of the
other strategies.
    Organizational Fitness
   Bottom-up: This strategy promotes work on the same data set. However, different segments of
       this dataset, when used by various team members, might lack semantic uniformity. This
       disparity becomes apparent when generalizing the embedded knowledge.
   Middle-out: This strategy provides a common starting point for building classes that members
       of an ontology development team can use. Nevertheless, there may be a lack of shared
       understanding across ontologies borne out of this strategy within an organization, in particular
       with respect to higher-level terms and relations such as part of, quality, or process.
   Top-down: While starting from the most general classes and working your way down is a
       suitable way to promote consistency, it is not free of challenges. Deciding how best to define
       such classes across an enterprise requires time and skill that organizations may not have.
    Definitions (Accuracy, Clarity, Coherence)
    The Aristotelian scheme for definition writing is standardly employed by ontology developers
[1,26], i.e., to define class A you identify its parent class B and describe the differentia that
distinguishes instances of A from any other instances of B [27]. Adopting the Aristotelian model
forces the ontologist to identify the relation of the class with the other classes in the hierarchy when
building the definition, and provides a format that is human-readable, consistent, and understandable
for the user if correctly applied.
   Bottom-Up: One cannot adopt the model of writing Aristotelian definitions if following this
       strategy, since there is no top-level ontology from which to identify parent classes.
       Consequently, those following the bottom-up strategy must develop some manner of schema to
       do so. Whatever that schema amounts to, it will not be extending from a top-level ontology,
       which seems a cost.
   Middle-Out: The middle-out strategy may adopt the Aristotelian schema for definitions to
       some extent, i.e., for classes on the lower level. Nevertheless, it is not possible to adopt this
       strategy for classes on the upper level of the hierarchy.
   Top-Down: Insofar as the top-down strategy allows for extensive application of Aristotelean
       definition schema, it scores highly with respect to definitions. By using the Aristotelian method,
       ontology developers can offer definitions by relating them to classes in a top-level ontology.
    Axiomatization (Accuracy, Clarity, Consistency)
    Formal axioms associated with terms and relations in an ontology promote accuracy, clarity, and
consistency, as well as automated checking of such evaluative criteria. Axioms provide machine-




                                                                                                            186
interpretable enforcement of domain and range for relations, ensure disjoint classes do share instances
in common, and in general connect parts of an ontology hierarchy to other parts of that hierarchy.
    Bottom-Up: Axiom development may be challenging when following the bottom-up strategy.
        Suppose a class start time should be related to a class earlier than that itself relates to any
        class representing entities having a duration, such as phosphorylation or dimerization. Intu-
        itively, one might say such classes are all activities or processes of some sort. This is not an ob-
        vious option when following the bottom-up strategy.
    Middle-Out: The middle-out strategy fares better, since axioms can be written starting from a
        broader scoped architecture than that provided by the bottom-up strategy. This allows those
        following a middle-out strategy to leverage the space of possibilities already constrained by the
        placement of lower-level terms within the ontology taxonomy. Put another way, following the
        middle-out strategy might avoid some of the issues raised above by including natively a class-
        like activity, but will at some point lack a way to answer questions further up the taxonomy.
    Top-Down: The top-down strategy appears to fare better than either of the preceding strategies,
        with respect to axiom development. By populating ontology entities downward from existing
        classes and relations, ontology elements have an implicit formal structure inherited from the
        top-down strategy itself. Moreover, because there are an increasing number of logical
        constraints enforced as one proceeds down the taxonomy, the scope of possible axioms that can
        be applied to an ontology element is decreased.
    Reinvention (Accuracy, Organizational Fitness, Clarity, Consistency)
    It is important to determine the extent to which an ontology strategy encourages or discourages
duplication of effort. Duplication is not only dangerous because it wastes time and resources. It also
risks re-creating the mistakes that other ontologists have effectively amended in their efforts.
    Bottom-Up: The bottom-up strategy appears to encourage duplicative effort when viewed from
        the perspective of interoperability. Focusing solely on accurately modeling a domain runs the
        risk of missing the forest for the trees. Put another way, creating terms and relations highly spe-
        cific to a domain, without reflection on how they might relate to existing ontologies, impedes
        interoperability with those other ontologies. This is, indeed, a recipe for creating data silos [1].
    Middle-Out: Middle-out ontologies avoid some of the issues plaguing the bottom-up strategy
        with respect to reinvention, by facilitating an upwards population of ontology content, as
        needed. Nevertheless, this strategy will ultimately run into the same issues at a higher level of
        generality.
    Top-Down: Top-down strategies clearly shine with respect to reinvention. This strategy creates
        a common architecture from which terms and relations extend, and consequently, can be reused
        in other ontologies employing the same top-level architecture. This is, of course, not to say that
        all ontologies designed according to this strategy thereby perform well with respect to reinven-
        tion. Several ontologies, for example, in the Open Biological and Biomedical Ontology (OBO)
        Foundry [28] extend from the top-level BFO but in the absence of collaboration across nearby
        efforts, do so by producing duplicative content.
    Unused Content (Conciseness, Clarity)
    In some cases, ontologies will be developed with placeholder classes or relations, intended to be
connected to data, but which never are. The result is there may be ontology classes that are not used,
potential points of confusion, or perplexity for ontology developers unfamiliar with the intent behind
creating such content.
    Bottom-Up: Bottom-up strategies perform best with respect to this criterion, as ontology con-
        tent is developed on demand, directly from relevant domain data. Bottom-up ontology develop-
        ment encourages creating ontology content that is representative of the domain, and so makes it
        unlikely that ontology developers following this strategy will create empty content.
    Middle-Out: Middle-out strategy followers arguably perform as well as bottom-up strategy fol-
        lowers with respect to this criterion. By keeping an eye on the domain-level and the upper-level,
        this strategy tends to result in ontologies that populate classes which are integrated into analyses
        of the domain in question.
    Top-Down: Top-level strategy results perform worst with respect to this criterion. Constructing
        ontologies from the most general content down runs the risk of including plausible content that
        does not end up being used by domain ontologies extending from the top-level.




                                                                                                               187
   Abstract Representations (Clarity, Conciseness, Accuracy)
   Ontologies sometimes include content that is so general that may lead to confusion by subject-
matter experts, ontology developers, or other users. Such abstract representations may be challenging
for users to understand, which may, in turn, undermine the use of the ontology, or support perceptions
that a given ontology is too difficult to be used for practical purposes.
   Bottom-Up: More than any of the alternative strategies, bottom-up strategies typically avoid
       the inclusion of abstract representations within ontology output. Working so closely at the do-
       main level encourages the creation of ontology content understood by relevant subject-matter
       experts.
   Middle-Out: Similarly, the middle-out strategy promises to avoid the inclusion of many ab-
       stract representations, given its emphasis on domain-level content while attending to top-level
       architecture. Expanding upwards only when needed keeps abstract representations at a mini -
       mum.
   Top-Down: Abstract representations are often found in ontologies designed following a top-
       down strategy, as top-level content must often be rather general. Terms such as continuant or
       predicate are often divorced from the experiences of subject-matter experts, employment of
       them leads to confusion at best and misuse at worst. Consequently, it becomes challenging to
       link these abstract representations to domain-level content, which is indeed one of the main pur -
       poses of developing ontologies.
   Adaptability (Adaptability, Organizational Fitness)
   Ontologies should be extendable to new domains, reflecting new discoveries, scientific
advancements, or novel ways of understanding existing knowledge. We must take care here, however,
when reflecting on what it means for an ontology development strategy to promote adaptability.
Ontologies may be extended vertically - by upward or downward population of content – or
horizontally – by covering new domains not yet represented. The upward and downward population
of content places constraints on the ontology content that can be created consistently. For example,
asserting that instances of process must have some temporal part requires that any subclasses of
process also have some temporal part, thereby narrowing the space of possible extensions. A
horizontal extension is, however, sometimes entirely unconstrained, as when an ontology is developed
outside of existing ontologies.
   Bottom-Up: Strictly speaking, this strategy promotes adaptability solely in the sense of hori-
       zontal extension since it encourages the creation of ontology silos, entirely disconnected from
       other potential ontologies within an enterprise. New terms and relations can be introduced out -
       side the scope of existing ontologies developed following this strategy, without encountering in -
       consistency. This, of course, comes at the cost of interoperability.
   Middle-Out: This strategy fares poorly when adaptability is understood as a vertical extension
       since constraints are applied during the upward and downward population of ontology content.
       Ontology content developed in the upward direction must also bear constraints from the lower-
       level ontology content, and vice versa. The upward and downward aspects of this strategy lead
       it thus compromised on two fronts. Consider, from the Credential Transparency Description
       Language [29]: an address is defined as “particulars describing the location of the place”,
       whereas ‘particular’ refers to details of a description. Such use of ‘particular’ is, however,
       wildly different from the use of ‘particular’ in, say, DOLCE [5], where the term denotes an in -
       dividual. Extending Credential Engine to a domain covered by a DOLCE-based ontology would
       thereby generate inconsistency. On the other hand, where adaptability is understood as horizon-
       tal extension the middle-out strategy permits the development of ontologies with few con-
       straints, with the caveat simply being that at some point such new ontologies will need to con-
       nect with a top-level. In this respect, the middle-out strategy provides flexibility in ontology de -
       sign.
   Top-Down: The top-down strategy encourages constraints on ontology content extended from
       the top-level, and so constrains the range of possible ontology extensions. Put another way,
       when adaptability is understood as a vertical extension, the top-down strategy places constraints
       on a downward population of ontology content and does not permit an upward population. In
       this respect, the top-down strategy fares worse than the bottom-up strategy with respect to per-
       mitting extensions without generating inconsistency, since any downward populated content




                                                                                                               188
       must remain consistent with the content from which it extends. The top-down strategy fares
       worse, moreover, than either of the other strategies when considering adaptability from the hori -
       zontal perspective. This is because any ontology designed following the top-down strategy must
       find parent terms and relations that ultimately have roots in a top-level ontology. In other
       words, when attempting to cover a new domain, ontology developers must find constraints to
       apply to ontology content for this new domain.
   Implicitness (Accuracy, Consistency, Coherence)
   It is important to be able to discern the extent to which an ontology design strategy encourages or
discourages the creation of implicit incoherence.
   Bottom-Up: While the bottom-up strategy offers developers greater flexibility, this approach
       has its challenges when it comes to inviting implicit incoherence. The significant dependence
       on data often results in classes that mirror imperfections in the data. Consequently, terms and
       relations falling out of a bottom-up strategy tend to reflect inherent flaws in the data.
   Middle-Out: Without an explicit structure guiding expansion, ontology developers following
       this strategy may overlook the "implicit rules" essential for coherence. Additionally, the ab-
       sence of a shared foundational understanding can render much of the stored knowledge incon-
       sistent, without obvious methods for detecting such inconsistency. Moreover, transitioning the
       ontology to different domains can be challenging for middle-out strategy ontologies, especially
       when the abstract classes it has developed are too domain-specific. As discussed, the term ad-
       dress in the Credential Transparency Description Language [29] is defined differently than in
       other ontologies, which can lead to confusion when seeking alignment.
   Top-Down: This strategy exhibits advantages and challenges. The top-down strategy equips the
       ontologist with the tools needed to craft robust and rigorous axioms, paired with clear class def -
       initions, reducing the risk of implicit inconsistencies. However, the rigidity of the top-level
       classes can be a limitation. If incoherence arises, rectifying it can be challenging, especially if it
       necessitates changes to foundational classes.

4. Discussion
    This section offers a comprehensive analysis of three ontological strategies, summarizing their
strengths, weaknesses, and unique characteristics with respect to the preceding evaluative criteria. We
aim to provide a holistic view, facilitating a deeper understanding of each method's applicability and
limitations.
    The bottom-up strategy excels in clarity and conciseness, primarily because the ontologies it
produces are more closely aligned with the terminology associated with a given domain. When
conducted effectively, this strategy accurately mirrors domain data structure, ensuring a direct
correlation with the original data's knowledge. However, such close reliance on original datasets can
compromise adaptability and accuracy when applied to slightly varied domains. The bottom-up
strategy’s biggest strength - its proximity to the original data – is also its biggest weakness.
     The middle-out strategy generally outperforms the bottom-up method. Ontology developers
following this strategy tend to avoid getting mired in data details. Nevertheless, the absence of a top-
level set of common terms can lead to implicit commitments in an ontology developed following the
middle-out strategy. Moreover, developers will ultimately be pressed in this strategy to create local,
domain-dependent top-level terminology, that will quite likely impede clarity and adaptability, and
lead to challenges in semantic homogeneity and external interoperability.
     The top-down strategy stands out in terms of coherence, clarity, and completeness. It offers
adaptability, especially when using top-level classes to ensure coherence across domains. However,
its inherent ontological commitments may make it fragile, necessitating careful adaptation to new
domains, and rigorous attention to how ontologies are extended from it. While the approach promotes
ontology alignment and avoids redundancy, it demands a centralized organizational structure and
skilled individuals adept at handling abstract classes.
     The top-down strategy appears to perform best overall with respect to our evaluative criteria, with
the middle-out strategy coming second, and the bottom-up strategy third. While challenges exist, the
top-down method's proactive approach to axiomatization and class building during production can




                                                                                                                189
also mitigate issues that might arise post-deployment. That said, it is important to emphasize that
applying top-down techniques often requires more effort in the development of an ontology.
Compliance to a top-level is not cheap, and the benefits gained are not always obvious when an
ontology is being developed for a specific domain-level modeling task. The tradeoff is – we claim –
best characterized as a difference between short-term and long-term costs. Effort spent upfront
following a top-down strategy saves effort downstream, a cost that artifacts resulting from middle-out
and bottom-up strategies will ultimately have to pay later – when it is more expensive - if they hope to
promote semantic interoperability.

5. The Middle-In Strategy
    Most researchers appear to develop ontologies according to one of the three developmental
strategies discussed thus far, but our analysis of the complexity of ontology development suggests
there is a promising strategy not yet discussed in the literature. The middle-in strategy combines
aspects of the top-down and bottom-up strategies, in a manner distinct from the middle-out strategy.
According to this strategy, ontologists begin with a top-level ontology which is then used as a guide
when exploring data to develop domain-level ontology content. This strategy takes its name from
starting at both the top and bottom levels, then developing ontology content to meet in the middle.
    During the writing of this manuscript, it was discovered that some ontology developers have –
often unknowingly – employed such a strategy or have hinted towards such a strategy. For example,
in 2010 Enrico Francesconi and his team published a paper where they describe a similar method to
build DALOS, a multilingual ontology for the legal domain [30]. More recently, CIDO and VIDO are
the result of the middle-in strategy, as they each use BFO as a top-level ontology, while employing
bottom-up, data-focused, design. For example, they re-use reference ontologies and mid-level
ontologies like CHEBI [31] and OBI [25]. Moreover, CIDO was built starting from the classifications
of real-life, already existing data coming from GISAID, NextStrain, and DrugBank, as well as data
coming from domain-specific literature [32], and VIDO was related to taxonomies such as the
NCBITaxon [33] and based on the Baltimore Classification [34]. Finally, both made use of
connections with domain experts to maintain their terminology and ontological commitments
grounded within the domain. The perks of employing top-down methods are still visible in the quality
of employed definitions and axioms, as well as in the number of external ontologies and domains that
are referred to, suggesting the ontologies are adaptable and extensible.
    CIDO and VIDO score well on all criteria most relevant for semantic interoperability. Starting
with accuracy, the precision of the content of the two stems from the quality and variety of the data
and domain knowledge used as a development basis. Clarity is favored by the terminology respecting
these sources, as well by the terms being defined using the Aristotelian schema. Coherence is
respected through the use of axioms taken or developed starting from the top-level layer provided by
BFO. Adaptability is preserved thanks to native integration with a set of BFO-based ontologies and to
potential coherent integration with all other ontologies that use the same upper-level architecture.
These virtues make CIDO and VIDO stable hubs for long-term development of interoperable
terminologies in the realm of infectious disease representation, and provide confirmation for our
evaluation of middle-in ontologies as best suited to promote semantic interoperability.

6. Conclusion
   Each of the four methodologies presents distinct advantages and challenges. The bottom-up
approach, with its implicit semantics, is apt for smaller projects where team members share a common
understanding and where rapid access to existing well-defined datasets is needed. Conversely, for
larger projects demanding explicit semantics to maintain coherence across multiple contributors,
extract implicit information, or promote reasoning capabilities, the middle-out and top-down
approaches are more appropriate. Both, however, are less appropriate than the middle-in strategy
which concluded our discussion. In the context of infectious disease ontologies, CIDO and VIDO
represent successful endeavors in providing a basis for structuring data. We argue that the quality of
the two ontologies is in part a function of their adoption of a middle-in development strategy. While




                                                                                                           190
the middle-in strategy is akin to the top-down strategy, the former may yield more fragile ontologies
due to its rigorous nature. Proper implementation demands meticulous effort from ontologists in
crafting precise mappings and alignments. As we understand it the primary impediment to
interoperability arises from inconsistent terminology and axioms across domains; this is an issue best
addressed by the middle-in strategy.

    Acknowledgements
   The authors wish to acknowledge the help and comments of Alexander Diehl, Asiyah Lin, the
members of the Spring 2023 UB Logic for Ontology seminar, the PROVO-BFO mapping group, the
2023 ICBO Infectious Disease workshop audience and especially of one of its participants, who
pointed out that empirical tests should accompany our current methodology for testing adherence to
the criteria.

    References
[1] Arp Robert, Spear Andrew D., Smith Barry, “Building Ontologies with Basic Formal Ontology”, MIT
     Press, 2015.
[2] Keet, Maria C. “Introduction to Ontology Engineering”, College Publications, 2020.
[3] Belhoucine, K., Mourchid, M., Mouloudi, A., Mbarki, S., "A Middle-out Approach for Building a Legal
     domain ontology in Arabic," 2020 6th IEEE Congress on Information Science and Technology (CiSt),
     Agadir - Essaouira, Morocco, (2020): 290-295.
[4] Uschold Mike, Grüninger Michael. “Ontologies: Principles, Methods and Applications.”, The Knowledge
     Engineering Review, vol. 11, no. 2, pp. 93–136 (1996)
[5] Raad Joe, Cruz Christophe, “A survey on ontology evaluation methods”, Proceedings of the International
     Conference on Knowledge Engineering and Ontology Development, part of the 7th International Joint
     Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, (2018).
[6] BFO 2020, 2020, URL: https://github.com/BFO-ontology/BFO-2020
[7] Borgo Stefano, Claudio Masolo. “Ontological Foundations of Dolce”, Theory and Applications of
     Ontology: Computer Applications, pp. 279–295, (2022).
[8] Mizoguchi Riichiro, Borgo Stefano, “Yamato: Yet-Another More Advanced Top-Level Ontology”,
     Applied Ontology, vol. 17, no. 1, pp. 211–232, (2022).
[9] CCO, Common Core Ontologies 2023, URL : https://github.com/CommonCoreOntology/
[10] Kulvatunyou, B. , Drobnjakovic, M. , Ameri, F. , Will, C. and Smith, B. (2022), “The Industrial Ontologies
     Foundry (IOF) Core Ontology”, Formal Ontologies Meet Industry (FOMI) 2022, Tarbes, FR, (2022).
[11] Yongqun He, Hong Yu, Edison Ong, Yang Wang, Yingtong Liu, anthony Huffman, Hsin-hui Huang, John
     Beverley, Junguk Hur, Xiaolin Yang, Luonan Chen, Gilbert S. Omenn, Brian Athey and Smith Barry,
     “CIDO, a community-based ontology for coronavirus disease knowledge and data integration, sharing, and
     analysis”, Scientific Data Nature, (2020).
[12] He Yongqun, Yu Hong, Hufman Anthony, Yu Lin Asiyah, Natale Darren A., Beverley John, Zheng Ling,
     Perl Yehoshua, Wang Zhigang, Liu Yingtong, Ong Edison, Wang Yang, Huang Philip, Tran Long, Du
     Jinyang, Shah Zalan, Shah Easheta, Desai Roshan, Huang Hsin‐hui, Tian Yujia, Merrell Eric, Duncan
     William D., Arabandi Sivaram, Schriml Lynn M., Zheng Jie, Masci Anna Maria, Wang Liwei, Liu
     Hongfang, Smaili Fatima Zohra, Hoehndorf Robert, Pendlington Zoë May, Roncaglia Paola, Ye Xianwei,
     Xie Jiangan, Tang Yi‐Wei, Yang Xiaolin, Peng Suyuan, Zhang Luxia, Chen Luonan, Hur Junguk, Omenn
     Gilbert S., Athey Brian and Smith Barry, “A comprehensive update on CIDO: the community-based
     coronavirus infectious disease ontology”, Journal of Biomedical Semantics, (2022).
[13] Beverley, John ; Babcock, Shane ; Smith, Barry ; He, Yongqun ; Merrell, Eric ; Cowell, Lindsay ; Hurley,
     Regina & Duesing, Sebastian (2022). Coordinating Coronavirus Research: The COVID-19 Infectious
     Disease Ontology. Proceedings of the International Conference on Biomedical Ontologies.
[14] Vrandečić Denny, “Ontology Evaluation.” in: Steffen Staab, Rudi Studer (Ed.), Handbook on Ontologies,
     200, pp. 293–313.
[15] Diehl Alexander D., Lee Jamie A., Scheuermann Richard H., Blake Judith A., “Ontology development for
     biological systems: immunology”, Bioinformatics Applications Note, Vol.23 no.7 (2007): 913-915.
[16] Hogan, Aidan, et al. "An empirical survey of linked data conformance.", Journal of Web Semantics 14, pp.
     14-44, (2012).




                                                                                                                  191
[17] Zaveri Amrapali, Rula Anisa, Maurino Andrea, Pietrobon Ricardo, Lehmann Jens, Auer Sören, "Quality
     assessment for linked data: A survey", Semantic Web 7.1 pp. 63-93, (2016).
[18] Neuhaus Fabian, Hastings Janna, “Ontology is Consensus Creation, Not (Merely) Representation, Applied
     Ontology 17, (2022).
[19] Fox Mark, Grüninger Michael, “The Role of Competency Questions in Enterprise Engineering”,
     Proceedings of the IFIP WG5.7 Workshop on Benchmarking - Theory and Practice, (1994).
[20] McCune William, “Mace4 Reference Manual and Guide”, Argonne National Laboratory Technical
     Memorandum n.264, (2003).
[21] Hermit, an OWL 2 reasoner, 2023. URL: http://www.hermit-reasoner.com/
[22] Pellet, an OWL 2 reasoner, 2023. URL: https://github.com/stardog-union/pellet
[23] Abukwaik H, Taibi D, Rombach D (2014) Interoperability-related architectural problems and solutions in
     information systems: a scoping study. In: Avgeriou P, Zdun U (eds) Software architecture. ECSA 2014.
     Lecture notes in computer science, vol 8627. Springer, Cham. https://doi.org/10.1007/978-3-319-09970-
     5_27
[24] Justine Flore Tchouanguem, Mohamed Hedi Karray, Bernard Kamsu Foguem, Camille Magniont, F.
     Henry Abanda, and Barry Smith. 2021. BFO-based ontology enhancement to promote interoperability in
     BIM. Appl. Ontol. 16, 4 (2021), 453–479. https://doi.org/10.3233/AO-210254
[25] Bandrowski A, et al. (2016) The Ontology for Biomedical Investigations. PLoS ONE 11 (4): e0154556.
     doi: 10.1371/journal.pone.0154556.
[26] Otte NJ, Beverley J, Ruttenberg A. (2022) BFO: Basic Formal Ontology. Applied Ontology. 17-43.
[27] Seppälä Selja, Ruttenberg Alan, Schreiber Yonatan, Smith Barry, “Definitions in ontologies”, Cahiers de
     Lexicologie 109 (2), pp.175‐207, (2016).
[28] Smith B, et al. (2007) The OBO Foundry: coordinated evolution of ontologies to support biomedical data
     integration. Nat Biotechnol. 25:1251–1255. doi:10.1038/nbt1346.
[29] Credential Transparency Description Language (CTDL), 2023, Accessed 8-14-23 from
     https://credentialengine.org/credential-transparency/ctdl/
[30] Francesconi Enrico, Montemagni Simonetta, Peters Wim, Tiscornia Daniela, “Integrating a Bottom–UP
     and Top–down Methodology for Building Semantic Resources for the Multilingual Legal Domain”,
     Semantic Processing of Legal Texts, pp. 95–121, (2010).
[31] Hastings J. et al. ChEBI in 2016: Improved services and an expanding collection of metabolites. Nucleic
     acids research 44, D1214-1219, doi:10.1093/nar/gkv1031 (2016).
[32] Yu Hong, Li Li, Huffman Anthony, Beverley John, Hur Junguk, Merrell Eric, Huang Hsin-hui, Wang
     Yang, Liu Yingtong, Ong Edison, Cheng Liang, Zeng Tao, Zhang Jingsong, Li Pengpai, Liu Zhiping,
     Wang Zhigang, Zhang Xiangyan, Ye Xianwei, Handelman Samuel K., Sexton Jonathan, Eaton Kathryn,
     Higgins Gerry, Omenn Gilbert S., Athey Brian, Smith Barry, Chen Luonan and He Yongqun, “A new
     framework for host- pathogen interaction research”, Frontiers in Immunology, (2022)
[33] Federhen S. (2012) The NCBI Taxonomy Database. Nucleic Acids Res. 40:D136-D143.
     doi:10.1093/nar/gkr1178.
[34] Baltimore D. (1971). Expression of Animal Virus Genomes. Bacteriological Reviews. 35, 235-41.




                                                                                                               192