<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Detecting Influences of Ontology Design Patterns in Biomedical Ontologies</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Christian Kindermann</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bijan Parsia</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Uli Sattler</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Background on Ontology Design Patterns</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Manchester</institution>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Ontology Design Patterns (ODP) have been proposed to facilitate ontology engineering. Despite numerous conceptual contributions for over more than a decade, there is little empirical work to support the often claimed benefits provided by ODPs. Determining ODP use from ontologies alone (without interviews or other supporting documentation) is challenging as there is no standard (or required) mechanism for stipulating the intended use of an ODP. Instead, we must rely on modelling features which are suggestive of a given ODP's influence. For the purpose of determining the prevalence of ODPs in ontologies, we developed a variety of techniques to detect these features with varying degrees of liberality. Using these techniques, we survey BioPortal with respect to well-known and publicly available repositories for ODPs. Our findings are predominantly negative. For the vast majority of ODPs we cannot find empirical evidence for their use in biomedical ontologies.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Different frameworks for working with patterns in Ontology Engineering have
been proposed [
        <xref ref-type="bibr" rid="ref11 ref16 ref20 ref22 ref24 ref27 ref28 ref8 ref9">8, 9, 11, 16, 20, 22, 24, 27, 28</xref>
        ]. Each framework is based on a different
approach for capturing assumed benefits of patterns and introduces its own
terminology as well as its own notation. While these different approaches bear
similarities to each other in some respects, there have been no efforts towards a
standardisation process. There is also no generally accepted de facto standard for
working with patterns in practise.
      </p>
      <p>A unifying concept for a majority of these proposals is a practical notion
pattern reuse. Such notions often involve prefabricated components expressed in
some representation formalism on the one hand, and operations to manipulate
these components on the other.</p>
      <p>
        Consider the following examples in which a pattern has been proposed to be
reused as
∙ “[. . .] a first-order theory whose axioms are not part of the target knowledge
base, but can be incorporated via renaming of their non-logical symbols [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].”
∙ “[] distinguished ontolog[y].” The basic mechanism for its application is OWL
ontology import in which pattern elements cannot be modified. Otherwise,
common operations for patterns are “clone, specialisation, generalisation,
composition, expansion.” [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ].
∙ “[. . .] an ontology fragment, including directly reusable elements (classes,
properties, etc.) as well as demo-elements that would be replaced by the
user’s own. The directly reusable elements should typically be borrowed from
upper level ontologies [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ].”
      </p>
      <p>Clearly, these ideas of pattern reuse are based on a set of predefined axioms
that may or may not be modified. In the scope of this work, we will restrict our
attention to ODPs of this kind, i.e. ODPs that are captured by a set of axioms or
an OWL ontology. In the following, a set of axioms (with or without variables)
given as part of an ODP that is meant for some notion of reuse, will be referred
to as a reusable component of the ODP. ODPs with reusable components have
been the focus of the academic literature for over a decade and are commonly
classified into two types: Content Ontology Design Patterns (CODP) and Logical
Ontology Design Patterns (LODP).</p>
      <p>
        CODPs are motivated as conceptual modelling solutions featuring a domain
dependant signature, possibly extracted from Upper Level Ontologies to be
applicable across different domains [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. For example, the following axioms have
been proposed as part of the AgentRole pattern which aims to provide a generic
modelling solution for capturing role relationships [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]:
      </p>
      <sec id="sec-1-1">
        <title>Role ⊑ ∃hasTemporalExtent.TemporalExtent</title>
      </sec>
      <sec id="sec-1-2">
        <title>AgentRole ⊑ Role</title>
        <p>∃rolePerformedBy.Agent ⊑ AgentRole</p>
        <p>rolePerformedBy ≡ performsRole−</p>
        <p>
          LODPs on the other hand are motivated as structural modelling solutions
that are domain-independent [
          <xref ref-type="bibr" rid="ref11 ref21">11, 21</xref>
          ]. They are characterised by a set of axioms
containing variables that are to be replaced as needed in the context of some use
case. For example, the LODP Partition describes how a concept can be divided
into distinct, non-overlapping, but covering subconcepts:
        </p>
        <p>≡ 1 ⊔ . . . ⊔ 
 ⊓  ⊑ ⊥ (for all ,  ≤  ∧  ̸= )
(1)
(2)
Here, , 1, . . . ,  are understood as variables for concepts that need to be
replaced. The variable concept  is the divided into covering parts 1, . . . ,  (see
Axiom 1). The parts 1, . . . ,  are made non-overlapping by pairwise disjointness
constraints (see Axioms 2).
3</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>Pattern Detection</title>
      <p>The lack of a generally agreed upon notion for ODP reuse poses a challenge
for determining whether an ODP has in fact informed the design of a given
ontology. Different approaches for ODP reuse result in different modelling features
suggestive for a given ODP’s influence. Therefore, we must design a detection
mechanism that accounts for this uncertainty.</p>
      <p>In the scope of this work, we limit our investigation to approaches that are based
on ODPs documented with reusable components (cf. Section 2). Furthermore, we
assume these components to be given in the form of ontologies or more generally
sets of axioms. Given such a component , the problem of detecting modelling
features which are suggestive of the ODP’s influence in a given ontology  can be
reduced to detecting features of  shared with . In the following, we formulate
a list of non-exhaustive criteria that may be used to determine shared features
between  and .
3.1</p>
      <sec id="sec-2-1">
        <title>Detection Techniques</title>
        <p>
          One of the earliest approaches for reusing an ODP’s  proposes to use ontology
imports as the basic mechanism for reuse [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ]. This approach has been adopted
by the NeOn project [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ], in the context of which a large amount of work has
been carried out that promulgated into the academic literature.
        </p>
        <p>Import containment Detecting whether a given  of some ODP has been
imported in an ontology  comes down to a straightforward analysis of ’s
import declaration. Given our primary concern of detecting an ODP’s influence
without any further qualification, we will generally equate an ontology with its
import closure unless stated otherwise.</p>
        <p>The analysis of ’s import declarations is based on the two ways an ontology
may be imported. Namely, import by name and import by location. Import by
name is performed by interpreting the object of an import declaration as the
name of an ontology in a predefined list of ontology repositories. If the object
of an import declaration can be matched with the name of an ontology in said
repositories, then the ontology is imported. Contrary, import by location is
performed by interpreting the object of an import declaration as a physical
location of an ontology. This location may be a location in the local file system.</p>
        <p>Import by name allows for an unambiguous way to determine whether a given
 has been imported, if its name in some ontology repository is known. Import
by location on the other hand, poses a challenge due to the possibility of arbitrary
renaming of local files. Nevertheless, it is reasonable to assume that the name of
a local file is suggestive of its contents. Hence, it is worthwhile to consider import
declarations as candidates for  reuse if the object of the declaration is lexically
close to the respective ODP’s name.</p>
        <p>These considerations motivate an ImportCheck for an ODP captured as
follows. First, check whether  is imported by name in  (including the import
closure). If  is not found, we test whether the object of any import declaration
in  is lexically similar to the ODP’s name captured by .</p>
        <p>
          Signature overlap It has been proposed to reuse a given  by coping its
contents into a target  [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ]. Copying any logical entities in  verbatim will result
in syntactic traces, i.e, ̃︀ ∩ ̃︀ ̸= ∅, where ̃︀ denotes the signature of an ontology,
i.e., its concept, role, and individual names. Hence, we specify an IRICheck that
tests for all logical axioms  ∈  whether the IRI of any  ∈ ̃︀ occurs in . This
occurrence test in  includes checking non-logical parts such as annotations and
entity declarations.
        </p>
        <p>In addition, we specify a NamespaceCheck that tests whether the longest
common prefix of IRIs of many entities in , occurs in .</p>
        <p>
          Lexical variation Apart from approaches to ODP reuse that preserve the
IRIs of elements in , there are proposals allowing for the possibility of a
renaming of copied elements [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]. In this case, the reuse of axioms  ∈  can be
identified by some substitution  : ̃︀ → ̃︀ such that  ( ) ∈ . Here, we require
substitutions to respect types, i.e. concepts, roles, and individuals are only mapped
to concepts, roles, and individuals respectively. However, with no information
expressly declaring that  has been reused via some  in , determining whether
 has been reused under some elusive substitution is a challenging task.
        </p>
        <p>Based on the assumption that entities  ∈ ̃︀ exhibit lexical similarities to
their mappings  () ∈ ̃︀, we can generate candidate substitutions. For  ∈ ̃︀,
find a ′ ∈ ̃︀ that is lexically similar. If such a ′ exists for all  ∈ ̃︀, then we
can define a substitution  simply by  ↦→ ′. Given a candidate  , we specify
a SubstitutionContainmentCheck that tests whether  ( ) ∈  holds for all
axioms  ∈ , where  ( ) denotes an axiom in which all substitutions specified
in  have been performed.</p>
        <p>
          Logical variation Besides changing the signature of an ODP’s , there have been
proposals for ODP reuse based on reimplementing aspects of  by analogy [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ].
In this case, both the logical structure as well as the signature of axioms  ∈ 
may be subject to change. Based on motivations for logical rewritings of  [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ],
we specify a SubstitutionEntailmentCheck that tests whether there exists
some substitution  (generated as above) such that for all  ∈  it holds that
 |=  ( ).
        </p>
        <p>Logical Axiom Agreement In addition to detection techniques searching for
positive evidence of a given ODP’s influence, we may also test for necessary
requirements imposed by some notion of ODP reuse. If these requirements
are not met by some ontology , then we can exclude an ODP’s reuse with
respect to the notion in question. For example, positive evidence for  under
SubstitutionContainmentCheck requires an ontology  to contain structurally
identical axioms to  since a simple renaming of entities in axioms of  does not
affect their logical structure. Thus, if an ontology  does not exhibit at least as
many structurally identical axioms as a given , then we can conclude that 
can not have been reused by a simple renaming of its signature.</p>
        <p>
          The above argument motivates an AxiomTypeCheck that tests whether a given
ontology  contains at least as many axioms of a given type as . The OWL
2 language specification distinguishes between three categories of axiom types:
class expression axioms, object property axioms, and data property axioms. Each
category defines a number of axiom types, e.g., subclass axioms, inverse object
properties, or disjoint data properties [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ].
        </p>
        <p>Logical Expression Agreement Orthogonal to a structural agreement in
terms of axioms, we can also specify a structural ExpressionCheck that tests
whether certain logical constructs or combination of logical constructs occurring
in a given  are present in an ontology. For example, if a logical constructor, e.g
concept disjunction ⊔, occurs in some expression used in , but there is no such
expression in  (as is often the case for biomedical ontologies conforming to the
EL profile), then certain notions of reusing  can be ruled out.</p>
        <p>
          In the context of this work, we specify expression checks for two logical
structures that seem to be crucial for a fair number of ODPs and LODPs in
particular. These structures are disjoint unions on the one hand, and -ary
relationships on the other hand. We test for the presence of disjoint unions as
specified in [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]. Since OWL 2 only allows for binary relationships, ODPs have
been proposed to model -ary relationships by using multiple binary relationships
in combination [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]. Hence, we test for the necessary condition of a concept that is
subsumed by at least two role restrictions. We refer to the checks of both of these
structures as DisjointUnionCheck and NAryRelationCheck respectively.
3.2
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Algorithm</title>
        <p>Most techniques introduced in the previous section involve some form of lexical
comparison between entities of  and . In order to maximise the recall of our
detection mechanism, we employ a threefold procedure for establishing a lexical
similarity between two strings 1 and 2 with an increasing degree of liberality.</p>
        <p>The first part is a strict equality that requires all symbols occurring in 1
to coincide with symbols in 2 at their respective positions. The second part
is a loose string match between 1 and 2 that removes all symbols not in
the Latin alphabet, converts all characters to lower case, and tests for string
containment of 1 in 2. The third part consists of calculating a string similarity
score greater that 0.8 between two strings 1, 2. The similarity score is calculated
by − ℎ(1,2) , where  = max(1.ℎ, 2.ℎ).</p>
        <p />
        <p>A lexical association between two elements 1 ∈ ̃︀ and 2 ∈ ̃︀ is established
by applying the above string comparison procedure to (1) both IRIs of 1 and 2,
(2) both ShortFormIRIs of 1 and 2, (3) 1’s IRI and 2’s annotations, (4) 1’s
ShortFormIRI and 2’s annotations.1</p>
        <p>Using this string comparison procedure for lexical comparisons between
entities of  and  in techniques presented in the previous section, we specify
the following algorithm to detect influences of a given ODP exhibiting lexical
modelling features.</p>
        <p>Algorithm 1: Pattern Detection</p>
        <p>Input : Ontology , Pattern</p>
        <p>Output : Suggestive evidence for influence of  in 
1 if ImportCheck(, ) then
2 return Import declarations in  containing 
3 if IRICheck(, ) then
4 return All  ∈  that account for evidence of the check
5 if NamespaceCheck(, ) then
6 return All  ∈  that account for evidence of the check
7 if AxiomTypeCheck(, ) then
8 if SubstitutionContainmentCheck(, ) then
9 return All  such that  () ∈ 
10 end
11 if SubstitutionEntailmentCheck(, ) then
12 return All  such that  |=  ()
13 end</p>
        <p>For ODPs that only propose a set of axioms with variables to be
instantiated we cannot sensibly apply Algorithm 1. Instead, the only applicaple
detection techniques are the structural AxiomTypeCheck, DisjointUnionCheck, and
NAryRelationCheck.
4</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Methods</title>
      <p>In Section 2, we have characterised the status quo of academic research around
ODPs by a diversity of ideas regarding both the notion of ODPs itself and ODP
reuse. This motivates an investigation of the research question as to how prevalent
ODPs influences in biomedical ontologies are. In the following, we describe our
procedure for answering this question.</p>
      <p>
        Pattern Corpus The most well-known catalogues for ODPs are (1) the ODP
Semantic Web Portal [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and (2) the ODPs Public Catalog [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Both of these
catalogues reflect the focus of the academic literature on CODPs and LODPs
1 We also considered using annotations of entities 1 ∈ ̃︀ to establish a lexical
relationship with 2 ∈ ̃︀. However, we noticed that annotations of 1 for (alternative)
labels are either equal to its ShortFormIRI or slight variations thereof. Since these
variations are already captured by our string comparison procedure, we decided
against using 1’s annotations to guard against spurious matches.
and contain mostly submissions for these two types. We select all ODPs from
catalogues (1) and (2) that satisfy the following criteria:
(i) The pattern is categorised as either an LODP or CODP in catalogue (1).
(ii) The pattern is published together with an ontology as its reusable component
or the pattern is published with an example ontology to demonstrate its
reuse.
(iii) The reusable component or example ontology can be loaded and initialised
with a reasoner by the OWL API.
(iv) A CODP is documented to belong to some biomedical related domain.
      </p>
      <p>This selection procedure results in the selection of 47 out of 155 CODPs from
(1), 4 out of 18 LODPs from (1), and all 16 ODPs from (2). Selected patterns
according to criteria (iv) belong to at least one of the following domains:
Agriculture, Biology, Cartography, Chemistry, Decision-making, Document Management,
Earth Science or Geoscience, Ecology, Event Processing, Explanation, Fishery,
General, Geology, Health-care, Management, Manufacturing, Materials Science,
Organisation, Participation, Parts and Collections, Physics, Planning, Product
Development, Scheduling, Software, Software Engineering, Social Science, Time,
Work-flow.</p>
      <p>
        Ontology Corpus We use a publicly available snapshot of BioPortal from
2017 [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Choosing the data set that contains all ontologies in their original state,
we extract all ontologies from the archive into one folder. Any ontology that could
not be loaded or handled by a reasoner in the OWL API was excluded form the
study. This procedure results in the exclusion of 78 out of 438 ontologies resulting
in a corpus of 360 ontologies.
      </p>
      <p>Experimental Design Our empirical investigation consists of two distinct
experiments.</p>
      <p>In the first experiment, we run Algorithm 1 over all input combinations of
ontologies from the ontology corpus and the 47 CODPs from catalogue (1). This
experiment is designed to provide positive indiciations for influences of ODPs
that exhibit lexical features, namely CODPs.</p>
      <p>In the second experiment, we run the AxiomTypeCheck over all input
combinations of ontologies from the ontology corpus and LODPs from catalogue (1) as well
as ODPs from catalogue (2). We distinguish between two conditions: (a) including
and (b) not including the imports closure of a given ODP. Lastly, we perform the
DisjointUnionCheck, and the NAryRelationCheck over all ontologies from the
ontology corpus. This experiment is designed to probe ontologies for necessary
conditions of several notions of ODP reuse.
5
5.1</p>
    </sec>
    <sec id="sec-4">
      <title>Results</title>
      <sec id="sec-4-1">
        <title>Experiment 1: Detection of CODPs</title>
        <p>Our detection mechanism generates only scant evidence for suggestive influences of
CODPs. The results are summarised in Table 1 (SubstitutionContainmentCheck
and SubstitutionEntailmentCheck are abbreviated by SContainmentCheck
and SEntailmentCheck). Each row reports on the evidence generated by each
subcomponent of Algorithm 1. In the following, we will provide further details on
these results with respect to the used techniques (1)-(5).
(1) The ImportCheck detects only one pattern that was undisputedly reused
by import, namely the AgentRole pattern. Interestingly, this reuse by import
was only detected due to AgentRole being both in the corpus of patterns
and ontologies. Since each ontology is contained in its own import closure,
the detection of AgentRole is as expected. Otherwise, the ImportCheck only
generates candidates for ODP reuse via import by location on the basis of lexical
association. For example, the pattern Region was generated as candidate in the
“Ontology of Geographical Region” since it contained the ontology
“http://www.owlontologies.com/GeographicalRegion.owl” in its import closure.</p>
        <p>(2) The detected reuse of an ODP’s IRIs by the IRICheck are unsurprisingly
owed to the presence of the AgentRole pattern.</p>
        <p>(3) The NamespaceCheck performed with “http://ontologydesignpatterns.org”
results in the detection of 5 entities in 3 different ontologies. In all cases, a “seeAlso”
annotation reference web pages related to ODPs. For example, the object property
“part of” in the “human interaction network ontology” has been annotated with
“rdfs:SeeAlso &lt;http://ontologydesignpatterns.org/wiki/Submissions:PartOf&gt;”.</p>
        <p>(4) The SubstitutionContainmentCheck generated candidate substitutions
for 11 patterns in 46 ontologies. Two out of the ODPs account solely for 26 of the
46 ontologies in which substitutions could be generated. These two ODPS are
TypesOfEntities and GOTop.</p>
        <p>(5) The SubstitutionEntailmentCheck did not result in the generation of
additional candidate substitutions.
5.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>Experiment 2: Testing necessary conditions</title>
        <p>The AxiomTypeCheck shows for both conditions (a) and (b) that 75% of ontologies
do not exhibit the necessary number of different axioms types occurring in ODPs
from catalogue (1). Similarly, 80% of ontologies do not contain the necessary
number of different axioms types for the majority (37 out of 47) of ODPs from
catalogue (2).</p>
        <p>However, the NAryRelationCheck reveals that nearly half (168/360) of all
ontologies in the corpus fulfil the tested necessary for the existence of an -ary
relationship. Lastly, the DisjointUnionCheck finds evidence for disjoint unions
in 24 ontologies.
6</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Discussion</title>
      <p>The results of our investigation provide very little support for influences of ODPs
in biomedical ontologies. The negative results of our ImportCheck show that a
given ODP’s component  is not reused in practise as originally envisioned by
the NeOn project. Furthermore, the negative results of our IRICheck indicate
that even parts of reusable components  do not directly influence the ontology
engineering tasks in practise.</p>
      <p>Even though we could not find explicit evidence for any ODP being reused by
import, we did find evidence by the mere presence of the AgentRole pattern in
the corpus of ontologies. Through manual inspection of the original data set for
the used BioPortal snapshot, we notice that the AgentRole pattern is located
in an archive file for the ontology ICPS. This archive also contains another
pattern, namely Person. However, the ontology ICPS has been excluded during
the process of the ontology corpus construction for the study. This observation
raises the question whether our results are skewed by our ontology exclusion
criteria for constructing the ontology corpus. We can invalidate this concern due
to the following. First, we acquire a version of the used BioPortal snapshot in
which each ontology has been merged with its import closure. Then, we treat
all ontologies as simple text files and performed another NamespaceCheck. Still,
there is no positive finding to be reported.</p>
      <p>Inspecting the positive evidence found by the NamespaceCheck, it is quite
clear that practitioners create their own entities instead of reusing IRIs from
ODPs directly. Nevertheless, it remains unclear whether this is owed to a conscious
modelling decision, mere personal preference, lack of know-how, or lack of tool
support for ODPs.</p>
      <p>Yet, there is a caveat with respect to reusing IRIs from ODPs. Some ODPs
published on http://ontologydesignpatterns.org are said to be “extracted
from upper level ontologies”. However, interestingly, their respective reusable
components  are often self-contained ontologies not bearing any relation to
upper level ontologies. This suggests that  is a somehow reimplemented fragment
of the upper level ontology. Clearly, this gets practitioners into the predicament of
choosing between aligning their ontologies to an upper level ontology or an ODP
(if they are so inclined in the first place). Hence, it is possible that practitioners
prefer to work with the original upper level ontology rather than the extracted
ODPs thereof.</p>
      <p>
        Irrespective of any matter of renaming, the findings of our AxiomTypeCheck
suggest that modelling features exhibited by most reusable components of ODPs
are not highly prevalent in ontologies of the biomedical domain. It has been noted
before that an ODP’s required language expressivity is outside of the popular
EL profile many biomedical ontologies conform to [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. Moreover, it seems that
a fair amount of published ODPs seem to propose property centric modelling
approaches whereas ontologies in the biomedical domain follow a concept centric
design.
      </p>
      <p>Since a high percentage of ontologies do not contain at least the same number
of axioms or axioms types of a given ODP, it is unsurprising to find a limited
number of candidates under the SubstitutionContainmentCheck. Likewise,
it is equally unsurprising to find a limited number of candidates under the
SubstitutionEntailmentCheck because a fair number of ODPs make use of
modelling techniques that are not expressible in the EL profile to which a lot of
biomedical ontologies conform.</p>
      <p>Given the above observation with respect to axiom types and differences in
language requirements, we considered to relax the conditions of our substitution
checks. Instead of requiring a substitution for all axioms  ∈ , we only require
a substitution for some subset  ⊆  such that  ( ) ∈  holds for all  ∈ .
Essentially, this corresponds to some notion of a partial reuse of . Allowing for
arbitrary subsets  ⊆  results in the generation of a large amount of spurious
data due to our liberal lexical association procedure. Imposing some lower bound
on the size of  is not straightforward as an ODP’s  is often quite small to begin
with. Limiting the search space for lexical associations in the target ontology
 by some heuristics seems to be the most promising approach. For example,
given a match between some  ∈ ̃︀ and ′ ∈ ̃︀, limit the search for further lexical
associations of elements in ̃︀ to the set { ∈  | ′ ∈ ̃︀} and proceed recursively.
However, slight variations in heuristic search strategies result in drastic effects
for the number of generated lexical associations. Overall, generating meaningful
data for partial reuse of a given ODP’s  turns out to be a non-trivial research
endeavour in and of itself.
6.1</p>
      <sec id="sec-5-1">
        <title>Limitations</title>
        <p>
          Despite our intention to maximise the recall of our detection mechanism, there
are a few limitations. Some patterns in our corpus are not intended to be directly
reused via some reusable component . The ODP UpperLevelOntology [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] is
such an example. This pattern motivates to align a given ontology to a chosen
upper level ontology. Since all our detection techniques are agnostic to influences
of upper level ontologies and only target lexical as well as structural modelling
features, the prevalence of ontologies aligned to upper level ontologies is not
determined and our negative results are inconclusive.
        </p>
        <p>Another limitation is the manner in which we try to establish lexical associations
between entities of ODPs and entities of domain ontologies. Entities of ODPs
are arguably of general nature and might not easily associated with domain
specific entities on a purely lexical basis. Instead, one might need to consider
lexical relationships based on hyponyms and hypernyms. However, doing so would
require an overall sophisticated lexical matching procedure to prevent spurious
associations.
6.2</p>
      </sec>
      <sec id="sec-5-2">
        <title>Related Work</title>
        <p>
          Ontology enrichment has motivated one of the first attempts to automatically
identify the use of ODPs in ontologies [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ]. Here, it is argued that the identification
of partial instantiations of ODPs may allow for ontology refinement by completing
the missing parts of a pattern. The proposed detection mechanism heavily depends
on a lexical association procedure that is based on a number of heuristics. However,
a large scale evaluation of the proposed techniques is left for future work.
WordNet has been considered to provide background knowledge for establishing
lexical associations between entities of ODPs and domain ontologies [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ]. A
detection mechanism primarily based on lexical string matching is proposed and a
first empirical evaluation is performed. Contrary to our findings, a large number
of frequently detected ODPs is reported. However, the produced results are
described as “probably not reliable”. The background knowledge from WordNet
seems to produce spurious results and skews the data towards patterns including
a certain signature.
        </p>
        <p>
          Query languages such as SPARQL and OPPL have been considered to probe an
ontology for structural aspects of ODPs. Work on detection mechanisms to combine
both lexical and structural aspects of ODPs is still preliminary [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ]. However,
a large scale study using a structural detection mechanism that deliberately
disregards notions of ODP reuse under lexical variations has been conducted [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ]. A
small number of structurally simple ODPs are reported to be reused in biomedical
ontologies. Otherwise, little evidence for ODPs reuse in biomedical ontologies is
found.
7
        </p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Conclusion</title>
      <p>
        The results of our empirical evaluation corroborate the findings of previous studies
to the extent that there is only scant evidence for influences of ODPs in the
biomedical domain [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. Even liberal notions for ODP reuse which can only be
considered suggestive of a given ODP’s influence do not allow for a different
conclusion. While this negative finding appears unconstructive, we will qualify its
implications in light of the nature of our chosen detection techniques.
      </p>
      <p>The results of our AxiomTypeCheck and DisjointUnionCheck indicate that
modelling solutions proposed by ODPs differ significantly compared with ontologies
authored by practitioners in the biomedical domain. The design of most biomedical
ontologies are concept centric and do not contain a lot of disjoint unions, whereas
the design of ODPs published in catalogues (1) and (2) place an emphasize on
roles and disjoint unions respectively. The lack of evidence under the IRICheck
also shows that ODPs are not partially reused by omission of unwanted axioms.
Moreover, even in cases in which there is some evidence that practitioners in the
biomedical domain are aware of ODPs, they seem to limit the reuse of ODPs to
the realm of annotations as the data collected by our NamespaceCheck suggests.</p>
      <p>Overall, it seems that currently, ODPs do not provide solutions for common
ontology design tasks in the biomedical domain.</p>
      <p>
        These findings can serve as a motivation for a data driven approach to
automatically generate or at least inform the development of practically relevant
ODPs. In such a scenario, detection techniques such as the ones presented in this
paper can serve as some kind of quality measure. The desire for such work has
already been expressed [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <source>BioPortal Snapshot</source>
          <volume>30</volume>
          .
          <fpage>03</fpage>
          .
          <year>2017</year>
          . https://zenodo.org/record/439510# .XKK-Nt-
          <source>YVhE (Accessed on 30.04</source>
          .
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Nary</surname>
          </string-name>
          <article-title>Relationship pattern</article-title>
          . http://odps.sourceforge.net/odp/html/Nary_ Relationship.
          <source>html (Accessed on 24.05</source>
          .
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>Ontology</given-names>
            <surname>Design Patterns (ODPs) Public</surname>
          </string-name>
          <article-title>Catalog</article-title>
          . http://odps.sourceforge. net/odp/html/index.
          <source>html (Accessed on 24.05</source>
          .
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Semantic</surname>
          </string-name>
          <article-title>Web portal dedicated to ontology design patterns</article-title>
          . http:// ontologydesignpatterns.
          <source>org (Accessed on 24.05</source>
          .
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>Upper</given-names>
            <surname>Level</surname>
          </string-name>
          Ontology Pattern. http://odps.sourceforge.net/odp/html/Upper_ Level_Ontology.
          <source>html (Accessed on 24.05</source>
          .
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <article-title>6. OWL 2 Web Ontology Language</article-title>
          . Structural Specification and
          <string-name>
            <surname>Functional-Style Syntax (Second Edition)</surname>
          </string-name>
          .
          <year>2012</year>
          . http://www.w3.org/TR/owl2-syntax
          <source>/ (Accessed on 24.05</source>
          .
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>Eva</given-names>
            <surname>Blomqvist</surname>
          </string-name>
          .
          <article-title>Ontology Patterns: Typology and Experiences from Design Pattern Development</article-title>
          .
          <source>In The Swedish AI Society Workshop May</source>
          <volume>20</volume>
          -21; 2010; Uppsala University, number
          <volume>048</volume>
          , pages
          <fpage>55</fpage>
          -
          <lpage>64</lpage>
          . Linköping University Electronic Press,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>Eva</given-names>
            <surname>Blomqvist</surname>
          </string-name>
          and
          <string-name>
            <given-names>Kurt</given-names>
            <surname>Sandkuhl</surname>
          </string-name>
          .
          <article-title>Patterns in Ontology Engineering-Classification of Ontology Patterns</article-title>
          .
          <source>In ICEIS 2005: proceedings of the Seventh International Conference on Enterprise Information Systems</source>
          , Miami, USA, May
          <volume>25</volume>
          -28,
          <year>2005</year>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>Peter</given-names>
            <surname>Clark</surname>
          </string-name>
          .
          <article-title>Knowledge Patterns</article-title>
          .
          <source>In EKAW</source>
          , volume
          <volume>5268</volume>
          of Lecture Notes in Computer Science, pages
          <fpage>1</fpage>
          -
          <lpage>3</lpage>
          . Springer,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Ricardo de Almeida Falbo</surname>
            , Giancarlo Guizzardi, Aldo Gangemi, and
            <given-names>Valentina</given-names>
          </string-name>
          <string-name>
            <surname>Presutti</surname>
          </string-name>
          . Ontology Patterns:
          <article-title>Clarifying Concepts and Terminology</article-title>
          .
          <source>In WOP</source>
          , volume
          <volume>1188</volume>
          <source>of CEUR Workshop Proceedings. CEUR-WS.org</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <given-names>Aldo</given-names>
            <surname>Gangemi</surname>
          </string-name>
          .
          <article-title>Ontology Pesign Patterns for Semantic Web Content</article-title>
          . In International Semantic Web Conference, pages
          <fpage>262</fpage>
          -
          <lpage>276</lpage>
          . Springer,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <given-names>Giancarlo</given-names>
            <surname>Guizzardi</surname>
          </string-name>
          .
          <article-title>Theoretical Foundations and Engineering Tools for Building Ontologies as Reference Conceptual Models</article-title>
          .
          <source>Semantic Web</source>
          ,
          <volume>1</volume>
          (
          <issue>1</issue>
          -2):
          <fpage>3</fpage>
          -
          <lpage>10</lpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Karl</surname>
            <given-names>Hammar</given-names>
          </string-name>
          , Eva Blomqvist, David Carral, Marieke van Erp,
          <string-name>
            <surname>Antske Fokkens</surname>
          </string-name>
          , Aldo Gangemi, Willem Robert van Hage,
          <string-name>
            <surname>Pascal Hitzler</surname>
            , Krzysztof Janowicz, Nazifa Karima, Adila Krisnadhi, Tom Narock, Roxane Segers, Monika Solanki, and
            <given-names>Vojtech</given-names>
          </string-name>
          <string-name>
            <surname>Svátek</surname>
          </string-name>
          .
          <source>Collected Research Questions Concerning Ontology Design Patterns. In Ontology Engineering with Ontology Design Patterns</source>
          , volume
          <volume>25</volume>
          <source>of Studies on the Semantic Web</source>
          , pages
          <fpage>189</fpage>
          -
          <lpage>198</lpage>
          . IOS Press,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <given-names>Karl</given-names>
            <surname>Hammar</surname>
          </string-name>
          and
          <string-name>
            <given-names>Valentina</given-names>
            <surname>Presutti</surname>
          </string-name>
          .
          <article-title>Template-Based Content ODP Instantiation</article-title>
          .
          <source>In The 7th Workshop on Ontology and Semantic Web Patterns</source>
          . IOS Press,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Matthew</surname>
            <given-names>Horridge</given-names>
          </string-name>
          , Mikel Egaña Aranguren, Jonathan Mortensen,
          <string-name>
            <given-names>Mark A.</given-names>
            <surname>Musen</surname>
          </string-name>
          , and Natalya Fridman Noy.
          <article-title>Ontology Design Pattern Language Expressivity Requirements</article-title>
          .
          <source>In WOP</source>
          , volume
          <volume>929</volume>
          <source>of CEUR Workshop Proceedings. CEURWS.org</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Chih-Sheng Johnson</surname>
            <given-names>Hou</given-names>
          </string-name>
          , Natalya Fridman Noy, and
          <string-name>
            <given-names>Mark A.</given-names>
            <surname>Musen</surname>
          </string-name>
          .
          <article-title>A TemplateBased Approach Toward Acquisition of Logical Sentences</article-title>
          .
          <source>In Intelligent Information Processing</source>
          , volume
          <volume>221</volume>
          <source>of IFIP Conference Proceedings</source>
          , pages
          <fpage>77</fpage>
          -
          <lpage>89</lpage>
          . Kluwer,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17. Muhammad Tahir Khan and
          <string-name>
            <given-names>Eva</given-names>
            <surname>Blomqvist</surname>
          </string-name>
          .
          <article-title>Ontology Design Pattern DetectionInitial Method and Usage Scenarios</article-title>
          .
          <source>In SEMAPRO 2010, The Fourth International Conference on Advances in Semantic Processing</source>
          , pages
          <fpage>19</fpage>
          -
          <lpage>24</lpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Jonathan</surname>
            <given-names>Mortensen</given-names>
          </string-name>
          , Matthew Horridge,
          <string-name>
            <given-names>Mark A.</given-names>
            <surname>Musen</surname>
          </string-name>
          , and Natalya Fridman Noy.
          <article-title>Modest Use of Ontology Design Patterns in a Repository of Biomedical Ontologies</article-title>
          .
          <source>In WOP</source>
          , volume
          <volume>929</volume>
          <source>of CEUR Workshop Proceedings. CEUR-WS.org</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Nadejda</surname>
            <given-names>Nikitina</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Sebastian</given-names>
            <surname>Rudolph</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Sebastian</given-names>
            <surname>Blohm</surname>
          </string-name>
          .
          <article-title>Refining Ontologies by Pattern-Based Completion</article-title>
          .
          <source>In WOP</source>
          , volume
          <volume>516</volume>
          <source>of CEUR Workshop Proceedings. CEUR-WS.org</source>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <given-names>Valentina</given-names>
            <surname>Presutti</surname>
          </string-name>
          and
          <string-name>
            <given-names>Aldo</given-names>
            <surname>Gangemi</surname>
          </string-name>
          .
          <article-title>Content Ontology Design Patterns as Practical Building Blocks for Web Ontologies</article-title>
          .
          <source>In International Conference on Conceptual Modeling</source>
          , pages
          <fpage>128</fpage>
          -
          <lpage>141</lpage>
          . Springer,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Valentina</surname>
            <given-names>Presutti</given-names>
          </string-name>
          , Aldo Gangemi, Stefano David, G. Aguado de Cea,
          <string-name>
            <surname>Mari-Carmen</surname>
            Suárez-Figueroa, Elena Montiel-Ponsoda, and
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Poveda</surname>
          </string-name>
          .
          <source>D2</source>
          .
          <article-title>5.1: A Library of Ontology Design Patterns: reusable solutions for collaborative design of networked ontologies</article-title>
          . (Available at: http://www.neon-project.
          <source>org/)</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22. Jacqueline Renée Reich.
          <article-title>Onthological Design Patterns for the Integration of Molecular Biological Information</article-title>
          . In German Conference on Bioinformatics, pages
          <fpage>156</fpage>
          -
          <lpage>166</lpage>
          ,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23. Fabiano Borges Ruy, Cássio Chaves Reginato, Victor Amorim dos Santos, Ricardo de Almeida Falbo, and
          <string-name>
            <given-names>Giancarlo</given-names>
            <surname>Guizzardi</surname>
          </string-name>
          .
          <article-title>Ontology Engineering by Combining Ontology Patterns</article-title>
          .
          <source>In ER</source>
          , volume
          <volume>9381</volume>
          of Lecture Notes in Computer Science, pages
          <fpage>173</fpage>
          -
          <lpage>186</lpage>
          . Springer,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Stefen</surname>
            <given-names>Staab</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Michael</given-names>
            <surname>Erdmann</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Alexander</given-names>
            <surname>Maedche</surname>
          </string-name>
          .
          <article-title>Engineering Ontologies using Semantic Patterns</article-title>
          .
          <source>In OIS@IJCAI</source>
          , volume
          <volume>47</volume>
          <source>of CEUR Workshop Proceedings. CEUR-WS.org</source>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>M. C. Suárez-Figueroa</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Brockmans</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Gangemi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Gómez-Pérez</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Lewen</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          <string-name>
            <surname>Presutti</surname>
            , and
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Sabou</surname>
            . D 5.1.1 NeOn
            <given-names>Modelling</given-names>
          </string-name>
          <string-name>
            <surname>Components</surname>
          </string-name>
          . (Available at: http://www.neon-project.
          <source>org)</source>
          ,
          <year>March 2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Ondrej</surname>
            Sváb-Zamazal,
            <given-names>François</given-names>
          </string-name>
          <string-name>
            <surname>Scharfe</surname>
            , and
            <given-names>Vojtech</given-names>
          </string-name>
          <string-name>
            <surname>Svátek</surname>
          </string-name>
          .
          <article-title>Preliminary Results of Logical Ontology Pattern Detection using SPARQL and Lexical Heuristics</article-title>
          .
          <source>In WOP</source>
          , volume
          <volume>516</volume>
          <source>of CEUR Workshop Proceedings. CEUR-WS.org</source>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <given-names>Vojtěch</given-names>
            <surname>Svátek</surname>
          </string-name>
          .
          <article-title>Design Patterns for Semantic Web Ontologies: Motivation and Discussion</article-title>
          .
          <source>In In: 7 th Conf. on Business Information Systems (BIS-04)</source>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <given-names>Denny</given-names>
            <surname>Vrandecic</surname>
          </string-name>
          .
          <article-title>Explicit knowledge engineering patterns with macros</article-title>
          .
          <source>In Proceedings of the Ontology Patterns for the Semantic Web Workshop a the ISWC</source>
          <year>2005</year>
          , Galway, Ireland,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>