<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Identifying Missing Hierarchical Relations in SNOMED CT from Logical Definitions Based on the Lexical Features of Concept Names</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Olivier Bodenreider</string-name>
          <email>olivier.bodenreider@nih.gov</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>U.S. National Library of Medicine National Institutes of Health Bethesda</institution>
          ,
          <addr-line>Maryland</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>-Objectives. To identify missing hierarchical relations in SNOMED CT from logical definitions based on the lexical features of concept names. Methods. We first create logical definitions from the lexical features of concept names, which we represent in OWL EL. We infer hierarchical (subClassOf) relations among these concepts using the ELK reasoner. Finally, we compare the hierarchy obtained from lexical features to the original SNOMED CT hierarchy. We review the differences manually for evaluation purposes. Results. Applied to 15,833 disorder and procedure concepts, our approach identified 559 potentially missing hierarchical relations, of which 78% were deemed valid. Conclusions. This lexical approach to quality assurance is easy to implement, efficient and scalable.</p>
      </abstract>
      <kwd-group>
        <kwd>description assurance</kwd>
        <kwd>lexical features</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>SNOMED
CT;
quality</p>
    </sec>
    <sec id="sec-2">
      <title>I. INTRODUCTION</title>
      <p>
        Quality assurance of large biomedical terminologies
remains an active area of research [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. For example, recent
investigations of SNOMED CT have highlighted issues in its
hierarchical structure and demonstrated their detrimental
consequences (e.g., [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]).
      </p>
      <p>
        Both lexical features and logical definitions have been used
for quality assurance purposes. Approaches based on lexical
features generally exploit the presence of specific words in
SNOMED CT terms or contrast sets of words for terms across
concepts to suggest relations among concepts (e.g., [
        <xref ref-type="bibr" rid="ref3 ref4 ref5 ref6">3-6</xref>
        ]). For
example, the concepts Asthma and Acute asthma can be
represented by the sets of words {asthma} and {acute,
asthma}, respectively. Since {asthma} is a proper subset of
{acute, asthma}, the principles of lexical semantics suggest
that Acute asthma is more specific than Asthma [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
Approaches based on logical definitions often rely on a
description logics reasoner for analyzing the facts in the
ontology (e.g., [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]). The logical definitions found in SNOMED
CT are sets of axioms (facts), i.e., logical statements relating
concepts through “roles” (relationships), representing
biomedical knowledge. For example, the axiom “Acute
asthma, Clinical Course, Sudden onset AND/OR short
duration” is part of the logical definition of Acute asthma and
provides a formal representation of the acute aspect of the
disease. Although logical definitions generally rely on
knowledge associated with concepts, we exploit the fact that
such definitions can also be created from lexical features.
      </p>
      <p>The objective of this investigation is to identify missing
hierarchical relations in SNOMED CT from logical definitions
based on the lexical features of concept names. More
specifically, we propose to leverage description logics for
representing the lexical features of concept names and infer
hierarchical relations based on these lexical features with a
reasoner. The hierarchical relations inferred from lexical
features but not present in SNOMED CT are candidates for
missing relations.</p>
    </sec>
    <sec id="sec-3">
      <title>II. BACKGROUND</title>
      <sec id="sec-3-1">
        <title>A. SNOMED CT</title>
        <p>Developed by the International Health Terminology
Standard Development Organization (IHTSDO), SNOMED
CT is the world’s largest clinical terminology. With 320,000
active concepts, it provides broad coverage of clinical
medicine, including findings, diseases, and procedures for use
in electronic medical records [9].</p>
        <p>SNOMED CT provides a preferred name and synonyms for
each concept (“descriptions” in SNOMED CT parlance). The
“fully specified name” is guaranteed to be unique for each
concept and consists of the preferred term followed by a
semantic tag (e.g., Blepharorrhaphy (procedure) (388008)). In
addition to names, all concept have a logical definition, based
on definitional characteristics of the concept (not on the lexical
features of the concept names). For example,</p>
      </sec>
      <sec id="sec-3-2">
        <title>Class: Blepharorrhaphy</title>
        <p>EquivalentTo:</p>
      </sec>
      <sec id="sec-3-3">
        <title>Suture of eyelid</title>
        <p>and (Method some Closure - action)
and (Procedure site - Direct some Structure of
palpebral fissure)</p>
        <p>and (Using device some Surgical suture, device)</p>
        <p>In SNOMED CT, the logical definitions are processed with
a description logic reasoner for consistency validation and to
generate the hierarchical structure by inferring subClassOf
relations among the concepts.</p>
        <p>The version of SNOMED CT used in this work is the U.S.
edition dated March 2016.</p>
      </sec>
      <sec id="sec-3-4">
        <title>B. Description logics</title>
        <p>
          Description logics (DL) are a family of knowledge
representation languages often used as ontology languages, and
defined as a trade-off between expressivity and tractability
[
          <xref ref-type="bibr" rid="ref9">10</xref>
          ]. Reasoners are computer programs that can check the
consistency of the facts asserted in the ontology and infer
relations among ontology classes based on these facts (i.e.,
infer hierarchical (subClassOf) relations).
        </p>
        <p>
          Among the various flavors of DL languages available, the
EL family offers sufficient expressivity for the simple
definitions resulting from lexical features, as well as scalability
to a large number of classes [
          <xref ref-type="bibr" rid="ref10">11</xref>
          ]. The reasoners developed for
EL (e.g., ELK [
          <xref ref-type="bibr" rid="ref11">12</xref>
          ]) offer impressive performance.
        </p>
        <p>As illustrated above, SNOMED CT relies on DL for
representing the logical definitions it provides for its concepts.
It also makes use of a reasoner for testing the consistency of
these definitions across the whole ontology, as well as for
inferring the hierarchy of concepts. In this work, we apply the
reasoner not to the logical definitions provided by SNOMED
CT to represent biomedical knowledge, but rather to the
definitions we generate from the lexical features of the terms of
SNOMED CT concepts.</p>
      </sec>
      <sec id="sec-3-5">
        <title>C. Quality assurance of biomedical ontologies</title>
        <p>
          Approaches to quality assurance in biomedical ontologies
can be classified into lexical, structural and semantic
approaches [
          <xref ref-type="bibr" rid="ref12">13</xref>
          ]. Lexical approaches rely on the lexical
features of terms; structural approaches analyze the
hierarchical structure of ontologies; and semantic approaches
exploit the relations among concepts (including logical
definitions). Examples of lexical and semantic approaches
applied to quality assurance in SNOMED CT were presented
earlier in the introduction. (Structural approaches are less
relevant to this work and will not be discussed here.)
        </p>
        <p>Of note, while DL techniques are generally used in the
context of semantic approaches, in this work, we leverage a DL
reasoner for the implementation of a lexical approach to QA,
since our logical definitions are created on the basis of lexical
features.</p>
        <p>
          The compositionality of terms in biomedical ontologies is
well documented and has been exploited for quality assurance
purposes (e.g., [
          <xref ref-type="bibr" rid="ref13 ref14">14, 15</xref>
          ]). However, Mungall used ad hoc
programming (in Prolog) rather than a DL reasoner to infer
relations among terms. Our approach is also much simpler in
that it only relies on sets of words and only attempts to elicit
hierarchical relations.
        </p>
      </sec>
      <sec id="sec-3-6">
        <title>D. Specific contribution</title>
        <p>The specific contribution of this work is not in leveraging
the compositionality of biomedical terms for suggesting
relations, but rather in proposing a description logics approach
to doing so. While ad hoc programming is usually necessary
for comparing bags of words, our work demonstrates it can
also be supported effectively by a DL reasoner. To our
knowledge, this is the first attempt to generate logical
definitions based on the lexical features of concept names in
SNOMED CT for quality assurance purposes.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>III. METHODS</title>
      <p>Our method for identifying missing hierarchical relations
from SNOMED CT can be summarized as follows. We first
create logical definitions from the lexical features of concept
names, which we represent in the web ontology language,
OWL. We infer hierarchical (subClassOf) relations among
these concepts using a reasoner. Finally, we compare the
hierarchy obtained from lexical features to the original
SNOMED CT hierarchy. We review the differences manually
for evaluation purposes. In this preliminary investigation, we
applied this approach to a significant subset of the Clinical
Finding hierarchy rooted with the concept Disorder of head
(disorder) (118934005) and a smaller subset of the Procedure
hierarchy rooted with the concept Operative procedure on
head (procedure) (89901005).</p>
      <sec id="sec-4-1">
        <title>A. Creating logical definitions based on the lexical features of concept names</title>
        <p>
          For each concept under investigation, we extract the fully
specified name, which consists of the preferred term (e.g.,
“Disorder of head”) followed by a semantic tag in parentheses
(e.g. “disorder”). For each concept C with fully specified name
“w1 w2 … wn (T)”, where {w1, w2, … wn} is the set of words in
the preferred term and where T is the semantic tag, we create a
logical definition of the following form (expressed in the
simplified OWL syntax known as Manchester syntax [
          <xref ref-type="bibr" rid="ref15">16</xref>
          ]):
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Class: C EquivalentTo:</title>
      <p>T
and (has_word some w1)
and (has_word some w2)
…
and (has_word some wn)</p>
      <p>For example, the class definition for the concept Complete
ablepharon (disorder) (708541009) is shown in Fig. 1.</p>
      <p>In practice, we use a simple script to create an OWL file
that contains the class definitions for all the concepts under
investigation. The words “the” and “of”, present in a large
proportion of terms, are omitted when generating the class
definitions.</p>
      <p>
        Of note, the OWL constructs used in these definitions
(namely class equivalence and existential quantification to a
class expression) are compatible with the OWL 2 EL profile
[
        <xref ref-type="bibr" rid="ref10">11</xref>
        ].
      </p>
      <p>Fig. 2. Asserted hierarchy – Ablepharon (disorder) prior to running the
reasoner (no inferred subclasses)</p>
      <sec id="sec-5-1">
        <title>B. Inferring subClassOf relations from lexical features</title>
        <p>
          We load this OWL file in the Protégé ontology editor (5.0
beta), in which we have installed the plugin for the ELK
reasoner [
          <xref ref-type="bibr" rid="ref11">12</xref>
          ], specially optimized for classifying OWL 2 EL
ontologies. Prior to running the reasoner, the SNOMED CT
concepts imported into Protégé appear as a flat list (i.e., with
no hierarchical structure) under the classes created for the
semantic tags (Fig. 2). After ELK has run, inferred subClassOf
axioms among the SNOMED CT concepts have been added to
the ontology and the concepts are no longer displayed as a flat
list (Fig. 3). For example, the three concepts Ablepharon
(disorder) (13401001), Complete ablepharon (disorder)
(708541009), and Partial ablepharon (disorder) (45484000)
are listed under disorder in the asserted hierarchy (Fig. 2), but
        </p>
      </sec>
      <sec id="sec-5-2">
        <title>Complete ablepharon (disorder) and Partial ablepharon</title>
        <p>(disorder) are subclasses of Ablepharon (disorder) in the
inferred hierarchy (Fig. 3).</p>
        <p>Since the subClassOf relations are inferred from lexical
features, we need to filter out complex terms with prepositional
phrases to avoid generating wrong subClassOf relations. For
example, for Dementia due to Parkinson's disease (disorder)
(101421000119107), a subClassOf relation is inferred to both</p>
      </sec>
      <sec id="sec-5-3">
        <title>Dementia (disorder) (52448006) and Parkinson's disease</title>
        <p>(disorder) (49049000). Similarly, for Goniopuncture without
goniotomy (procedure) (202727004), a subClassOf relation is
inferred to both Goniopuncture (procedure) (265293008) and
Goniotomy (procedure) (265292003). While this behavior is
expected from the reasoner, it is not desirable, because</p>
      </sec>
      <sec id="sec-5-4">
        <title>Dementia due to Parkinson's disease (disorder) is not a kind of</title>
      </sec>
      <sec id="sec-5-5">
        <title>Parkinson's disease (disorder) as suggested by the</title>
        <p>prepositional expression “due to”. Similarly, Goniopuncture
without goniotomy (procedure) specifically excludes
Goniotomy (procedure). In practice, to avoid generating such
wrong subClassOf relations, we filter out the relations
generated when the name of the most specific (“child”) concept
contains any of the following words: “and”, “or”, “and/or”,
“with”, “without”, “from”, “due to”, “secondary to”, “except”,
“by”, “after”, “revision” and “ligation for”.</p>
      </sec>
      <sec id="sec-5-6">
        <title>C. Comparing the hierarchy inferred from lexical features to the original hierarchy</title>
        <p>To analyze which relations from the inferred hierarchy are
not already in the original SNOMED CT hierarchy (i.e., the
hierarchy found in the SNOMED CT distribution), we need to
generate these two sets of hierarchical relations and compute
the difference between them. Using Protégé, we export the
inferred subClassOf axioms to a file in RDF format for
comparison to the original hierarchical relations in SNOMED
CT. Using a simple script, we write the original hierarchical
relations in SNOMED CT to RDF for the subhierarchies under
investigation. In practice, because the inferred relations can be
between any two classes, we enrich the original hierarchy with
the transitive closure of subClassOf relations. We load the files
for the two sets of relations, inferred and original, into the
triple store Virtuoso and use a SPARQL query to compute the
set of hierarchical relations from the inferred set that is not part
of the hierarchical relations originally in SNOMED CT
(transitively closed). The SPARQL 1.1 operator MINUS
makes such comparison between two graphs extremely easy.</p>
      </sec>
      <sec id="sec-5-7">
        <title>D. Evaluation</title>
        <p>We manually review for validity a random subset of 100
inferred relations that are not present in the original SNOMED
CT hierarchy (transitively closed).</p>
        <p>IV. RESULTS</p>
      </sec>
      <sec id="sec-5-8">
        <title>A. Creating logical definitions based on the lexical features of concept names</title>
        <p>We created logical definitions based on the lexical features
of concept names for the 12,088 concepts (4871 distinct words)
of the subhierarchy rooted with the concept Disorder of head
(disorder) (118934005) and for the 3795 concepts (1899
distinct words) of the subhierarchy rooted with the concept</p>
      </sec>
      <sec id="sec-5-9">
        <title>Operative procedure on head (procedure) (89901005).</title>
      </sec>
      <sec id="sec-5-10">
        <title>B. Inferring subClassOf relations from lexical features</title>
        <p>Running the ELK reasoner took a few seconds and resulted
in the creation of 7079 inferred subClassOf relations among
the concepts of the subhierarchy rooted with the concept
Disorder of head (disorder). Similarly, 1357 relations were
inferred in the subhierarchy rooted with the concept Operative
procedure on head (procedure).</p>
      </sec>
      <sec id="sec-5-11">
        <title>C. Comparing the hierarchy inferred from lexical features to the original hierarchy</title>
        <p>After subtracting from the inferred subClassOf relations
created by the reasoner those subClassOf relations already
present in the original version of SNOMED CT (transitively
closed), we obtained 1210 inferred subClassOf relations for the</p>
      </sec>
      <sec id="sec-5-12">
        <title>Disorder of head (disorder) hierarchy and 242 inferred</title>
        <p>subClassOf relations for the Operative procedure on head
(procedure) hierarchy. Of these, 469 subClassOf relations for
disorders and 90 for procedures met our criteria for review
(i.e., the name of the child concept does not contain any of the
prepositional and other expressions listed earlier).</p>
      </sec>
      <sec id="sec-5-13">
        <title>D. Evaluation</title>
        <p>The random subset of 100 inferred subClassOf relations we
reviewed comprises 83 disorders and 17 procedures. Overall,
78 relations were deemed valid, 19 invalid and 3 questionable
(i.e., these relations seem to have face validity, but may not be
compliant with SNOMED CT editorial policies). Examples of
such relations are listed in Table I.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>V. DISCUSSION</title>
      <sec id="sec-6-1">
        <title>A. Findings</title>
        <p>As expected, a vast majority of the hierarchical relations
suggested lexically were already present in the original
SNOMED CT hierarchy (transitively closed). Specifically,
only 1210 of the 7079 hierarchical relations for disorders
(17%) and 242 of the 1357 hierarchical relations for procedures
(18%) were not already represented in SNOMED CT.</p>
        <p>However, it was somewhat surprising to us to see that a
large number of potentially missing hierarchical relations had
been generated from this simple technique based on lexical
features. Assuming 80% of the 559 hierarchical relations
generated are correct, we discovered 447 missing hierarchical
relations among the 15,883 concepts under investigation.
Interestingly, the proportion is roughly the same for disorders
and procedures.</p>
        <p>In addition to the evaluation, we performed a cursory
review of the 559 potentially missing hierarchical relations,
among which we identified a few patterns. In 31 cases, the
missing relation was between “carcinoma in situ of &lt;some
anatomical structure&gt;” and “carcinoma of &lt;some anatomical
structure&gt;” (or “&lt;some anatomical structure&gt; carcinoma”), for
example, between Carcinoma in situ of palate (disorder)
(92670007) and Palate carcinoma (disorder) (274084007).
Another such patterns was found in 23 cases between
“congenital &lt;some disorder&gt;” and the unqualified disorder, for
example, between Congenital anterior staphyloma (disorder)
(253230008) and Anterior staphyloma (disorder) (231888000).</p>
      </sec>
      <sec id="sec-6-2">
        <title>B. Technical significance</title>
        <p>The novel aspect of this work is to use a DL approach to
lexical similarity. In practice, it means that no ad hoc
programming is required for identifying partial ordering
relations among sets of words for terms in an ontology
reflecting hierarchical relations among the corresponding
concepts. Instead, logical definitions created from lexical
features can simply be represented in DL formalism and run
through a reasoner to infer the relevant subClassOf relations.
As shown here, this approach is easy to implement, efficient
and scalable. The only programming required is for serializing
the logical definitions in the appropriate DL format.</p>
        <p>Moreover, given that SNOMED CT already uses DL
techniques for representing its logical definitions based on
biomedical knowledge and an EL reasoner for inferring its
hierarchy, it can be expected that the IHTSDO could easily
integrate the lexical approach to quality assurance proposed
here.</p>
        <p>Finally, having two kinds of logical definitions (from
biomedical knowledge and from lexical features) represented
in the same formalism would make it possible to integrate them
into the same framework, for example to test the consistency
between the two kinds of definitions.</p>
      </sec>
      <sec id="sec-6-3">
        <title>C. Limitations and future work</title>
        <p>
          This preliminary investigation is limited to two
subhierarchies of SNOMED CT for diseases and procedures.
However, we also generated definitions and inferred hierarchy
for the whole SNOMED CT and did not notice any scalability
issues. We did not leverage SNOMED CT synonyms for
creating logical definitions, but this should be a natural
extension of this investigation. In future work, we also would
like to normalize terms before creating the definitions, since
normalization is common approach to managing term variation
[
          <xref ref-type="bibr" rid="ref16">17</xref>
          ].
        </p>
        <p>
          This bag-of-word approach to comparing terms tends to
generate more false positives than a linguistically motivated
approach, where the head of the noun phrase would be required
to be the same in two hierarchically related concepts, as we did
in other work [
          <xref ref-type="bibr" rid="ref17">18</xref>
          ]. In fact, many of the errors detected during
the evaluation correspond to cases where the specific term is
linked to a term that does not contain the head of the noun
phrase of the specific term. However, the bag-of-word
approach is much easier to implement than linguistically
motivated approaches, and we showed that false positives can
be mitigated in part by filtering out complex terms.
        </p>
        <p>In this preliminary investigation, we performed a limited
evaluation. Given the encouraging results, we plan to extend
the investigation to the entirety of SNOMED CT, evaluate the
results more thoroughly, and share them with the SNOMED
CT developers at the IHTSDO.</p>
        <p>
          Finally, the lexical approach to quality assurance proposed
here could also complement structural approaches, such as the
lattice-based approach we proposed earlier [
          <xref ref-type="bibr" rid="ref18">19</xref>
          ].
        </p>
      </sec>
      <sec id="sec-6-4">
        <title>D. Generalization</title>
        <p>This approach to identifying missing hierarchical relations
would be applicable not only to the entirety of SNOMED CT,
but to other biomedical ontologies as well. More specifically, it
could be applied to any biomedical ontology for which concept
names and hierarchical relations are available (i.e., most
ontologies). The same approach could also be applied to the
creation of partial mappings.</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>ACKNOWLEDGMENT</title>
      <p>This work was supported by the Intramural Research
Program of the NIH, National Library of Medicine. This work
was conducted using the Protégé resource, which is supported
by grant GM10331601 from the National Institute of General
Medical Sciences of the United States National Institutes of
Health. We would like to thank Dr. GQ Zhang for providing
motivation and encouragement for this investigation.
[9] IHTSDO, “SNOMED CT,” 2016.</p>
      <p>TABLE I.</p>
      <p>EXAMPLES OF SUBCLASSOF RELATIONS INFERRED FROM LEXICAL FEATURES
yes
yes
yes
yes
yes
yes</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J.</given-names>
            <surname>Geller</surname>
          </string-name>
          , et al., “
          <article-title>Special issue on auditing of terminologies,” J Biomed Inform</article-title>
          , vol.
          <volume>42</volume>
          , no.
          <issue>3</issue>
          ,
          <issue>2009</issue>
          , pp.
          <fpage>407</fpage>
          -
          <lpage>411</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>A.L.</given-names>
            <surname>Rector</surname>
          </string-name>
          , et al.,
          <article-title>“Getting the foot out of the pelvis: modeling problems affecting use of SNOMED CT hierarchies in practical applications</article-title>
          ,
          <source>” J Am Med Inform Assoc</source>
          , vol.
          <volume>18</volume>
          , no.
          <issue>4</issue>
          ,
          <issue>2011</issue>
          , pp.
          <fpage>432</fpage>
          -
          <lpage>440</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>O.</given-names>
            <surname>Bodenreider</surname>
          </string-name>
          , et al.,
          <article-title>“Assessing the consistency of a biomedical terminology through lexical knowledge,”</article-title>
          <source>Int J Med Inform</source>
          , vol.
          <volume>67</volume>
          , no.
          <issue>1-3</issue>
          ,
          <year>2002</year>
          , pp.
          <fpage>85</fpage>
          -
          <lpage>95</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>K.E.</given-names>
            <surname>Campbell</surname>
          </string-name>
          , et al.,
          <article-title>“A "lexically-suggested logical closure" metric for medical terminology maturity</article-title>
          ,
          <source>” Proc AMIA Symp</source>
          ,
          <year>1998</year>
          , pp.
          <fpage>785</fpage>
          -
          <lpage>789</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>E.</given-names>
            <surname>Mikroyannidi</surname>
          </string-name>
          , et al.,
          <article-title>“Analysing Syntactic Regularities and Irregularities in SNOMED-CT,” J Biomed Semantics</article-title>
          , vol.
          <volume>3</volume>
          , no.
          <issue>1</issue>
          ,
          <issue>2012</issue>
          , pp.
          <fpage>8</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>E.</given-names>
            <surname>Pacheco</surname>
          </string-name>
          , et al.,
          <article-title>“Detecting Underspecification in SNOMED CT concept definitions through natural language processing</article-title>
          ,
          <source>” AMIA Annu Symp Proc</source>
          , vol.
          <year>2009</year>
          ,
          <year>2009</year>
          , pp.
          <fpage>492</fpage>
          -
          <lpage>496</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>D.A.</given-names>
            <surname>Cruse</surname>
          </string-name>
          , Lexical semantics, Cambridge University Press,
          <year>1986</year>
          , p.
          <fpage>xiv</fpage>
          ,
          <volume>310</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>K.</given-names>
            <surname>Dentler</surname>
          </string-name>
          and
          <string-name>
            <given-names>R.</given-names>
            <surname>Cornet</surname>
          </string-name>
          , “
          <article-title>Intra-axiom redundancies in SNOMED CT,”</article-title>
          <source>Artif Intell Med</source>
          , vol.
          <volume>65</volume>
          , no.
          <issue>1</issue>
          ,
          <issue>2015</issue>
          , pp.
          <fpage>29</fpage>
          -
          <lpage>34</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>F.</given-names>
            <surname>Baader</surname>
          </string-name>
          , et al., “Description logics,” Handbook on ontologies,
          <source>International handbooks on information systems, S. Staab and R</source>
          . Studer, eds., Springer,
          <year>2004</year>
          , pp.
          <fpage>3</fpage>
          -
          <lpage>28</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [11]
          <fpage>W3C</fpage>
          , “OWL 2
          <string-name>
            <given-names>Web</given-names>
            <surname>Ontology Language Profiles (Second Edition</surname>
          </string-name>
          ),” 2012; https://www.w3.org/TR/owl2-profiles/#OWL_2_EL.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Kazakov</surname>
          </string-name>
          , et al., “
          <article-title>The Incredible ELK: From Polynomial Procedures to Efficient Reasoning with EL Ontologies,”</article-title>
          <source>Journal of Automated Reasoning</source>
          , vol.
          <volume>53</volume>
          , no.
          <issue>1</issue>
          ,
          <issue>2013</issue>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>61</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>X.</given-names>
            <surname>Zhu</surname>
          </string-name>
          , et al.,
          <article-title>“A review of auditing methods applied to the content of controlled biomedical terminologies,” J Biomed Inform</article-title>
          , vol.
          <volume>42</volume>
          , no.
          <issue>3</issue>
          ,
          <issue>2009</issue>
          , pp.
          <fpage>413</fpage>
          -
          <lpage>425</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>C.J.</given-names>
            <surname>Mungall</surname>
          </string-name>
          , “
          <article-title>Obol: integrating language and meaning in bioontologies,” Comp Funct Genomics</article-title>
          , vol.
          <volume>5</volume>
          , no.
          <issue>6-7</issue>
          ,
          <year>2004</year>
          , pp.
          <fpage>509</fpage>
          -
          <lpage>520</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>P.V.</given-names>
            <surname>Ogren</surname>
          </string-name>
          , et al., “
          <article-title>The compositional structure of Gene Ontology terms</article-title>
          ,” Pac Symp Biocomput,
          <year>2004</year>
          , pp.
          <fpage>214</fpage>
          -
          <lpage>225</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [16]
          <fpage>W3C</fpage>
          , “https://www.w3.org/TR/owl2-manchester-syntax/,”
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [17]
          <string-name>
            <surname>A.T. McCray</surname>
          </string-name>
          , et al., “
          <article-title>Lexical methods for managing variation in biomedical terminologies</article-title>
          ,
          <source>” Proc Annu Symp Comput Appl Med</source>
          Care,
          <year>1994</year>
          , pp.
          <fpage>235</fpage>
          -
          <lpage>239</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>F.</given-names>
            <surname>Dhombres</surname>
          </string-name>
          and
          <string-name>
            <given-names>O.</given-names>
            <surname>Bodenreider</surname>
          </string-name>
          , “
          <article-title>Interoperability between phenotypes in research and healthcare terminologies--Investigating partial mappings between HPO and SNOMED CT,” J Biomed Semantics</article-title>
          , vol.
          <volume>7</volume>
          ,
          <issue>2016</issue>
          , pp.
          <fpage>3</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>G.Q.</given-names>
            <surname>Zhang</surname>
          </string-name>
          and
          <string-name>
            <given-names>O.</given-names>
            <surname>Bodenreider</surname>
          </string-name>
          , “
          <article-title>Large-scale, Exhaustive Lattice-based Structural Auditing of SNOMED CT,” AMIA Annu Symp Proc</article-title>
          , vol.
          <year>2010</year>
          ,
          <year>2010</year>
          , pp.
          <fpage>922</fpage>
          -
          <lpage>926</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>