<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Quality Assurance Methodology for ChEBI Ontology Focusing on Uncommonly Modeled Concepts</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Hao Liu</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ling Chen</string-name>
          <email>lchen@bmcc.cuny.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ling Zheng</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yehoshua Perl</string-name>
          <email>perl@njit.edu</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>James Geller</string-name>
          <email>geller@njit.edu</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Borough of Manhattan Community New Jersey Institute of Technology, College CUNY Newark</institution>
          ,
          <addr-line>NJ USA New York, NY</addr-line>
          <country country="US">USA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>CSSE Department Monmouth University</institution>
          ,
          <addr-line>West Long Branch, NJ</addr-line>
          <country country="US">USA</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Department of Computer Science</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>-The Chemical Entities of Biological Interest (ChEBI) ontology is an important knowledge source of chemical entities in a biological context. ChEBI is large and complex, making it almost impossible to be error-free, given the scarce resources for quality assurance (QA). We present a methodology to locate concepts in ChEBI with a high probability of being erroneous. An Abstraction Network, which provides a compact summarization of an ontology, supports the methodology. By investigating a sample of ChEBI concepts, we show that uncommonly modeled concepts residing in small units of the Abstraction Network of ChEBI are statistically significantly more likely to have errors than other concepts. The finding may guide ChEBI ontology curators to focus their limited QA resources on such concepts to achieve a better QA yield. Furthermore, this study, combined with previous work, contributes to progress in showing that this methodology can be applied to a whole family of similar ontologies.</p>
      </abstract>
      <kwd-group>
        <kwd>ChEBI</kwd>
        <kwd>chemical ontology</kwd>
        <kwd>chemical concept</kwd>
        <kwd>quality assurance</kwd>
        <kwd>modeling error</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>INTRODUCTION</p>
      <p>
        The Chemical Entities of Biological Interest (ChEBI)
ontology [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] is a large important knowledge source that
facilitates reference to chemical entities within the biological
field. It annotates small distinguishable entities such as atoms,
ions, and polymers and their relationships to each other. ChEBI
has been used to support chemical analysis. For example, a
method for determining optimal semantic similarity and
particularity thresholds was applied to the ChEBI ontology [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>
        Quality assurance (QA) is an essential part of the ontology
lifecycle to make sure that there is no modeling error in an
ontology [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Errors in an ontology can propagate to its
applications. However, due to limited available QA resources,
it is impossible to perform thorough quality assurance on a large
ontology like ChEBI or on a large hierarchy of an ontology, e.g.,
ChEBI’s Chemical Entity hierarchy with 106,707 concepts (in
July 2017), without automatic/semi-automatic techniques.
      </p>
      <p>
        One practical ontology QA approach is to identify sets of
concepts, which are more likely to have errors than other
concepts. Focusing audits on such sets of concepts can achieve
a high QA yield in terms of the ratio of the number of concepts
with modeling errors to the number of reviewed concepts. Thus,
the question is how can we find such sets of concepts?
The SABOC team has previously demonstrated that
Abstraction Networks are an effective tool to identify sets of
concepts in ontologies that are more likely to have errors [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
An Abstraction Network (AbN) is a compact summary of an
ontology’s content and structure, which is automatically
derived from the ontology. There are different kinds of AbNs
depending on the ontologies’ structure. The Partial-area
taxonomy (introduced in the Background) is an AbN that was
derived for several ontologies [
        <xref ref-type="bibr" rid="ref3 ref5">3, 5</xref>
        ], e.g., NCI Thesaurus (NCIt)
[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], and SNOMED CT [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
      <p>A partial-area (explained in the Background section)
represents a group of concepts with the same structure and
semantics, that is, all have the same set of relationships and are
descendants of the same concept. Concepts in partial-areas that
summarize few concepts (“small partial-areas”) expose their
uncommon modeling, which is of special interest to us.</p>
      <p>
        A modeling error in an ontology can be either an omission
error or a commission error. Examples of omissions are missing
a parent concept or missing a lateral relationship. Examples of
commission errors are an incorrect parent or an incorrect lateral
relationship. Omission errors represent knowledge in an
ontology that is not complete, while commission errors mean
that the modeling is wrong. Some ontologies, e.g. NCIt, are
intentionally modeled with incomplete knowledge. Hence,
commission errors are more severe and attract more interest
from ontology curators. A previous study [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] on a random
sample of 400 concepts in the February 2016 release of ChEBI,
found that 41.8% of the concepts exhibited errors. Thus, we
focused in this study on commission errors with the goal to find
a semi-automatic QA technique, which can identify ChEBI
concepts with a higher probability of commission errors.
      </p>
      <p>The purpose of this study is to test whether the partial-area
taxonomy-based QA methodology, focusing on concepts in
small partial-areas, improves the error yield for the large
Chemical Entity hierarchy of ChEBI.</p>
      <p>II.</p>
    </sec>
    <sec id="sec-2">
      <title>BACKGROUND</title>
      <sec id="sec-2-1">
        <title>A. ChEBI</title>
        <p>
          ChEBI [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] is maintained by the European Molecular
Biology Laboratory–European Bioinformatics Institute
(EMBL-EBI). Various applications have been developed based
on the ChEBI ontology. A prediction method [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] was proposed
to utilize information from the ChEBI ontology for identifying
drugs’ target groups. Hill et al. [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] integrated the ChEBI
structural hierarchy into the Gene Ontology to enable data
integration across the biology and chemistry domains.
        </p>
        <p>
          ChEBI is provided in the W3C standard Web Ontology
Language (OWL) and OBO formats. In this study, using the
OWL format, object properties (relationships) are used only in
restrictions. There is a stated version and an inferred version of
OWL files. ChEBI provides the stated version. The reasoner
HermiT [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ] was used to get the inferred version of ChEBI.
        </p>
        <p>The ChEBI ontology consists of three hierarchies. In this
study we used concepts from the largest, the Chemical Entity
hierarchy, (98.7% of ChEBI concepts). It annotates chemical
compounds within molecular entities. Besides the IS-A
relationships, ChEBI’s Chemical Entity hierarchy has nine
lateral semantic relationship types, such as has part, has role,
and is conjugate base of. The other two hierarchies are the
Subatomic Particle and the Role hierarchy. The former
hierarchy is mainly used to categorize particles smaller than
atoms, while the Role hierarchy defines the roles of molecular
entities in different contexts. The ChEBI ontology employs a
star rating annotation: “3-star” indicates that a concept was
manually annotated by the ChEBI curator team; “2-star” means
that the concept is annotated by a third party; and “1-star”
usually represents a concept marked as deleted or obsolete.
Thus, “1-star” concepts are excluded from our research.</p>
        <p>QA requests by users of ChEBI are easily made via ChEBI’s
GitHub issue tracking system
(https://github.com/ebichebi/ChEBI/issues). ChEBI’s curators review and verify these
requests. Approved changes are made available in subsequent
releases. For example, a user may report a wrong reference to a
certain compound, or a wrong definition or relationship for a
compound, by creating an issue report on GitHub. These reports
are reviewed on a weekly or monthly basis. If the curators agree
with a request, they make the corresponding changes to ChEBI.
The volume of requests and the response time reflect the limited
QA resources of ChEBI’s curator team. At the time of this
writing, there are 289 accumulated open issues and 3175 closed
requests.</p>
      </sec>
      <sec id="sec-2-2">
        <title>B. Partial-area taxonomy</title>
        <p>
          In our previous research, Abstraction Networks (AbN) have
been proven successful for summarizing and visualizing
ontologies [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ], and for supporting quality assurance of
ontologies [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. Different types of AbNs have been developed
for various ontologies. In this study, we use the partial-area
taxonomy AbN. Fig. 1 illustrates the derivation of the
partialarea taxonomy for an excerpt of concepts from ChEBI’s
Chemical Entity hierarchy.
        </p>
        <p>Fig. 1(a) depicts a subhierarchy of 17 concepts (drawn as
ellipses, labeled with their names). The arrows denote IS-A
links. Concepts with the same set of lateral relationships are
grouped together in a dashed bubble, labeled by the common
set of relationships. For example, Atom, Nonmetal atom,
Sblock element atom, Polymer, Ionic Polymer, and Polyanionic
polymer are grouped in the green bubble because they all have
only the has part relationship.</p>
        <p>
          An area taxonomy is an AbN which consists of nodes called
areas and child-of links connecting areas. An area, depicted as
a color-coded box, represents the group of concepts with the
same set of relationships, i.e. in the same bubble in Fig. 1(a).
Fig. 1(b) shows the area taxonomy for Fig. 1(a). An area is
labeled by its set of relationship types. The concepts in the green
bubble are represented compactly by the green area {has part}.
The concept Chemical Entity and its descendants in the grey
bubble are now represented by the area {Ø}. (The symbol Ø
represents the empty set.) Similarly, the concept Hydrogen
atom and its descendants are represented by the area {has part,
has role}. Areas with the same number of relationship types
have the same color and are aligned at the same level. For
example, the areas {has part, has role} and {has part, is
conjugate base of} appear in the third level, colored in blue.
Child-of links (arrows in Fig. 1(b)) are derived based on the
underlying IS-A relationships in ontologies. See [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] for further
details.
        </p>
        <p>
          Areas summarize concepts with the same structure
(relationships). An area may have multiple roots, which are
concepts such that their parents are not in the area. For example,
Atom and Polymer in the green area {has part} are roots, each
imposing its semantics on its descendants in the area. For
example, the descendants of Atom are kinds of atoms and the
descendants of Polymer are kinds of polymers. The partial-area
taxonomy is a refinement of the area taxonomy. A partial-area
is composed of an area root concept and all its descendant
concepts in the same area. The size of a partial-area is the
number of concepts in it. Each partial-area is labeled by the
name of its root concept, expressing its semantics, with its size
in parentheses. Partial-areas are shown as white boxes in Fig.
1(c). For example, Atom (3) is a partial-area in the green area
summarizing three concepts, Atom and its two children, in the
green area. A partial-area taxonomy is an AbN composed of
nodes called partial-areas and hierarchical child-of links
(arrows in Fig. 1(c)) connecting them. The compact
summarization and visual simplification provided by the
partial-area taxonomy, make it easier to identify anomalies in
modeling the ontology. An area taxonomy and a partial-area
taxonomy can be created automatically by a software tool called
Ontology Abstraction Framework (OAF) [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ] available in the
NCBO BioPortal.
        </p>
        <p>III.</p>
        <p>METHODS</p>
        <p>As described above, there are two kinds of modeling errors
in an ontology: omission errors and commission errors. Table I
shows one omission error example and five commission error
examples. For example, 3-buten-1-amine in row 1 is missing
the relationship is conjugate base of (an omission error). In row
4, (S)-3-hydroxybutyric acid has an incorrect hierarchical
relationship (a commission error). In this study the term “error”
will be reserved for commission errors, unless otherwise noted,
following the preference of ontology curators described above.</p>
        <p>Given the fact that concepts within the same partial-area
share the same structure and semantics, a partial-area that
accommodates just a few concepts stands out as an “outlier,”
which often needs more QA attention. We consider concepts as
“uncommonly modeled” if they appear as outliers through the
lens of the partial-area taxonomy. The motivation for auditing
concepts in outlier sets is that if a concept is in a small
partialarea while related concepts reside in large partial-areas, this
raises suspicions about the correctness of the modeling of the
concepts in the outlier set. For example, 421 polymer concepts
with the same modeling in one partial-area appear to be
correctly modeled. However, a partial-area with only two
concepts (out of thousands of concepts in the ontology) may
indicate error(s). It is, of course, possible that these two
uncommonly modeled concepts are correct, but there is a higher
possibility that they have errors. Once any errors are corrected,
these concepts may become part of another (larger) partial-area.
We formulate the following hypothesis.</p>
        <p>Hypothesis 1: There exists a threshold value Θ
differentiating small and large partial-areas, such that concepts
in small partial-areas within the partial-area taxonomy for an
ontology have a statistically significantly higher error rate than
concepts in large partial-areas.</p>
        <p>If Hypothesis 1 is confirmed even for one threshold value,
it can be the basis for a QA methodology to guide the ChEBI
ontology curators to focus on concepts in the small partial-areas
whenever the QA resources are limited. To test the above
Hypothesis 1, 500 concepts (0.5%) were randomly selected
from the Chemical Entity hierarchy of ChEBI’s July 2017
release. Concepts were presented in random order to our
domain expert LC for review. LC is a chemistry professor with
substantial experience in chemical ontology auditing. We
analyzed the sizes of partial-areas of these 500 concepts. The
evaluation of Hypothesis 1 depends on the threshold value
differentiating small partial-areas from large ones.</p>
        <p>
          The threshold Θ may be different for different ontologies.
We consider two ways to obtain a threshold value. One is that
the threshold is predefined, based on prior experience from
other ontologies. Then we can conduct a study to test whether
we achieve statistical significance for the error rate difference
between small and large partial-areas. For example, in the study
by Zheng et al. [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] of NCIt’s Neoplasm subhierarchy, small
partial-areas were predefined as partial-areas with up to 10
concepts and large partial-areas were predefined as those with
at least 20 concepts. Partial-areas from 11 to 19 concepts were
considered medium-sized. Zheng et al. demonstrated that NCIt
Neoplasm concepts in small partial-areas have a statistically
significantly higher error rate than such concepts in large
partial-areas.
        </p>
        <p>The other way is to choose the threshold value that
maximizes the error rate difference between small partial-areas
and large ones. Hence, the threshold is determined by the study
results. In this study, we introduce this second method.
Considering the variations in terms of the numbers of concepts
for different partial-area sizes, we use the weighted average
error rate instead of the average error rate to determine the
desired threshold. We call such a threshold an optimizing
threshold, since it optimizes the difference between the
weighted error rates of the two ranges of partial-area sizes. The
weighted average error rate is calculated using formula (1),
where   is the total number of concepts in the partial-areas
with the size i.   is the commission error rate of the reviewed
concepts in partial-areas with the size = i.  ̅ is the weighted
average commission error rate of all reviewed concepts of the
partial-areas with sizes ranging over all existing sizes of small
(large) partial-areas, respectively. Thus, the contribution of the
concepts in the partial-areas of size i, to the weighted average
error rate is   ∗   .</p>
        <p>∑  
 ̅ = ∑   ∗  ()</p>
        <p>∑</p>
        <p>
          We calculated the weighted average error rate for all
possible threshold values and picked the maximizing threshold
as value for Θ. We calculated the two-tailed p-value of Fisher’s
exact test [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ] to evaluate the statistical significance for the
optimizing threshold.
        </p>
        <p>IV.</p>
        <p>RESULT</p>
        <p>
          The Chemical Entity hierarchy had 106,707 concepts in the
July 2017 release of ChEBI. There are 27,498 partial-areas in
its partial-area taxonomy, from which an excerpt of 164
partialareas summarizing 92,685 concepts (86.86%) is shown in Fig.
2 created by the OAF tool [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]. Fig. 2 summarizes the content
and the structure of most of the Chemical Entity hierarchy. It
captures the “big picture” of what this hierarchy is about by
displaying most of the very large partial-areas. For example, the
hierarchy has 35,141 polyatomic entit(y)(ies) and 4550
carbohydrate derivative(s). Out of the 500 randomly selected
concepts, only 476 concepts were reviewed, excluding 24
concepts with “1-star” that were marked either deleted or
obsolete. There were 164 concepts exhibiting commission
errors (164/476=34.45%). They were posted on the GitHub site
of ChEBI for review by curators.
        </p>
        <p>Table II shows the distribution of concepts and errors in
terms of partial-area sizes in the partial-area taxonomy. For
each partial-area size i, the columns include the number of
concepts   , the number of audited concepts, the number of
concepts with commission errors, and the corresponding error
rate   . For example, there are 25,798 partial-areas of size = 1,
in which 236 concepts were reviewed by LC of which 98
concepts (41.53%) were found to have commission errors.
Similarly, there are nine concepts with commission errors out
of all 20 audited concepts (45.00%) in partial-areas with size 2.</p>
        <p>The last three columns of Table II show the weighted error
rate for the small partial-areas, for the large partial-areas, and
the error rate difference between the two categories according
to the corresponding threshold value equal to the partial-area
size in the corresponding row. For example, in row 2, the
partial-area size 2 is selected as the threshold to distinguish
small partial-areas and large partial-areas. The weighted error
rate for the small partial-area category according to formula (1)
is 41.71%, while the weighted error rate for the large
partialarea category is 26.31%. The error rate difference between them
is 15.41%. From Table II we can see that the maximizing
threshold value is 2. Thus, we choose Θ = 2 for Hypothesis 1.</p>
        <p>Table III is the 2x2 contingency table for the commission
erroneous concepts of the small partial-areas and the large
partial-areas where the threshold value is the maximizing
threshold 2. The count for erroneous concepts and concepts
without errors for partial-areas is calculated using data from
Table II. According to Table II, 236 concepts in the sample are
from partial-areas with size = 1 (row 1) and 20 concepts are
from partial-areas with size = 2 (row 2), thus a total number of
256 (=236+20) concepts from small partial-areas were audited.
There are 107 (=98+9) concepts with commission errors in the
small partial-areas. There are 149 (=256-107) concepts without
commission errors from the small partial-areas. Similarly, we
can calculate the number of erroneous concepts and the number
of concepts without errors for the large partial-areas with size &gt;
2, i.e., 57 and 163.</p>
        <p>
          The two-tailed p-value of Fisher’s exact test [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ] is 0.0003
(&lt; 0.05) based on Table III. That is, the difference of the
weighted average error rates of concepts in small and large
partial-areas has statistical significance, and Hypothesis 1 is
confirmed.
        </p>
        <p>Two types of commission errors were reported by our expert:
71 concepts (14.92%) had incorrect hierarchical relationships
and 93 (19.54%) had incorrect relationship targets. Table I
shows examples with their error descriptions. For example, in
row 2, the incorrect target of the relationship is conjugate acid
of for N-acetyl-D-glucosaminyldiphosphodolichol is a concept
with 2- charge. It should have a 1- charge, since only one proton
is removed. In row 5, zorbamycin is a secondary, not a primary
amide, thus it has an incorrect hierarchical relationship error.
The distribution of ChEBI concepts and errors between “2-star”
and “3-star” concepts is in Table IV. It shows a much higher
error rate for “3-star” concepts. Thus, we recommend to start by
auditing “3-star” concepts in small partial-areas.</p>
        <p>We utilized the partial-area taxonomy of ChEBI’s Chemical
Entity hierarchy to explore the QA methodology focusing on
small partial-areas. The results show that a threshold value Θ =
2 maximizes the average error rate difference between small
and large partial-areas. The weighted average error rate for
concepts of small partial-areas of up to 2 concepts is 41.71%.
Hence, if the total of 27,262 concepts of the small partial-areas
would be reviewed, about 11,371 concepts are expected to
require corrections. Thus, if the QA resources are too limited to
review 27,262 concepts, then 41.71% of the concepts from
small partial-areas reviewed are expected to be erroneous.</p>
        <p>
          Ochs et al. [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ] and He et al. [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ] presented a family-based
QA framework such that one methodology is applicable to a
whole family of structurally similar ontologies. If the same QA
methodology is successful on six out of six ontologies in the
same family, then it will be successful for at least half of the
ontologies in the family. To be considered successful, the error
rate of study concepts should be statistically significantly higher
than for control concepts. Ochs et al. classified 373 BioPortal
ontologies into 81 structural families, according to structural
features of those ontologies for which AbNs can be derived.
        </p>
        <p>
          In the previous study on NCIt’s Biological Process
hierarchy by Hua et al. [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ], we formulated a similar hypothesis
for small partial-areas. Although we reported the error rates for
different partial-area sizes and the error rate of small
partialareas with sizes up to three was higher than that of large
partialareas, we did not calculate the statistical significance.
According to the previously reported error rates, the two-tailed
p-value of Fisher’s exact test is 0.0011, based on Table V.
Hence, Hypothesis 1 for the Biological Process hierarchy of
NCIt is confirmed with statistical significance.
        </p>
        <p>
          The concerns of SNOMED CT users about errors are
documented in [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ]. In the study of SNOMED CT’s Procedure
hierarchy by Ochs et al. [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ], we obtained a similar result.
Concepts in small partial-areas with sizes up to three have more
errors than large partial-areas. The two-tailed p-value of
Fisher’s exact test is p &lt; 0.019. A study on the NCIt’s Neoplasm
subhierarchy by Zheng et al. [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] reported that concepts in small
partial-areas with sizes ≤ 10 have a statistically significantly
higher error rate than large partial-areas, with p = 0.0113.
        </p>
        <p>
          According to Ochs et al. [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ], NCIt’s small Biological
Process and Neoplasm subhierarchies and SNOMED CT’s
large Procedure hierarchy in the previous three studies, and
ChEBI’s Chemical Entity hierarchy in this study, belong to the
same family of 76 ontologies. In summary, there have been four
successful studies, out of the required six, of the QA technique
showing that small partial-areas have statistically significantly
more errors than large partial-areas. However, the threshold that
defines small partial-areas varies. Hence, this study advances
towards the goal of showing that small partial-area-based QA is
applicable to the whole family. If two more such studies will be
successful, then we can make a statement that the small
partialarea-based methodology is applicable to this whole family as
follows. For at least half of the remaining ontologies there exists
a threshold Θ such that the error rate for concepts of small
partial-areas is statistically significantly higher than for large
partial-areas. A substantially different approach to QA using
partial-areas is based on a refinement of the partial-area
taxonomy into the disjoint partial-area taxonomy [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ].
        </p>
        <p>CONCLUSION</p>
        <p>Abstraction Networks of ontologies have been proven to
define a framework for the identification of concept sets that are
expected to have comparatively higher error rates. Small
partial-areas in the partial-area taxonomy derived from an
ontology likely reflect uncommonly modeled concepts in the
ontology. In this paper we tested the QA methodology that
concentrates on auditing the concepts in small partial-areas.</p>
        <p>This study applied the small partial-area-based QA
methodology to the ChEBI ontology. Our analysis revealed that
small partial-areas have statistically significantly more errors
than large partial-areas, with an optimal threshold of two. The
results confirmed that in the ChEBI ontology small partial-areas
with size up to two concepts harbor statistically significantly
more commission errors compared to large partial-areas.
Overall, this approach narrows down the places in the ChEBI
ontology where limited QA efforts should be invested to obtain
a higher QA yield. This study, in combination with three other
previous studies, provides progress toward showing that the
small partial-area-based methodology is successful in
identifying likely errors for a whole family of 76 BioPortal
ontologies.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>ACKNOWLEDGMENT</title>
      <p>Research reported in this publication was partially
supported by the National Cancer Institute of the National
Institutes of Health under Award Number R01CA190779. The
content is solely the responsibility of the authors and does not
necessarily represent the official views of the National Institutes
of Health.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J.</given-names>
            <surname>Hastings</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Owen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Dekker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ennis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Kale</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Muthukrishnan</surname>
          </string-name>
          , et al.,
          <article-title>"ChEBI in 2016: Improved services and an expanding collection of metabolites,"</article-title>
          <source>Nucleic Acids Res</source>
          , vol.
          <volume>44</volume>
          , pp.
          <fpage>D1214</fpage>
          -
          <lpage>9</lpage>
          , Jan 4
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>C.</given-names>
            <surname>Bettembourg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Diot</surname>
          </string-name>
          , and
          <string-name>
            <given-names>O.</given-names>
            <surname>Dameron</surname>
          </string-name>
          ,
          <article-title>"Optimal Threshold Determination for Interpreting Semantic Similarity and Particularity: Application to the Comparison of Gene Sets and Metabolic Pathways Using GO and ChEBI,"</article-title>
          <source>PLoS One</source>
          , vol.
          <volume>10</volume>
          , p.
          <fpage>e0133579</fpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>H.</given-names>
            <surname>Min</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Perl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Halper</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Geller</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>"Auditing as part of the terminology design life cycle,"</article-title>
          <source>J Am Med Inform Assoc</source>
          , vol.
          <volume>13</volume>
          , pp.
          <fpage>676</fpage>
          -
          <lpage>90</lpage>
          , Nov-Dec
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M.</given-names>
            <surname>Halper</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Gu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Perl</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Ochs</surname>
          </string-name>
          ,
          <article-title>"Abstraction networks for terminologies: Supporting management of "big knowledge","</article-title>
          <source>Artif Intell Med</source>
          , vol.
          <volume>64</volume>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>16</lpage>
          , May
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Halper</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Min</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Perl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Chen</surname>
          </string-name>
          , and
          <string-name>
            <given-names>K. A.</given-names>
            <surname>Spackman</surname>
          </string-name>
          ,
          <article-title>"Structural methodologies for auditing SNOMED," J Biomed Inform</article-title>
          , vol.
          <volume>40</volume>
          , pp.
          <fpage>561</fpage>
          -
          <lpage>81</lpage>
          ,
          <year>Oct 2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6] S. de Coronado,
          <string-name>
            <given-names>M. W.</given-names>
            <surname>Haber</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Sioutos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. S.</given-names>
            <surname>Tuttle</surname>
          </string-name>
          , and
          <string-name>
            <given-names>L. W.</given-names>
            <surname>Wright</surname>
          </string-name>
          ,
          <article-title>"NCI Thesaurus: using science-based terminology to integrate cancer research results," Stud Health Technol Inform</article-title>
          , vol.
          <volume>107</volume>
          , pp.
          <fpage>33</fpage>
          -
          <lpage>7</lpage>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>M. Q.</given-names>
            <surname>Stearns</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Price</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. A.</given-names>
            <surname>Spackman</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A. Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>"SNOMED clinical terms: overview of the development process and project status,"</article-title>
          <source>Proc AMIA Symp</source>
          , pp.
          <fpage>662</fpage>
          -
          <lpage>6</lpage>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>H.</given-names>
            <surname>Yumak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Halper</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Zheng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Perl</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Elhanan</surname>
          </string-name>
          ,
          <article-title>"A Quality-Assurance Study of ChEBI,"</article-title>
          in ICBO/BioCreative,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>EMBL-EBI</surname>
          </string-name>
          .
          <article-title>Chemical Entities of Biological Interest</article-title>
          . Available: https://www.ebi.ac.uk/chebi/
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>Y. F.</given-names>
            <surname>Gao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. H.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , K. Y. Feng,
          <string-name>
            <given-names>H. P.</given-names>
            <surname>Li</surname>
          </string-name>
          , et al.,
          <article-title>"Prediction of drugs target groups based on ChEBI ontology,"</article-title>
          <source>Biomed Res Int</source>
          , vol.
          <year>2013</year>
          , p.
          <fpage>132724</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>D. P.</given-names>
            <surname>Hill</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Adams</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Bada</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Batchelor</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. Z.</given-names>
            <surname>Berardini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Dietze</surname>
          </string-name>
          , et al.,
          <article-title>"Dovetailing biology and chemistry: integrating the Gene Ontology with the ChEBI chemical ontology,"</article-title>
          <source>BMC Genomics</source>
          , vol.
          <volume>14</volume>
          , p.
          <fpage>513</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>B.</given-names>
            <surname>Glimm</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Horrocks</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Motik</surname>
          </string-name>
          , G. Stoilos, and
          <string-name>
            <given-names>Z.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>"HermiT: An OWL 2 Reasoner,"</article-title>
          <string-name>
            <given-names>J.</given-names>
            <surname>Autom</surname>
          </string-name>
          . Reason., vol.
          <volume>53</volume>
          , pp.
          <fpage>245</fpage>
          -
          <lpage>269</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>M.</given-names>
            <surname>Halper</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Perl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Ochs</surname>
          </string-name>
          , and
          <string-name>
            <given-names>L.</given-names>
            <surname>Zheng</surname>
          </string-name>
          ,
          <article-title>"Taxonomy-Based Approaches to Quality Assurance of Ontologies," J Healthc Eng</article-title>
          , vol.
          <year>2017</year>
          , p.
          <fpage>3495723</fpage>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>C.</given-names>
            <surname>Ochs</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Geller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Perl</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Musen</surname>
          </string-name>
          ,
          <article-title>"A unified software framework for deriving, visualizing, and exploring abstraction networks for ontologies," J Biomed Inform</article-title>
          , vol.
          <volume>62</volume>
          , pp.
          <fpage>90</fpage>
          -
          <lpage>105</lpage>
          ,
          <year>Aug 2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>L.</given-names>
            <surname>Zheng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Min</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Geller</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Perl</surname>
          </string-name>
          ,
          <article-title>"Auditing National Cancer Institute thesaurus neoplasm concepts in groups of high error concentration,"</article-title>
          <source>Applied Ontology</source>
          , vol.
          <volume>12</volume>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>18</lpage>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>R. A</surname>
          </string-name>
          . Fisher,
          <article-title>"Statistical methods for research workers,"</article-title>
          in Breakthroughs in Statistics, ed: Springer,
          <year>1992</year>
          , pp.
          <fpage>66</fpage>
          -
          <lpage>70</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>C.</given-names>
            <surname>Ochs</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Zheng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Geller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Perl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Hripcsak</surname>
          </string-name>
          , et al.,
          <article-title>"Utilizing a structural meta-ontology for family-based quality assurance of the BioPortal ontologies," J Biomed Inform</article-title>
          , vol.
          <volume>61</volume>
          , pp.
          <fpage>63</fpage>
          -
          <lpage>76</lpage>
          ,
          <year>Jun 2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>Z.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Ochs</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Agrawal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Perl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Zeginis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Tarabanis</surname>
          </string-name>
          , et al.,
          <article-title>"A family-based framework for supporting quality assurance of biomedical ontologies in BioPortal,"</article-title>
          <source>AMIA Annu Symp Proc</source>
          , vol.
          <year>2013</year>
          , pp.
          <fpage>581</fpage>
          -
          <lpage>90</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>H.</given-names>
            <surname>Min</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Perl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Halper</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Geller</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>"Auditing as Part of the Terminology Design Life Cycle,"</article-title>
          <source>J Am Med Inform Assoc</source>
          , vol.
          <volume>13</volume>
          , pp.
          <fpage>676</fpage>
          -
          <lpage>690</lpage>
          , Nov-Dec
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>G.</given-names>
            <surname>Elhanan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Perl</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Geller</surname>
          </string-name>
          ,
          <article-title>"A survey of SNOMED CT direct users, 2010: impressions and preferences regarding content and quality,"</article-title>
          <source>J Am Med Inform Assoc</source>
          , vol.
          <volume>18</volume>
          <issue>Suppl 1</issue>
          , pp.
          <fpage>i36</fpage>
          -
          <lpage>44</lpage>
          ,
          <year>Dec 2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>C.</given-names>
            <surname>Ochs</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Perl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Geller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Halper</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Gu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Chen</surname>
          </string-name>
          , et al.,
          <article-title>"Scalability of abstraction-network-based quality assurance to large SNOMED hierarchies,"</article-title>
          <source>AMIA Annu Symp Proc</source>
          , vol.
          <year>2013</year>
          , pp.
          <fpage>1071</fpage>
          -
          <lpage>80</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Halper</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Wei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Perl</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Geller</surname>
          </string-name>
          ,
          <article-title>"Abstraction of complex concepts with a refined partial-area taxonomy of SNOMED," J Biomed Inform</article-title>
          , vol.
          <volume>45</volume>
          , pp.
          <fpage>15</fpage>
          -
          <lpage>29</lpage>
          ,
          <year>Feb 2012</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>