<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Measuring the importance of annotation granularity to the detection of semantic similarity between phenotype profiles</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Prashanti Manda</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>James P. Balhoff</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Todd J. Vision</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Biology, University of North Carolina at Chapel Hill</institution>
          ,
          <addr-line>NC</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>RTI International</institution>
          ,
          <addr-line>NC</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>-In phenotype annotations curated from the biological and medical literature, considerable human effort must be invested to select ontological classes that capture the expressivity of the original natural language descriptions, and finer annotation granularity can also entail higher computational costs for particular reasoning tasks. Do coarse annotations suffice for certain applications? Here, we measure how annotation granularity affects the statistical behavior of semantic similarity metrics. We use a randomized dataset of phenotype profiles drawn from 57,051 taxon-phenotype annotations in the Phenoscape Knowledgebase. We compared query profiles having variable proportions of matching phenotypes to subject database profiles using both pairwise and groupwise Jaccard (edge-based) and Resnik (node-based) semantic similarity metrics, and compared statistical performance for three different levels of annotation granularity: entities alone, entities plus attributes, and entities plus qualities (with implicit attributes). All four metrics examined showed more extreme values than expected by chance when approximately half the annotations matched between the query and subject profiles, with a more sudden decline for pairwise statistics and a more gradual one for the groupwise statistics. Annotation granularity had a negligible effect on the position of the threshold at which matches could be discriminated from noise. These results suggest that coarse annotations of phenotypes, at the level of entities with or without attributes, may be sufficient to identify phenotype profiles with statistically significant semantic similarity.</p>
      </abstract>
      <kwd-group>
        <kwd>ontology</kwd>
        <kwd>phenotype</kwd>
        <kwd>curation</kwd>
        <kwd>annotation granularity</kwd>
        <kwd>semantic similarity</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>I. INTRODUCTION</title>
      <p>
        To make phenotype descriptions in the biological and
medical literature amenable to large-scale discovery and
computation, a variety of efforts have been launched to convert
such descriptions into logical expressions using ontologies and
to integrate them into the larger ecosystem of online, open
biological information resources [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Typically, this involves
curation and annotation of phenotypes in the Entity-Quality
(EQ) formalism [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], which is widely used by model
organism communities for representation of gene phenotypes [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
The EQ formalism has more recently been adopted by the
Phenoscape project to curate phenotypes from the literature
that are reported to vary among evolutionary lineages [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] with
the goal of linking them to gene phenotypes and generating
hypotheses about the genetic bases of evolutionary transitions
[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>In the EQ approach, an entity represents a biological
object, e.g. an anatomical structure, an anatomical space, or
a biological process, while a quality represents a trait or
property that an entity possesses, e.g, shape, color, or size.
Curators often create complex logical expressions called
postcompositions by combining ontology terms, relations, and
spatial properties from multiple ontologies in different ways
to create entities and qualities that adequately represent
phenotypic descriptions. For example, “big supraorbital bone” is
represented as E: supraorbital bone (UBERON 0004747), Q:
enlarged size (PATO˙0000586). A more complex description
such as “parietal fused with supraoccipital bone” is
represented by relating the two affected entities, supraorbital bone
(UBERON 0004747) and parietal (UBERON 2001997) using
the quality fused with (PATO 0000642).</p>
      <p>
        Annotation of phenotypes at this level of ontological detail
is time consuming and expensive [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Annotating evolutionary
phenotypes at the finest level of granularity often requires
curators to create new ontology terms and request those terms
to be added to the ontology. Coarse annotation removes the
need for ontology development by limiting curators to a small
set of attribute level qualities already present in the ontology.
Reducing the effort on curatorial tasks such as ontology
development and data preparation improves the annotation rate
from two characters per hour to 14 characters per hour [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
Thus coarse annotation can be part of an efficient annotation
workflow, and permit larger datasets to be curated for
equivalent resources. In addition, reasoning over the combinatorial
entity and quality ontology space for EQ annotations poses a
serious computational challenge.
      </p>
      <p>
        Given these competing considerations, what level of
annotation granularity is optimal? The answer may depend on the
particular application. For Phenoscape, a major goal is to be
able to find sets of phenotypes that show greater semantic
similarity than would be expected by chance when comparing sets
of phenotypes from different biological domains (e.g. those
observed in evolutionary lineages versus those induced by
genetic manipulations in the laboratory) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. When comparing
phenotypes with such different biological origins, we would
not expect to see congruence in fine detail for a variety of
reasons. For instance, even if the same or homologous genes
have contributed to the two profiles, independent changes to
those genes may underpin the phenotypes, they may be in
lineages for which the genetic networks have diverged, and
there may have been considerable evolutionary modification
of the phenotype since its first origin. Even if two biological
phenotypes are identical, the way in which the phenotypes
are observed and described by independent researchers may
lead to natural language descriptions, and thus profiles of
annotations, that are quite different. With such weak matches,
do finer annotations enable similarities to be detected, or are
finer annotations superfluous or even distracting?
      </p>
      <p>
        To explore this issue, we have conducted experiments to
test the statistical sensitivity of semantic similarity at
varying annotation granularity. Our approach involves simulating
phenotype profiles by sampling from real annotations drawn
from the Phenoscape Knowledgebase [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. We measured
similarity between profiles that shared all, some or none of
their annotations, with the remainder drawn randomly from
the population of annotations. We assessed the decline of
semantic similarity to the point at which it could no longer
be discriminated from random chance. This was done for four
different semantic similarity statistics, and for three levels of
annotation granularity.
      </p>
      <p>II.</p>
      <p>METHODS</p>
      <sec id="sec-1-1">
        <title>A. Semantic similarity metrics</title>
        <p>
          The four semantic similarity statistics we have chosen
represent extremes along two different dimensions by which
semantic similarity metrics vary [
          <xref ref-type="bibr" rid="ref10 ref7 ref8 ref9">7–10</xref>
          ]. Edge-based semantic
similarity metrics use the distance between terms in the
ontology as a measure of similarity. Node-based measures use
the Information Content of the annotations to the terms being
compared and/or their least common subsumer. The similarity
metrics we have chosen are based on Jaccard (edge-based) and
Resnik (node-based) similarity, which are popular in biological
applications (e.g. [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]). For each, we have one version that
summarizes the distribution of pairwise similarities between
two sets of annotations, and another that calculates a groupwise
score directly.
        </p>
        <p>
          1) Jaccard similarity: The Jaccard similarity (sJ ) of two
classes (A, B) in an ontology is defined as the ratio of the
number of classes in the intersection of their subsumers over
the number of classes in their union of their subsumers [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ].
sJ (A; B) = jS(A) \ S(B)j
        </p>
        <p>jS(A) [ S(B)j
where S(A) is the set of classes that subsume A.</p>
        <p>
          2) Resnik similarity: The Information Content of ontology
class A, denoted I(A) is defined as the negative logarithm of
the proportion of profiles annotated to that class f (A) out of
T profiles in total.
Since the minimum value of I(A) is zero, at the root of the
ontology, while the maximum value is log(1=T ), we can
compute a Normalized Information Content (In) with range
[
          <xref ref-type="bibr" rid="ref1">0, 1</xref>
          ]
        </p>
        <p>In(A) =</p>
        <p>I(A)
log(1=T )
The Resnik similarity (sR) of two ontology classes is defined
as the Normalized Information Content of the least common
subsumer (LCS) of the two classes.</p>
        <p>sR(A; B) = In(LCS(A; B))</p>
      </sec>
      <sec id="sec-1-2">
        <title>B. Profile similarity</title>
        <p>A set of ontology-based phenotype annotations is called a
phenotype profile. When comparing two profiles, X and Y ,
where each has at least one, and potentially many annotations,
we could either summarize all the pairwise combinations
of annotations, or we could compute a groupwise similarity
measure directly as a function of graph overlap.</p>
        <p>1) Best Pairs: Pairwise approaches summarize the
distribution of pairwise Jaccard or Resnik similarity scores between
annotations in the two profiles. Here we use the Best Pairs
score. For each annotation in X, the best scoring match in
Y is determined, and the median of the jXj resultant values
is taken. Similarly, for each annotation in Y , the best scoring
match in X is determined, and the median of the jY j values
is taken. The Best Pairs score pz(X; Y ) is the mean of these
two medians. The index z can be used to denote whether the
pairwise values are Resnik (z = R) or Jaccard (z = J ).</p>
        <p>pz(X; Y ) = (1=2)[bz(X; Y ) + bz(Y; X)]
where
n
sz(Xi; Yj )
o
bz(X; Y ) =</p>
        <p>median
i2f1:::jXjg;j=argmax sz(Xi;Yj)</p>
        <p>j=1:::jY j
Note that, as defined, pz(X; Y ) = pz(Y; X).</p>
        <p>2) Groupwise: Groupwise approaches compare profiles
directly based on set operations or graph overlap.</p>
        <p>The Groupwise Jaccard similarity of profiles X and Y ,
gJ (X; Y ), is defined as the ratio of the number of classes
in the intersection to the number of classes in the union of the
two profiles
gJ (X; Y ) = jC(X) \ C(Y )j</p>
        <p>jC(X) [ C(Y )j
where C(X) is the set of classes belonging to X plus their
subsumers.</p>
        <p>Similarly, the Groupwise Resnik similarity of profiles X
and Y , gR(X; Y ), is defined as the ratio of the normalized
information content summed over all nodes in the intersection
of X, Y to the information content summed over all nodes in
the union.</p>
        <p>P
gR(X; Y ); = P
t2fC(X)\C(Y )g In(t)
t2fC(X)[C(Y )g In(t)
where C(X) is defined as above.</p>
        <p>
          The Phenoscape Knowledgebase contains a dataset of 661
taxa with 57,051 evolutionary phenotypes, which are
phenotypes that have been inferred to vary among the taxon’s
immediate descendents [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. A simulation dataset of subject
profiles having the same size distribution of annotations per
taxon was created by permutation of the taxon labels.
        </p>
      </sec>
      <sec id="sec-1-3">
        <title>D. Simulating profile ‘decay’</title>
        <p>To simulate decay of profile similarity, five query profiles
of size ten were randomly selected from the simulated dataset.
For each, there is one profile among the set of subjects for
which each annotation has a one-to-one perfect match. For
each of the five profiles, ten progressively decayed profiles
were obtained by iteratively replacing one of the original
annotations with an annotation randomly selected from among
the 57,051 available (Figure II-D). Thus, for each original
profile, there is a profile in which one original annotation has
been replaced with random annotation, another in which two
have been replaced, and so on, through to a fully decayed
profile in which all original annotations have been replaced
with a random one. To characterize the noise distribution for
each metric in the absence of semantic similarity, we also
generated 5,000 profiles of size ten by drawing annotations
randomly from among the 57,051 available. These profiles
would not be expected to have more than nominal similarity
with any of the simulated subject profiles.</p>
      </sec>
      <sec id="sec-1-4">
        <title>E. Adjusting annotation granularity</title>
        <p>
          The evolutionary phenotypes available from Phenoscape
have been annotated with both entities and qualities, and
the intermediate level of attribute is implicit in the quality
annotation due the structure of the PATO quality ontology
[
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. In order to measure semantic similarity for three levels of
Best Pairs
        </p>
        <p>Groupwise
Jaccard
Resnik
1.0
granularity: entity only (E), entity-attribute (EA), and
entityquality (EQ), we used three different phenotype ontologies,
one for each granularity level, containing phenotype concepts
combining terms from Uberon (entities) and PATO (attributes
and qualities). In each evaluation, annotations in the query
profiles and the simulated database will match at the granularity
level available in the generated phenotype ontology.</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>III. RESULTS AND DISCUSSION</title>
      <p>We measured semantic similarity between each of the five
query profiles and their decay series to all 661 profiles in
the subject database. This was done for each of the four
semantic similarity metrics (Best Pairs and Groupwise variants
of Jaccard and Resnik metrics) and for each of the three
granularity levels (E: Entity only, EA: Entity-Attribute, and
EQ: Entity-Quality). The results are shown in Figure 2). For
ease of interpretation, we take the upper 99:9% of the
similarity distribution for random profile matches as an arbitrary
threshold for comparing the sensitivity of the different series.
All series cross this threshold when approximately half of
the annotations have been replaced, with a sudden decline
in similarity for the Best Pairs statistics and a more gradual
decline for the groupwise statistics. While the differences
in sensitivity among the annotation granularity levels are
subtle, the annotations of intermediate granularity (EA) have
marginally greater sensitivity for all four statistics.</p>
      <p>
        The sharp decline in similarity under the Best Pairs statistics
at approximately 50% decay can be understood as a result
of summarizing the pairwise distribution with the median. In
future work, we aim to explore how the sensitivity of pairwise
statistics might be tuned by using different percentiles. Given
the relatively flat performance of the Best Pairs statistics when
decay was under 50%, we suggest that groupwise statistics are
likely to provide greater discrimination between true matches
of varying quality and thus better for rank ordering the
outcome of semantic similarity searches, e.g. [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Our results
also illustrate how difficult it can be to statistically discriminate
weakly matching profiles from noise, something which has
received relatively little consideration in many applications of
semantic similarity search to date.
      </p>
      <p>
        The relatively minor differences in statistical
performance with varying annotation granularity, with EA showing
marginally greater sensitivity, has implications both for the
process of generating annotations and the implementation of
semantic similarity computation. As noted in the Introduction,
annotation to EA requires considerably less human curation
effort than EQ, and is almost identical in effort to curation
to E. Restricting annotation granularity to EA may also ease
the challenge of speeding of curation through machine-aided
natural language processing, e.g. [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
      </p>
      <p>
        Second, the computational expense of measuring semantic
similarity can be prohibitive for fine-grained annotations due
to an explosion in the number of classes required for reasoning
when annotations draw from multiple ontologies [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. If the
inclusion of qualities does not improve sensitivity, that opens
up the possibility of conducting fast, scalable on-the-fly
webbased semantic searches at coarser annotation levels.
      </p>
      <p>One of the contributions of this work is in introducing a
framework for evaluating the statistical sensitivity of semantic
similarity metrics. Nonetheless, the results reported here are
specific to one particular model for the decay of similarity
between two profiles, in which some portion of annotations
that match perfectly while others do not match at all. We
recognize a need to explore other models, especially ones
where pairs of annotations may match imperfectly. We also
propose that other evaluation criteria should be examined
to more fully understand the trade-offs involved in building
datasets with a particular level of annotation granularity.</p>
      <p>IV.</p>
      <p>ACKNOWLEDGEMENTS</p>
      <p>We thank W. Dahdul, T.A. Dececchi, N. Ibrahim and
L. Jackson for curation of the original dataset, along
with the larger community of ontology contributors and
data providers (http://phenoscape.org/wiki/Acknowledgments#
Contributors), and useful feedback from P. Mabee, H. Lapp,
W. Dahdul, and other members of the Phenoscape team. This
work was funded by the National Science Foundation
(DBI1062542).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A. R.</given-names>
            <surname>Deans</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. E.</given-names>
            <surname>Lewis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Huala</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. S.</given-names>
            <surname>Anzaldo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ashburner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Balhoff</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. C.</given-names>
            <surname>Blackburn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Blake</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. G.</given-names>
            <surname>Burleigh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Chanet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. D.</given-names>
            <surname>Cooper</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Courtot</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Cssz</surname>
          </string-name>
          , H. ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Dahdul</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Das</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. T.</given-names>
            <surname>Dececchi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Dettai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Diogo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. E.</given-names>
            <surname>Druzinsky</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dumontier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. M.</given-names>
            <surname>Franz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Friedrich</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. V.</given-names>
            <surname>Gkoutos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Haendel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. J.</given-names>
            <surname>Harmon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. F.</given-names>
            <surname>Hayamizu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. M.</given-names>
            <surname>Hines</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ibrahim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. M.</given-names>
            <surname>Jackson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Jaiswal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>James-Zorn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Khler</surname>
          </string-name>
          , G. Lecointre,
          <string-name>
            <given-names>H.</given-names>
            <surname>Lapp</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. J.</given-names>
            <surname>Lawrence</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. Le</given-names>
            <surname>Novre</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. G.</given-names>
            <surname>Lundberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Macklin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. R.</given-names>
            <surname>Mast</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. E.</given-names>
            <surname>Midford</surname>
          </string-name>
          , I. Mik,
          <string-name>
            <given-names>C. J.</given-names>
            <surname>Mungall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Oellrich</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Osumi-Sutherland</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Parkinson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Ramrez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Richter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. N.</given-names>
            <surname>Robinson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ruttenberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. S.</given-names>
            <surname>Schulz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Segerdell</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. C.</given-names>
            <surname>Seltmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Sharkey</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. D.</given-names>
            <surname>Smith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Smith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. D.</given-names>
            <surname>Specht</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Squires</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. W.</given-names>
            <surname>Thacker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Thessen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Fernandez-Triana</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Vihinen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. D.</given-names>
            <surname>Vize</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Vogt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. E.</given-names>
            <surname>Wall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. L.</given-names>
            <surname>Walls</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Westerfeld</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. A.</given-names>
            <surname>Wharton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. S.</given-names>
            <surname>Wirkner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. B.</given-names>
            <surname>Woolley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Yoder</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. M.</given-names>
            <surname>Zorn</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Mabee</surname>
          </string-name>
          , “
          <article-title>Finding our way through phenotypes</article-title>
          ,”
          <source>PLoS Biology</source>
          , vol.
          <volume>13</volume>
          , no.
          <issue>1</issue>
          , p.
          <fpage>e1002033</fpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>G. V.</given-names>
            <surname>Gkoutos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. C.</given-names>
            <surname>Green</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.-M. Mallon</surname>
            ,
            <given-names>J. M.</given-names>
          </string-name>
          <string-name>
            <surname>Hancock</surname>
            , and
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Davidson</surname>
          </string-name>
          , “
          <article-title>Using ontologies to describe mouse phenotypes</article-title>
          ,
          <source>” Genome Biology</source>
          , vol.
          <volume>6</volume>
          , no.
          <issue>1</issue>
          , p.
          <fpage>R8</fpage>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>C. J.</given-names>
            <surname>Mungall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. V.</given-names>
            <surname>Gkoutos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. L.</given-names>
            <surname>Smith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Haendel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. E.</given-names>
            <surname>Lewis</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Ashburner</surname>
          </string-name>
          , “
          <article-title>Integrating phenotype ontologies across multiple species</article-title>
          ,
          <source>” Genome Biology</source>
          , vol.
          <volume>11</volume>
          , no.
          <issue>1</issue>
          , p.
          <fpage>R2</fpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>W. M.</given-names>
            <surname>Dahdul</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Balhoff</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Engeman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Grande</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. J.</given-names>
            <surname>Hilton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Kothari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Lapp</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. G.</given-names>
            <surname>Lundberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. E.</given-names>
            <surname>Midford</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. J.</given-names>
            <surname>Vision</surname>
          </string-name>
          , M. Westerfield, and
          <string-name>
            <given-names>P. M.</given-names>
            <surname>Mabee</surname>
          </string-name>
          , “
          <article-title>Evolutionary characters, phenotypes and ontologies: curating data from the systematic biology literature</article-title>
          ,”
          <source>PLoS One</source>
          , vol.
          <volume>5</volume>
          , no.
          <issue>5</issue>
          , p.
          <fpage>e10708</fpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>P.</given-names>
            <surname>Manda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Balhoff</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Lapp</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Mabee</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T. J.</given-names>
            <surname>Vision</surname>
          </string-name>
          , “
          <article-title>Using the phenoscape knowledgebase to relate genetic perturbations to phenotypic evolution,” Genesis</article-title>
          , vol.
          <volume>53</volume>
          , no.
          <issue>8</issue>
          , pp.
          <fpage>561</fpage>
          -
          <lpage>571</lpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>W.</given-names>
            <surname>Dahdul</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. A.</given-names>
            <surname>Dececchi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ibrahim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Lapp</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Mabee</surname>
          </string-name>
          , “
          <article-title>Moving the mountain: analysis of the effort required to transform comparative anatomy into computable anatomy</article-title>
          ,
          <source>” Database</source>
          , vol.
          <year>2015</year>
          , p.
          <fpage>bav040</fpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>C.</given-names>
            <surname>Pesquita</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Faria</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. O.</given-names>
            <surname>Falcao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Lord</surname>
          </string-name>
          , and
          <string-name>
            <given-names>F. M.</given-names>
            <surname>Couto</surname>
          </string-name>
          , “
          <article-title>Semantic similarity in biomedical ontologies,” PLoS Computational Biology</article-title>
          , vol.
          <volume>5</volume>
          , no.
          <issue>7</issue>
          , p.
          <fpage>e1000443</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>R. M.</given-names>
            <surname>Othman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Deris</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R. M.</given-names>
            <surname>Illias</surname>
          </string-name>
          , “
          <article-title>A genetic similarity algorithm for searching the gene ontology terms and annotating anonymous protein sequences</article-title>
          ,
          <source>” Journal of Biomedical Informatics</source>
          , vol.
          <volume>41</volume>
          , no.
          <issue>1</issue>
          , pp.
          <fpage>65</fpage>
          -
          <lpage>81</lpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>J. Z.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Du</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Payattakool</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. Y.</given-names>
            <surname>Philip</surname>
          </string-name>
          , and
          <string-name>
            <surname>C.-F. Chen</surname>
          </string-name>
          , “
          <article-title>A new method to measure the semantic similarity of go terms</article-title>
          ,
          <source>” Bioinformatics</source>
          , vol.
          <volume>23</volume>
          , no.
          <issue>10</issue>
          , pp.
          <fpage>1274</fpage>
          -
          <lpage>1281</lpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>X.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Pang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Lin</surname>
          </string-name>
          , and
          <string-name>
            <surname>Z.-M. Pei</surname>
          </string-name>
          , “
          <article-title>Improving the measurement of semantic similarity between gene ontology terms and gene products: insights from an edge-and ic-based hybrid method</article-title>
          ,”
          <source>PloS One</source>
          , vol.
          <volume>8</volume>
          , no.
          <issue>5</issue>
          , p.
          <fpage>e66745</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>N. L.</given-names>
            <surname>Washington</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Haendel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. J.</given-names>
            <surname>Mungall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ashburner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Westerfield</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S. E.</given-names>
            <surname>Lewis</surname>
          </string-name>
          , “
          <article-title>Linking human diseases to animal models using ontology-based phenotype annotation</article-title>
          ,”
          <source>PLoS Biology</source>
          , vol.
          <volume>7</volume>
          , no.
          <issue>11</issue>
          , p.
          <fpage>e1000247</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>M.</given-names>
            <surname>Mistry</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Pavlidis</surname>
          </string-name>
          , “
          <article-title>Gene ontology term overlap as a measure of gene functional similarity,” BMC Bioinformatics</article-title>
          , vol.
          <volume>9</volume>
          , no.
          <issue>1</issue>
          , p.
          <fpage>327</fpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>H.</given-names>
            <surname>Cui</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Dahdul</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. T.</given-names>
            <surname>Dececchi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ibrahim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Mabee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Balhoff</surname>
          </string-name>
          , and
          <string-name>
            <given-names>H.</given-names>
            <surname>Gopalakrishnan</surname>
          </string-name>
          , “
          <article-title>Charaparser+eq: Performance evaluation without gold standard</article-title>
          ,
          <source>” Proceedings of the Association for Information Science and Technology</source>
          , vol.
          <volume>52</volume>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>10</lpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>P.</given-names>
            <surname>Manda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Mungall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Balhoff</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Lapp</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Vision</surname>
          </string-name>
          , “
          <article-title>Investigating the importance of anatomical homology for cross-species phenotype comparisons using semantic similarity</article-title>
          ,
          <source>” Biocomputing</source>
          <year>2016</year>
          , pp.
          <fpage>132</fpage>
          -
          <lpage>143</lpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>