<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Reasoning over anatomical homology in the Phenoscape KB</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Paula M. Mabee</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Wasila M. Dahdul</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Christopher J. Mungall</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>James P. Balhoff</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hilmar Lapp</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Todd J. Vision</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Center for Genomic and Computational Biology Duke University Durham</institution>
          ,
          <addr-line>NC</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Biology University of North Carolina at Chapel Hill</institution>
          <addr-line>Chapel Hill, NC</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Department of Biology University of South Dakota Vermillion</institution>
          ,
          <addr-line>SD</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Environmental Genomics and Systems Biology Lawrence Berkeley National Laboratory Berkeley</institution>
          ,
          <addr-line>CA</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>Renaissance Computing Institute University of North Carolina at Chapel Hill</institution>
          <addr-line>Chapel Hill, NC</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2018</year>
      </pub-date>
      <fpage>7</fpage>
      <lpage>10</lpage>
      <abstract>
        <p>- The Phenoscape project (www.phenoscape.org) has semantically annotated the features of species from the comparative literature, enabling links between candidate genes and novel species phenotypes for which they might be responsible. To enable discovery of homologous phenotypes and associated genes, we incorporated machine-reasoning with knowledge about homology into the Phenoscape Knowledgebase (KB). We show that with homology reasoning enabled, the results of database queries can be expanded to incorporate shared evolutionary history. We discuss the challenges in developing a logical model of homology assertions and implications for database queries, as well as theoretical entailment and practical performance tradeoffs between alternative models.</p>
      </abstract>
      <kwd-group>
        <kwd>homology</kwd>
        <kwd>reasoning</kwd>
        <kwd>evolution</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>anatomy
ontology;
phenotypes;</p>
    </sec>
    <sec id="sec-2">
      <title>I. INTRODUCTION</title>
      <p>
        The enormous volume of biological data that has become
available to researchers has brought with it a rapidly expanding
taxonomic range represented by the data. Because different
taxa can possess similar features due to shared ancestry, the
incorporation of homology in connecting, aggregating, and
analyzing data has become increasingly important. For
example, without explicit incorporation of homology, the
results of user queries for species phenotypes or candidate
genes may be incomplete. The Phenoscape Project [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ] has
semantically annotated the features of species from the
comparative literature, enabling links between novel species
phenotypes and candidate genes that may underlie them. To
enable discovery of homologous phenotypes and associated
      </p>
      <p>Funding provided by National Science Foundation ABI Innovation
collaborative grants (1661529, 1661356, 1661456, 1661516) and an ABI
Development grant (1062542).</p>
    </sec>
    <sec id="sec-3">
      <title>II. ANNOTATION OF HOMOLOGY ASSERTIONS</title>
      <p>
        Homology assertions for both historical and serial
homology of vertebrate skeletal elements were extracted from
the comparative literature for teleost fishes and early
sarcopterygians [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], and from the developmental genetic
literature. We constructed these assertions using anatomy terms
from the Uberon anatomy ontology [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and taxon terms from
the Vertebrate Taxonomy Ontology [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], resulting in a total of
98 homology assertions pertaining to skeletal anatomy.
Attribution for each homology statement was recorded, and the
type of evidence (e.g., positional, developmental) provided by
the author supporting or contradicting the homology assertion
was annotated with terms from the Evidence and Conclusion
Ontology [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. The most common type of evidence for or
against homology cited by authors in the collection of
homology assertions were based on development (27
statements), followed by morphological similarity (26
statements), position (20 statements), and gene expression
(14 homology statements). Some author statements (5) cited
evidence traceable to a different publication, whereas some (6
statements) did not cite traceable evidence. The collection of
homology assertions was incorporated in the Phenoscape KB,
which currently contains over 600,000 annotated phenotypes
for vertebrate taxa from 185 comparative morphological
studies.
      </p>
    </sec>
    <sec id="sec-4">
      <title>III. HOMOLOGY REASONING MODELS</title>
      <p>We have taken an exploratory approach toward resolving
the most effective way to enable machine reasoning on
historical and serial homology across anatomical structures.
Specifically, we explored the reasoning ramifications of two
OWL models of homology that we have developed. In the first
model, classes of homologous entities are represented using
reciprocal existential property restrictions. In the second
model, an OWL individual is introduced that represents the
ancestral structure from which all instances of two classes of
homologous structures are descended. Using the collection of
homology assertions and a sample of fin/limb phenotypes from
the KB, we evaluated each model against the expected
outcomes for a set of queries formulated as competency
questions relevant to research in developmental biology,
comparative anatomy, and evolution. We discuss these results
and the implementation of homology reasoning in the KB.</p>
    </sec>
    <sec id="sec-5">
      <title>IV. CONCLUSIONS</title>
      <p>We have enabled homology reasoning in the Phenoscape
KB, where it allows discovery of homologous structures. With
homology reasoning incorporated, computational tools can
now access the results of reasoning across evolutionary history.
Although the model we select and implement in the KB
satisfies basic reasoning, we expect that it can and will be
optimized for different purposes, and as computational
methods to represent uncertainty evolve.</p>
    </sec>
    <sec id="sec-6">
      <title>ACKNOWLEDGMENT</title>
      <p>We are grateful to the many collaborators who have
contributed data and expertise to the Phenoscape Project.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>R. C.</given-names>
            <surname>Edmunds</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Su</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Balhoff</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. F.</given-names>
            <surname>Eames</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W M.</given-names>
            <surname>Dahdul</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Lapp</surname>
          </string-name>
          , et al. (
          <year>2016</year>
          )
          <article-title>Phenoscape: Identifying Candidate Genes for Evolutionary Phenotypes</article-title>
          .
          <source>Mol Biol Evol</source>
          .
          <volume>33</volume>
          :
          <fpage>13</fpage>
          -
          <lpage>24</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>P. M.</given-names>
            <surname>Mabee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Balhoff</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. M.</given-names>
            <surname>Dahdul</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Lapp</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. E.</given-names>
            <surname>Midford</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. J.</given-names>
            <surname>Vision</surname>
          </string-name>
          , et al. (
          <year>2012</year>
          ).
          <volume>500</volume>
          ,000 fish phenotypes:
          <article-title>The new informatics landscape of evolutionary and developmental skeletal biology</article-title>
          .
          <source>J Applied Ichthy</source>
          <volume>28</volume>
          (
          <issue>3</issue>
          ):
          <fpage>300</fpage>
          -
          <lpage>305</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>T. A.</given-names>
            <surname>Dececchi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Balhoff</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Lapp</surname>
          </string-name>
          ,
          <string-name>
            <surname>P. M. Mabee.</surname>
          </string-name>
          (
          <year>2015</year>
          ).
          <article-title>Toward synthesizing our knowledge of morphology: using ontologies and machine reasoning to extract presence/absence evolutionary phenotypes across studies</article-title>
          .
          <source>Syst Biol</source>
          <volume>64</volume>
          (
          <issue>6</issue>
          ):
          <fpage>936</fpage>
          -
          <lpage>952</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Haendel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Balhoff</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F. B.</given-names>
            <surname>Bastian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. C.</given-names>
            <surname>Blackburn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Blake</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Comte</surname>
          </string-name>
          , et al. (
          <year>2014</year>
          ).
          <article-title>Uberon: Unification of multi-species vertebrate anatomy ontologies for comparative biology</article-title>
          .
          <source>J Biomed Semantics. 5:21</source>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>P. E.</given-names>
            <surname>Midford</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. A.</given-names>
            <surname>Dececchi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Balhoff</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. M.</given-names>
            <surname>Dahdul</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ibrahim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Lapp</surname>
          </string-name>
          , et al. (
          <year>2013</year>
          ).
          <article-title>The Vertebrate Taxonomy Ontology: A framework for reasoning across model organism and species phenotypes</article-title>
          .
          <source>J Biomed Semantics</source>
          .
          <volume>4</volume>
          :
          <fpage>34</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M. C.</given-names>
            <surname>Chibucos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. J.</given-names>
            <surname>Mungall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Balakrishnan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. R.</given-names>
            <surname>Christie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. P.</given-names>
            <surname>Huntley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>White</surname>
          </string-name>
          , et al. (
          <year>2014</year>
          ).
          <article-title>Standardized description of scientific evidence using the Evidence Ontology (ECO)</article-title>
          .
          <source>Database</source>
          .
          <year>2014</year>
          :bau075
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>