<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Reasoning over anatomical homology in the Phenoscape KB</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Paula</forename><forename type="middle">M</forename><surname>Mabee</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Department of Biology</orgName>
								<orgName type="institution">University of South Dakota Vermillion</orgName>
								<address>
									<region>SD</region>
									<country key="US">USA</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">James</forename><forename type="middle">P</forename><surname>Balhoff</surname></persName>
							<affiliation key="aff1">
								<orgName type="institution">Renaissance Computing Institute University of North Carolina at</orgName>
								<address>
									<addrLine>Chapel Hill</addrLine>
									<settlement>Chapel Hill</settlement>
									<region>NC</region>
									<country key="US">USA</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Wasila</forename><forename type="middle">M</forename><surname>Dahdul</surname></persName>
							<affiliation key="aff2">
								<orgName type="department">Department of Biology</orgName>
								<orgName type="institution">University of South Dakota Vermillion</orgName>
								<address>
									<region>SD</region>
									<country key="US">USA</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Hilmar</forename><surname>Lapp</surname></persName>
							<affiliation key="aff3">
								<orgName type="department">Center for Genomic and Computational Biology</orgName>
								<orgName type="institution">Duke University Durham</orgName>
								<address>
									<region>NC</region>
									<country key="US">USA</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Christopher</forename><forename type="middle">J</forename><surname>Mungall</surname></persName>
							<affiliation key="aff4">
								<orgName type="department">Environmental Genomics and Systems Biology</orgName>
								<orgName type="institution" key="instit1">Lawrence</orgName>
								<orgName type="institution" key="instit2">Berkeley National Laboratory Berkeley</orgName>
								<address>
									<region>CA</region>
									<country key="US">USA</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Todd</forename><forename type="middle">J</forename><surname>Vision</surname></persName>
							<affiliation key="aff5">
								<orgName type="department">Department of Biology</orgName>
								<orgName type="institution">University of North Carolina at</orgName>
								<address>
									<addrLine>Chapel Hill</addrLine>
									<settlement>Chapel Hill</settlement>
									<region>NC</region>
									<country key="US">USA</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Reasoning over anatomical homology in the Phenoscape KB</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">46EDA1FEA91279E878F4DA0015C07296</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T05:06+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>homology</term>
					<term>anatomy ontology</term>
					<term>phenotypes</term>
					<term>reasoning</term>
					<term>evolution</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The Phenoscape project (www.phenoscape.org) has semantically annotated the features of species from the comparative literature, enabling links between candidate genes and novel species phenotypes for which they might be responsible. To enable discovery of homologous phenotypes and associated genes, we incorporated machine-reasoning with knowledge about homology into the Phenoscape Knowledgebase (KB). We show that with homology reasoning enabled, the results of database queries can be expanded to incorporate shared evolutionary history. We discuss the challenges in developing a logical model of homology assertions and implications for database queries, as well as theoretical entailment and practical performance tradeoffs between alternative models.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>I. INTRODUCTION</head><p>The enormous volume of biological data that has become available to researchers has brought with it a rapidly expanding taxonomic range represented by the data. Because different taxa can possess similar features due to shared ancestry, the incorporation of homology in connecting, aggregating, and analyzing data has become increasingly important. For example, without explicit incorporation of homology, the results of user queries for species phenotypes or candidate genes may be incomplete. The Phenoscape Project <ref type="bibr" target="#b0">[1,</ref><ref type="bibr" target="#b1">2]</ref> has semantically annotated the features of species from the comparative literature, enabling links between novel species phenotypes and candidate genes that may underlie them. To enable discovery of homologous phenotypes and associated genes, we incorporated homology reasoning in the Phenoscape Knowledgebase (KB) (kb.phenoscape.org). One of the difficulties in rendering homology knowledge amenable to reasoning is that statements of homology are hypotheses, and in some cases homology assertions regarding the same anatomical structures can be in conflict. Thus, we represent homology assertions separate from a core anatomy ontology as annotations in spreadsheet form with evidence and attribution. The annotations are transformed into OWL axioms according to a model with the desired entailments, and a user can choose whether or not to include hypotheses of homology in reasoning. We explore the ramifications of different logical models of homology and use a series of competency questions to evaluate the performance of each model.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>II. ANNOTATION OF HOMOLOGY ASSERTIONS</head><p>Homology assertions for both historical and serial homology of vertebrate skeletal elements were extracted from the comparative literature for teleost fishes and early sarcopterygians <ref type="bibr" target="#b2">[3]</ref>, and from the developmental genetic literature. We constructed these assertions using anatomy terms from the Uberon anatomy ontology <ref type="bibr" target="#b3">[4]</ref> and taxon terms from the Vertebrate Taxonomy Ontology <ref type="bibr" target="#b4">[5]</ref>, resulting in a total of 98 homology assertions pertaining to skeletal anatomy. Attribution for each homology statement was recorded, and the type of evidence (e.g., positional, developmental) provided by the author supporting or contradicting the homology assertion was annotated with terms from the Evidence and Conclusion Ontology <ref type="bibr" target="#b5">[6]</ref>. The most common type of evidence for or against homology cited by authors in the collection of homology assertions were based on development (27 statements), followed by morphological similarity (26 statements), position (20 statements), and gene expression (14 homology statements). Some author statements (5) cited evidence traceable to a different publication, whereas some (6 statements) did not cite traceable evidence. The collection of homology assertions was incorporated in the Phenoscape KB, which currently contains over 600,000 annotated phenotypes for vertebrate taxa from 185 comparative morphological studies.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>III. HOMOLOGY REASONING MODELS</head><p>We have taken an exploratory approach toward resolving the most effective way to enable machine reasoning on historical and serial homology across anatomical structures. Specifically, we explored the reasoning ramifications of two OWL models of homology that we have developed. In the first model, classes of homologous entities are represented using reciprocal existential property restrictions. In the second model, an OWL individual is introduced that represents the ancestral structure from which all instances of two classes of homologous structures are descended. Using the collection of homology assertions and a sample of fin/limb phenotypes from the KB, we evaluated each model against the expected outcomes for a set of queries formulated as competency questions relevant to research in developmental biology, comparative anatomy, and evolution. We discuss these results and the implementation of homology reasoning in the KB.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>IV. CONCLUSIONS</head><p>We have enabled homology reasoning in the Phenoscape KB, where it allows discovery of homologous structures. With homology reasoning incorporated, computational tools can now access the results of reasoning across evolutionary history. Although the model we select and implement in the KB satisfies basic reasoning, we expect that it can and will be optimized for different purposes, and as computational methods to represent uncertainty evolve.</p></div>			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_0">Proceedings of the 9th International Conference on Biological Ontology (ICBO 2018), Corvallis, Oregon, USA</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>ACKNOWLEDGMENT</head><p>We are grateful to the many collaborators who have contributed data and expertise to the Phenoscape Project.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Phenoscape: Identifying Candidate Genes for Evolutionary Phenotypes</title>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">C</forename><surname>Edmunds</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Su</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">P</forename><surname>Balhoff</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><forename type="middle">F</forename><surname>Eames</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">M</forename><surname>Dahdul</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Lapp</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Mol Biol Evol</title>
		<imprint>
			<biblScope unit="volume">33</biblScope>
			<biblScope unit="page" from="13" to="24" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">500,000 fish phenotypes: The new informatics landscape of evolutionary and developmental skeletal biology</title>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">M</forename><surname>Mabee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">P</forename><surname>Balhoff</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">M</forename><surname>Dahdul</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Lapp</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">E</forename><surname>Midford</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">J</forename><surname>Vision</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">J Applied Ichthy</title>
		<imprint>
			<biblScope unit="volume">28</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="300" to="305" />
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Toward synthesizing our knowledge of morphology: using ontologies and machine reasoning to extract presence/absence evolutionary phenotypes across studies</title>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">A</forename><surname>Dececchi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">P</forename><surname>Balhoff</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Lapp</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">M</forename><surname>Mabee</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Syst Biol</title>
		<imprint>
			<biblScope unit="volume">64</biblScope>
			<biblScope unit="issue">6</biblScope>
			<biblScope unit="page" from="936" to="952" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Uberon: Unification of multi-species vertebrate anatomy ontologies for comparative biology</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Haendel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">P</forename><surname>Balhoff</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">B</forename><surname>Bastian</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">C</forename><surname>Blackburn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">A</forename><surname>Blake</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Comte</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">J Biomed Semantics</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="page">21</biblScope>
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">The Vertebrate Taxonomy Ontology: A framework for reasoning across model organism and species phenotypes</title>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">E</forename><surname>Midford</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">A</forename><surname>Dececchi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">P</forename><surname>Balhoff</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">M</forename><surname>Dahdul</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Ibrahim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Lapp</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">J Biomed Semantics</title>
		<imprint>
			<biblScope unit="volume">4</biblScope>
			<biblScope unit="page">34</biblScope>
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Standardized description of scientific evidence using the Evidence Ontology (ECO)</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">C</forename><surname>Chibucos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">J</forename><surname>Mungall</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Balakrishnan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">R</forename><surname>Christie</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">P</forename><surname>Huntley</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>White</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Database</title>
		<imprint>
			<biblScope unit="page">75</biblScope>
			<date type="published" when="2014">2014. 2014</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
