<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">FoodOn: A Semantic Ontology Approach for mapping Foodborne Disease Metadata</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Dalia</forename><forename type="middle">A</forename><surname>Alghamdi</surname></persName>
							<email>dalia.alghamdi@bccdc.ca</email>
						</author>
						<author>
							<persName><forename type="first">Damion</forename><forename type="middle">M</forename><surname>Dooley</surname></persName>
						</author>
						<author>
							<persName><forename type="first">Gurinder</forename><surname>Gosal</surname></persName>
						</author>
						<author>
							<persName><forename type="first">Emma</forename><forename type="middle">J</forename><surname>Griffiths</surname></persName>
						</author>
						<author>
							<persName><forename type="first">Fiona</forename><forename type="middle">S L</forename><surname>Brinkman</surname></persName>
						</author>
						<author>
							<persName><forename type="first">William</forename><forename type="middle">W L</forename><surname>Hsiao</surname></persName>
						</author>
						<author>
							<affiliation key="aff0">
								<orgName type="department">BC Center for Disease Control</orgName>
								<address>
									<addrLine>655 W 12th Ave</addrLine>
									<postCode>V5Z 4R4</postCode>
									<settlement>Vancouver</settlement>
									<region>BC</region>
									<country key="CA">Canada</country>
								</address>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff1">
								<orgName type="institution">University of British Columbia</orgName>
								<address>
									<addrLine>2329 West Mall</addrLine>
									<postCode>V6T 1Z4</postCode>
									<settlement>Vancouver</settlement>
									<region>BC</region>
									<country key="CA">Canada</country>
								</address>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff2">
								<orgName type="institution">University</orgName>
								<address>
									<addrLine>8888</addrLine>
									<postCode>V5A 1S6</postCode>
									<settlement>Burnaby</settlement>
									<region>BC</region>
									<country key="CA">Canada</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">FoodOn: A Semantic Ontology Approach for mapping Foodborne Disease Metadata</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">BBFF5068F738D27102988CF38C7C2A85</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-25T02:13+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The FoodOn Food Ontology contains standardized terms and a facetbased classification scheme for describing food products, processing and environments. Mapping of foodborne pathogen isolate source information (descriptors of the contaminated materials and locations) to the FoodOn standard can facilitate data sharing and integration between multijurisdictional health and regulatory agencies utilizing disparate software platforms and data dictionaries. Faster and more efficient sharing of information is critical for tracking and controlling outbreaks of foodborne disease at local, national and international levels. This work describes mapping procedures which can be utilized by organizations and software developers to better enable interoperability between foodborne pathogen surveillance and outbreak management systems.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>INTRODUCTION</head><p>Globalization of food networks increases opportunities for the spread of foodborne pathogens beyond borders and jurisdictions, with major impacts on global health and economies <ref type="bibr" target="#b0">(Altekruse &amp; Swerdlow et al.,1996;</ref><ref type="bibr">World Health Organization, 2008)</ref>. Whole genome sequencing (WGS) provides the highest resolution evidence for identifying, typing and matching foodborne pathogen isolates from different sources. WGS results must be combined with source information to be meaningfully interpreted for regulatory and health interventions, outbreak investigation, and risk assessment. Isolate metadata (source of a pathogen) is critical for determining mode of disease transmission, sources of exposure and risk, susceptible populations, geographical distribution and more. Public health and regulatory agencies not only use different analytical platforms to track and resolve outbreaks, but implement different data dictionaries and free text descriptions for describing isolates and exposures. The most important factor in reducing the number of preventable cases of disease is timeliness of investigations and responses, which is negatively impacted by the timeconsuming re-coding and manual curation required for translating non-standardized information between systems and agencies. To address the interoperability problem, it is important to relate similar concepts or relations from one agency, information management system or jurisdiction, to another. Mapping terms using an ontology represents a very powerful solution for standardizing and integrating heterogeneous data.</p><p>FoodOn (http://foodon.org) is an ontology resource that aims to model the food domain, which includes knowledge about food and food-related human activities, such as agriculture, medicine, food safety inspection, shopping patterns and sustainable development <ref type="bibr" target="#b2">(Griffiths et al., 2016)</ref>. Mapping of foodborne pathogen isolate metadata to FoodOn can provide a means for standardizing, translating, and communicating this critical contextual information between health agencies and platforms in a timely fashion. Here we describe a semi-automated method derived from mapping metadata from the widely used online microbial MLST typing platform Enterobase, which can be broadly applied to other use cases.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>FOODON DESIGN PRINCIPLES</head><p>Although there are several existing indexing systems directly or indirectly related to food and food-borne illness, including those maintained by Health Canada, the US Department of Agriculture, and the UN's Food and Agriculture Organization, they have been built for different purposes and so differences in their architecture hinder interoperability. To provide a more comprehensive view of food safety, data from these various sources must be integrated. In a concerted effort to solve this semantic interoperability problem, the OBOFoundry.org family of ontologies was established in 2007 in order to provide a comprehensive set of vocabularies in the biomedical domain. FoodOn, built largely on a longstanding American and European facetbased food indexing system called LanguaL (http://langual.org), provides a list of over 2,000 plant and animal food ingredient terms, as well as a supplemental list of over 9,000 indexed food products. Facets include fields for describing food processing, cooking and preservation, as well as source ingredient anatomy, taxonomy, geography and cultural heritage. The aim of FoodOn is to develop an international standard for describing properties of food related to agriculture, animal husbandry, collection, distribution, preservation, culinary use, consumption and food safety. FoodOn was accepted into the OBOFoundry in 2017.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>FOODON MAPPING AND DATA HARMONIZATION</head><p>Microbial Multilocus Sequence Typing (MLST) is a technique used to classify and identify pathogenic strains for outbreak investigation and surveillance of contamination.</p><p>Enterobase is a widely used online platform enabling MLST analysis of enteric pathogens such as E. coli, Salmonella, Shigella, Yersinia and Moraxella. Enterobase contains &gt;100,000 genomes, along with their source metadata, encompassing food, anatomical and clinical, as well as environmental domains. Based on curation and mapping of Enterobase isolate metadata, we have developed a semi-automated ontology mapping system that will enable mapping of food safety metadata to FoodOn food products and processing environments according to the following steps: (1) Syntactic analysis, where each categorical term in a single free text entry will be separated. (2) Semantic mapping of each categorical term according one of the following rules: (a) mapping to similar concept; (b) mapping to similar ancestors; (c) mapping to similar relations; (d) combining several matching techniques. (3) structural mapping, where the items are mapped to a corresponding subclass in the reference hierarchy. Non-interactive ontology matching tool can be evaluated using recall and precision (J. <ref type="bibr">Euzenat and P. Shvaiko., 2010)</ref>. They are measured based on comparing the expected results with the results of the evaluated system. Precision measure the ration of the correct matched terms over the total number of the matched terms, on the other hand, recall measures the ratio of the correct matched of the total number of expected terms to be matched. Logically, one can say that the precision can evaluate the correctness of the evaluated system and recall can evaluate the completion of the evaluated system <ref type="bibr" target="#b2">(Euzenat., 2007)</ref>. Here, we will evaluate our mapping accuracy by sub-sampling randomly a set of 500 records from the Enterobase and evaluate the suitability of the terms assigned using recall and precision methods. We will further manually review all the terms that cannot be manually mapped to an ontological term and add the terms to appropriate ontologies. The goal of our exercise is to minimize manually intervention when annotating food-related sample sources using FoodOn. Integrating genomic profiling of foodborne pathogens with a food descriptor framework will help reduce barriers for knowledge exchange among research communities, government risk managers and health providers.</p></div>		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>ACKNOWLEDGEMENTS</head><p>We thank Mark Achtman, Nabil-Fareed Alikhan, Mark Pallen, Martin Sergeant, Zhemin Zhou for sharing the Enterobase.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">The changing epidemiology of foodborne disease</title>
		<author>
			<persName><forename type="first">S</forename><surname>Altekruse</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Swerdlow</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Am J Med Sci</title>
		<imprint>
			<biblScope unit="volume">311</biblScope>
			<biblScope unit="page" from="23" to="29" />
			<date type="published" when="1996">1996</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<title level="m">World Health Organization, Foodborne disease outbreaks: guidelines for investigation and control</title>
				<meeting><address><addrLine>Geneva</addrLine></address></meeting>
		<imprint>
			<publisher>WHO Press</publisher>
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Semantic precision and recall for ontology alignment evaluation</title>
		<author>
			<persName><forename type="first">E</forename><surname>Griffiths</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Dooley</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">L</forename><surname>Buttigieg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Hoehndorf</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Brinkman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Hsiao</surname></persName>
		</author>
		<author>
			<persName><surname>Jérôme Euzenat</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 20th international joint conference on Artifical intelligence (IJCAI&apos;07)</title>
				<editor>
			<persName><forename type="first">Rajeev</forename><surname>Sangal</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Harish</forename><surname>Mehta</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">R</forename><forename type="middle">K</forename><surname>Bagga</surname></persName>
		</editor>
		<meeting>the 20th international joint conference on Artifical intelligence (IJCAI&apos;07)<address><addrLine>San Francisco, CA, USA</addrLine></address></meeting>
		<imprint>
			<publisher>Morgan Kaufmann Publishers Inc</publisher>
			<date type="published" when="2007">2016. 2007. 2007</date>
			<biblScope unit="page" from="348" to="353" />
		</imprint>
	</monogr>
	<note>Ontology Matching</note>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
