<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">OMOP-2-OPMI: Ontologization of OMOP CDM using OPMI to support clinical data interoperability and analysis</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Long</forename><surname>Tran</surname></persName>
							<email>longtr@umich.edu</email>
							<affiliation key="aff0">
								<orgName type="institution">University of Michigan</orgName>
								<address>
									<settlement>Ann Arbor</settlement>
									<region>MI</region>
									<country key="US">USA</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Yongqun</forename><surname>He</surname></persName>
							<email>yongqunh@med.umich.edu</email>
							<affiliation key="aff0">
								<orgName type="institution">University of Michigan</orgName>
								<address>
									<settlement>Ann Arbor</settlement>
									<region>MI</region>
									<country key="US">USA</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">OMOP-2-OPMI: Ontologization of OMOP CDM using OPMI to support clinical data interoperability and analysis</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">EB3F240B282858A3CFAD0D6CE5B29ACA</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T19:04+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>OMOP</term>
					<term>Common Data Model</term>
					<term>ontology</term>
					<term>OPMI</term>
					<term>adverse events</term>
					<term>COVID-19</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The OMOP Common Data Model (CDM) has been widely used as an open community data standard in observational data integration and analysis. However, it still has its drawbacks including weak semantics and interoperability with other CDMs. In this study, we report our ontologization of the OMOP CDM elements and the semantic relations among the elements using the Ontology of Precision Medicine and Investigation (OPMI). A total of 165 terms from 15 OMOP CDM tables has been mapped to OPMI, with 46 terms newly generated with OPMI namespace and the other terms reported from OBO reference ontologies. An Omop2Opmi.owl file was also generated by extracting the OMOP CDM related terms and relations from OPMI. Three categories of use cases are reported, using the ontology-level OMOP CDM element standardization and data integration, adverse event (AE) modeling, and COVID-19 clinical data studies. Following the Ontology of Adverse Events (OAE) definition, we developed a generalizable OMOP-AE model that transforms the OMOP data to systematically define, identify, and analyze specific adverse events following some medical interventions that include Drug/Device Exposure and Procedure Occurrence in OMOP. Overall, OMOP-2-OPMI complements and empower OMOP CDM for enhanced clinical data standardization, sharing, interoperability, and analysis.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>The Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) is an open community data standard that aims to allow for systematic analysis of disparate observational databases <ref type="bibr" target="#b0">[1]</ref>. With the CDM, the data contained in those databases can be transformed into a common format with a common representation. OMOP CDM has been widely used to support the standardization of various electronic medical records (EMR) and administrative claims within and outside the United States. Billions of patient records have been standardized using OMOP CDM. Recently, OMOP CDM has become an established data CEUR Workshop Proceedings (CEUR-WS.org) model used by the National COVID Cohort Collaborative (N3C, https://ncats.nih.gov/n3c). As of May 2022, the N3C data enclave has stored the records of 14 million persons, including over 5 million COVID+ cases. Based on the N3C data use design, the COVID-19 clinical data warehouse data dictionary used in N3C is based on OMOP CDM, and the other data formats need to be aligned with the OMOP CDM in order to be entered and used in the N3C data enclave. Therefore, the OMOP CDM has clearly played a significant role in the data standardization and integration.</p><p>Still the OMOP CDM has its own drawbacks <ref type="bibr" target="#b1">[2,</ref><ref type="bibr" target="#b2">3]</ref>. One drawback is its weak semantics in that OMOP CDM does not provide robust semantic relations among CDM elements. Basically, the OMOP CDM provides the schema structure of a standardized a relational database that includes over 10 tables, which has an inherent weakness in terms of representing the relations among terms from different tables. As a result, the layout of OMOP and how it is set up to document patients' conditions could lead to ambiguities, inaccurate representations and erroneous counting <ref type="bibr" target="#b1">[2]</ref>. Another drawback is that OMOP CDM does not inherently provide systematic interoperability with other CDMs such as National Patient-Centered Clinical Research Network (PCORnet) <ref type="bibr" target="#b3">[4]</ref> and Clinical Data Interchange Standards Consortium (CDISC) <ref type="bibr" target="#b4">[5]</ref>. In the N3C data integration, the COVID-19 data formulated with other CDMs are required to be harmonized based on OMOP CDM version 5.3 <ref type="bibr" target="#b5">[6]</ref>, which is separately conducted and difficult to achieve robust interoperability and scalability.</p><p>Ontology can be a solution to solve the above drawbacks <ref type="bibr" target="#b2">[3,</ref><ref type="bibr" target="#b6">7]</ref>. In the 2018 OHDSI Symposium, we proposed a strategy of ontological representation of the OMOP CDM using the OBO framework <ref type="bibr" target="#b2">[3]</ref>. In addition to the core OMOP CDM model, the OMOP system also includes many standardized clinical terminologies that can be used under the OMOP CDM framework to collaboratively support observational data standardization and integration. In the 2020 OHDSI Symposium, Callahan et al. reports their development of the OMOP2OBO, a health system-wide program of the integration and alignment between OMOP's standardized clinical terminologies and eight OBO biomedical ontologies spanning diseases, phenotypes, anatomical entities, cell types, organisms, chemicals, metabolites, hormones, vaccines, and proteins <ref type="bibr" target="#b6">[7]</ref>. As of the end of May 2022, the OMOP2OBO mapping program has collected 92,367 OMOP Conditions, 8,615 Drug Exposure ingredients, and 3,827 Measurements (10,673 measurement test results) terms <ref type="bibr" target="#b7">[8]</ref>. OMOP2OBO allows its users to construct their own sets of omop2obo mappings.</p><p>Among &gt;100 ontologies in the Open Biomedical Ontology (OBO) library, the Ontology of Precision Medicine and Investigation (OPMI) is an ontology in the domain of precision medicine and investigation <ref type="bibr" target="#b8">[9,</ref><ref type="bibr" target="#b9">10]</ref>. Following the OBO ontology principles (e.g., openness and collaboration, OPMI reuses many terms of existing reference ontologies and include many of its own terms in the field of clinical and translational precision medicine, supporting nonredundant and interoperable ontology development <ref type="bibr" target="#b10">[11]</ref>. OPMI has been developed and used to support the Kidney Precision Medicine Project <ref type="bibr" target="#b8">[9,</ref><ref type="bibr" target="#b9">10]</ref>. We have been using the OPMI to model and represent the core OMOP CDM elements and relations among the elements <ref type="bibr" target="#b2">[3]</ref>.</p><p>This manuscript reports our usage and extension of the OPMI to ontologize the OMOP CDM elements and the relations among these elements, and how such OMOP-2-OPMI ontologization supports systematic clinical data interoperability, sharing, and integration.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Methods</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">OMOP CDM resource used in the study</head><p>The OMOP version 5.4 was used in our OPMI mapping. First, we obtained terms and their annotations from the OMOP CDM version 5.4 resource <ref type="bibr" target="#b11">[12]</ref>. The Athena software program (https://athena.ohdsi.org/) is the tool used to search OMOP CDM terms and related terms from OMOP-associated terminologies.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">OMOP-2-OPMI development strategy</head><p>The OPMI ontology is used as the default ontology platform for the ontology mapping and new term generation of the OMOP CDM elements and semantic relations among the elements. In general, the eXtensible Ontology Development (XOD) strategy <ref type="bibr" target="#b12">[13]</ref>, including the methods of ontology term reuse, semantic alignment, ontology design pattern, and community extensibility, were used for the OPMI mapping. Specifically, all those OMOP CDM element terms were first searched in Ontobee <ref type="bibr" target="#b13">[14]</ref>. For those terms existing in reference OBO ontologies that map to the OMOP CDM elements, Ontofox <ref type="bibr" target="#b14">[15]</ref> was used to import those terms to OPMI (if the import has not been done before). For those OMOP elements that cannot be mapped to any OBO reference ontology, we generated new terms and defined them with OPMI namespace based on specific ontology design patterns. The OPMI ontology editing was performed using Protege-OWL editor <ref type="bibr" target="#b15">[16]</ref>, and the ontology reasoning was conducted using the Hermit reasoner <ref type="bibr" target="#b16">[17]</ref>. All the terms are aligned under the upper-level Basic Formal Ontology (BFO) <ref type="bibr" target="#b17">[18]</ref>. Meanwhile, we have discussed our project design in different scenarios, and community feedback and comments were obtained to adjust our definitions and design.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3.">Download and license</head><p>The OMOP-2-OPMI GitHub web page is: https://github.com/OPMI/OMOP-2-OPMI. The source code of the Omop2Opml.owl file is openly available at this GitHub website for downloading. The OWL file is generated primarily by extracting the OMOP CDM-related terms and associated relations from the OPMI using Ontofox <ref type="bibr" target="#b14">[15]</ref>. Considering the usage of OPMI as the platform for the OMOP CDM mapping, the OMOP-2-OPMI source page is designated as a repository under the general OPMI organization in GitHub.</p><p>Meanwhile, the OMOP-2-OPMI repository has also stored related data files including our cleanup spreadsheets of the mapping details available at: https://github.com/OPMI/OMOP-2-OPMI/tree/main/docs.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.4.">Use case studies</head><p>Three use cases are developed and discussed in this study. Specifically, the first use case is about the OMOP data standardization and inference. The second use case is the development of an adverse event model based on the OMOP CDM logic and available data formats. The third use case is the usage of OMOP-2-OPMI to study N3C COVID-19 related clinical data.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Results</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">General OMOP CDM ontologization architecture</head><p>Figure <ref type="figure" target="#fig_0">1</ref> represents the hierarchical structure of the OMOP-2-OPMI, which is the ontologization of the OMOP CDM using the OPMI as the ontology platform. Specifically, all the terms are aligned under the Basic Formal Ontology (BFO) <ref type="bibr" target="#b17">[18]</ref>, an ISO-approved upper level ontology <ref type="bibr" target="#b18">[19]</ref>. BFO includes two branches: continuants and occurrents. Continuants cover time-independent entities including material entities, quality, realizable entities such as disposition, and information content entities. Occurrents are time-dependent entities including temporal region and processes. All the OMOP CDM elements can be categorized under these two categories (Figure <ref type="figure" target="#fig_0">1</ref>). BFO has been used by over 300 ontologies. The alignment with BFO allows us to integrate our ontology with the large number of other ontologies, supporting data interoperability. Figure <ref type="figure" target="#fig_1">2</ref> is a simplified high level OMOP-2-OPMI ontology design pattern (ODP) that covers the major elements in 11 OMOP tables. Specifically, the person (usually here it refers to patient in OMOP) is centric to the ODP. The person participates in five medical occurrences (i.e., visit/condition/procedure occurrences, and drug/device exposure) and the observation process, which are all under BFO:process (Figure <ref type="figure" target="#fig_0">1</ref>). The observation happens during a specific observation period. The person is also the target of measurement. A specimen derives from some organ or tissue of the person. The person has different phenotypes, and death is a specific phenotype (Figure <ref type="figure" target="#fig_1">2</ref>). </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">OMOP-2-OPMI statistics</head><p>A total of 165 terms from 15 OMOP CDM tables has been mapped to OPMI, with 46 terms newly generated with OPMI namespace and the other terms reported from OBO reference ontologies. In addition to the 11 tables listed in Figure <ref type="figure" target="#fig_1">2</ref>, the other four tables are Care Site, Payer Plan Period, Episode, and Location, which are not included in Figure <ref type="figure" target="#fig_1">2</ref> to simplify that figure. Table <ref type="table" target="#tab_0">1</ref> lists ontology mapped CDM element terms from 10 representative OMOP tables.</p><p>Our current mapping primarily covers those clinical data tables and health system data tables. We have not yet included the Metadata Tables, Vocabulary Tables, Standardized Derived Tables except for Episode, and the Cost table which belongs in the Health Economics Data Tables category. These missing tables do not directly involve clinical investigation, which is our current focus. Also as shown in Table <ref type="table" target="#tab_1">2</ref>, many terms are not mapped to ontology. Most of these missing terms are various "source value" or source concept ID terms. Throughout OMOP CDM, there are similar terms representing various source concepts and source values. In the OMOP structure, a source concept set organizes terms into groups called source value sets. A value set (e.g., 'procedure_source_value') is a set of codes whose context and usage are defined by one or more code systems in which the clinical data came from. However, the organization of value sets is not often ontology-based. In most cases, we have decided to not incorporate terms for "source concept" and "source value" sets until we figure out a place for these terms to make sense ontologically within OPMI. In our ontologization, we have also included specific source value terms as seen in Table <ref type="table" target="#tab_0">1</ref> and detailed later in the manuscript. Note: *13/19 represents that 13 out of 19 OMOP CDM terms in the specific category have been mapped to terms in the OPMI ontology. The unmapped terms are primarily those terms related to "source value". More terms in the visit/condition occurrences are mapped because some specific source value terms are ontologized.</p><p>In addition to source values or source concept IDs, there are also many terms in OMOP CDM not yet ontologized. The reasons of such imcompleteness include the lack of necessity of many terms, and the complexity of many other terms in terms of ontology modeling. We will continue this work later, ideally by involving more collaboration and discussion with the ontology and clinical informatics communities. Next we will focus on a few major ontology modeling topics to show how we model and ontologize the OMOP CDM elements.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3.">Ontologization of OMOP medical occurrences</head><p>By examining the OMOP CDM elements, we found that five OMOP tables can be categorized under an ontology class called 'medical occurrence', which is defined as a process event that a patient experiences over a period of time (Figure <ref type="figure" target="#fig_2">3</ref>). These five OMOP tables are: 'condition occurrence', 'device exposure', 'drug exposure', 'procedure occurrence', and 'visit occurrence' (Figure <ref type="figure" target="#fig_2">3</ref>). In two of the five OMOP tables, Visit Occurrence and Condition Occurrence, in addition to mapping the elements in original tables (Table <ref type="table" target="#tab_0">1</ref>), we also added some terms from the supporting OMOP vocabularies for developing a complete semantic model. In the case of Visit Occurrence, the extra terms are due to the ontologization of 11 types of visit occurrences (e.g., 'emergency room visit', 'home visit') that are originally not defined in OMOP's CDM model and instead are from the supporting OMOP vocabularies identified on the Athena program. We have ontologized such terms under 'visit occurrence' (OPMI) (Figure <ref type="figure" target="#fig_2">3</ref>). These terms represent the overarching types of encounters between a person and the healthcare system, which are adopted in most healthcare systems worldwide.</p><p>In the case of Condition Occurrence, the extra 22 terms come from the incorporation of medical condition statuses (e.g., 'admission diagnosis', 'cause of death', and 'confirmed diagnosis'), which were defined by OMOP and searchable in Athena. In OMOP, a medical condition status denotes the stages of a patient's diagnosis, not the actual state of the disease by itself. OPMI represents these medical condition statuses in two strategies. First, OPMI includes a term called 'medical condition status' under the 'status' term, which is a subclass of BFO:'realizable entity'. In this classification, a medical diagnosis status, such as admission diagnosis, represents a patient diagnosis status such as the status of diagnosis at the time when the patient is admitted to the hospital.</p><p>We have also adopted the OGMS:diagnosis classification and defines various diagnosis types under the OGMS:diagnosis (Figure <ref type="figure" target="#fig_4">4</ref>).  Meanwhile, the diagnosis clinical data type vs the diagnosis medical condition status are closely related. In OPMI, we propose to generate a relation term called 'has status content', which represents a relation between a status and an information content entity where the status has its content information defined by the information content entity. For example, we can define an axiom that links a diagnosis status to a diagnosis data item:</p><p>'</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>admission diagnosis status': 'has status content' some 'admission diagnosis'</head><p>However, such duplicated representation may not be needed. It is possible to just define 'admission diagnosis status' and remove the term 'admission diagnosis'. We will examine more use cases and discuss with the ontology and medical informatics communities on this regard.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.4.">Ontologization of temporal date/time in OMOP</head><p>To ontologically represent various entities denoting time that can be found throughout OMOP, we have mapped 24 temporal terms from 6 tables. The OMOP tables that have temporal terms ontologized are Visit Occurrence, Device Exposure, Drug Exposure, Procedure Occurrence, Condition Occurrence, and Person. For all tables but Person, the entities are ontologized with temporal terms for -start date, -start datetime, -end date, and -end datetime. Meanwhile, temporal terms related to the Person table are instead ontologized with more familiar terms which are 'birth datetime', 'day of birth', 'month of birth', and 'year of birth'. All temporal terms are grouped under a higher level term for a better organizational purpose (e.g., 'visit start date/datetime', 'end date/datetime' are all grouped under 'visit temporal region') (Figure <ref type="figure" target="#fig_5">5</ref>). </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.5.">Ontologization of entity identifiers in OMOP CDM</head><p>In OMOP, fields with the suffix "_id_" usually denote identifiers, which function as primary keys in their respective OMOP tables along with other supporting entities (e.g., person_id in Person table). These identifiers can also be used as foreign keys to connect other related OMOP tables (e.g., person_id to connect Provider and Care Site tables).</p><p>OPMI has ontologized OMOP CDM related identifiers under the class of 'centrally registered identifier', a subclass under 'information content entity'. Example identifiers defined include 'person ID', 'care site identifier', 'clinical measurement identifier', 'DEA identifier' and 'National Provider Identifier'. These identifiers identify assets belonging to different but centrally registered local databases.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.6.">Ontologization of provenance records in OMOP</head><p>In OMOP, most entities from various tables have their own "type_concept" terms, which indicate the provenance, or the source of the record in which it comes from. For instance, drug exposure entries could come from either prescriptions list or self-reported by patients, the provenance of which can differ from a patient's measurement records.</p><p>In OPMI, type_concepts are mapped as various terms under 'provenance of record', a class under 'information content entity'. So far, we have generated 12 terms for the provenance of records for 12 corresponding entities of OMOP CDM tables. The provenance of records is dedicated for each corresponding OMOP entity since the sources of the entries can vary across different fields.</p><p>Meanwhile, OPMI also defines most of the records for the OMOP provenance purposes under 'electronic health record', such as 'electronic medical visit record,' 'electronic death record,' 'electronic device record,' etc. (Figure <ref type="figure" target="#fig_6">6</ref>). The users can choose the usage of these electronic health records as the sources of the data collected to the OMOP database. Note that not all the provenance records are electronic health records (EHR). For example, in addition to the record from an EHR system, the measurement record might also come from an insurance claim, registry, or other sources. Next, we will focus on the description of three use cases of the OMOP-2-OPMI approach.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.7.">Use case 1: Ontologylevel data standardization</head><p>The first use case is rooted in the nature of ontology. As an open access ontology following the OBO ontology development principles, OMOP-2-OPMI provides the standard representation and definitions of the OMOP CDM mapped terms and the axioms among these terms. The OMOP-2-OPMI ontology terms can be used to support standardized clinical data representation and annotation. The semantic relations among the OMOP CDM terms and their associated other terms provide solid semantic associations, which addresses the OMOP CDM drawback of weak semantics.</p><p>The ontologized terms are also interoperable. For example, the Coronavirus Infectious Disease Ontology (CIDO), a biomedical ontology in the domain of coronavirus diseases [20], has imported the OMOP-2-OPMI ontology contents. The contents of OMOP-2-OPMI fit seamlessly with the other CIDO contents, providing another demonstration of the ontology-supported knowledge and data interoperability, sharing, and integration. It is also possible to use the some ontology terms for mapping to the other CDMs such as PCORnet <ref type="bibr" target="#b3">[4]</ref> and CDISC <ref type="bibr" target="#b4">[5]</ref>, which will be explored in the future.</p><p>Such interoperable ontology representation also supports data and knowledge inferencing. This is also rooted from the nature of ontology. The following two other use cases provide such demonstrations.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.8.">Use case 2: Adverse event modeling and analysis</head><p>Another use case of the OMOP CDM ontologization is the modeling of adverse events (AEs) post medical intervention. The OMOP CDM does not include AE per se. However, by specific modeling, we can find the OMOP CDM data can be processed to support specific AE identification and analysis.</p><p>Figure <ref type="figure" target="#fig_7">7</ref> is a general OMOP-AE ontology design pattern, which follows the AE definition by the Ontology of Adverse Events (OAE) <ref type="bibr" target="#b19">[21]</ref>. According to the OAE, an adverse event (AE) is a pathological bodily process that occurs following some medical intervention <ref type="bibr" target="#b19">[21]</ref>. In order to model AEs with OMOP data, we need to identify the medical intervention vs. adverse events to be mapped in OMOP. By examining all the five medical occurrence types defined in OMOP, only three of them are considered as medical interventions: Drug Exposure, Device Exposure, and Procedure Occurrence (e.g., surgical procedure). Vaccination can be considered as a special drug exposure.</p><p>Note that the visit occurrence and condition occurrence are regarded as natural occurrence events without medical intervention. Based on the AE definition, contracting a natural infection is not an AE since the patient does not receive an adverse outcome after a medical intervention. However, the condition occurrence may include conditions of different phenotypes that are the outcomes of specific adverse events (Figure <ref type="figure" target="#fig_7">7</ref>). Our original OPMI conference proceeding paper presented a use case study of identifying and analyzing the acute kidney injury (AKI) AE following heart surgery <ref type="bibr" target="#b8">[9]</ref>. Using OHDSI data provided by the IQVIA Pharmetric Plus database, our OHDSI cohort study identified a total of 15,548 patients that fulfilled our predefined model of AKI AE following heart surgery. Specific patterns were identified. For example, 72% of the identified patients were male and 28% were female patients. Over 78% of these AE cases occurred in patients aged greater than 55 years old. Many phenotypes, such as coronary arteriosclerosis, kidney disease, pain, dyspnea, hyperlipidemia, and Type II diabetes, were found in these patients as well <ref type="bibr" target="#b8">[9]</ref>.</p><p>Our OMOP AE model is a very general model in that it can be used to study specific adverse event profiles following various medical interventions including different drug/medicine exposure and procedure occurrence. We are currently applying such a strategy to design a pattern for identifying and analyzing the vaccine and drug AEs in COVID-19 patients using the N3C data. Note that if a patient contracted COVID-19 in a natural environment, the patient has a condition, which is not an adverse event (because an AE is always associated with medical intervention). However, the occurrence of new phenotypes after medical treatment on these COVID-19 patients are considered AEs.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.9.">Use case 3: COVID-19 clinical data standardization, modeling, and analysis</head><p>In addition to the import of the OMOP-2-OPMI to CIDO and the study of COVID-19 associated AE modeling and analysis as described above, we are also applying the OMOP-2-OPMI for more COVID-19 clinical data modeling and analysis. Two data resources for our OMOP-2-OPMI based studies are the literature reports and N3C clinical data.</p><p>One specific use case is the study of the relation between the COVID-19 infection and the increased risk for kidney diseases. For example, acute kidney injury (AKI) is a significant complication of COVID-19. The incidence of AKI in hospitalized patients varies from 0.5% to 75%. The mortality rate for patients with kidney disease is also significantly higher than the general infected population. However, the big variation of AKI incidence in COVID-19 patients appears to depend on many factors such as race, region, and disease severity. The N3C cohort data is being used to detect, compare, and analyze the occurrences of kidney disease following COVID-19 infection. The OMOP-2-OPMI model, together with the OMOP2OBO, can be used to support data modeling, integration, and analysis. The integrated data can also be further used for machine learning tool development for kidney disease prediction following COVID-19 prediction. We have registered for an N3C program to perform related research.</p><p>Another use case in this category is the application of OMOP-2-OPMI and CIDO for secondary literature data analysis and knowledge representation. There have been a big number of COVID-19 studies reported in the literature, many of which involve the usage of OMOP CDM model. For example, one study examined the association between immune dysfunction and COVID-19 breakthrough infection after SARS-CoV-2 vaccination in the US using N3C data <ref type="bibr" target="#b20">[22]</ref>. The N3C data and the results out of the data analysis can both be modeled, annotated, and represented using ontology including our OMOP-2-OPMI and CIDO.</p><p>The above two studies are currently ongoing and we expect to have more specific results available in near future.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Discussion</head><p>This manuscript has made two main contributions. First, we report our systematic survey and ontologization of the OMOP CDM elements using the OPMI ontology. The Omop-2Opmi.owl file is the OWL file that includes only the OMOP CDM-related ontology terms, their directly associated terms (e.g., their parent terms), and the semantic relations between these terms that are presented as ontology axioms. Second, we presented three categories of use cases of our OMOP CDM ontologization, including ontologylevel OMOP CDM element standardization and inferencing, adverse event modeling and analysis, and COVID-19 clinical data studies. Overall, our systematic ontologization of the OMOP CDM complements and empowers the OMOP CDM system, providing a new way of supporting systematic clinical data interoperability, sharing, and integration.</p><p>A similar and related system is OMOP2OBO, a systematic mapping tool that maps OMOP related terms to OBO ontologies <ref type="bibr" target="#b6">[7]</ref>. The terms mapped in OMOP2OBO cover 8 OBO ontologies, including Cell Ontology (CL), ChEBI chemical entity ontology, Human Phenotype Ontology (HP), MONDO disease Ontology, NCBI Taxonomy Ontology (NCBITaxon), Protein Ontology (PR), Uberon anatomy ontology, and Vaccine Ontology (VO). While OMOP2OBO includes the mapping of over 100,000 terms in the OMOP terminology system, it does not cover the OMOP CDM elements in the over 10 basic OMOP tables. Instead, OMOP-2-OPMI focuses on the core OMOP CDM level mapping and representation. In addition to ontology term mapping, since many high level terms in OMOP CDM are not yet represented in OBO ontologies, we have taken extensive effort to generate many new terms in OPMI. We have also generated ontological relations among these OMOP CDM elements using the OPMI ontology platform. Overall, OMOP2OBO and OMOP-2-OPMI are complementary in that they map and integrate OMOP data from different aspects.</p><p>There are still many issues to consider in our ontologization. For example, we presented two types of methods for representing medical condition statuses and two types of methods of representing provenance records in our work. Since most medical condition statuses are different types of diagnosis, such status representations can be defined under "status", which is defined as a BFO:'realizable entity', or under OGMS:diagnosis, which is basically a type of clinical data item. Similarly, for the provenance records, they can be represented under provenance itself or under electronic health record. The ICBO-2022 conference will provide us a discussion platform to discuss the pros and cons of different representation styles.</p><p>Several use cases are introduced in this article. We demonstrated the development of a new OMOP-based adverse event model based on the OMOP CDM data structure. Such an OMOP-AE model can be used to support various specific AE studies, including the modeling of adverse event cases post COVID-19 vaccination (or drug admin) using N3C data. In addition to the AKI AE study following heart surgery <ref type="bibr" target="#b8">[9]</ref>, we are currently applying the OMOP AE model for more COVID-19 related AE studies. Furthermore, we can develop new models to apply OMOP CDM to study other such as long COVID and the effects of different variables to the disease outcomes.</p><p>One future project is to map the CDM terms from other systems, including PCORnet <ref type="bibr" target="#b3">[4]</ref> and CDISC <ref type="bibr" target="#b4">[5]</ref>, to the OPMI ontology using the same OMOP-2-OPMI development strategy. These different CDMs are overlapped. For example, There are similarities between the organizations of OMOP and PCORnet CDMs, evidenced by the overlaps of certain tables such as Demographic, Procedures, or Condition <ref type="bibr">[23]</ref>. When all these CDM elements and relations are mapped to the same OPMI structure, we can integrate all the data using different CDMs, leading to compatible and interoperable clinical and observational data standardization and integration. A recent study reports the development of an ETL tool for converting the PCORnet CDM into OMOP CDM to facilitate the COVID-19 data integration <ref type="bibr" target="#b22">[24]</ref>. It is possible to apply our ontology approach to enhance such an ETL tool.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: OMOP-2-OPMI top level hierarchical structure and representative terms. Ontology names are highlighted with different colors. Ontology-mapped OMOP terms are also provided.</figDesc><graphic coords="3,72.00,454.84,451.00,180.10" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: General ontology design pattern that links CDM elements from 11 OMOP tables. Note one box covers five OMOP occurrence/exposure tables. Mapped ontology terms are also labeled.</figDesc><graphic coords="4,72.00,106.50,207.50,139.10" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: Modeling of 5 medical occurrence categories and 11 specific visit occurrences.</figDesc><graphic coords="5,330.17,211.67,167.45,169.85" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head></head><label></label><figDesc>According to the Ontology for General Medical Science (OGMS), diagnosis (OGMS_0000073) is a subclass of clinical data item and represents the conclusion of a diagnostic process. Based on the OMOP classification, OPMI has defined different categories of diagnosis, including 'admission diagnosis', 'primary diagnosis', 'secondary diagnosis', and 'death diagnosis', etc. (Figure 4). These specific diagnosis types are commonly used at the clinical setting. The classification of these diagnosis types facilitates the clinical data annotations.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Figure 4 :</head><label>4</label><figDesc>Figure 4: Modeling of different medical diagnosis under the OGMS:diagnosis, which is a subclass of clinical data item.</figDesc><graphic coords="6,107.95,375.57,159.75,194.24" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_5"><head>Figure 5 :</head><label>5</label><figDesc>Figure 5: OPMI modeling of date and time used in OMOP CDM.</figDesc><graphic coords="7,111.27,72.00,139.65,214.90" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_6"><head>Figure 6 :</head><label>6</label><figDesc>Figure 6: OPMI modeling of different records used as data provenance in OMOP CDM.Next, we will focus on the description of three use cases of the OMOP-2-OPMI approach.</figDesc><graphic coords="7,329.65,362.93,168.47,136.60" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_7"><head>Figure 7 :</head><label>7</label><figDesc>Figure 7: General OMOP-AE model based on OMOP-2-OPMI. The red boxes represent OMOP tables and their mapped ontology terms. The black boxes are added ontology representation to fill up the gaps for adverse event modeling. *, OMOP uses SMOMED-CT concepts for disease or symptom representation. These can be mapped to Human Phenotype Ontology (HP) terms.</figDesc><graphic coords="8,304.80,109.95,216.75,167.60" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 . CDM terms from 10 representative OMOP tables mapped to OPMI Selected OMOP tables Mapped OMOP terms Mapped Ontology Term Examples</head><label>1</label><figDesc></figDesc><table><row><cell>PERSON</cell><cell>13/19*</cell><cell>person ID (OPMI_0000470), gender (PATO_0001894), year of birth</cell></row><row><cell></cell><cell></cell><cell>(OPMI_0000473), race (NCIT_C17049)</cell></row><row><cell>PROVIDER</cell><cell>9/13</cell><cell>care provider (OPMI_0000163), National Provider Identifier (OPMI_0000503),</cell></row><row><cell></cell><cell></cell><cell>DEA identifier (OPMI_0000504)</cell></row><row><cell>SPECIMEN</cell><cell>6/15</cell><cell>specimen ID (OBI_0001616), date of specimen collection (OBIB_0000714),</cell></row><row><cell></cell><cell></cell><cell>anatomical structure (UBERON_0000061)</cell></row><row><cell>VISIT</cell><cell>26/17</cell><cell>visit occurrence (OPMI_0000482), visit start date (OPMI_0000487), preceding</cell></row><row><cell>OCCURRENCE</cell><cell></cell><cell>visit occurrence (OPMI_0000492)</cell></row><row><cell>PROCEDURE</cell><cell>13/16</cell><cell>procedure (NCIT_C25218), procedure start date (OPMI_0000508), procedure</cell></row><row><cell>OCCURRENCE</cell><cell></cell><cell>end date (OPMI_0000510)</cell></row><row><cell>DRUG</cell><cell>18/23</cell><cell>drug exposure (OPMI_0000572), drug product (DRON_00000005) drug</cell></row><row><cell>EXPOSURE</cell><cell></cell><cell>exposure start time (OPMI_0000565)</cell></row><row><cell>CONDITION</cell><cell>38/16</cell><cell>condition occurrence (OPMI_0000527), medical condition status</cell></row><row><cell>OCCURRENCE</cell><cell></cell><cell>(OPMI_0000533), admission diagnosis status (OPMI_0000542)</cell></row><row><cell>DEVICE</cell><cell>7/15</cell><cell>device exposure (OPMI_0000554), device (OBI_0000968), device exposure</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2 . Ontology mapping of OMOP CDM terms by element types types OMOP terms OMOP mapped percent mapped</head><label>2</label><figDesc></figDesc><table><row><cell>_id</cell><cell>23</cell><cell>19</cell><cell>82.61%</cell></row><row><cell>_date</cell><cell>34</cell><cell>27</cell><cell>79.41%</cell></row><row><cell>_concept_id</cell><cell>41</cell><cell>29</cell><cell>70.73%</cell></row><row><cell>_concept_name</cell><cell>30</cell><cell>16</cell><cell>53.33%</cell></row><row><cell>_source_concept_id</cell><cell>17</cell><cell>1</cell><cell>5.88%</cell></row><row><cell>_source_value</cell><cell>34</cell><cell>1</cell><cell>2.94%</cell></row><row><cell>Total</cell><cell>179</cell><cell>93</cell><cell>51.96%</cell></row></table></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Acknowledgements</head><p>We acknowledge the Kidney Precision Medicine Project (KPMP) project supported by NIH-NIDDK grant: 1U2CDK114886, and a COVID-19 research grant from the Michigan Medicine-Peking University Health Sciences Center Joint Institute for Clinical and Translational Research (U072807). We appreciate the discussion and comments from the ontology and OMOP societies including Dr. Asiyah Yu Lin and Dr. Andrew Williams.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Feasibility and utility of applications of the common data model to multiple, disparate observational health databases</title>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">A</forename><surname>Voss</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Makadia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Matcho</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Q</forename><surname>Ma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Knoll</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Schuemie</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">J Am Med Inform Assoc</title>
		<imprint>
			<biblScope unit="volume">22</biblScope>
			<biblScope unit="page" from="553" to="564" />
			<date type="published" when="2015-05">May 2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">A Realism-Based View on Counts in OMOP&apos;s Common Data Model</title>
		<author>
			<persName><forename type="first">W</forename><surname>Ceusters</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Blaisure</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Stud Health Technol Inform</title>
		<imprint>
			<biblScope unit="volume">237</biblScope>
			<biblScope unit="page" from="55" to="62" />
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Ontological representation of OMOP CDM using the OBO framework</title>
		<author>
			<persName><forename type="first">Y</forename><surname>He</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Ong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Zheng</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">presented at the 2018 OHDSI Symposium</title>
				<meeting><address><addrLine>Bethesda North Marriott, Bethesda, MD</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">PCORnet: turning a dream into reality</title>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">S</forename><surname>Collins</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">L</forename><surname>Hudson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">P</forename><surname>Briggs</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">S</forename><surname>Lauer</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">J Am Med Inform Assoc</title>
		<imprint>
			<biblScope unit="volume">21</biblScope>
			<biblScope unit="page" from="576" to="577" />
			<date type="published" when="2014-08">Jul-Aug 2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Current applications and future directions for the CDISC Operational Data Model standard: A methodological review</title>
		<author>
			<persName><forename type="first">S</forename><surname>Hume</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Aerts</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Sarnikar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Huser</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">J Biomed Inform</title>
		<imprint>
			<biblScope unit="volume">60</biblScope>
			<biblScope unit="page" from="352" to="362" />
			<date type="published" when="2016-04">Apr 2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<ptr target="https://ncats.nih.gov/files/OMOP_CDM_COVID.pdf" />
		<title level="m">COVID-19 Clinical Data Warehouse Data Dictionary Based on OMOP Common Data Model Specifications Version 5</title>
				<imprint>
			<biblScope unit="volume">3</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">OMOP2OBO: Semantic Integration of Standardized Clinical Terminologies to Power Translational Digital Medicine Across Health Systems</title>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">J</forename><surname>Callahan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">M</forename><surname>Wyrwa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">A</forename><surname>Vasilevsky</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">N</forename><surname>Robinson</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">2020 OHDSI Symposium, Virtual meeting</title>
				<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<title level="m" type="main">OMOP2OBO</title>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">J</forename><surname>Callahan</surname></persName>
		</author>
		<ptr target="https://github.com/callahantiff/OMOP2OBO" />
		<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">OPMI: the Ontology of Precision Medicine and Investigation and its support for clinical data and metadata representation and analysis</title>
		<author>
			<persName><forename type="first">Y</forename><surname>He</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Ong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Schaub</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Dowd</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">F</forename><surname>O'toole</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Siapos</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">The 10th International Conference on Biomedical Ontology (ICBO-2019)</title>
				<meeting><address><addrLine>Buffalo, NY, USA</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2019-08-02">July 30 -August 2. 2019</date>
			<biblScope unit="page" from="1" to="10" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Modelling kidney disease using ontology: insights from the Kidney Precision Medicine Project</title>
		<author>
			<persName><forename type="first">E</forename><surname>Ong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">L</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Schaub</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">F</forename><surname>O'toole</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Steck</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">Z</forename><surname>Rosenberg</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Nat Rev Nephrol</title>
		<imprint>
			<biblScope unit="volume">16</biblScope>
			<biblScope unit="page" from="686" to="696" />
			<date type="published" when="2020-11">Nov 2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration</title>
		<author>
			<persName><forename type="first">B</forename><surname>Smith</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Ashburner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Rosse</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Bard</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Bug</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Ceusters</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Nat Biotechnol</title>
		<imprint>
			<biblScope unit="volume">25</biblScope>
			<biblScope unit="page" from="1251" to="1255" />
			<date type="published" when="2007-11">Nov 2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<ptr target="http://ohdsi.github.io/CommonDataModel/cdm54.html" />
		<title level="m">OMOP CDM</title>
				<imprint>
			<biblScope unit="volume">5</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">The eXtensible ontology development (XOD) principles and tool implementation to support ontology interoperability</title>
		<author>
			<persName><forename type="first">Y</forename><surname>He</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Xiang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Zheng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Lin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">A</forename><surname>Overton</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Ong</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">J Biomed Semantics</title>
		<imprint>
			<biblScope unit="volume">9</biblScope>
			<biblScope unit="issue">3</biblScope>
			<date type="published" when="2018-01-12">Jan 12 2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Ontobee: A linked ontology data server to support ontology term dereferencing, linkage, query and integration</title>
		<author>
			<persName><forename type="first">E</forename><surname>Ong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Xiang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Zhao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Lin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Zheng</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Nucleic Acids Res</title>
		<imprint>
			<biblScope unit="volume">45</biblScope>
			<biblScope unit="page" from="D347" to="D352" />
			<date type="published" when="2017-01-04">Jan 04 2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">OntoFox: webbased support for ontology reuse</title>
		<author>
			<persName><forename type="first">Z</forename><surname>Xiang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Courtot</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">R</forename><surname>Brinkman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Ruttenberg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>He</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">BMC Res Notes</title>
		<imprint>
			<biblScope unit="volume">3</biblScope>
			<biblScope unit="page" from="1" to="12" />
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">The Protégé project: A look back and a look forward. AI Matters</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Musen</surname></persName>
		</author>
		<idno type="DOI">10.1145/2557001.25757003</idno>
	</analytic>
	<monogr>
		<title level="j">Association of Computing Machinery Specific Interest Group in Artificial Intelligence</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<monogr>
		<title/>
		<author>
			<persName><forename type="first">Owl</forename><surname>Hermit</surname></persName>
		</author>
		<author>
			<persName><surname>Reasoner</surname></persName>
		</author>
		<ptr target="http://hermit-reasoner.com/" />
		<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<monogr>
		<title level="m" type="main">Building Ontologies with Basic Formal Ontology</title>
		<author>
			<persName><forename type="first">R</forename><surname>Arp</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Smith</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">D</forename><surname>Spear</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2015">2015</date>
			<publisher>MIT Press</publisher>
			<pubPlace>Cambridge, MA, USA</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">CIDO, a communitybased ontology for coronavirus disease knowledge and data integration, sharing, and analysis</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Huffman</surname></persName>
		</author>
		<idno>ISO/IEC 21838-2:2021</idno>
		<ptr target="https://www.iso.org/standard/74572.html" />
	</analytic>
	<monogr>
		<title level="m">Information technology -Top-level ontologies (TLO) -Part 2: Basic Formal Ontology (BFO)</title>
				<imprint>
			<date type="published" when="2020-06-12">Jun 12 2020</date>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="page">181</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">OAE: The Ontology of Adverse Events</title>
		<author>
			<persName><forename type="first">Y</forename><surname>He</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Sarntivijai</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Lin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Xiang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Guo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Zhang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">J Biomed Semantics</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="page">29</biblScope>
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<monogr>
		<title level="m" type="main">Association Between Immune Dysfunction and COVID-19</title>
		<author>
			<persName><forename type="first">J</forename><surname>Sun</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Q</forename><surname>Zheng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Madhira</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">L</forename><surname>Olex</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">J</forename><surname>Anzalone</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Vinson</surname></persName>
		</author>
		<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Breakthrough Infection After SARS-CoV-2 Vaccination in the US</title>
		<ptr target="https://pcornet.org/wp-content/uploads/2022/01/PCORnet-Common-Data-Model-v60-2020_10_221.pdf" />
	</analytic>
	<monogr>
		<title level="j">JAMA Intern Med</title>
		<imprint>
			<biblScope unit="volume">182</biblScope>
			<biblScope unit="page" from="153" to="162" />
			<date type="published" when="2022-02-01">Feb 1 2022</date>
		</imprint>
	</monogr>
	<note>PCORnet Common Data Model (CDM) Specification, Version 6.0</note>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Developing an ETL tool for converting the PCORnet CDM into the OMOP CDM to facilitate the COVID-19 data integration</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Yu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Zong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Wen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">J</forename><surname>Stone</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Knaack</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">J Biomed Inform</title>
		<imprint>
			<biblScope unit="volume">127</biblScope>
			<biblScope unit="page">104002</biblScope>
			<date type="published" when="2022-03">Mar 2022</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
