<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Med2RDF: Semantic Biomedical Knowledge-base and APIs for the Clinical Genome Medicine</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Mayumi</forename><surname>Kamada</surname></persName>
							<email>mkamada@kuhp.kyoto-u.ac.jp</email>
							<affiliation key="aff0">
								<orgName type="institution">Kyoto University</orgName>
								<address>
									<addrLine>54 Shogoin, Sakyo-ku</addrLine>
									<postCode>606-8397</postCode>
									<settlement>Kyoto</settlement>
									<country key="JP">Japan</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Toshiaki</forename><surname>Katayama</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">Database Center for Life Science</orgName>
								<address>
									<addrLine>178-4-4 Wakashiba, Kashiwa-shi</addrLine>
									<postCode>277-0871</postCode>
									<settlement>Chiba</settlement>
									<country key="JP">Japan</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Shuichi</forename><surname>Kawashima</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">Database Center for Life Science</orgName>
								<address>
									<addrLine>178-4-4 Wakashiba, Kashiwa-shi</addrLine>
									<postCode>277-0871</postCode>
									<settlement>Chiba</settlement>
									<country key="JP">Japan</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Ryosuke</forename><surname>Kojima</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Kyoto University</orgName>
								<address>
									<addrLine>54 Shogoin, Sakyo-ku</addrLine>
									<postCode>606-8397</postCode>
									<settlement>Kyoto</settlement>
									<country key="JP">Japan</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Masahiko</forename><surname>Nakatsui</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Kyoto University</orgName>
								<address>
									<addrLine>54 Shogoin, Sakyo-ku</addrLine>
									<postCode>606-8397</postCode>
									<settlement>Kyoto</settlement>
									<country key="JP">Japan</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Yasushi</forename><surname>Okuno</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Kyoto University</orgName>
								<address>
									<addrLine>54 Shogoin, Sakyo-ku</addrLine>
									<postCode>606-8397</postCode>
									<settlement>Kyoto</settlement>
									<country key="JP">Japan</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Med2RDF: Semantic Biomedical Knowledge-base and APIs for the Clinical Genome Medicine</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">88A498C1BA1C9B6E77D464A91171F7C3</idno>
					<idno type="DOI">10.12688/f1000research.18238.1</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T05:33+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Med2RDF</term>
					<term>Database integration</term>
					<term>APIs</term>
					<term>clinical genome medicine</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>For clinical interpretation of genomic variants, it is necessary to aggregate knowledge from public databases and literatures. To construct an integrated knowledge-base for interpretation, we have developed RDF versions of major biomedical databases in the Med2RDF project. This resource uses the originally developed med2rdf ontology covering core concepts ranging from genomes, genes, transcripts, variations, diseases, to evidence, common to the supported databases. We currently provide converters for 19 public databases that are required to interpret disease relevance. We stored most of the resulting RDF data in our SPARQL endpoint and are currently developing APIs to utilize the RDF data for accelerating application development for genomic medicine.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Genomic medicine aims to provide an appropriate medical treatment policy based on individual genetic background. However, many of genomic variants identified by genome sequence analysis are unclear in relation to mechanism of disease and often do not lead to clinical determination. These variants are called variants of uncertain significance (VUS) and the interpretation of these variants is a bottleneck of genomic medicine. To clarify the disease relevance of VUS, in addition to specialized knowledge in each disease domain, comprehensive interpretation of enormous amounts of information in the literature and public databases is needed. Thus, in Med2RDF project, we have tackled to integrate knowledge required to the clinical interpretation utilizing Resource Description Framework (RDF).</p><p>To date, major life science databases have been developed and provided as RDF data thanks to the community efforts [1]. Our Med2RDF is an addition of biomedical databases to this collaboration. We provide converters for MedGen, HGNC, ClinVar, dbSNP, dbVar ExAC, gnomAD, dbNSFP, dbscSNV, HiNT, INstruct, ICGC, TCGA, CIViC, COSMIC, CCLE, GDSC, OpenTG-Gates and DGIdb at the Med2RDF GitHub repository<ref type="foot" target="#foot_0">1</ref> and have stored the resulting RDF datasets at our SPARQL endpoint<ref type="foot" target="#foot_1">2</ref> .</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Med2RDF ontology and API development</head><p>Along with the development of RDF data, we have developed the med2rdf ontology covering core concepts ranging from genomes, genes, transcripts, variations, diseases, to evidence, common to the supported databases (Fig <ref type="figure" target="#fig_0">1)</ref>. This ontology enables us to integrate heterogeneous datasets by improving the interoperability, and one can utilize any combination of data in a standardized manner. Moreover, we are currently developing APIs that encapsulate the SPARQL query with the help of the SPARQList<ref type="foot" target="#foot_2">3</ref> with which users can develop applications for the clinical genome medicine with ease. This will also help researchers to apply machine learning methods to Med2RDF data for the clinical interpretation of VUS obtained from clinical sequencing.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig 1 .</head><label>1</label><figDesc>Fig 1. A schematic representation of the Med2RDF ontology commonly used in the Med2RDF datasets to improve interoperability.</figDesc><graphic coords="2,175.40,253.57,258.18,193.65" type="bitmap" /></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">https://github.com/med2rdf</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">http://sparql.med2rdf.org/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2">https://github.com/dbcls/sparqlist</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_3">https://doi.org/10.12688/f1000research.18238.1</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgements</head><p>. This research is supported by the Program for an Integrated Database of Clinical and Genomic Information from Japan Agency for Medical Research and development, AMED. Reference. 1. Katayama T, Kawashima S, Micklem G et al.: BioHackathon series in 2013 and 2014, F1000Research 2019, 8:1677 4</p></div>
			</div>

			<div type="references">

				<listBibl/>
			</div>
		</back>
	</text>
</TEI>
