<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>LOD Surfer API: A Web API for LOD Sur ng Using Class-Class Relationships in Life Sciences</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Atsuko Yamaguchi</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kouji Kozaki</string-name>
          <email>kozaki@ei.sanken.osaka-u.ac.jp</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yasunori Yamamoto</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hiroshi Masuya</string-name>
          <email>hiroshi.masuya@riken.jp</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Norio Kobayashi</string-name>
          <email>norio.kobayashi@riken.jp</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Advanced Center for Computing and Communication (ACCC), RIKEN</institution>
          ,
          <addr-line>2-1 Hirosawa, Wako, Saitama, 351-0198</addr-line>
          <country country="JP">Japan</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>BioResource Center (BRC)</institution>
          ,
          <addr-line>RIKEN 3-1-1, Koyadai, Tsukuba, Ibaraki, 305-0074</addr-line>
          <country country="JP">Japan</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Database Center for Life Science (DBCLS), Research Organization of Information and Systems</institution>
          ,
          <addr-line>178-4-4 Wakashiba, Kashiwa, Chiba, 277-0871</addr-line>
          <country country="JP">Japan</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>The Institute of Scienti c and Industrial Research (ISIR), Osaka University</institution>
          ,
          <addr-line>8-1 Mihogaoka, Ibaraki, Osaka, 567-0047</addr-line>
          <country country="JP">Japan</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Linked Open Data (LOD) is being increasingly used when publishing life science databases. To facilitate the exible use of such databases, we employ a method that uses federated query search along a path of class-class relationships. To demonstrate our strategy, we implemented a prototype system accessible via a web API as a preliminary trial. We have been collecting SPARQL Builder Metadata (SBM) that describe a data schema for each SPARQL endpoint. Employing the SBM, we constructed a graph of the class-class relationships in LOD. Using this graph, our system can provide the information required to construct a federated search query, such as the SPARQL endpoints that include a class{class relationship, and paths of the class-class relationships between two classes.</p>
      </abstract>
      <kwd-group>
        <kwd>Linked Open Data</kwd>
        <kwd>class{class relationships</kwd>
        <kwd>database integration</kwd>
        <kwd>federated query search</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        LOD Surfer is a search system that discovers data along paths of class{class
relationships over life sciences Linked Open Data (LOD) provided by di erent
SPARQL endpoints. A user can interactively obtain desired data by inputting
data or class, and selecting an output class and a path of class{class
relationships between the classes. In a previous study [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], we developed a system called
SPARQL Builder (http://www.sparqlbuilder.org/) that allows users to build
a SPARQL query for a SPARQL endpoint without thorough understanding of
the RDF. For this system, we have been collecting metadata that describes a
data schema for each SPARQL endpoint. These metadataa are called SPARQL
Builder Metadata (SBM). By applying our SPARQL endpoint crawler program
to the SPARQL endpoints obtained from YummyData [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], we collected SBM for
76 datasets from 43 SPARQL endpoints (as of July 2017).
      </p>
      <p>In this study, we implemented a web API for LOD Surfer as a preliminary
trial, using technologies developed for SPARQL Builder such as SBM and class
graphs. This enables exible data acquisition from multiple SPARQL endpoints
for users who do not have detailed information for data schema over LOD.
2</p>
      <p>LOD Surfer API
The LOD Surfer API
is designed to
easily develop a search
system, such as LOD
Surfer. Using this
API, a list of SPARQL
endpoints, classes
reachable using links from
a given class, and Fig. 1. Overview of the architecture of the LOD Surfer API
paths between two
given classes can be obtained in JSON format along with a federated SPARQL
query and result obtained for a given path by a simple HTTP GET request.
Figure 1 shows the overview of the architecture of the LOD Surfer API. From the
SBM collected by the LOD crawler, a single graph with 14147 nodes that
corresponds to the classes appearing in LOD is constructed. Using this graph, the
output generator can e ciently compute the output of this API. For example, from
the connected components of the graph, a list of classes reachable from a given
class can be instantly obtained. A typical example of the usage for this API is
obtaining related data typed by some other classes (e.g., Disease, Protein, etc) from
various datasets for a set of IDs typed by a class (e.g. Gene). The code for this
web API can be accessed at https://github.com/LODSurfer/lodsurfer-api.
This system is an early prototype and will be improved to make it more practical
for users. In addition, the development for an application using this API will be
the focus of our future work.</p>
      <p>Acknowledgments This work was supported by JSPS KAKENHI grant
numbers 17K00434, 17K00424 and 17H01789 and by the National Bioscience
Database Center (NBDC) of the Japan Science and Technology Agency (JST).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Yamaguchi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kozaki</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lenz</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yamamoto</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Masuya</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kobayashi</surname>
          </string-name>
          , N.:
          <article-title>Semantic Data Acquisition by Traversing Class-Class Relationships Over Linked Open Data</article-title>
          .
          <source>The 6th Joint International Semantic Technology Conference (JIST</source>
          <year>2016</year>
          ), LNCS
          <volume>10055</volume>
          ,
          <issue>136</issue>
          {
          <fpage>151</fpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Yamamoto</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yamaguchi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Splendiani</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <string-name>
            <surname>Umaka-Yummy Data</surname>
          </string-name>
          :
          <article-title>A Place to Facilitate Communication between Data Providers and Consumers</article-title>
          .
          <source>9th International Conference Semantic Web Applications and Tools for Life Sciences, CEUR Workshop Proceedings 1795</source>
          ,
          <year>2016</year>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>