<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Reconstruction of Hierarchical Structures in a Thesaurus-based Ontology through Curation by Domain Experts</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Database Center for Life Science, Research Organization of Information and Systems</institution>
          ,
          <addr-line>Kashiwa, Chiba</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Faculty of Information and Communication Engineering, Osaka Electro-Communication University</institution>
          ,
          <addr-line>Neyagawa, Osaka</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>National Agriculture and Food Research Organization</institution>
          ,
          <addr-line>Tsukuba, Ibaraki</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>National Bioscience Database Center, Japan Science and Technology Agency</institution>
          ,
          <addr-line>Tokyo</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>Tatsuya Kushida</institution>
        </aff>
        <aff id="aff5">
          <label>5</label>
          <institution>Toyama University of International Studies, Toyama, Toyama</institution>
          ,
          <country country="JP">Japan</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2017</year>
      </pub-date>
      <abstract>
        <p>We reconstructed hierarchical structures of Interlinking Ontology for Biological Concepts by life-science experts converting the skos:broader and dcterms:subject relations to rdfs:subClassOf relations. As a result, we could efficiently create more than 125,000 rdfs:subClassOf in the ontology, and we expect that the relations would be leveraged for inferring biological molecules' functions by the inheritance approach.</p>
      </abstract>
      <kwd-group>
        <kwd>Expert curation</kwd>
        <kwd>Life science</kwd>
        <kwd>Ontology</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>Interlinking Ontology for Biological Concepts (IOBC) is
developed from Japan Scientific and Technological Thesaurus (JST
thesaurus). It contains approximately 120,000 terms related to life
science and chemical science. So far we sub-classified the
skos:related relations within the IOBC to 35 relations such as
SIO:000225 (has function) [1]. Consequently, we could infer
biological and chemical functions to biological molecules using the
sub-classified and sub-class-of relations of IOBC [2]. However
the hierarchical structure has been described using skos:broader
and dcterms:subject without distinguishing some ontological
relationships such as sub-class-of, and is-part-of because it was
developed from the thesaurus. In this study, we attempted to
reconstruct the hierarchical relations for the purpose of more rigorously
describing the relationships among biological concepts and
inferring biological molecules’ functions at scale.</p>
    </sec>
    <sec id="sec-2">
      <title>Methods</title>
      <p>In order to improve the hierarchical structures of IOBC, four
lifesciences experts attempted the followings. (1) The experts
considered separating each “skos:broader” relation into each of
“rdfs:subClassOf”, and “SIO:000068(is part of)” because they
were represented by skos:broader in IOBC. In addition, (2) the
experts considered converting each “dcterms:subject” relation to
“rdfs:subClassOf,” because some upper concepts were
represented by dcterms:subject instead of rdfs:subClassOf in IOBC.
Furthermore, they examined whether there was more appropriate
term upper concept for each rdfs:subClassOf in IOBC. As the
result, the experts defined 35 newly categorized terms as upper
concepts. The newly defined terms include “biological
phenomenon, process, and state,” “substance”, etc.</p>
      <p>In the above reconstruction processes, each relation was decided
by 3 experts’ majority rule, and discussion, and the ontological
consistency was finally confirmed by an expert.</p>
    </sec>
    <sec id="sec-3">
      <title>Results</title>
      <p>We separated 147,460 skos:broader into 110,582
rdfs:subClsssOf, and 135 SIO:000068 (is part of) (1-1 and 1-2 in Table.1).
Meanwhile, we converted 118,355 dcterms:subject to 15,092
rdfs:subClassOf (2 in Table.1) . This ontology is available in
BioPortal (http://purl.bioontology.org/ontology/IOBC).
Table.1 Reconstruction examples of the hierarchical structures
# Before reconstruction After reconstruction
1-1 erythrocyte aggregation (skos: erythrocyte aggregation (rdfs:subClassO
broader) hemagglutination f) hemagglutination
1-2 antigen recognition (skos:broa antigen recognition (SIO:000068 (is part
der) antigen-antibody reaction of)) antigen-antibody reaction
2 cell aggregation (dcterms:subje cell aggregation (rdfs:subClassOf) biolog
ct) Immunology ical phenomenon, process, and state</p>
    </sec>
    <sec id="sec-4">
      <title>Conclusions</title>
      <p>The expert curation greatly and exactly increasing the number of
rdfs:subClassOf, and SIO:000068 (is part of) relations in the
IOBC, we can more largely and accurately discover new
functions, and roles of biological concepts such as chemical
compounds and gene products using the is-a hierarchy and the part of
structure, and combinating the IOBC, with other ontologies, such
as ChEBI.</p>
    </sec>
    <sec id="sec-5">
      <title>Address for correspondence</title>
    </sec>
  </body>
  <back>
    <ref-list />
  </back>
</article>