<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Efficient construction of a new ontology for life sciences by sub- classifying related terms in the Japan Science and Technology Agency thesaurus</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Tatsuya Kushida</string-name>
          <email>kushida@biosciencedbc.jp</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kouji Kozaki</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yuka Tateisi</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Katsutaro Watanabe</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Takeshi Masuda</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Katsuji Matsumura</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Takahiro Kawamura</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Toshihisa Takagi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Dept. Biological Sciences, Grad. School of Science, The Univ. of Tokyo</institution>
          ,
          <addr-line>Tokyo</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Dept. of Information Planning, Japan Science and Technology Agency</institution>
          ,
          <addr-line>Tokyo</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>National Bioscience Database Center, Japan Science and Technology Agency</institution>
          ,
          <addr-line>Tokyo</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>The Institute of Scientific and Industrial Research</institution>
          ,
          <addr-line>Osaka Univ., Ibaraki</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>We are developing a new ontology for life sciences that can be used to interlink biological concepts from various categories with approximately 10,000 concepts and 31 types of relations. We create these relations by subclassifying the related terms (RT) that are used in the thesaurus of Japan Science and Technology Agency (JST) for associating concepts along with the broader and narrower terms. In this study, we describe an efficient ontological development method based on the JST thesaurus in terms of the majority decision of a panel of life-sciences experts. Three trained curators sub-classified 2850 RTs into 31 types of relations an improved version of Hozo ontology editor. We evaluated the results and confirmed high precision (0.93) and recall (0.83). Finally, a manager adjudicated the results by the curators and decided on 2850 relations. We conclude that the RT subclassification was efficiently conducted and the method is both effective and practical.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>1.1</p>
    </sec>
    <sec id="sec-2">
      <title>INTRODUCTION</title>
      <sec id="sec-2-1">
        <title>Japan Science and Technology Agency (JST) thesaurus</title>
        <p>JST thesaurus being developed by JST is one of the largest
scientific and technological thesauri. It contains 24.5
thousand concepts across a wide range of scientific and
technological fields including the Life Sciences, Mechanics,
Physics, Industrial Chemistry, Environmental Science, and
Metallography. The JST thesaurus includes approximately
10,000 life-sciences concepts and its associated dictionary
includes approximately 80,000 life-sciences concepts; some
of which have links to MeSH
(https://www.ncbi.nlm.nih.gov/mesh) terms. The concepts
are structured using broader terms, narrower terms, and
related terms (RT), and they are mainly used for the purpose
of indexing scientific literature (http://jglobal.jst.go.jp/en/).</p>
      </sec>
      <sec id="sec-2-2">
        <title>Background</title>
        <p>To elucidate the mechanisms of biological phenomena, it is
important to interpret what occurs at each level of molecules,
cells, tissues, organs, and individuals and to define the
relations among them. In this type of situation, specific
biological ontologies, thesauri, and databases such as Gene
Ontology (http://www.geneontology.org/) are used for
interpreting data from high-throughput experiments including
NextGeneration Sequencing and microarray. These ontologies
and databases have already proven to be essential
knowledge bases for assisting in understanding mechanisms
related to biological phenomena.</p>
        <p>Although relations among such biological phenomena
and gene products are being vigorously collected in Gene
Ontology, there are almost no other ontologies, thesauri, or
databases that have arranged relations among different
categories and levels of biological phenomena such as the
relationship between cellular and individual-level phenomena.
One of the strong points of the JST thesaurus is that it
widely collects information on the relations among biological
concepts in different categories of the life-sciences field. For
example, the JST thesaurus directly relates
thromboembolism categorized into a disease to platelet aggregation
categorized into a cellular phenomenon by using RT. In the
thesaurus, information about concepts associated with RT is
curated by experienced biological experts and can provide
more reliable results than information based on
cooccurrence among literature achieved by machine curation.
1.3</p>
      </sec>
      <sec id="sec-2-3">
        <title>Refined JST thesaurus (ontology)</title>
        <p>The original thesaurus of JST is mainly used for the purpose
of indexing scientific literatures and extending retrieval
terms, and it includes a wide range of terms, but the RTs
among them are not rigorous. We aim to sub-classify the
RTs and develop a new inter-linking ontology for biological
concepts to solve the problems. It might become possible to
describe more detailed and rigorous biological relations
such as gene products that positively regulate cellular
physiological phenomena to disease. We assign concepts of
existing standard ontologies in the life-sciences field such as
Semanticscience Integrated Ontology
(http://sio.semanticscience.org) to the sub-classified
relations to improve the versatility, reusability, and extensibility.
Moreover, we plan to open the thesaurus to the public to
perform an analysis of biological experimental data and
assist in elucidating the mechanism of biological
phenomena.</p>
        <p>
          Furthermore, we will provide an information retrieval
system in which the refined JST thesaurus (a new ontology)
is implemented. This will not only allow researchers to
investigate retrieval results from connections between
concepts but also discover new knowledge according to
inference or intelligent exploration. For example, by using the
sub-classified relations such as the has function and
precedes we can discover CLEC2 and thrombin as gene
products strongly related to thromboembolism and exclude gene
products distantly related to thromboembolism such as
PRKCH that is connected to it through two RTs
          <xref ref-type="bibr" rid="ref2">(Fig. 1 &amp;
Kushida et al., 2016)</xref>
          .
2
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>RELATED WORKS</title>
      <p>
        Examples of the ontological development from thesauri and
other language resources include YAGO. YAGO is
constructed by unifying the categories and the infoboxes that
are automatically extracted from Wikipedia with synsets of
WordNet in a rule-based and heuristic method
        <xref ref-type="bibr" rid="ref8">(Suchanek et
al., 2007)</xref>
        . In Life Sciences, the examples include the
conversion from thesaurus of agriculture and its related
concepts (AGROVOC) into the ontology. In this project, the
refining RT in more specific relation and the modeling using
OWL are conducted
        <xref ref-type="bibr" rid="ref7">(Soergel et al., 2004)</xref>
        .
      </p>
      <p>
        Conversely, it is argued that merely specifying the
relation of the thesaurus is insufficient for ontology construction
        <xref ref-type="bibr" rid="ref1">(Kless et al., 2016)</xref>
        . Thus it is necessary to carefully design
the structure of the relationship between concepts to convert
the thesaurus into more a rigorous and solid ontology. In
this study, by sub-classifying RT without defining the
rigorous structure of concepts and the axiom, we aim to engineer
a hybrid between thesaurus and ontology that has aspects of
both forms. It is our future work to solve the differences
between thesaurus and ontology and to explicitly define
each of them, as pointed out by
        <xref ref-type="bibr" rid="ref1">Kless et al. (2016)</xref>
        .
      </p>
      <p>
        The examples using the crowdsourcing include
ontological alignment
        <xref ref-type="bibr" rid="ref6">(Sarasua et al., 2012)</xref>
        and the ontology’s
development and maintenance
        <xref ref-type="bibr" rid="ref4">(Mortensen et al., 2013)</xref>
        .
Mortensen et al. investigated crowdsourcing’s performance
for validating the relations among concepts in SNOMED
CT (2015) and Gene Ontology (2016) and for validating the
effects of the combination of crowdsourcing with medical
experts’ curation. LEGO
(http://geneontology.org/page/connecting-annotations-legomodels) is an ongoing project where modeling semantic
relations among biological processes, molecular functions,
cellular components, and the related gene products is
performed using expert crowdsourcing. The objective and
approach are similar to that of the refining RT in JST
thesaurus such as arranging biological relations by experts.
3
      </p>
    </sec>
    <sec id="sec-4">
      <title>PAST APPROACH AND RESULTS</title>
      <p>
        In this section, we summarize our past study
        <xref ref-type="bibr" rid="ref2">(Kushida et al.,
2016)</xref>
        and evaluate the validity of the method in detail.
3.1
      </p>
      <sec id="sec-4-1">
        <title>Method of the sub-classification in 2016</title>
        <p>We sub-classified 2065 RTs that made up approximately
42% (2065/4815) of all RTs in the life-sciences category in
the JST thesaurus until March 2016. Four life-sciences
experts including three curators were in charge of the practical
implementation and sub-classification while one person (the
manager) was in charge of the management and control of
the sub-classification. The three curators had prior
experience in indexing the JST thesaurus for scientific literature
from 3 to &gt;10 years although they were not experienced in
handling ontologies. Conversely, the manager had
experience in developing life-sciences ontologies.</p>
        <p>The work was performed using the graphical ontology
editor Hozo (http://www.hozo.jp/). The three curators
subclassified each RT to ten types of relations, namely,
“subClassOf,” “has part,” “is part of,”
“has function,” “is function of,” “has quality,” “is quality
of,” and “antonym” along with RT, following the guideline
which had been created by the manager and had contained
the definition of ten types of relations and the information of
typical use examples.</p>
        <p>When the three curators attempted to sub-classify an RT
into a relation and it was agreed by three curators, we
named it “the relation agreed by three curators (III-2016).”
Likewise, when the sub-classified relation was agreed by
two curators, we named it “the relation agreed by two
curators (II-2016).”</p>
        <p>Next, the manager confirmed whether each of the
relations (III-2016 and II-2016) was correct or not. The
following cases were used: case 1: when it was judged to be
correct, the relation was determined as a result of the
subclassification; case 2: when it was judged to be incorrect, an
appropriate relation was decided by the manager in
consultation with the three curators; and case 3: when the relations
which three curators had proposed were split (we named this
situation as “Split-2016”), an appropriate relation was
decided by the manager in consultation with the three curators.
We defined these relations decided by the process of the
above three cases as “Correct relations.” The number of
relations of III-2016, II-2016, and Split-2016 was 1453
(70.4%), 580 (28.1%), and 32 (1.5%), respectively.
3.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>Method of evaluation</title>
        <p>To quantitatively evaluate the validity of the RT
subclassifying method based on the majority decision, we
calculated the precision and recall of III-2016 and II-2016.</p>
        <p>Precision was calculated as the quotient of “the number
of correct relations in the relations that were agreed by three
or two curators (true positive)” divided by “the number of
the relations that were agreed by three or two curators (true
positive + false positive).”</p>
        <p>Recall was calculated as the quotient of “the number of
correct relations in the relations that were agreed by three or
two curators (true positive)” divided by “the number of
correct relations (true positive + false negative).” We only
calculated recall for each relation but did not calculate recall
for the sum of each relation. This is because in the case of
the sum of relations, the denominator value of the
calculation formula of recall will be equal to the denominator value
of the precision; thus, the recall and precision values will be
same. Therefore, we calculated the average of the recall for
each relation instead of calculating the recall of the sum of
each relation.</p>
        <p>“Concentration rate” was defined to be an index of the
degree of the answer tendency by curators. The
concentration rate was calculated as the quotient of “the number of
the relations that were agreed by three or two curators (true
positive + false positive)” divided by “the number of correct
relations (true positive + false negative).” These results
were interpreted by an ontologist and life-science experts.
3.3</p>
        <sec id="sec-4-2-1">
          <title>Error analysis of the III-2016 and II-2016</title>
          <p>The precision of the sum of the relations in III-2016 (0.79)
is higher than that in II-2016 (0.51) (Table 1). In III-2016
and II-2016, the precision of RT was somewhat low (0.78
and 0.33) while the recall was high (1 and 0.93 respectively).
Conversely, in III-2016 and II-2016, the precision of other
relations except for RT such as “has part” (1 and 0.83) and
“has function” (1 and 0.95) were high, and the recall of “has
part” (0.04 and 0.11) and “has function” (0.23 and 0.46)
were low. The concentration rate of RT in III-2016 and
II2016 were 1.28 and 2.63, respectively, and the values were
more than that of other relations. This meant that the
curator’s answers seemed to be biased toward the RT.</p>
          <p>Then, we examined the occurrence tendency of errors in
III-2016 and II-2016 and observed that the total number of
errors was 312 and 285, in which the number of errors
relating to RT was 308 and 256, and the rate of errors relating to
RT were 98.7% (308/312) and 89.8% (256/285),
respectively. These results suggest that the three curators were unable
to properly sub-classify RT into each relation such as “has
part” and “has function.” One reason for this might be that
the curators did not fully understand the definitions and the
usage of each relation. We considered that to solve this
problem, it was necessary to revise the guideline for the
subclassification of RT to enhance curator training and to
extend the graphical ontology editor Hozo as a curation tool.</p>
        </sec>
      </sec>
      <sec id="sec-4-3">
        <title>Adding new relations as candidates for the</title>
      </sec>
      <sec id="sec-4-4">
        <title>RT sub-classifying</title>
        <p>
          After finishing the sub-classification, in consultation with
curators we decided to add 21 new relations as candidates,
namely “synonym,” “is connected to,” “precedes,”
“succeeds,” “has role,” “is role of,” “has phenotype,” “is a
Phenotype Of,” “has output,” “output of,” “is similar to,” “has
creator,” “is creator of,” “has provider,” “is provider of,”
“transforms into,” “is transformed from,” “is located in,” “is
location of,” “regulate,” and “is regulated by.” We
conducted the re-sub-classification using these 31 relations
including the original ten relations
          <xref ref-type="bibr" rid="ref2">(Kushida et al., 2016)</xref>
          .
4
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>IMPROVEMENT OF RT SUB-CLASSIFYING</title>
      <p>Based on the results of the sub-classification conducted in
2016, we attempted to establish the process of efficiently
developing an ontology from the JST thesaurus.
4.1</p>
      <sec id="sec-5-1">
        <title>Revision of the guideline of RT subclassifying and executing curators training</title>
        <p>In addition to the definitions and usages of the 31 relations,
we described the “domain” data that refers to the scope of a
subject of each relation and the “range” that refers to the
scope of an object of each relation in the guideline for the
sub-classification of the RT (Table 2).</p>
        <p>Furthermore, to fully understand each relation that is
assigned in the sub-classification and to acquire basic
knowledge about ontologies, the three curators participated
in discussion about the creation of the guideline and have
undertaken training in the sub-classification process using
past data over a two-month period</p>
        <sec id="sec-5-1-1">
          <title>Extending the ontology editor tool Hozo for RT sub-classifying</title>
          <p>By accepting the proposals of curators, we improved the
ontology editor Hozo to be able to input the first and second
candidate relations. It was mandatory to input the first
candidate relation and voluntary to input the second candidate
relation.
By following the new revised guidelines and using the
extended Hozo tool, we sub-classified 2850 RT that was
approximately 58% (2850/4815) of all of RT in the
lifesciences category in JST thesaurus until March 2017. This
was conducted by the same three trained curators and one
manager as before.</p>
          <p>When we sub-classified RT and in the first candidates a
relation was agreed by three curators, we named the relation
“1st-III.” Likewise, we named the relation “1st-II:2nd-III,”
when in the first candidates, a relation was agreed by two
curators, and in the first and second candidates, a relation
was agreed by three curators. Moreover, when in the first
candidates, a relation was agreed by two curators, and in the
first and second candidates a relation was agreed by two
curators, we named the relation “1st-II:2nd-II.” In the case
that the first candidate’s relations proposed by the three
curators were split and in the first and second candidates a
relation was agreed by two curators, we named the relation
“1st-Split:2nd-II.” When in both of the first candidates and
the second candidate’s relations which the three curators
proposed were split, we named the relation
“1st-Split:2ndSplit” (Fig. 2).</p>
          <p>Next, the manager confirmed whether each of the agreed
relations (of 1st-III, 1st-II:2nd-III, 1st-II:2nd-II and
1st-Split:2ndII) were correct or not, and case 1: when it was judged to be
correct, the relation was determined as a result of the
subclassification, case 2: when it was judged to be incorrect, an
appropriate relation was decided by the manager in
consultation with three curators, and case 3: when the relations
which three curators had proposed were split, namely
1stSplit:2nd-Split, an appropriate relation was decided by the
manager in consultation with three curators likewise.
6
Both the precision scores and the recall scores of each
relation were high (see Table 3). In comparison with the results
of III-2016 which corresponded to 1st-III, both of the
precision and the recall of 1st-III was higher than those of
III2016, especially the recall of 1st-III was considerably higher
than that of III-2016, e.g., “has function” and “has part”
(Table 1 &amp; 3).</p>
          <p>Then, we investigated error occurrence. As a result, the
total number of errors was 28 in which the number of errors
relating to “subClassOf” was the most (18 errors, 64.3%
(18/28)) of all. Examples included a relationship between
“Carbon Cycle” and “biogeochemical cycle,” and the
correct relation was “is part of.” Conversely, the number of
errors relating to RT (6 errors, 21.4% (6/28)) that was the
most in III-2016 (308 errors, 98.7% (308/312)) greatly
decreased.
The precision and the recall of each relation in 1st-II:2nd-III
were as high as or slightly lower than that in 1st-III (Table 3).
The total number of errors was ten, the number of errors
relating to RT was the most (6 errors, 60.0% (6/10)) of all.
Examples included a relationship between “Ascaride” and
“parasite,” and the correct relation was “has role.”
The precision of each relation was more than 0.83 except for
“is similar to” (0.33 (2/6)), “has provider” (0 (0/1)), and “is
provider of” (0 (0/1)) (Table 3). The recall of each relation
was more than 0.89 except for “has creator” (0 (0/2)), “is
creator of” (0 (0/2)), “has function” (0.63 (20/32)), “is
function of” (0.64 (21/33)), “has role” (0.30 (10/57)), and “is
role of” (0.30 (10/57)).</p>
          <p>The total number of errors was 144 in which the number
of errors relating to “RT” was the most (88 errors, 61.1%
(88/144)) of all. Examples included the relationship between
“Blattaria” and “insanitary insect” and the correct relation
was “has role” and included the relationship between “body
cavity camera” and “celioscopy” and the correct relation
was “has function.”
The precision of each relation was more than 0.80 except for
“is location of” (0.33 (1/3)) and “is located in” (0.33 (1/3))
(Table 3). The recall of each relation was more than 0.75
except for “has creator” (0 (0/2)) and “is creator of” (0
(0/2)). The total number of errors was six in which the
number of errors relating to “is location of” was the most (4
errors, 66.7% (4/6)) of all. Examples included a relationship
between “oil seed” and “Plant oils” and the correct relation
was “is creator of.”
6.5</p>
        </sec>
      </sec>
      <sec id="sec-5-2">
        <title>Summary of the precision</title>
        <p>We named the sum of 1st-III and 1st-II:2nd-III “III-2017,”
and we obtained the precision of III-2017 (0.97) and the
average of recall (0.86) (Fig. 2). Moreover, we named the
sum of 1st-II:2nd-II and 1st-Split:2nd-II “II-2017,” and we
obtained the precision score of II-2017 (0.87) and the
average of recall (0.84). III-2017 meant the sum of the relations
agreed in first and second candidates by three curators, and
II-2017 meant the sum of relations agreed in first and
second candidates by two curators. We summarize this as
follows (Fig. 2 &amp; Table 1),
 The precision of III-2017 (0.97) was higher than that of</p>
        <p>II-2017 (0.87).
 The precision of III-2016 (0.79) was higher than that of</p>
        <p>II-2016 (0.51) (Section 3.3 &amp; Table 1).
 The precision of III-2017 (0.97) was higher than that of</p>
        <p>III-2016 (0.79).
 The precision of II-2017 (0.87) was higher than that of
II-2016 (0.51).</p>
        <p>As a result, we confirmed that the precision was
improved by using the results of relations agreed by three
curators such as III-2017 and III-2016, and the modified method
in 2017 such as III-2017 and II-2017.
6.6</p>
      </sec>
      <sec id="sec-5-3">
        <title>Summary of the recall</title>
        <p>We compared the average recall of III-2017 and II-2017
with that of III-2016 and II-2016 and summarized the
observations as follows (Fig. 2 &amp; Table 1),
 The recall of III-2017 (0.86) was as much as that of
II2017 (0.84).
 The recall of III-2016 (0.37) was as much as that of
II2016 (0.36) (Table 1).
 The recall of III-2017 (0.86) was higher than that of
III2016 (0.37).
 The recall of II-2017 (0.84) was higher than that of
II2016 (0.36).</p>
        <p>As a result, we confirmed that the recall was improved
by using the modified method in 2017 such as III-2017 and
II-2017. Conversely, we did not recognize that the recall
would be considerably improved by using the information
of the relations agreed by the three curators.
7</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>EVALUATION OF METHOD IN 2017</title>
      <p>To evaluate effects of the sub-classification using the
second candidate information, we compared the precision and
recall of the sub-classification using both of the first and
second relations information with that using the first
candidate information only. We named relations agreed by two or
three curators in both of the first and second candidates
“All-2017.” Namely, it included 1st-III, 1st-II:2nd-III,
1stII:2nd-II, and 1st-Sprit:2nd-II (Fig. 2). We also named
relations agreed by two or three curators in first candidate
“1st2017.” Namely, it included 1st-III and 1st-II:2nd-II.</p>
      <p>We calculated the precision and recall of All-2017 and
1st-2017. As a result, there was not much difference in the
precision and the recall between All-2017 (P = 0.93, R =
0.85) and that of 1st-2017 (P = 0.93, R = 0.85). We did not
recognize the advantage of using the second candidate
information in either the precision or the recall.</p>
      <p>Nevertheless, we confirmed that the number of relations
which the three curators disagreed on in the first candidate
(238 relations, we named it “1st-Split”) was reduced to the
number of 1st-Split:2nd-Split (186 relations) by the second
candidate’s information (Fig. 2). The process in which the
manager suggested appropriate relations for those on which
the three curators disagreed was laborious and
timeconsuming. Therefore, we considered that utilizing the
second candidate’s information to reduce the number of
relations disagreed on by the three curators might be effective in
terms of reducing this burden.</p>
      <p>Furthermore, with the three curators, we examined
whether inputting the second candidate relation in addition
to the first candidate was difficult or not. We discovered that
this was not much of a burden and allowing the curators to
input the second candidate relation could contribute to
reducing the time needed to narrow down the choice to just
one candidate and to relieve work stress.
8</p>
    </sec>
    <sec id="sec-7">
      <title>PUBLICATION</title>
      <p>Currently, we are preparing to make a new ontology
developed by using the improved method which is open to the
public. Until the preparation is finished, the ontology and
related results including the guidelines and the curation data
are available for evaluation purposes only. If this is required,
please contact the corresponding author. Furthermore, we
are also currently examining the usage of one of the
Creative Commons licenses for the publication; the public
SPARQL endpoint is now being prepared and is planned for
release. Moreover, we intend to submit the ontology to
BioPortal (http://bioportal.bioontology.org/).
9</p>
    </sec>
    <sec id="sec-8">
      <title>CONCLUSIONS</title>
      <p>We described a method of efficiently constructing a new
life-science ontology from an existing scientific and
technological thesaurus by a small panel of experts, which
comprised three curators and one manager. In the three cases
within the RT sub-classification process, the manager
confirmed whether each of the relations that had been agreed by
the curators was correct or not. The following cases were
used. In case 1, when it was judged to be correct, the
relation was determined as a result of the sub-classification. In
case 2, when it was judged to be incorrect, an appropriate
relation was decided by the manager in consultation with
three curators. Finally, in case 3, when the relations that the
three curators had proposed were split, an appropriate
relation was decided by the manager in consultation with three
curators. However, we realize that cases 2 and 3 are
laborious and time-consuming. We assume that it is important to
reduce the number of cases 2 and 3 as much as possible to
efficiently perform RT sub-classification. The rate of case 2
and 3 for 2016 and 2017 was 30.5% (629/2065) and 13.3%
(380/2850), respectively. We confirmed that the proportion
was reduced by less than half by performing the RT
subclassification following the revised guidelines in
combination with the Hozo ontology editor operated by trained
curators.</p>
      <p>From our interviews and discussions with the curators,
we realized that the domain and range information of each
relation described in the guidelines were of practical use for
appropriately selecting relations in the RT sub-classification.
Specifically, this information could potentially be useful for
curators lacking full experience in developing ontologies.</p>
      <p>Although we consider that life-science experts need to
sub-classify the relations among biological concepts, we
will attempt to evaluate the validity of the crowdsourcing in
the RT sub-classification process and the effect of cost
reduction using crowdsourcing in our future research.</p>
    </sec>
    <sec id="sec-9">
      <title>ACKNOWLEDGEMENTS</title>
      <p>This work was supported by an operating grant from the
Japan Science and Technology Agency and JSPS
KAKENHI Grant Number JP17H01789.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Kless</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jansen</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Milton</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>A content-focused method for re-engineering thesauri into semantically adequate ontologies using OWL</article-title>
          .
          <source>Semantic Web</source>
          ,
          <volume>7</volume>
          (
          <issue>5</issue>
          ),
          <fpage>543</fpage>
          -
          <lpage>576</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Kushida</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Masuda</surname>
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tateisi</surname>
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Watanabe</surname>
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Matsumura</surname>
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kawamura</surname>
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kozaki</surname>
            <given-names>K.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Takagi</surname>
            <given-names>T.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>Refining JST thesaurus and discussing the effectiveness in life science research</article-title>
          .
          <source>Proc. of IESD2016.</source>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Mortensen</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Minty</surname>
            ,
            <given-names>E.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Januszyk</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sweeney</surname>
            ,
            <given-names>T.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rector</surname>
            ,
            <given-names>A.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Noy</surname>
            ,
            <given-names>N.F.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Musen</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          (
          <year>2015</year>
          ).
          <article-title>Using the wisdom of the crowds to find critical errors in biomedical ontologies: a study of SNOMED CT</article-title>
          .
          <source>J Am Med Inform Assoc.</source>
          ,
          <volume>22</volume>
          ,
          <fpage>640</fpage>
          -
          <lpage>8</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Mortensen</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Musen</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Noy</surname>
            ,
            <given-names>N.F.</given-names>
          </string-name>
          (
          <year>2013</year>
          ).
          <article-title>Crowdsourcing the verification of relationships in biomedical ontologies</article-title>
          .
          <source>AMIA Annu Symp Proc. 16</source>
          ,
          <fpage>1020</fpage>
          -
          <lpage>9</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Mortensen</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Telis</surname>
            ,
            <given-names>N</given-names>
          </string-name>
          , Hughey,
          <string-name>
            <given-names>J.J</given-names>
            ,
            <surname>Fan-Minogue</surname>
          </string-name>
          , H.,
          <string-name>
            <surname>Van Auken</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dumontier</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Musen</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>Is the crowd better as an assistant or a replacement in ontology engineering? An exploration through the lens of the Gene Ontology</article-title>
          .
          <source>J Biomed Inform</source>
          .
          <volume>60</volume>
          ,
          <fpage>199</fpage>
          -
          <lpage>209</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Sarasua</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Simperl</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Noy</surname>
            ,
            <given-names>N.F.</given-names>
          </string-name>
          (
          <year>2012</year>
          ).
          <article-title>CrowdMAP: Crowdsourcing Ontology Alignment with Microtasks</article-title>
          .
          <source>11th ISWC.</source>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Soergel</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lauser</surname>
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liang</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fisseha</surname>
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Keizer</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Katz</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2004</year>
          ).
          <article-title>Reengineering Thesauri for New Applications: the AGROVOC Example</article-title>
          .
          <article-title>New Applications of KOS</article-title>
          .,
          <volume>4</volume>
          (
          <issue>4</issue>
          ),
          <fpage>1</fpage>
          -
          <lpage>26</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>Suchanek F.M.</given-names>
            ,
            <surname>Kasneci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            , and
            <surname>Weikum</surname>
          </string-name>
          ,
          <string-name>
            <surname>G.</surname>
          </string-name>
          (
          <year>2007</year>
          ).
          <article-title>YAGO-A Core of Semantic Knowledge Unifying WordNet and Wikipedia</article-title>
          .
          <source>Proc 16th International Conference on WWW. 697-706.</source>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Viera</surname>
            ,
            <given-names>A.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Garrett</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          , (
          <year>2005</year>
          ).
          <article-title>Understanding interobserver agreement: the kappa statistic</article-title>
          .
          <source>Fam Med</source>
          ,
          <volume>37</volume>
          (
          <issue>5</issue>
          ),
          <fpage>360</fpage>
          -
          <lpage>363</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>